Vaibhav Srivastav's picture

Vaibhav Srivastav PRO

reach-vb

·

https://vaibhavs10.github.io

AI & ML interests

TTS + LM performance prediction

Recent Activity

liked a model about 24 hours ago

ICTNLP/llava-mini-llama-3.1-8b

upvoted a paper 2 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

liked a model 3 days ago

StephanST/WALDO30

View all activity

Articles

Faster Text Generation with Self-Speculative Decoding

Llama can now see and run on your device - welcome Llama 3.2

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

WWDC 24: Running Mistral 7B with Core ML

Welcome Gemma 2 - Google's new open LLM

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

CodeGemma - an official Google release for code LLMs

TTS Arena: Benchmarking Text-to-Speech Models in the Wild

AI Watermarking 101: Tools and Techniques

Deploy MusicGen in no time with Inference Endpoints

Jupyter X Hugging Face

Swift Diffusers: Fast Stable Diffusion for Mac

Organizations

reach-vb's activity

upvoted a paper 2 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 3 days ago • 175

upvoted a collection 3 days ago

Sa2VA model zoo

3 items • Updated 3 days ago • 20

upvoted a collection 4 days ago

Cosmos

The collection of Cosmos models • 31 items • Updated about 9 hours ago • 206

upvoted a paper 8 days ago

2 OLMo 2 Furious

Paper • 2501.00656 • Published 11 days ago • 15

upvoted a collection 8 days ago

OLMo 2

Artifacts for the second set of OLMo models. • 22 items • Updated 5 days ago • 74

upvoted 2 collections 9 days ago

Yi VL

2 items • Updated May 11, 2024 • 2

Falcon2

5 items • Updated 3 days ago • 5

upvoted 5 collections 11 days ago

QVQ

QVQ: Qwen models for visual reasoning • 7 items • Updated 10 days ago • 39

Llama 3.3

This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated Dec 6, 2024 • 106

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Nov 28, 2024 • 457

Chameleon

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR. • 2 items • Updated Jul 9, 2024 • 28

Stable Diffusion 3

Stable Diffusion 3 and related models for text-to-image and image-to-image • 2 items • Updated 2 days ago • 92

upvoted a collection 16 days ago

DeepSeek-V3

3 items • Updated 6 days ago • 108

upvoted 2 collections 19 days ago

NeMo Audio Codecs

A series of Neural Audio Codecs • 5 items • Updated about 9 hours ago • 10

InternVL2.5-MPO

Enhancing the Reasoning Ability of MLLMs via Mixed Preference Optimization • 16 items • Updated 1 day ago • 24

upvoted an article 19 days ago

Article

FineWeb2-C: Help Build Better Language Models in Your Language

By

•

19 days ago

• 12

upvoted a collection 22 days ago

📐 FineMath

FineMath datasets and ablation models • 14 items • Updated 5 days ago • 17

upvoted 2 collections 23 days ago

Bamba

Collection of Bamba - hybrid Mamba2 model architecture based models trained on open data • 8 items • Updated 24 days ago • 18

Granite 3.1 Language Models

A series of language models with 128K context length trained by IBM licensed under Apache 2.0 license. • 8 items • Updated 24 days ago • 47

upvoted a collection 25 days ago

Falcon3

Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. • 40 items • Updated 3 days ago • 78