jina-embeddings-v3 Collection Multilingual multi-task general text embedding model • 6 items • Updated Sep 19, 2024 • 20
2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining Paper • 2501.00958 • Published 10 days ago • 91
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs Paper • 2412.18925 • Published 17 days ago • 89
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper • 2412.13663 • Published 24 days ago • 121
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated 23 days ago • 122
Sana Collection ⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 19 items • Updated 3 days ago • 71
Whisper Release Collection Whisper includes both English-only and multilingual checkpoints for ASR and ST, ranging from 38M params for the tiny models to 1.5B params for large. • 12 items • Updated Sep 13, 2023 • 93
Whisper Collection OpenAI Whisper speech recognition models in MLX format • 48 items • Updated Oct 1, 2024 • 22
Llama 3.3 Evals Collection This collection provides detailed information on how we derived the reported benchmark metrics for the Llama 3.3 models, including the configurations • 1 item • Updated Dec 6, 2024 • 12
Llama 3.3 Collection This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated Dec 6, 2024 • 106
SmolVLM Collection State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct • 5 items • Updated 20 days ago • 30
C4AI Aya Expanse Collection Aya Expanse is an open-weight research release of a model with highly advanced multilingual capabilities. • 3 items • Updated 26 days ago • 30
Ovis1.6 Collection With 29B parameters, Ovis1.6-Gemma2-27B achieves exceptional performance in the OpenCompass benchmark, ranking among the top-tier open-source MLLMs. • 5 items • Updated Nov 26, 2024 • 10