Trans-Tokenization and Cross-lingual Vocabulary Transfers: Language Adaptation of LLMs for Low-Resource NLP Paper • 2408.04303 • Published Aug 8, 2024 • 14
CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings Paper • 2501.01257 • Published 9 days ago • 45
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper • 2412.13663 • Published 24 days ago • 121
📚 FineWeb-Edu Collection FineWeb-Edu datasets, classifier and ablation model • 5 items • Updated Jun 12, 2024 • 13
Hymba: A Hybrid-head Architecture for Small Language Models Paper • 2411.13676 • Published Nov 20, 2024 • 40
view article Article Advanced Flux Dreambooth LoRA Training with 🧨 diffusers By linoyts • Oct 21, 2024 • 32
view article Article Model2Vec: Distill a Small Fast Model from any Sentence Transformer By Pringled • Oct 14, 2024 • 61
Llama 3.2 Collection Meta's new Llama 3.2 vision and text models including 1B, 3B, 11B and 90B. Includes GGUF, 4-bit bnb and original versions. • 23 items • Updated 3 days ago • 46
WebInstruct 🌐 Embeddings 🧱 Models Collection A collection of SoTA embeddings model fine-tuned on WebInstruct dataset to learn to pair instructions with its responses • 3 items • Updated Sep 4, 2024 • 11
LLaVA-OneVision Collection a model good at arbitrary types of visual input • 15 items • Updated Oct 5, 2024 • 20
embeddings-spanish-models 🎯 Collection A collection with embeddings models I fine-tuned for better performance in Spanish texts. • 4 items • Updated about 21 hours ago • 2
view article Article Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging By akjindal53244 • Aug 19, 2024 • 75
view article Article Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth By mlabonne • Jul 29, 2024 • 262