view article Article Crowd-sourced Open Preference Dataset for Text-to-Image Generation By RapidataAI β’ 4 days ago β’ 17
view article Article Synthetic Data Generation with FastData and Hugging Face By asoria β’ 4 days ago β’ 12
view article Article πΊπ¦ββ¬ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark By wolfram β’ 9 days ago β’ 36
view article Article β΄οΈ ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use By Ziyang β’ 8 days ago β’ 11
view article Article Fine-tune a SmolLM on domain-specific synthetic data from a LLM By davidberenstein1957 β’ 8 days ago β’ 29
view article Article Superposition in Transformers: A Novel Way of Building Mixture of Experts By BenChaliah β’ 7 days ago β’ 14
view article Article AI in 2025: A Combinatorial Explosion of Possibilities, but NOT AGI By Kseniase β’ 7 days ago β’ 3
view article Article Building Effective Agents with Anthropicβs Best Practices and smolagents β€οΈ By Sri-Vigneshwar-DJ β’ 7 days ago β’ 4
view article Article **Fine-tune SmolLM's on custom synthetic data** By prithivMLmods β’ 6 days ago β’ 15
view article Article How to Automate Reddit Comment Generation with AI Agents in KaibanJS By darielnoel β’ 5 days ago β’ 2
view article Article Announcing NVIDIA Cosmos World Foundation Models By mingyuliutw β’ 5 days ago β’ 21
view article Article Accelerating Language Model Inference with Mixture of Attentions By hba123 β’ 4 days ago β’ 24
Datasets: A Community Library for Natural Language Processing Paper β’ 2109.02846 β’ Published Sep 7, 2021 β’ 11
view article Article π¦Έπ»#2: Your Go-To Vocabulary to Navigate the World of AI Agents and Agentic Workflows By Kseniase β’ 14 days ago β’ 9
view article Article Fine-tune ModernBERT for text classification using synthetic data By davidberenstein1957 β’ 12 days ago β’ 22
view article Article Finetuning Falcon 7b in a hybrid distributed fashion By Neo111x β’ 11 days ago β’ 4
DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought Paper β’ 2412.17498 β’ Published 19 days ago β’ 21