view article Article Fine-tune ModernBERT for text classification using synthetic data By davidberenstein1957 β’ 12 days ago β’ 22
view article Article πΊπ¦ββ¬ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark By wolfram β’ 9 days ago β’ 36
view article Article Fine-tune a SmolLM on domain-specific synthetic data from a LLM By davidberenstein1957 β’ 8 days ago β’ 29
view article Article Accelerating Language Model Inference with Mixture of Attentions By hba123 β’ 4 days ago β’ 24