Argilla 2.4: Easily Build Fine-Tuning and Evaluation datasets on the Hub — No Code Required Nov 4, 2024 • 41
Ethics and Society Newsletter #6: Building Better AI: The Importance of Data Quality Jun 24, 2024 • 33
view article Article Crowd-sourced Open Preference Dataset for Text-to-Image Generation By RapidataAI • 4 days ago • 17
view article Article FineWeb2-C: Help Build Better Language Models in Your Language By davanstrien • 19 days ago • 12
view article Article Fine-tune ModernBERT for text classification using synthetic data By davidberenstein1957 • 12 days ago • 22
Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation Paper • 2412.03304 • Published Dec 4, 2024 • 17
view article Article Let’s make a generation of amazing image generation models By burtenshaw • Nov 26, 2024 • 34
Dataset Creation Collection Spaces and utilities for creating datasets and getting them on the Hub • 3 items • Updated Nov 10, 2024 • 10
view article Article Halo: Open Source Health Tracking with Wearables By cyrilzakka • Nov 19, 2024 • 101
view article Article How to build a custom text classifier without days of human labeling By sdiazlor • Oct 17, 2024 • 55
view article Article How to optimize your data labelling project with custom interfaces By burtenshaw • Oct 16, 2024 • 18
LLM Reasoning Papers Collection Papers to improve reasoning capabilities of LLMs • 18 items • Updated 1 day ago • 99
Critique-out-Loud Reward Models Collection Paper: https://arxiv.org/abs/2408.11791 | Code: https://github.com/zankner/CLoud • 7 items • Updated Sep 5, 2024 • 3
view article Article ColPali: Efficient Document Retrieval with Vision Language Models 👀 By manu • Jul 5, 2024 • 186