Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models Paper • 2409.17146 • Published Sep 25, 2024 • 106
ParaGuide: Guided Diffusion Paraphrasers for Plug-and-Play Textual Style Transfer Paper • 2308.15459 • Published Aug 29, 2023 • 1
Large Language Models Can Self-Improve At Web Agent Tasks Paper • 2405.20309 • Published May 30, 2024 • 2
TinyStyler: Efficient Few-Shot Text Style Transfer with Authorship Embeddings Paper • 2406.15586 • Published Jun 21, 2024 • 2
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale Paper • 2406.17557 • Published Jun 25, 2024 • 90
DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM Workflows Paper • 2402.10379 • Published Feb 16, 2024 • 30
Magnitude: A Fast, Efficient Universal Vector Embedding Utility Package Paper • 1810.11190 • Published Oct 26, 2018
Low-Resource Authorship Style Transfer with In-Context Learning Paper • 2212.08986 • Published Dec 18, 2022
Learning Interpretable Style Embeddings via Prompting LLMs Paper • 2305.12696 • Published May 22, 2023 • 3
Distributed Inference and Fine-tuning of Large Language Models Over The Internet Paper • 2312.08361 • Published Dec 13, 2023 • 25
Git-Theta: A Git Extension for Collaborative Development of Machine Learning Models Paper • 2306.04529 • Published Jun 7, 2023 • 1
Crosslingual Generalization through Multitask Finetuning Paper • 2211.01786 • Published Nov 3, 2022 • 2
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model Paper • 2211.05100 • Published Nov 9, 2022 • 27