38 72 309

Edoardo Federici

efederici

https://banda-larga.github.io

AI & ML interests

llms, ir, graphs & co

Recent Activity

liked a model about 23 hours ago

llamaindex/vdr-2b-multi-v1

liked a dataset 22 days ago

argilla/ifeval-like-data

updated a dataset 28 days ago

mii-llm/train_eval_mix

View all activity

Organizations

efederici's activity

upvoted a paper about 2 months ago

OminiControl: Minimal and Universal Control for Diffusion Transformer

Paper • 2411.15098 • Published Nov 22, 2024 • 53

upvoted an article 3 months ago

Article

Visually Multilingual: Introducing mcdse-2b

•

Oct 27, 2024

• 37

upvoted 4 papers 3 months ago

Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs

Paper • 2404.05719 • Published Apr 8, 2024 • 83

Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs

Paper • 2402.14740 • Published Feb 22, 2024 • 12

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1, 2024 • 145

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2, 2024 • 51

upvoted 2 papers 4 months ago

Improve Mathematical Reasoning in Language Models by Automated Process Supervision

Paper • 2406.06592 • Published Jun 5, 2024 • 27

MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct

Paper • 2409.05840 • Published Sep 9, 2024 • 47

upvoted an article 4 months ago

Article

Selective fine-tuning of Language Models with Spectrum

•

Sep 3, 2024

• 30

upvoted a paper 5 months ago

Show-o: One Single Transformer to Unify Multimodal Understanding and Generation

Paper • 2408.12528 • Published Aug 22, 2024 • 51

upvoted a collection 5 months ago

Probably function calling datasets

Collection

Created using the https://huggingface.co/spaces/librarian-bots/dataset-column-search-api Space. • 39 items • Updated Jul 17, 2024 • 36

upvoted 2 papers 6 months ago

Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion

Paper • 2407.01392 • Published Jul 1, 2024 • 39

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22, 2024 • 127

upvoted a paper 7 months ago

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

Paper • 2406.06525 • Published Jun 10, 2024 • 66

upvoted 6 papers 8 months ago

Matryoshka Multimodal Models

Paper • 2405.17430 • Published May 27, 2024 • 31

An Introduction to Vision-Language Modeling

Paper • 2405.17247 • Published May 27, 2024 • 87

Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

Paper • 2405.15071 • Published May 23, 2024 • 37

DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data

Paper • 2405.14333 • Published May 23, 2024 • 37

KAN: Kolmogorov-Arnold Networks

Paper • 2404.19756 • Published Apr 30, 2024 • 108

Iterative Reasoning Preference Optimization

Paper • 2404.19733 • Published Apr 30, 2024 • 47