Manuel Romero's picture

Manuel Romero PRO

mrm8488

·

https://mrm8488.github.io

AI & ML interests

#AI Research and Democratization. NLP/NLG 🤗

Recent Activity

updated a model about 20 hours ago

mrm8488/ModernBERT-base-ft-fineweb-multilingual-sentiment-es-2k

updated a collection about 21 hours ago

embeddings-spanish-models 🎯

updated a model about 21 hours ago

mrm8488/modernbert-embed-base-ft-sts-spanish-matryoshka-768-64

View all activity

Organizations

mrm8488's activity

upvoted a paper 2 days ago

Trans-Tokenization and Cross-lingual Vocabulary Transfers: Language Adaptation of LLMs for Low-Resource NLP

Paper • 2408.04303 • Published Aug 8, 2024 • 14

upvoted a paper 6 days ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published 9 days ago • 45

upvoted a paper 22 days ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published 24 days ago • 121

upvoted a collection about 1 month ago

📚 FineWeb-Edu

FineWeb-Edu datasets, classifier and ablation model • 5 items • Updated Jun 12, 2024 • 13

upvoted a paper about 1 month ago

GEITje 7B Ultra: A Conversational Model for Dutch

Paper • 2412.04092 • Published Dec 5, 2024 • 3

upvoted a paper about 2 months ago

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published Nov 20, 2024 • 40

upvoted 3 articles 3 months ago

Article

Allegro: Advanced Video Generation Model

By

•

Oct 22, 2024

• 57

Article

Advanced Flux Dreambooth LoRA Training with 🧨 diffusers

By

•

Oct 21, 2024

• 32

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

By

•

Oct 14, 2024

• 61

upvoted an article 4 months ago

Article

Llama can now see and run on your device - welcome Llama 3.2

Sep 25, 2024

• 180

upvoted 2 collections 4 months ago

Llama 3.2

Meta's new Llama 3.2 vision and text models including 1B, 3B, 11B and 90B. Includes GGUF, 4-bit bnb and original versions. • 23 items • Updated 3 days ago • 46

Molmo

Artifacts for open multimodal language models. • 5 items • Updated 5 days ago • 292

upvoted an article 4 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 215

upvoted 2 collections 4 months ago

WebInstruct 🌐 Embeddings 🧱 Models

A collection of SoTA embeddings model fine-tuned on WebInstruct dataset to learn to pair instructions with its responses • 3 items • Updated Sep 4, 2024 • 11

LLaVA-OneVision

a model good at arbitrary types of visual input • 15 items • Updated Oct 5, 2024 • 20

upvoted an article 4 months ago

Article

Serverless Inference with Hugging Face and NVIDIA NIMs

Jul 29, 2024

• 27

upvoted a collection 4 months ago

embeddings-spanish-models 🎯

A collection with embeddings models I fine-tuned for better performance in Spanish texts. • 4 items • Updated about 21 hours ago • 2

upvoted 3 articles 5 months ago

Article

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

By

•

Aug 19, 2024

• 75

Article

The 5 Most Under-Rated Tools on Hugging Face

Aug 22, 2024

• 86

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

By

•

Jul 29, 2024

• 262