Miguel Guerrero's picture

Miguel Guerrero

apol

·

http://miguelguerrero.me/

AI & ML interests

nlp, avatars, gans, time series, alzheimer, education

Recent Activity

liked a model about 3 hours ago

unsloth/phi-4

liked a model 3 days ago

microsoft/phi-4

liked a model 3 days ago

cognitivecomputations/Dolphin3.0-Llama3.1-8B

View all activity

Organizations

apol's activity

upvoted 2 collections 4 days ago

Deepseek V3 (All Versions)

Deepseek V3 - available in bf16, original, and GGUF formats, with support for 2, 3, 4, 5, 6 and 8-bit quantized versions. • 3 items • Updated 3 days ago • 21

Cosmos

The collection of Cosmos models • 31 items • Updated about 13 hours ago • 206

upvoted a collection 12 days ago

Smol but mighty

A collection of smoll but mighty models • 10 items • Updated 24 days ago • 4

upvoted a collection 15 days ago

DeepSeek-V3

3 items • Updated 6 days ago • 108

upvoted a paper 22 days ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 23 days ago • 339

upvoted a collection 23 days ago

ModernBERT

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated 23 days ago • 122

upvoted a paper 23 days ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published 24 days ago • 121

upvoted a collection 24 days ago

Falcon3

Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. • 40 items • Updated 3 days ago • 78

upvoted a paper 29 days ago

Phi-4 Technical Report

Paper • 2412.08905 • Published about 1 month ago • 101

upvoted a paper about 1 month ago

OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations

Paper • 2412.07626 • Published Dec 10, 2024 • 22

upvoted 2 collections about 1 month ago

Dec 6 Releases 🎄

28 items • Updated Dec 9, 2024 • 10

Llama 3.3 (All Versions)

Meta's new Llama 3.3 (70B) model in all formats. Includes GGUF, 4-bit bnb and original versions. • 3 items • Updated 3 days ago • 29

upvoted a paper about 1 month ago

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 121

upvoted a collection about 1 month ago

PaliGemma 2 Release

Vision-Language Models available in multiple 3B, 10B and 28B variants. • 23 items • Updated 29 days ago • 125

upvoted a paper about 1 month ago

MALT: Improving Reasoning with Multi-Agent LLM Training

Paper • 2412.01928 • Published Dec 2, 2024 • 40

upvoted an article about 2 months ago

Article

Halo: Open Source Health Tracking with Wearables

By

•

Nov 19, 2024

• 101

upvoted a collection about 2 months ago

Qwen 2.5 Coder

Complete collection of Code-specific model series for Qwen2.5 in bnb 4bit, 16bit and GGUF formats. • 35 items • Updated 3 days ago • 21

upvoted a collection 2 months ago

🇫🇷 Calme-3

Here you can find all the new Calme-3 models • 27 items • Updated 10 days ago • 10

upvoted a paper 2 months ago

LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning

Paper • 2410.02884 • Published Oct 3, 2024 • 53

upvoted an article 2 months ago

Article

Introducing GGUF-my-LoRA

By

•

Nov 1, 2024

• 13