Muhtasham Oblokulov's picture

Muhtasham Oblokulov PRO

muhtasham

·

https://www.linkedin.com/in/muhtasham/

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

OS-Copilot/OS-Atlas-Base-7B

liked a dataset 3 days ago

MariusHobbhahn/swe-bench-verified-mini

upvoted a collection 4 days ago

Deepseek V3 (All Versions)

View all activity

Organizations

muhtasham's activity

upvoted 2 collections 4 days ago

Deepseek V3 (All Versions)

Deepseek V3 - available in bf16, original, and GGUF formats, with support for 2, 3, 4, 5, 6 and 8-bit quantized versions. • 3 items • Updated 3 days ago • 21

Cosmos

The collection of Cosmos models • 31 items • Updated about 12 hours ago • 206

upvoted a paper 5 days ago

Maya: An Instruction Finetuned Multilingual Multimodal Model

Paper • 2412.07112 • Published Dec 10, 2024 • 26

upvoted a collection 6 days ago

marc

5 items • Updated Nov 11, 2024 • 2

upvoted a collection 8 days ago

Reasoning Datasets

Reasoning datasets that are trending 🔥 • 10 items • Updated 8 days ago • 16

upvoted a collection 22 days ago

Falcon3

Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. • 40 items • Updated 3 days ago • 78

upvoted a collection 23 days ago

ModernBERT

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated 23 days ago • 122

upvoted an article about 1 month ago

Article

Finding Moroccan Arabic (Darija) in Fineweb 2

By

•

Dec 8, 2024

• 21

upvoted a collection about 1 month ago

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated 20 days ago • 208

upvoted 2 collections about 2 months ago

OLMo 2

Artifacts for the second set of OLMo models. • 22 items • Updated 5 days ago • 74

Hymba

A series of Hybrid Small Language Models. • 2 items • Updated about 12 hours ago • 25

upvoted 2 collections 2 months ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated 20 days ago • 198

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 9 items • Updated Nov 27, 2024 • 101

upvoted a paper 2 months ago

MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding

Paper • 2408.11049 • Published Aug 20, 2024 • 12

upvoted an article 3 months ago

Article

How to build a custom text classifier without days of human labeling

By

•

Oct 17, 2024

• 55

upvoted 3 collections 5 months ago

⛈️ Llama-3.1 Storm Models

Fine-tuned Llama 3.1 8B model with superior reasoning, conversation abilities, and function calling! • 3 items • Updated Aug 25, 2024 • 15

Tower

Model weights and SFT data for Tower. • 11 items • Updated Nov 15, 2024 • 26

Code Evaluation

Collection of Papers on Code Evaluation (from code generation language models) • 45 items • Updated Oct 29, 2024 • 15

upvoted an article 5 months ago

Article

Mixture of Depth is Vibe

By

•

Apr 22, 2024

• 44

upvoted a collection 5 months ago

Llama-3.1 Quantization

Neural Magic quantized Llama-3.1 models • 22 items • Updated Nov 22, 2024 • 42