Victor Mustar's picture

Victor Mustar PRO

victor

·

victormustar

AI & ML interests

Building the UX of this website

Recent Activity

liked a model 8 minutes ago

NovaSky-AI/Sky-T1-32B-Preview

liked a Space about 17 hours ago

fffiloni/Sa2VA-simple-demo

liked a Space about 23 hours ago

declare-lab/TangoFlux

View all activity

Articles

Inference for PROs

Organizations

victor's activity

upvoted a paper 2 days ago

DarkIR: Robust Low-Light Image Restoration

Paper • 2412.13443 • Published 25 days ago • 4

upvoted a collection 2 days ago

Phi-4 (All Versions)

Microsoft's new Phi-4 model in all formats. Includes GGUF, 4-bit bnb and original versions. Includes Unsloth's bug fixes. • 4 items • Updated 3 days ago • 22

upvoted a collection 3 days ago

Sa2VA model zoo

3 items • Updated 3 days ago • 20

upvoted a paper 4 days ago

BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning

Paper • 2501.03226 • Published 5 days ago • 33

upvoted a collection 4 days ago

Cosmos

The collection of Cosmos models • 31 items • Updated about 10 hours ago • 206

upvoted 3 papers 5 days ago

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published 10 days ago • 91

EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation

Paper • 2501.01895 • Published 8 days ago • 43

Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1, 2024 • 40

upvoted an article 8 days ago

Article

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

By

•

9 days ago

• 36

upvoted a paper 10 days ago

1.58-bit FLUX

Paper • 2412.18653 • Published 18 days ago • 69

upvoted 4 papers 12 days ago

How Well Do LLMs Generate Code for Different Application Domains? Benchmark and Evaluation

Paper • 2412.18573 • Published 18 days ago • 1

DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines

Paper • 2310.03714 • Published Oct 5, 2023 • 33

RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response

Paper • 2412.14922 • Published 23 days ago • 85

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published 17 days ago • 89

upvoted a paper 15 days ago

YuLan-Mini: An Open Data-efficient Language Model

Paper • 2412.17743 • Published 19 days ago • 61

upvoted a collection 16 days ago

DeepSeek-V3

3 items • Updated 6 days ago • 108

upvoted a paper 18 days ago

Parallelized Autoregressive Visual Generation

Paper • 2412.15119 • Published 23 days ago • 50

upvoted a collection 20 days ago

Vision Language Models

Grounding, chat • 5 items • Updated 11 days ago • 10

upvoted a paper 21 days ago

AniDoc: Animation Creation Made Easier

Paper • 2412.14173 • Published 24 days ago • 49

upvoted a paper 22 days ago

Spectrum: Targeted Training on Signal to Noise Ratio

Paper • 2406.06623 • Published Jun 7, 2024 • 12