rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published 3 days ago • 176
Agent Laboratory: Using LLM Agents as Research Assistants Paper • 2501.04227 • Published 4 days ago • 59
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos Paper • 2501.04001 • Published 4 days ago • 34
Cosmos World Foundation Model Platform for Physical AI Paper • 2501.03575 • Published 5 days ago • 52
VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation Paper • 2412.21059 • Published 12 days ago • 17
Virgo: A Preliminary Exploration on Reproducing o1-like MLLM Paper • 2501.01904 • Published 8 days ago • 27
VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction Paper • 2501.01957 • Published 8 days ago • 32
Through-The-Mask: Mask-based Motion Trajectories for Image-to-Video Generation Paper • 2501.03059 • Published 5 days ago • 17
Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models Paper • 2501.01423 • Published 9 days ago • 34
VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM Paper • 2501.00599 • Published 11 days ago • 40
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis Paper • 2412.19723 • Published 15 days ago • 78
Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search Paper • 2412.18319 • Published 18 days ago • 35
Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization Paper • 2412.17739 • Published 19 days ago • 39
Outcome-Refining Process Supervision for Code Generation Paper • 2412.15118 • Published 23 days ago • 19
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners Paper • 2412.17256 • Published 20 days ago • 45
CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up Paper • 2412.16112 • Published 22 days ago • 21
No More Adam: Learning Rate Scaling at Initialization is All You Need Paper • 2412.11768 • Published 26 days ago • 41
ColorFlow: Retrieval-Augmented Image Sequence Colorization Paper • 2412.11815 • Published 26 days ago • 26