rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published 3 days ago • 176
Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision Paper • 2407.06189 • Published Jul 8, 2024 • 26
Applied Machine Learning Papers Collection Reading List (Mainly Focused of VLM's and Diffusion Models) • 48 items • Updated 25 days ago • 1
Byte Latent Transformer: Patches Scale Better Than Tokens Paper • 2412.09871 • Published 30 days ago • 85
Applied Machine Learning Papers Collection Reading List (Mainly Focused of VLM's and Diffusion Models) • 48 items • Updated 25 days ago • 1
Apollo: An Exploration of Video Understanding in Large Multimodal Models Paper • 2412.10360 • Published 29 days ago • 136
AnimateLCM: Accelerating the Animation of Personalized Diffusion Models and Adapters with Decoupled Consistency Learning Paper • 2402.00769 • Published Feb 1, 2024 • 22
SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters Paper • 2412.00174 • Published Nov 29, 2024 • 22
VEnhancer: Generative Space-Time Enhancement for Video Generation Paper • 2407.07667 • Published Jul 10, 2024 • 14
VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models Paper • 2411.13503 • Published Nov 20, 2024 • 30
Applied Machine Learning Papers Collection Reading List (Mainly Focused of VLM's and Diffusion Models) • 48 items • Updated 25 days ago • 1