Diving into Self-Evolving Training for Multimodal Reasoning Paper • 2412.17451 • Published 19 days ago • 42
Investigating Copyright Issues of Diffusion Models under Practical Scenarios Paper • 2311.12803 • Published Sep 15, 2023 • 1
Subclass-balancing Contrastive Learning for Long-tailed Recognition Paper • 2306.15925 • Published Jun 28, 2023
When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training Paper • 2411.13476 • Published Nov 20, 2024 • 15
When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training Paper • 2411.13476 • Published Nov 20, 2024 • 15
Locality Sensitive Sparse Encoding for Learning World Models Online Paper • 2401.13034 • Published Jan 23, 2024
Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning Paper • 2402.03046 • Published Feb 5, 2024 • 6
Bootstrapping Language Models with DPO Implicit Rewards Paper • 2406.09760 • Published Jun 14, 2024 • 38
Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows? Paper • 2407.10956 • Published Jul 15, 2024 • 6
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models Paper • 2411.04905 • Published Nov 7, 2024 • 113