Search-o1: Agentic Search-Enhanced Large Reasoning Models Paper • 2501.05366 • Published 2 days ago • 43
Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs Paper • 2412.21187 • Published 12 days ago • 34
SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents Paper • 2310.11667 • Published Oct 18, 2023 • 3
SDPO: Segment-Level Direct Preference Optimization for Social Agents Paper • 2501.01821 • Published 8 days ago • 17
Virgo: A Preliminary Exploration on Reproducing o1-like MLLM Paper • 2501.01904 • Published 8 days ago • 27
Test-time Computing: from System-1 Thinking to System-2 Thinking Paper • 2501.02497 • Published 6 days ago • 31
BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning Paper • 2501.03226 • Published 5 days ago • 33