Kai Zuberbühler's picture

413 290

Kai Zuberbühler

kaizuberbuehler

·

k-zubi

AI & ML interests

language models, agents, image generation, music generation

Recent Activity

updated a collection 1 day ago

LM Prompt Engineering

updated a collection 1 day ago

upvoted a paper 1 day ago

Search-o1: Agentic Search-Enhanced Large Reasoning Models

View all activity

Organizations

None yet

kaizuberbuehler's activity

updated 2 collections 1 day ago

LM Prompt Engineering

28 items • Updated 1 day ago

Reasoning

37 items • Updated 1 day ago

upvoted a paper 1 day ago

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Paper • 2501.05366 • Published 2 days ago • 43

updated a collection 2 days ago

Reasoning

37 items • Updated 1 day ago

upvoted a paper 2 days ago

Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

Paper • 2412.21187 • Published 12 days ago • 34

updated 2 collections 2 days ago

Benchmarks

48 items • Updated 2 days ago • 1

Agents

67 items • Updated 2 days ago • 3

upvoted a paper 2 days ago

SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents

Paper • 2310.11667 • Published Oct 18, 2023 • 3

updated a collection 2 days ago

Agents

67 items • Updated 2 days ago • 3

upvoted a paper 2 days ago

SDPO: Segment-Level Direct Preference Optimization for Social Agents

Paper • 2501.01821 • Published 8 days ago • 17

updated 3 collections 2 days ago

Vision Language Models

53 items • Updated 2 days ago • 5

Reasoning

37 items • Updated 1 day ago

LM Training

64 items • Updated 2 days ago • 1

upvoted a paper 2 days ago

Virgo: A Preliminary Exploration on Reproducing o1-like MLLM

Paper • 2501.01904 • Published 8 days ago • 27

updated a collection 2 days ago

LM Training

64 items • Updated 2 days ago • 1

upvoted a paper 2 days ago

Scaling Laws for Floating Point Quantization Training

Paper • 2501.02423 • Published 7 days ago • 20

updated a collection 2 days ago

Reasoning

37 items • Updated 1 day ago

upvoted a paper 2 days ago

Test-time Computing: from System-1 Thinking to System-2 Thinking

Paper • 2501.02497 • Published 6 days ago • 31

updated a collection 2 days ago

Reasoning

37 items • Updated 1 day ago

upvoted a paper 2 days ago

BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning

Paper • 2501.03226 • Published 5 days ago • 33