Collection of papers and resources on enhancing LLM performance by scaling test-time compute.
🗂️ Table of Contents
-
Trading Inference-Time Compute for Adversarial Robustness.
Wojciech Zaremba, Evgenia Nitishinskaya, Boaz Barak, Stephanie Lin, Sam Toyer, Yaodong Yu, Rachel Dias, Eric Wallace, Kai Xiao, Johannes Heidecke, Amelia Glaese. Preprint'25
-
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters.
Charlie Snell, Jaehoon Lee, Kelvin Xu, Aviral Kumar. Preprint'24
-
VerifierQ: Enhancing LLM Test Time Compute with Q-Learning-based Verifiers.
Jianing Qi, Hao Tang, Zhigang Zhu. Preprint'24
-
Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning.
Zhenni Bi, Kai Han, Chuanjian Liu, Yehui Tang, Yunhe Wang. Preprint'24
-
Adaptive Inference-Time Compute: LLMs Can Predict if They Can Do Better, Even Mid-Generation.
Rohin Manvi, Anikait Singh, Stefano Ermon. Preprint'24
-
Niklas Muennighoff, Zitong Yang, Weijia Shi, Xiang Lisa Li, Li Fei-Fei, Hannaneh Hajishirzi, Luke Zettlemoyer, Percy Liang, Emmanuel Candès, Tatsunori Hashimoto. Preprint'25
-
Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps.
Nanye Ma, Shangyuan Tong, Haolin Jia, Hexiang Hu, Yu-Chuan Su, Mingda Zhang, Xuan Yang, Yandong Li, Tommi Jaakkola, Xuhui Jia, Saining Xie. Preprint'25
-
SETS: Leveraging Self-Verification and Self-Correction for Improved Test-Time Scaling.
Jiefeng Chen, Jie Ren, Xinyun Chen, Chengrun Yang, Ruoxi Sun, Sercan Ö Arık. Preprint'25
-
CodeMonkeys: Scaling Test-Time Compute for Software Engineering. [code]
Ryan Ehrlich, Bradley Brown, Jordan Juravsky, Ronald Clark, Christopher Ré, Azalia Mirhoseini. Preprint'25
- [Blogpost] Scaling Time-Time Compute with Open Models
- Search and Learn Recipes to scale inference-time compute of open models
- Awesome-LLM-Reasoning Collection of papers and resources on LLM Reasoning.
- Awesome-LLM-Synthetic-Data Collection of papers and resources on LLM based Synthetic Data Generation.
- Awesome-Controllable-Generation Collection of papers and resources on Controllable Generation using Diffusion Models.
- Chain-of-ThoughtsPapers A trend starts from "Chain-of-Thought Prompting Elicits Reasoning in Large Language Models".
- LM-reasoning Collection of papers and resources on Reasoning in Large Language Models.
- Prompt4ReasoningPapers Repository for the paper "Reasoning with Language Model Prompting: A Survey".
- ReasoningNLP Paper list on reasoning in NLP
- Awesome-LLM Curated list of Large Language Model.
- Awesome-LLM-Self-Consistency Curated list of Self-consistency in Large Language Models.
- Deep-Reasoning-Papers Recent Papers including Neural-Symbolic Reasoning, Logical Reasoning, and Visual Reasoning.
- Add a new paper or update an existing paper, thinking about which category the work should belong to.
- Use the same format as existing entries to describe the work.
- Add the abstract link of the paper (
/abs/
format if it is an arXiv publication).
Don't worry if you do something wrong, it will be fixed for you!