Skip to content

wasiahmad/Awesome-LLM-Test-Time-Scaling

Folders and files

NameName
Last commit message
Last commit date

Latest commit

a67931b · Feb 3, 2025

History

14 Commits
Feb 3, 2025
Feb 3, 2025

Repository files navigation

Stargazers Forks Contributors MIT License

Awesome LLM Test-Time Scaling

Collection of papers and resources on enhancing LLM performance by scaling test-time compute.

🗂️ Table of Contents
  1. Survey
  2. Analysis
  3. Technique
  4. Other Useful Resources
  5. Other Awesome Lists
  6. Contributing

Survey

↑ Back to Top ↑

Analysis

2025

  1. Trading Inference-Time Compute for Adversarial Robustness.

    Wojciech Zaremba, Evgenia Nitishinskaya, Boaz Barak, Stephanie Lin, Sam Toyer, Yaodong Yu, Rachel Dias, Eric Wallace, Kai Xiao, Johannes Heidecke, Amelia Glaese. Preprint'25

2024

  1. Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters.

    Charlie Snell, Jaehoon Lee, Kelvin Xu, Aviral Kumar. Preprint'24

↑ Back to Top ↑

Technique

2024

  1. VerifierQ: Enhancing LLM Test Time Compute with Q-Learning-based Verifiers.

    Jianing Qi, Hao Tang, Zhigang Zhu. Preprint'24

  2. Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning.

    Zhenni Bi, Kai Han, Chuanjian Liu, Yehui Tang, Yunhe Wang. Preprint'24

  3. Adaptive Inference-Time Compute: LLMs Can Predict if They Can Do Better, Even Mid-Generation.

    Rohin Manvi, Anikait Singh, Stefano Ermon. Preprint'24

2025

  1. s1: Simple test-time scaling.

    Niklas Muennighoff, Zitong Yang, Weijia Shi, Xiang Lisa Li, Li Fei-Fei, Hannaneh Hajishirzi, Luke Zettlemoyer, Percy Liang, Emmanuel Candès, Tatsunori Hashimoto. Preprint'25

  2. Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps.

    Nanye Ma, Shangyuan Tong, Haolin Jia, Hexiang Hu, Yu-Chuan Su, Mingda Zhang, Xuan Yang, Yandong Li, Tommi Jaakkola, Xuhui Jia, Saining Xie. Preprint'25

  3. SETS: Leveraging Self-Verification and Self-Correction for Improved Test-Time Scaling.

    Jiefeng Chen, Jie Ren, Xinyun Chen, Chengrun Yang, Ruoxi Sun, Sercan Ö Arık. Preprint'25

  4. CodeMonkeys: Scaling Test-Time Compute for Software Engineering. [code]

    Ryan Ehrlich, Bradley Brown, Jordan Juravsky, Ronald Clark, Christopher Ré, Azalia Mirhoseini. Preprint'25

↑ Back to Top ↑

Other Useful Resources

Other Awesome Lists

↑ Back to Top ↑

Contributing

  • Add a new paper or update an existing paper, thinking about which category the work should belong to.
  • Use the same format as existing entries to describe the work.
  • Add the abstract link of the paper (/abs/ format if it is an arXiv publication).

Don't worry if you do something wrong, it will be fixed for you!

Contributors

Star History

Star History Chart

About

LLM Test-Time Compute Scaling: Papers and Resources 🔥

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published