Scaling LLM Test-Time can be More Effective than Scaling Parameters

📅 September 13, 2024 ⏱️ 1 min read

Enabling LLMs to improve their outputs by using more test-time computation is a critical step towards building generally self-improving agents that can operate on open-ended natural language. In this paper, we study the scaling of inference-time computation in LLMs, with a focus on answering the que