Scaling LLM Test-Time Compute More Effective Than Scaling Model Parameters

Join the discussion on this paper page

Read more here: External Link