When LLM agents can do a task, they can often do so at a fraction of human cost

An update on our general capability evaluations

Read more here: External Link