Wolfram LLM Benchmarking Project

Results from Wolfram's ongoing tracking of LLM performance. The benchmark is based on a Wolfram Language code generation task.

Read more here: External Link