Everyone Is Judging AI by These Tests. Experts Say They're Close to Meaningless

Benchmarks used to rank AI models are several years old, often sourced from amateur websites, and, experts worry, lending automated systems a dubious sense of authority

Read more here: External Link