Basic Arithmetic Test

A new math benchmark just dropped and leading AI models can solve 'less than 2%' of its problems... oh dear

While today's AI models don't tend to struggle with other mathematical benchmarks such as GSM-8k and MATH, according to Epoch ...

New secret math benchmark stumps AI models and PhDs alike

FrontierMath's performance results, revealed in a preprint research paper, paint a stark picture of current AI model ...

Testing AI systems on hard math problems shows they still perform very poorly

A team of AI researchers and mathematicians affiliated with several institutions in the U.S. and the U.K. has developed a ...

Epoch AI Launches FrontierMath AI Benchmark to Test Capabilities of AI Models

Epoch AI highlighted that to measure AI's aptitude, benchmarks should be created on creative problem-solving where the AI has ...

AI’s math problem: FrontierMath benchmark shows how far technology still has to go

FrontierMath, a new benchmark from Epoch AI, challenges advanced AI systems with complex math problems, revealing how far AI still has to go before achieving true human-level reasoning.

Data Shows A Dramatic Difference In Test Scores Between Hawaii Schools

Statewide numbers suggest student test scores have flatlined in Hawaii in recent years, but results for individual schools ...

NOLA.com21d

Editorial: New Orleans Public Schools front office fails a basic math test

But that doesn’t mean we shouldn’t be alarmed when one of those mistakes reveals an appalling failure in what should be one of the most basic areas of government operation. So while we’re ...

Australian Broadcasting Corporation21d

Calls for changes to Australia's citizenship test after Thai migrant fails five times

"People can seek special assistance with the test, and it is regularly reviewed to ensure the language and questions are clear, fair, and accord with the legal standard of basic English," the ...

TechRadar29d

Apple’s latest study proves that AI can’t even solve basic grade-school math problems

especially when it comes to basic grade school math. According to a recently published paper from six Apple researchers, 'GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results