Very Hard Math Problem

AI scores a ‘C-’ on its hardest math test yet

The second batch of “First Proof” problems is meant to evaluate AI’s usefulness for research-level math. The best model got ...

Hosted on MSN

Top AI models are failing hard at solving fresh math problems

Top artificial intelligence systems now ace many textbook-style math questions, yet they still fall apart on genuinely new problems. The gap between polished performance on familiar benchmarks and ...

Education Week

The Future of Math Class: How AI Could Transform Instruction

Clarification: This story has been updated to clarify how University of Colorado researchers handle their data collection. A student digs into a math problem that references his favorite superhero, ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

AI scores a ‘C-’ on its hardest math test yet

Top AI models are failing hard at solving fresh math problems

The Future of Math Class: How AI Could Transform Instruction

Trending now