Maths Project Work Models New

News

OpenAI o1 Model Sets New Math and Complex Reasoning Records

OpenAI o1 is a new large language model trained with reinforcement learning to perform complex reasoning. o1 thinks before it answers—it can produce a long internal chain of thought before responding ...

Ars Technica10mon

New secret math benchmark stumps AI models and PhDs alike

New secret math benchmark stumps AI models and PhDs alike FrontierMath's difficult questions remain unpublished so that AI companies can't train against it.

VentureBeat4y

Researchers find that large language models struggle with math

In a new paper, researchers show that even the most sophisticated general-purpose AI language models struggle to solve math problems.

Ars Technica1y

Telling AI model to “take a deep breath” causes math scores to soar ...

Telling AI model to “take a deep breath” causes math scores to soar in study DeepMind used AI models to optimize their own prompts, with surprising results.

Results that may be inaccessible to you are currently showing.

Hide inaccessible results