News

OpenAI o1 is a new large language model trained with reinforcement learning to perform complex reasoning. o1 thinks before it answers—it can produce a long internal chain of thought before responding ...
New secret math benchmark stumps AI models and PhDs alike FrontierMath's difficult questions remain unpublished so that AI companies can't train against it.
In a new paper, researchers show that even the most sophisticated general-purpose AI language models struggle to solve math problems.
Telling AI model to “take a deep breath” causes math scores to soar in study DeepMind used AI models to optimize their own prompts, with surprising results.