back to top
HomeTechOpenAI Says Its AI Solved an 80-Year-Old Math Problem. The Proof Surprised...

OpenAI Says Its AI Solved an 80-Year-Old Math Problem. The Proof Surprised Mathematicians.

- Advertisement -

OpenAI says one of its internal reasoning models has solved a math problem that has been there on mathematicians’ desks since 1946.

The problem, first posed by legendary mathematician Paul Erdős, looks almost absurdly simple. Given a set of points on a flat plane, how many pairs can be exactly one unit apart? People have spent nearly 80 years trying to pin down the answer.

OpenAI’s model didn’t just make progress on the problem. According to the company, it disproved a longstanding conjecture that many researchers believed was essentially correct.

A problem mathematicians thought they understood

The unit distance problem openai model solved
via: OpenAI

The unit distance problem sounds almost simple. Take a large number of points on a flat plane. How many pairs can sit exactly one unit apart?

Paul Erdős posed the question in 1946. For the next 80 years, nobody could fully solve it.

The strange part is that mathematicians thought they had a pretty good idea of what the answer looked like.

The best-known constructions all revolved around variations of the same idea: arrange points in carefully scaled grid patterns and count how many unit-length connections appear. Researchers improved the math around those constructions, tightened bounds, and explored related versions of the problem.

But the overall picture barely changed. The prevailing belief was that the square-grid approach was probably close to the truth. Maybe not perfect, but close enough that any future improvement would be marginal.

Then OpenAI’s model took a different approach.. Instead of refining the existing approach, it found a construction that breaks away from the grid entirely. The result produces substantially more unit-distance pairs than mathematicians thought possible.

For years, the assumption was that the best-known constructions were already close to optimal. This result suggests there was more room for improvement than researchers realized.

The proof nobody was looking for

According to OpenAI, the proof relies on tools from algebraic number theory, a field concerned with the properties of numbers and abstract algebraic structures. These ideas were never developed to solve questions about points on a plane.

That’s part of why mathematicians found the result surprising.

The solution comes from connecting two areas of mathematics that most people wouldn’t naturally put together.

Several of the researchers who reviewed the proof focused on this point. The result suggests there may be connections between number theory and discrete geometry that mathematicians haven’t fully explored.

You May Like: Best AI Coding Models for Consumer Hardware

Why mathematicians are paying attention

Mathematicians aren’t reacting this way simply because an AI solved an old problem.

Open problems get solved all the time. What caught people’s attention here is the nature of the solution.

The proof appears to contain an idea that experts didn’t expect.

Noga Alon, one of the world’s leading combinatorialists, described the result as an outstanding achievement and said the answer itself was surprising. The prevailing assumption had been that the number of unit distances would stay close to the rate Erdős originally conjectured.

Instead, the new construction shows that assumption was wrong.

Tim Gowers, a Fields Medal winner and one of the mathematicians who reviewed the work, called it a milestone in AI mathematics. Other researchers pointed to the same thing: the proof didn’t just combine existing steps mechanically. It connected ideas from different parts of mathematics in a way that appears genuinely novel.

Most recent AI successes in mathematics have involved solving competition-style problems, checking proofs, or helping researchers explore possibilities. This is one of the first cases where an AI-generated result is being discussed as a meaningful contribution to an active research question.

What changed here?

AI systems have been getting better at mathematics for years. They moved from solving school-level problems to competition problems, then to helping researchers with specific tasks.

This result sits a little further along that path.

The model wasn’t trained specifically for the unit distance problem. It wasn’t built around a custom search system for this proof. According to OpenAI, a general-purpose reasoning model produced the argument.

That’s what makes the reaction from mathematicians notable.

The headline isn’t that a machine solved an 80-year-old problem. Open problems eventually fall. The unusual part is that experts appear to view the proof as containing an idea worth paying attention to.

Whether this turns out to be a one-off result or the beginning of something larger is still unclear. But for the first time, the conversation is about how much of that research it might eventually do on its own.

LEAVE A REPLY

Please enter your comment!
Please enter your name here

YOU MAY ALSO LIKE
MiniCPM5-1B Shows Why the Small-Model Race Isn't Over

MiniCPM5-1B Shows Why the Small-Model Race Isn’t Over

0
A 1B model scoring 40.42 on AIME 2025 should not be possible. AIME is the American Invitational Mathematics Examination, the kind of test that filters out most humans who attempt it. Qwen3-0.6B scores 16.25 on the same benchmark. LFM2.5-1.2B, a larger model, scores 31.88. MiniCPM5-1B, at roughly one billion parameters, beats both. OpenBMB just dropped MiniCPM5-1B, the first model in their MiniCPM5 series, and it's built specifically for the scenarios like on-device deployment, resource-constrained environments, local inference on consumer hardware. The AIME score is surprising. The telecom agent benchmark is even more surprising. And then there's the desktop pet. We'll get to that.
Step 3.7 Flash

StepFun Says Step 3.7 Flash Matches 97% of Claude Opus 4.6’s Coding Performance at...

0
$0.19 vs $1.76. That's the per-task cost of running Step 3.7 Flash with Advisor Mode enabled versus Claude Opus 4.6 on SWE-Bench Verified. The Flash model scores 76.3% to Opus 4.6's 78.7%. Two percentage points of difference. Nine times cheaper to get there. For anyone building agentic coding workflows at scale that math changes the decision about which model actually belongs in production. Frontier performance has been getting cheaper for a while but this is a specific, benchmarked claim with a specific cost figure attached.
Microsoft Threatened Legal Action Against a Security Researcher. The Security Community Pushed Back

Microsoft Threatened Legal Action Against a Security Researcher. The Security Community Pushed Back.

0
Finding bugs in Microsoft products used to come with a clear social contract. You find it, you report it privately, you wait for a fix, then you publish. Microsoft gets to patch quietly. You get credit and maybe a bug bounty. Nowadays that contract seem to get complicated. A researcher going by Nightmare Eclipse published a series of unpatched vulnerabilities in Microsoft products including Windows Defender and BitLocker, along with working exploit code, without giving Microsoft a chance to fix them first. Microsoft responded with a blog post threatening criminal referrals and invoking its Digital Crimes Unit. The cybersecurity community, the same community Microsoft depends on to find these bugs before actual criminals do, reacted about as well as you'd expect.

Don’t miss any Tech Story

Subscribe To Firethering NewsLetter

You Can Unsubscribe Anytime! Read more in our privacy policy