back to top
HomeTechClaude Just Doubled Its Usage Limits. The Real Story Is the SpaceX...

Claude Just Doubled Its Usage Limits. The Real Story Is the SpaceX Deal Behind It

- Advertisement -

Claude users have spent months playing a very specific game, how much work can you squeeze out of Opus before the rate limits slam shut?

Anthropic is finally loosening things up. The company says it’s doubling Claude Code limits, removing peak-hour reductions for paid users, and significantly raising Opus API caps. The reason is also there in the same announcement. Anthropic now has access to all the compute capacity at SpaceX’s Colossus 1 data center. That’s over 220,000 NVIDIA GPUs.

That’s the kind of announcement that makes you realize AI companies aren’t just shipping models anymore. They’re building power infrastructure.

This Isn’t really about Chatbot Anymore

The easiest way to understand Anthropic’s announcement is to look at what people are actually using Claude for now.

A normal chatbot session doesn’t burn through compute like this. You ask a few questions, maybe upload a file, then leave. That’s not the workload forcing companies to secure hundreds of thousands of GPUs.

Claude Code is different. People are leaving it running for hours inside terminals. They’re asking it to refactor projects, debug broken dependencies, analyze huge codebases, write tests, retry failed tasks, call tools, and keep context alive across long sessions. One coding agent can easily consume far more tokens than a casual chat user ever would.

That’s probably why the double limits part of Anthropic’s announcement matters more than it first appears.

For months, Claude had this weird split reputation among developers. The model itself was excellent, especially for coding, but heavy users kept running into walls. Long sessions would suddenly slow down. Opus users learned to ration prompts. Some developers even changed workflows entirely around avoiding rate limits.

Now Anthropic is signaling something pretty clearly, they expect usage to keep getting heavier And honestly, that tracks with where AI tooling is heading. The industry keeps talking about chatbots, but the real compute monster may end up being autonomous agents quietly running in the background for hours at a time.

TierMaximum Input Tokens per MinuteMaximum Output Tokens per Minute
130,000 -> 500,0008,000 -> 80,000
2450,000 -> 2,000,00090,000 -> 200,000
3800,000 -> 5,000,000160,000 -> 400,000
42,000,000 -> 10,000,000400,000 -> 800,000

Those new limits aren’t small bumps either. Some Opus tiers are seeing output caps increase by 10x. Which also explains why Anthropic immediately followed the announcement, a compute partnership with SpaceX.

You May Like Best AI Coding Models for Consumer Hardware

The real flex wasn’t the higher limits

The company says its SpaceX partnership gives it access to all of the compute capacity at Colossus 1, adding more than 300 megawatts of capacity and over 220,000 NVIDIA GPUs within the month.

That’s an absurd amount of compute for what most people still casually describe as “a chatbot.”

And Anthropic isn’t talking like this in isolation anymore. Over the past year, major AI companies have increasingly started announcing infrastructure deals the way they used to announce model launches. Amazon, Google, Microsoft, NVIDIA, xAI everybody suddenly sounds half software company, half utility provider.

The shift makes sense once you look at where AI usage is heading. Reasoning models are expensive to run. Coding agents stay active for long stretches. Enterprise workloads don’t disappear after a few prompts. The more capable these systems become, the more compute they consume in the background.

A year ago, companies competed on benchmark screenshots. Now they’re competing on who can secure enough GPUs to keep the systems online at scale.

This feels like the next phase of the AI race

For a while, AI companies competed mostly on model quality. Better benchmarks, reasoning, coding. But now the conversation is shifting towards, who can actually keep these systems running at scale without constantly slamming users into limits.

Anthropic partnering with SpaceX for compute sounds unusual today. It probably won’t a year from now. Because at this point, the bottleneck isn’t really ideas anymore. It’s power, GPUs, and who can secure enough infrastructure before everyone else does.

Don’t miss any Tech Story

Subscribe To Firethering NewsLetter

You Can Unsubscribe Anytime! Read more in our privacy policy

LEAVE A REPLY

Please enter your comment!
Please enter your name here

YOU MAY ALSO LIKE
glm 5.2 ai open weights

GLM-5.2 Is the Closest an Open Model Has Come to Claude

0
What does it take for an open-weight model to stop chasing Claude and actually beat it? Every open-weight release for two years has told some version of the same story: closer, but not quite. The chart shrinks, the wording softens to "competitive with," and the conversation moves on until the next model repeats the cycle. GLM-5.2 breaks that pattern. The model is built to survive long, messy coding work, the kind that runs for hours without losing the thread. That's the pitch its maker is leading with. But scroll down their own benchmark table and something else is sitting there quietly: on a couple of standard math evals, this open model isn't approaching Claude Opus 4.8, GPT-5.5, or Gemini 3.1 Pro. It's beating all three, on the same table. It loses plenty of ground elsewhere, and that part matters just as much as the wins. But a model anyone can download under an MIT license, with no usage restrictions attached, coming out ahead of the lab everyone else measures themselves against, is worth pausing on before getting to what the rest of the numbers actually say.
Open-Source AI Tools Worth Trying Right Now

5 Open-Source AI Tools You Probably Haven’t Tried Yet

0
Every week brings another open source AI release, and most of them require setting up a Python environment. Find out the model card lied about VRAM requirements. By the time something actually runs, the appeal has mostly worn off. The five tools below skip most of that. One turns image and video generation into something closer to a desktop app. One gives DeepSeek an actual workspace instead of a browser tab. One builds UI prototypes using coding agents you probably already have installed. One quietly builds a memory system out of your own apps. And one is, literally, a desktop pet.
Claude Mythos 5 and Claude Fable 5

Claude Mythos 5 Was Too Powerful to Ship. Anthropic Released Fable 5 Instead.

0
Anthropic gave stripe early access to Fable 5 and set it loose on a 50 million line Ruby codebase. The migration that would have taken a full engineering team over two months got done in a day. That's a real company's real codebase and a task with real consequences if it goes wrong. Anthropic leads with it because it's the kind of result that's hard to argue with & because it sets up everything else they need to tell you about why this launch looks the way it does. Because here's the thing. The model Anthropic actually built Claude Mythos 5, isn't what most people are getting today. What's going live for general use is Claude Fable 5. Same underlying model. Different version. The parts Anthropic decided were too dangerous for public release got a separate wrapper, a separate name, and a separate approval process controlled in part by the US government.