back to top
HomeTechClaude Just Doubled Its Usage Limits. The Real Story Is the SpaceX...

Claude Just Doubled Its Usage Limits. The Real Story Is the SpaceX Deal Behind It

- Advertisement -

Claude users have spent months playing a very specific game, how much work can you squeeze out of Opus before the rate limits slam shut?

Anthropic is finally loosening things up. The company says it’s doubling Claude Code limits, removing peak-hour reductions for paid users, and significantly raising Opus API caps. The reason is also there in the same announcement. Anthropic now has access to all the compute capacity at SpaceX’s Colossus 1 data center. That’s over 220,000 NVIDIA GPUs.

That’s the kind of announcement that makes you realize AI companies aren’t just shipping models anymore. They’re building power infrastructure.

This Isn’t really about Chatbot Anymore

The easiest way to understand Anthropic’s announcement is to look at what people are actually using Claude for now.

A normal chatbot session doesn’t burn through compute like this. You ask a few questions, maybe upload a file, then leave. That’s not the workload forcing companies to secure hundreds of thousands of GPUs.

Claude Code is different. People are leaving it running for hours inside terminals. They’re asking it to refactor projects, debug broken dependencies, analyze huge codebases, write tests, retry failed tasks, call tools, and keep context alive across long sessions. One coding agent can easily consume far more tokens than a casual chat user ever would.

That’s probably why the double limits part of Anthropic’s announcement matters more than it first appears.

For months, Claude had this weird split reputation among developers. The model itself was excellent, especially for coding, but heavy users kept running into walls. Long sessions would suddenly slow down. Opus users learned to ration prompts. Some developers even changed workflows entirely around avoiding rate limits.

Now Anthropic is signaling something pretty clearly, they expect usage to keep getting heavier And honestly, that tracks with where AI tooling is heading. The industry keeps talking about chatbots, but the real compute monster may end up being autonomous agents quietly running in the background for hours at a time.

TierMaximum Input Tokens per MinuteMaximum Output Tokens per Minute
130,000 -> 500,0008,000 -> 80,000
2450,000 -> 2,000,00090,000 -> 200,000
3800,000 -> 5,000,000160,000 -> 400,000
42,000,000 -> 10,000,000400,000 -> 800,000

Those new limits aren’t small bumps either. Some Opus tiers are seeing output caps increase by 10x. Which also explains why Anthropic immediately followed the announcement, a compute partnership with SpaceX.

You May Like Best AI Coding Models for Consumer Hardware

The real flex wasn’t the higher limits

The company says its SpaceX partnership gives it access to all of the compute capacity at Colossus 1, adding more than 300 megawatts of capacity and over 220,000 NVIDIA GPUs within the month.

That’s an absurd amount of compute for what most people still casually describe as “a chatbot.”

And Anthropic isn’t talking like this in isolation anymore. Over the past year, major AI companies have increasingly started announcing infrastructure deals the way they used to announce model launches. Amazon, Google, Microsoft, NVIDIA, xAI everybody suddenly sounds half software company, half utility provider.

The shift makes sense once you look at where AI usage is heading. Reasoning models are expensive to run. Coding agents stay active for long stretches. Enterprise workloads don’t disappear after a few prompts. The more capable these systems become, the more compute they consume in the background.

A year ago, companies competed on benchmark screenshots. Now they’re competing on who can secure enough GPUs to keep the systems online at scale.

This feels like the next phase of the AI race

For a while, AI companies competed mostly on model quality. Better benchmarks, reasoning, coding. But now the conversation is shifting towards, who can actually keep these systems running at scale without constantly slamming users into limits.

Anthropic partnering with SpaceX for compute sounds unusual today. It probably won’t a year from now. Because at this point, the bottleneck isn’t really ideas anymore. It’s power, GPUs, and who can secure enough infrastructure before everyone else does.

LEAVE A REPLY

Please enter your comment!
Please enter your name here

YOU MAY ALSO LIKE
Elon Musk Lost His OpenAI Lawsuit. The Jury Never Actually Decided If He Was Right

Elon Musk Lost His OpenAI Lawsuit. The Bigger Question Was Never Put to the...

0
Elon Musk spent months in a California courtroom trying to prove that Sam Altman stole a charity. He got nine jurors, weeks of testimony from some of the biggest names in Silicon Valley, and a front row seat to the most revealing airing of OpenAI's founding history ever put on public record. Then the jury came back in under two hours and told him he'd filed too late. Not that he was wrong. Not that Altman and Brockman acted properly. Just that whatever happened between them and Musk, the legal clock had already run out before he decided to do something about it. The question of whether OpenAI actually betrayed its founding mission, the question that made this case worth following in the first place never got answered.
Apple New Siri Could Auto-Delete Chats. Google Gemini Is Reportedly Under the Hood

Apple’s New Siri Could Auto-Delete Chats. Google Gemini Is Reportedly Under the Hood.

0
Apple has a Siri problem and everyone knows it. ChatGPT became a verb. Gemini is powering half the Android ecosystem. Claude is showing up in enterprise workflows. Meanwhile Siri is still struggling to set timers reliably. WWDC is in June and Apple is reportedly planning its biggest Siri overhaul yet. A standalone app, a proper chatbot experience, and a privacy pitch front and center. According to Bloomberg's Mark Gurman, Apple executives plan to argue they're taking a more privacy-friendly approach than every other AI company out there. That argument gets complicated quickly. The model powering this new Siri is Google Gemini.
zero language for ai agents

Vercel Built a Programming Language for AI Agents. The Compiler Speaks JSON.

0
Every serious coding agent including Claude Code, Cursor, Copilot, whatever you're using shares the same quiet problem. The agent writes code, the compiler throws an error, and the agent has to read text written for a human engineer to figure out what went wrong and how to fix it. That sounds like a minor inconvenience. In practice it's one of the main reasons agentic coding loops break down. Error message formats change between compiler versions. The same underlying problem gets described differently depending on context. There's no built-in concept of a repair action, just prose that an agent has to parse and hope it understood correctly. Vercel Labs just released Zero, an experimental systems language built from day one around the idea that the compiler should talk to agents as clearly as it talks to humans. Its Apache 2.0 licensed, available now and genuinely interesting even at v0.1.1.

Don’t miss any Tech Story

Subscribe To Firethering NewsLetter

You Can Unsubscribe Anytime! Read more in our privacy policy