back to top
HomeTechApple’s New Siri Could Auto-Delete Chats. Google Gemini Is Reportedly Under the...

Apple’s New Siri Could Auto-Delete Chats. Google Gemini Is Reportedly Under the Hood.

- Advertisement -

Apple has a Siri problem and everyone knows it. ChatGPT became a verb. Gemini is powering half the Android ecosystem. Claude is showing up in enterprise workflows. Meanwhile Siri is still struggling to set timers reliably.

WWDC is in June and Apple is reportedly planning its biggest Siri overhaul yet. A standalone app, a proper chatbot experience, and a privacy pitch front and center. According to Bloomberg’s Mark Gurman, Apple executives plan to argue they’re taking a more privacy-friendly approach than every other AI company out there.

That argument gets complicated quickly. The model powering this new Siri is Google Gemini.

What Apple is actually building

The revamp has a few concrete pieces according to Gurman. First, a standalone Siri app, the first time Siri has existed as something you open rather than something you invoke. The experience is described as reminiscent of ChatGPT, which is a significant shift from what Siri has been for the last fifteen years.

One of the more specific details Gurman surfaced is an auto-delete option for conversations, similar to how the Messages app handles retention. Users would be able to set chats to delete after 30 days, after a year, or keep them indefinitely. That kind of user control over data retention is meaningful and more explicit than what most AI chatbots currently offer.

Apple’s framing will reportedly center on the argument that it stores less, retains less, and gives users more control than competitors. For a company that has built significant brand equity around privacy, leaning into that for its AI relaunch makes obvious sense.

You May Like: OpenAI Wanted Distribution on the iPhone. Apple Had Other Plans.

The Google problem

The model doing the actual work inside this new Siri is Google Gemini, under the deal Apple struck earlier this year paying roughly $1 billion annually. When you ask the new Siri something that requires real intelligence, that query is going to Google’s infrastructure.

Apple can control the retention policy on its end. It can build auto-delete into the app. It can limit how long conversation data lives on Apple’s servers. What it has significantly less control over is what happens on Google’s end once the request gets there.

Gurman flagged this tension directly, suggesting Apple’s privacy emphasis might also be functioning as cover for Siri’s limitations compared to ChatGPT and other dedicated AI products. A privacy-first framing is genuinely differentiating. It’s also a convenient way to explain why the product does less because privacy constraints wouldn’t allow it.

Whether that’s the full part or a convenient one is something WWDC will at least partially answer.

What to actually expect at WWDC

June is close enough that this is less speculation and more preview. The broad strokes Gurman is describing, standalone app, chatbot experience, privacy controls, Gemini backend are consistent with everything that’s been reported about Apple’s AI direction over the past several months.

The more interesting question WWDC will answer is how Apple handles the Google disclosure. Right now the privacy pitch and the Gemini backend exist in separate conversations. On stage in June they’ll have to coexist in the same product demo. How Apple explains that to a general audience, not to journalists who already know about the deal, but to the average iPhone user who assumes Siri is an Apple product end to end, will say a lot about how honest this privacy relaunch actually is.

The auto-delete feature is useful regardless of the backend question. User control over conversation retention is something every AI chatbot should offer and most don’t. Apple building that in from launch is worth acknowledging.

But the gap between “Apple is taking privacy seriously” and “Apple is routing your queries through Google” is a noticable gap. June is when Apple has to close it or explain why it doesn’t need to be closed.

We’ll be watching.

Don’t miss any Tech Story

Subscribe To Firethering NewsLetter

You Can Unsubscribe Anytime! Read more in our privacy policy

LEAVE A REPLY

Please enter your comment!
Please enter your name here

YOU MAY ALSO LIKE
Ornith Coding model that beats Claude opus 4.7

Ornith 1.0: The New Open-Source AI Model for Agentic Coding

0
Most reinforcement learning setups for coding models work the same way. Researchers build a harness, a fixed scaffold that tells the model how to approach a category of task, then the model gets rewarded for solving problems inside that structure. The harness stays fixed. Only the model's answers change. Ornith-1.0, a new open-source coding model family from DeepReinforce is not just about coding, Instead the model writes its own scaffold. At every training step, it looks at the task in front of it and the scaffold it used last time, then proposes a better version of that scaffold before even attempting an answer. The reward doesn't just grade the solution. It grades the scaffold that produced it. That's a small architectural choice with a strange consequence. A model that gets to design its own training process can, in theory, design one that cheats the verifier instead of solving the actual problem, and DeepReinforce is upfront that this happened during training. The fix they built for it is also worth understanding before getting to the benchmark numbers.
OpenAI Built Its First AI Chip. It's Not Trying to Replace NVIDIA

OpenAI Built Its First AI Chip. It’s Not Trying to Replace NVIDIA.

0
When the news broke that OpenAI had built a custom chip, the instinct was to frame it as a NVIDIA story. Another lab trying to cut the cord, reduce dependence on H100s, claw back some margin from the company that's been printing money off the AI boom. That's not quite what's happening here. The chip is called Jalapeño, built with Broadcom, and it doesn't touch training at all. It's an inference chip, meaning it only runs models after they're already built, when a user sends a message and ChatGPT has to respond. The compute-heavy work of actually training those models still runs on NVIDIA hardware. OpenAI isn't replacing NVIDIA. It's going after a different part of the problem entirely, the part that happens millions of times a day, every time someone uses one of their products. That distinction matters because inference is where AI costs actually accumulate at scale. Training happens once per model. Inference never stops.
glm 5.2 ai open weights

GLM-5.2 Is the Closest an Open Model Has Come to Claude

0
What does it take for an open-weight model to stop chasing Claude and actually beat it? Every open-weight release for two years has told some version of the same story: closer, but not quite. The chart shrinks, the wording softens to "competitive with," and the conversation moves on until the next model repeats the cycle. GLM-5.2 breaks that pattern. The model is built to survive long, messy coding work, the kind that runs for hours without losing the thread. That's the pitch its maker is leading with. But scroll down their own benchmark table and something else is sitting there quietly: on a couple of standard math evals, this open model isn't approaching Claude Opus 4.8, GPT-5.5, or Gemini 3.1 Pro. It's beating all three, on the same table. It loses plenty of ground elsewhere, and that part matters just as much as the wins. But a model anyone can download under an MIT license, with no usage restrictions attached, coming out ahead of the lab everyone else measures themselves against, is worth pausing on before getting to what the rest of the numbers actually say.