back to top

Tech Stories

Nucleus-Image AI image MOE model
The mixture-of-experts trick changed how people think about LLMs. Instead of running every parameter on every token, you activate a small fraction of the network per forward pass and somehow the quality stays competitive while the compute drops. It's the reason models like Mixtral punched above their weight. Everyone in the LLM space understood it immediately. Nobody had done it openly for image generation. Until now. Nucleus-Image is a 17B parameter diffusion transformer that activates roughly 2B parameters per forward pass. It beats Imagen4 on OneIG-Bench, sits at number one on DPG-Bench overall, and matches Qwen-Image on GenEval. It's also a base model. No fine-tuning, reinforcement learning or human preference tuning. What you're seeing in those benchmarks is raw pre-training performance. That's either impressive or a caveat depending on what you need it for, probably both.
Foundation-1 Is the Open Source AI Model That Thinks Like a Music Producer
There are genuinely impressive open source music generation models out there right now. ACE Step, YuE, HeartMuLa, models that generate full songs with vocals, structure and emotion. If you want a complete track from a single prompt those are worth exploring. Foundation-1 does not compete with them. It does not try to. What it does instead is something more specific and honestly more useful for anyone who actually makes music. It generates individual loops and samples like tempo-synced, key-locked, bar-aware, built to drop straight into a production without fixing anything first. Just clean, structured instrumental loops that behave like something a producer built rather than something an AI guessed at. If you have ever spent twenty minutes trying to make an AI-generated loop fit your track you already understand why that matters.
Voxtral TTS Mistral Is Pushing Voice AI Off the Cloud
Voxtral TTS supports nine languages: English, French, German, Spanish, Dutch, Portuguese, Italian, Hindi, and Arabic. That by itself isn’t unusual anymore. A lot of models claim multilingual support. The interesting part is how it handles switching between them. Mistral says it can move between languages mid-sentence without changing the speaker’s voice. So you don’t get that awkward reset where the tone or identity shifts when the language changes. If that holds up, it’s actually useful for real scenarios like think support calls where people naturally switch languages, or content that mixes languages without warning.
5 Open-Source AI World Models You Can Use for Free
We've watched open source absolutely run through video generation, image generation, audio. Every few months another closed model gets matched, then beaten, by something free on GitHub. But world generation always felt different. Like that was the one thing that needed a Google-sized lab behind it. I thought so too, until I actually went looking. Turns out there are open source models right now that take a text prompt and build you an explorable, interactive world. Some go even further — hand them a single image and they'll construct an entire environment around it. The quality on a few of these genuinely caught me off guard.
ERNIE-Image Open-Source 8B Text-to-Image Model for Posters Comics and control
Text rendering in open source AI image generation has been broken for a long time. Ask most models to put readable words on a poster, lay out a comic panel, or generate anything where the text actually has to make sense and only few models can do it accurately and from rest you get something that looks like it was written by someone who learned the alphabet from a fever dream. ERNIE-Image is Baidu's answer to that specific problem. It's an 8B open weight text-to-image model built on a Diffusion Transformer and it's genuinely good at dense text, structured layouts, posters, infographics and multi-panel compositions. It can run on a 24GB consumer GPU, it's on Hugging Face right now, and it comes in two versions, a full quality model and a turbo variant that gets there in 8 steps instead of 50.
GPT-5.4 Is Outperforming Humans at Work. But the Real Story Is What OpenAI Isn't Telling You
OpenAI dropped their latest model yesterday and buried inside the benchmarks is a number that deserves more attention than it's getting. On GDPval, a test that puts AI agents through real professional tasks across 44 actual occupations, GPT-5.4 matched or outperformed human professionals 83% of the time. The previous version sat at 71%. That's not a small jump. And this isn't GPT writing emails or summarizing documents anymore. This version can move a mouse, click buttons, fill out forms, and work across applications the way a person sitting at a desk would. It scored 75% on OSWorld, a benchmark that tests exactly that. The average office worker scores 72.4%. The model is already better at operating a computer than most people who use one for a living & 83% is just the beginning of what this release actually means.
Open Source AI Video Models for Editing and Generation
If you have been looking for open source tools to work with video using AI you have probably noticed something. Most of what gets covered is generation like creating new videos from scratch. The editing side, actually modifying existing footage with AI, has been much quieter. That is starting to change. There are now open source models that can swap outfits, replace backgrounds, remove objects, change characters and apply styles to existing video using plain text instructions. Some are built specifically for editing. Others are generation models that fit naturally into a creative video workflow. This list covers both honestly. Three models built specifically for video editing and two generation models worth knowing about if you are working with video content. All open source, all available today.

Discover Softwares

Discover Apps

Discover AI Apps

OpenSwarm: The Open-Source AI Workspace for Everything Beyond Claude Code

There are countless AI tools that still revolve around one assistant doing everything inside a chat window. OpenSwarm feels closer to assigning work across a small team. The research agent handles analysis. The slides agent builds presentations. The data analyst creates charts. Video and image agents manage media generation separately. Single-agent systems tend to hallucinate once projects become larger or more visual. OpenSwarm keeps tasks separated, which usually makes the outputs feel more structured and usable. It also fits naturally beside tools like Claude Code instead of replacing them. You might still use Claude Code for engineering work, debugging, or architecture decisions while OpenSwarm handles the surrounding deliverables like reports, presentations, marketing assets, research, documentation, and media generation.

Ovi AI Video + Audio Generator in ComfyUI — Best Open-Source Alternative to Veo 3 & Sora 2

Experience next-generation AI video and audio generation locally with Ovi in ComfyUI — the most powerful open-source workflow that rivals Google’s Veo 3 and OpenAI’s Sora 2. With Ovi’s multimodal fusion engine and seamless integration into ComfyUI, you can create AI-generated videos with synchronized sound, all without depending on cloud services. It’s inspired by Character.AI’s Ovi and integrates seamlessly into the ComfyUI node environment, offering a fully modular, GPU-accelerated, and privacy-friendly experience.

LibreChat: Top Open-Source ChatGPT Alternative for Self-Hosting AI Models Like GPT-OSS, LLaMA, Mistral & More

LibreChat is a game-changer in the world of AI chat interfaces. Designed with inspiration from OpenAI's ChatGPT and supercharged with cutting-edge enhancements, LibreChat offers a modern, clean & highly customizable interface to run your own LLMs. Whether you're a developer, researcher, or just someone who wants full control over their AI assistant experience. LibreChat gives you everything you need, without the need for third-party subscriptions or cloud lock-in.

Krita AI Diffusion: Powerful Free Tool for Image Creation, Generative fill & AI Editing

This plugin is for digital artists, illustrators, concept designers, and hobbyists alike, Krita with Generative AI delivers an end-to-end creative environment with deep AI integration that feels native and intuitive. With support for LoRAs, negative prompts, & advanced nodes like Ultimate SD Upscale and IP-Adapter, the tool matches and in some cases exceeds the capabilities of more complex standalone AI UIs.

Discover Games

Content Creation

Find Content Creation Niche with 3 easy steps

3 Simple Steps to Find Your Niche as a Content Creator

0
If you're thinking to start your content creation journey, the first question that comes in your mind could be "What to Create?" and when you scroll through Instagram, YouTube, LinkedIn, and see creators with clear focus on their niche like fitness, finance, coding, fashion, motivation. Most of the new creators probably wonder at this point that if everything is already being created then what should we create?
10 Faceless YouTube Channel Ideas

10 Faceless YouTube Channel Ideas In 2026

0
Finding the perfect niche can feel challenging if you don't want to show your face in YouTube videos
Five proven ways to boost instgram reels reach

5 Proven Ways to Boost Your Instagram Reels Reach in 2025

0
Instagram is continuously evolving and so do we, when I created my first page, during the initial stages my reels were barely getting views,...