Realtime AI worlds, GPT-5, Qwen Image, GPT-OSS, Claude 4.1, Grok Imagine, Gemini Storybooks, Seed Diffusion
Welcome to the AI Search newsletter. Here are the top highlights in AI this week.
Genie 3 is a new AI model that can generate interactive 3D worlds in real-time, allowing users to explore and interact with virtual environments. It can simulate a wide range of scenarios, from natural environments to fantastical worlds, and can even allow users to change the environment with text-based inputs. Read more
OpenAI's gpt-oss is a series of open-weight language models designed for powerful reasoning and versatile developer use cases. These models, gpt-oss-120b and gpt-oss-20b, are released under a permissive Apache 2.0 license, allowing for free use and modification. They offer features like configurable reasoning effort, full chain-of-thought, and fine-tuning capabilities. Read more
FastWan is a new video generation model that can produce a 5-second video in just 5 seconds, thanks to a technique called sparse distillation. This model is powered by FastVideo and can generate high-quality videos at a much faster rate than previous models. Read more
Do you prefer to watch instead of read? Check out this video covering all the highlights in AI this week
Claude Opus 4.1 is an upgraded version of Claude Opus 4, with improved performance in agentic tasks, real-world coding, and reasoning. It achieves state-of-the-art coding performance with a score of 74.5% on SWE-bench Verified and also excels in pinpointing exact corrections within large codebases. This upgrade is available to paid Claude users and can be accessed through the API, Amazon Bedrock, and Google Cloud's Vertex AI. Read more
Skywork UniPic is a unified autoregressive model that combines image understanding, text-to-image generation, and image editing capabilities within a single architecture. This 1.5B-parameter model can generate images from text prompts, edit images, and describe images in detail. Read more
Qwen-Image is a powerful image generation foundation model that can create complex images with precise text rendering and editing capabilities. It can generate high-fidelity images with diverse artistic styles, from photorealistic scenes to impressionist paintings, and supports advanced image editing operations like style transfer and object manipulation. Qwen-Image is also capable of image understanding tasks, including object detection and semantic segmentation. Read more
To celebrate reaching 500K subscribers on Youtube, we are giving a away a DJI Mini 4 Pro! This is a small, versatile, powerful drone equipped with a high resolution 48MP sensor and can capture 4K/60fps HDR video. Enter for FREE!
OpenAI has launched GPT-5, their smartest AI model yet. It handles everything from coding to creative writing, is less likely to make mistakes or “hallucinate” facts, and can switch between quick or deep thinking based on the task. The new model is available to all users, with more features unlocked for paying subscribers. Read more
Gemini Storybooks lets you create personalized, illustrated storybooks with audio narration based on any story idea or even your own photos. It's simple to use, works in 45+ languages, and you get a unique 10-page book you can read, listen to, share, or print. This tool is available for free globally on mobile and desktop through the Gemini app. Read more
xAI’s Grok Imagine is an AI tool for generating creative images and short 15-second videos from text or images, now available free in the Grok app. Just upload or create an image, and use the app to turn it into a video with audio; the tool also includes a “Spicy” mode for NSFW content (with moderation). It’s accessible to everyone—no subscription required. Read more
ByteDance’s Seed Diffusion Preview is a breakthrough AI model for fast code generation using “discrete diffusion” instead of traditional step-by-step methods. It generates code 5.4 times faster than similar models—about 2,146 tokens per second—while keeping quality high, marking a big step for AI coding tools. Read more
With Monica, you can use the top AI models, image generators, and video generators, all in one integrated platform. Use code AISEARCH10 to get 25% OFF 'Unlimited Annual Plan' within 24h of registration, or enjoy 10% OFF. Try it for free today!
ElevenLabs has launched Eleven Music, an AI platform for making studio-quality songs from simple text prompts. You can pick the genre, style, structure, and add vocals or just keep it instrumental; plus, you can edit the sound and lyrics for different song sections. Read more
Alibaba releases new AI models, Qwen3-4B-Instruct-2507 and Qwen3-4B-Thinking-2507. The Instruct version is great for everyday chatbot use—it's fast and follows instructions for tasks like info lookup and basic problem-solving, while the Thinking model digs deeper, showing all its thought steps and excelling at tough reasoning, math, coding, and complex questions. Both make building smarter, more helpful apps easier for developers. Read more