DeepSeek Janus, OpenAI o3-mini, new music generators, new top AI models
Welcome to the AI Search newsletter. Here are the top highlights in AI this week.
OpenAI has launched o3-mini, a new AI model that excels at problem-solving and reasoning. This model, which is faster and more proficient in math, science, and coding than its predecessors, is now freely accessible to ChatGPT users. o3-mini introduces features like adjustable reasoning levels and enhanced safety measures, making it a powerful tool for various applications. Read more
Do you prefer to watch instead of read? Check out this video covering all the highlights in AI this week:
Qwen2.5-Max is a new state of the art AI model by Alibaba. It's designed to compete with other top AI models like GPT-4 and Claude, and it's showing impressive results in various tests of intelligence and problem-solving. Read more
AI2 introduced Tülu 3 405B, the first fully open post-training approach applied to the largest open-weight models. As an instruction-following model family, Tülu 3 provides open-source data, code, and recipes, achieving performance on par with or surpassing DeepSeek v3 and GPT-4o. Read more
uPix is an AI Selfie Generator that allows users to turn into anyone in just one click. Select from a vast array of templates, ranging from superheroes to business portraits, and even anime characters. Try it out today!
Researchers at NVIDIA have developed a new AI model called DiffusionRenderer. This model uses a technique called diffusion-based rendering to generate high-quality images and videos. It can be used for a variety of applications, including computer vision and robotics. Read more
YuE is an AI model that can generate music from lyrics. This model uses a technique called lyrics-to-song generation to create music that is highly realistic and engaging. It can be used for a variety of applications, including music composition and audio production. Read more
Lumina-Image-2.0 is an open-source AI model for generating images. This model is designed to be efficient, unified, and transparent, making it a powerful tool for a variety of applications. It includes checkpoints, fine-tuning and inference code, as well as a demo and website. Read more
Riffusion unveiled a free web app powered by their latest music generation model, FUZZ, which creates full songs from text or audio clips and adapts to a user’s unique style over time. Read more
Turbotype is a Chrome extension that allows you to type faster by setting customizable shortcuts, designed to boost productivity and save time. Easily create, save, and use keyboard shortcuts for frequently used phrases. It’s free forever - try it out today!
DeepSeek unveiled Janus-Pro-7B, an open-source multimodal LLM designed for both visual analysis and image generation. Read more
Hailuo AI (MiniMax) introduced Hailuo T2V-01-Director, a text-to-video model that enables users to control camera movements with natural language or simple commands. Read more
Pika 2.1, the latest video generation model, is now available with realistic physics simulations, enhanced character and object movement control, full HD resolution, new animation styles, and more. Read more
Alibaba Cloud has released Wanx 2.1, an AI model that can create amazing videos from text descriptions. This new version is really good at making realistic videos with complex movements and following instructions precisely. Read more