AI makes any video game • Image to 3D world • New open-source video • New OpenAI models • New face animator • AI news this week
Welcome to the AI Search newsletter. Here are the top highlights in AI this week.
Google DeepMind has introduced Genie 2, an AI model that creates interactive 3D environments from a single image or text prompt. This advanced model allows users to explore dynamic virtual worlds for up to a minute, featuring realistic physics, animated characters, and various perspectives like first-person and isometric views. Read more
Do you prefer to watch instead of read? Check out this video covering all the highlights in AI this week:
Tencent has launched HunyuanVideo, an open-source text-to-video generation tool that rivals OpenAI's Sora. This model features over 13 billion parameters and can create high-quality videos from text prompts in both English and Chinese, making it the largest of its kind available for free. Read more.
Moondream has launched Moondream 0.5B, the world's smallest Vision-Language Model (VLM) featuring just 0.5 billion parameters. This model is specifically optimized for edge devices and mobile platforms. Read more.
uPix is an AI Selfie Generator that allows users to turn into anyone in just one click. Select from a vast array of templates, ranging from superheroes to business portraits, and even anime characters. Try it out today!
Amazon has launched a new series of AI models called Amazon Nova, designed to enhance generative AI capabilities for businesses. These models, available through Amazon Bedrock, include various options like Nova Micro for fast text processing and Nova Premier for complex reasoning tasks. Read more
Hailuo AI has introduced a new AI model called I2V-01-Live that animates static 2D illustrations into lively videos. This technology uses deep learning to add smooth animations and subtle expressions, making characters appear more dynamic and realistic. Read more
OpenAI has officially released its new AI model, o1, which offers significant improvements in reasoning and problem-solving capabilities. Additionally, a new ChatGPT Pro plan allows users to access the o1 model with enhanced features for $200 a month, making it a powerful tool for tackling complex challenges. Read more
xAlerts is a powerful tool that helps you track the activity of your favorite accounts. Stay informed from the latest activities of investors, celebrities, influencers, athletes, and more. Try it for free!
Fish Audio has released Fish Speech 1.5, an advanced open-source Text-to-Speech model that can generate lifelike speech from text. It has been trained on over 1 million hours of audio data and supports 13 languages, making it highly versatile and accurate in speech synthesis. Read more
Google has launched two new AI models, Imagen 3 and Veo, on its Vertex AI platform, enhancing image and video generation capabilities. Imagen 3 is designed to create high-quality, photorealistic images from text prompts with improved detail and fewer artifacts, while Veo can animate static images or generate videos from text. Read more
Google DeepMind has introduced GenCast, a new AI model that significantly improves weather forecasting accuracy and speed, predicting conditions up to 15 days in advance. This model outperforms existing systems, like the European Centre for Medium-Range Weather Forecasts, by providing probabilistic forecasts that help users understand the likelihood of various weather scenarios. Read more
Microsoft has launched a limited preview of Copilot Vision, an AI tool for its Edge browser that helps users by analyzing and responding to content on web pages in real time. This feature acts like a virtual assistant, allowing users to ask questions about what they see, such as finding recipes or product deals, while browsing the internet. Read more