AGI is here, realtime 3D faces, insane AI video model, image to 3D worlds
Welcome to the AI Search newsletter. Here are the top highlights in AI this week.
Google has unveiled Gemini Flash Thinking, an experimental model that provides a unique glimpse into its thought process. It is designed to leverage its own thought patterns to enhance its reasoning capabilities. You can now access this powerful tool through Google AI Studio and the Gemini API. Read more
Do you prefer to watch instead of read? Check out this video covering all the highlights in AI this week:
Meta has announced Apollo, a groundbreaking family of open-source models designed to efficiently process hour-long videos. The Apollo-7B model has achieved state-of-the-art performance on the Multi-Modal Video Understanding (MLVU) benchmark and Video-Multi-Modal Evaluation (Video-MME) metric. Read more
Genesis, an open-source physics simulation environment, has been released for general-purpose applications in Robotics, Embodied AI, and Physical AI. At the heart of Genesis lies a cutting-edge, generative physics engine capable of creating complex, 4D dynamical worlds, all powered by a robust physics simulation platform. Read more.
uPix is an AI Selfie Generator that allows users to turn into anyone in just one click. Select from a vast array of templates, ranging from superheroes to business portraits, and even anime characters. Try it out today!
Google has released an updated version of its video generation model, Veo 2. This new version can produce videos at resolutions up to 4K, featuring ultra high quality, enhanced consistency, and a better prompt understanding. Read more
Odyssey has launched Explorer, an image-to-world model that converts any image into a fully realized, detailed 3D environment. Explorer is specifically optimized for creating photorealistic worlds. Read more
Nvidia has launched the Jetson Orin Nano Super, a compact AI supercomputer priced at $249. This new device offers significant performance improvements, including a 1.7x increase in generative AI capabilities and enhanced memory bandwidth, making it ideal for developers and hobbyists interested in AI applications. Read more
xAlerts is a powerful tool that helps you track the activity of your favorite accounts. Stay informed from the latest activities of investors, celebrities, influencers, athletes, and more. Try it for free!
Pika has released Pika 2.0, an advanced AI video generation tool with a new feature called Scene Ingredients. This feature allows users to upload images of people, objects, and places to create customized video scenes, enhancing creative control and personalization. Read more
Researchers have developed a new method for creating long volumetric videos from multi-view videos using a technique called Temporal Gaussian Hierarchy. This approach allows for efficient reconstruction of longer videos while maintaining high quality, overcoming limitations of previous methods that struggled with memory and rendering speed. Read more
Researchers have created a system called Wonderland that generates 3D scenes from just a single image. It uses advanced techniques to build high-quality, detailed 3D environments quickly and efficiently, overcoming limitations of previous methods that required multiple images or took a long time to process. Read more
CAP4D is a new AI system that creates realistic 4D avatars from reference images. It uses a morphable multi-view diffusion model to generate various views and expressions, allowing for real-time rendering and animation of these avatars. This technology can produce high-quality avatars even from a single image, making it useful for gaming and virtual reality. Read more