Live AI assistants, INSANE 3D models, SORA is out, Gemini 2.0, AI animates images, AI makes full comics - AI news this week
Welcome to the AI Search newsletter. Here are the top highlights in AI this week.
A new 3D generation model called TRELLIS can create high-quality 3D assets from text or image prompts. It uses a structured latent representation to capture both structural and textural information, allowing for versatile and high-quality 3D asset creation. The model can also generate variants of a given 3D asset and manipulate targeted regions. Read more
Do you prefer to watch instead of read? Check out this video covering all the highlights in AI this week:
Researchers at DeepMind have developed a way to control video generation using motion trajectories. This allows for fine-grained control over objects and cameras in videos, enabling features like object manipulation, camera control, and motion transfer. The model can even transfer motion from one video to another. Read more
Google introduces Gemini 2.0, a new AI model that can understand and respond to user requests in multiple formats like images, audio, and text. This AI model is designed to be more capable and helpful, allowing it to take actions on behalf of the user with their supervision. Read more.
uPix is an AI Selfie Generator that allows users to turn into anyone in just one click. Select from a vast array of templates, ranging from superheroes to business portraits, and even anime characters. Try it out today!
Meta has launched Meta Motivo, an AI model that enables virtual humanoid agents to perform complex tasks with human-like movements. This model can handle various actions like motion tracking and reaching for goals without needing extra training. Read more
OpenAI has launched Sora, a new AI tool that turns text prompts into videos, making video creation easier and more accessible. With features like storyboards for planning scenes and the ability to animate images, Sora allows users to generate high-quality videos up to 1080p resolution and 20 seconds long. Read more
Google has launched a new feature called Deep Research, designed to assist users with web research. This AI tool can create a detailed research plan, gather information from the internet, and produce comprehensive reports based on user queries. It's aimed at making complex research tasks quicker and easier for users by mimicking human browsing and analysis processes. Read more
xAlerts is a powerful tool that helps you track the activity of your favorite accounts. Stay informed from the latest activities of investors, celebrities, influencers, athletes, and more. Try it for free!
OpenAI has upgraded its ChatGPT mobile app with new features that allow users to share live video and screens during voice conversations. This means you can show ChatGPT what you're looking at or working on in real-time, making it easier to get help or feedback without needing to describe everything. Read more
Google has introduced Project Mariner, an AI agent that can automate tasks in your web browser. Built on the Gemini 2.0 framework, it can understand various elements on the screen, like text and images, and perform actions like searching for information or filling out forms based on user commands. Read more
xAI has launched Aurora, a new image generation model integrated into its Grok assistant on X. Aurora can create photorealistic images from text prompts and edit existing visuals, making it a powerful tool for users looking to generate high-quality content. Read more
Researchers have developed a new AI framework called DiffSensei that can generate customized manga pages with consistent characters. It uses a combination of diffusion models and multimodal LLMs to control character appearances and interactions in multiple panels and pages. Read more