New Open-source AI Animator, Google AI Overviews Update, Ultra-realistic AI Avatars, & more!
Welcome to the AI Search newsletter. Here are the top updates in AI this week.
Google’s AI Overviews Update: Addressing Errors and Improving Accuracy
Google's AI Overviews feature, launched at Google I/O, aims to provide users with accurate information and links to relevant web content. However, some users reported odd and erroneous results, which were amplified on social media. Google explains how AI Overviews work, acknowledging that they can make mistakes due to misinterpreted queries, language nuances, or lack of quality information. The company has made over a dozen technical improvements to address these issues, including better detection of nonsensical queries, limiting satire and user-generated content, and enhancing quality protections for news and health topics.
ToonCrafter: Free Open-source Cartoon Interpolation Generation
ToonCrafter is a generative model that can create cartoon-style animations by interpolating between given frames. It uses a dual-reference-based 3D VAE decoder and a sparse sketch guidance system to generate high-quality, coherent animations. The model can be applied to various tasks, including cartoon sketch interpolation, reference-based sketch colorization, and sparse-sketch-guided generation. The authors demonstrate the model's capabilities through various showcases and comparisons with baseline methods, highlighting its ability to generate realistic and smooth animations.
AI-Powered Drones to the Rescue: Finding Lost Hikers with Machine Learning
Researchers have developed an AI-based drone system to assist in search and rescue operations for lost hikers. The system uses machine learning algorithms to analyze data on hiking paths, geographical features, and hiker characteristics to predict the most likely paths a lost hiker would take. The AI model was trained on a large dataset of lost hiker scenarios and was found to be more effective than traditional search methods, locating lost hikers 19% of the time compared to 8-12% with traditional methods.
ChatLLM by Abacus
ChatLLM by Abacus AI is an integrated platform that allows enterprises to use multiple LLMs, deploy custom agents, and collaborate with team members. Choose from state-of-the-art LLMs such as GPT-4o, Claude 3 Opus, and their new open source Smaug.
Data-Driven Model Mimics Realistic Human Motions for Virtual Avatars
Researchershave developed WANDR, a data-driven model that generates natural human motions for virtual avatars. The model unifies different data sources to achieve more realistic motions, allowing avatars to interact with their virtual environment. WANDR uses a purely data-driven approach, without reinforcement learning, to learn general navigation skills from large datasets and specialized reaching motions from smaller datasets.
Neural Parametric Gaussian Avatars: A Data-Driven Approach to Creating High-Fidelity Digital Humans
NPGA (Neural Parametric Gaussian Avatars) is a data-driven method for creating high-fidelity, controllable digital avatars from multi-view video recordings. It combines 3D Gaussian splatting with neural parametric head models (NPHM) to achieve photo-realistic and real-time rendering performance. NPGA distills the backward deformation field of NPHM into forward deformations compatible with rasterization-based rendering, and learns fine-scale, expression-dependent details from multi-view videos. The method also incorporates per-primitive latent features to govern dynamic behavior and regularize expressivity.
ALOGIC
ALOGIC offers a wide range of professional home/office hardware, including their Clarity series of professional 4K touchscreen monitors, wireless chargers, docking stations, and more. Enhance your productivity and elevate your digital lifestyle with ALOGIC!
AI System Developed to Reduce Emotional Burden of Monitoring Hate Speech on Social Media
Researchers have developed an AI system, called the multi-modal discussion transformer (mDT), that can detect hate speech on social media platforms with 88% accuracy, reducing the emotional toll on humans who would otherwise have to manually monitor and identify such content. The mDT system can understand the relationship between text and images, as well as contextualize comments, making it more effective than previous methods.ALOGIC!
Revolutionary AI Model Can Alter Material Properties in Images
Researchers from MIT and Google have developed an AI-powered diffusion model, called Alchemist, that can change the material properties of objects in images. This model allows users to alter four attributes - roughness, metallicity, albedo, and transparency - of objects in images with a simple slider-based interface. This technology has the potential to revolutionize various fields, including video game design, visual effects, and robotics, by enabling precise control over material properties.