Stable Diffusion 3 is out! Reddit's $60M AI deal with Google, Huawei’s PanGu-π Beats All Tiny Language Models, & More!
Welcome to the AI Search newsletter. Here are the top updates in AI this week.
Reddit strikes $60M deal with Google to train AI
Reddit has partnered with Google, allowing the tech giant to utilize posts from the popular online discussion platform to train its artificial intelligence models and enhance services like Google Search. The deal, valued at approximately $60 million, also grants Reddit access to Google's AI models to improve its internal site search and other features. This partnership comes as Reddit unveiled plans for its initial public offering, reporting a net income of $18.5 million in the last quarter of 2020. The collaboration marks a significant step for Reddit, which is known for its user-driven content and unique approach to content moderation. Unlike other social media platforms, Reddit does not rely on algorithmic processes to predict user preferences, instead focusing on user-driven discussions organized by topic. Google, on the other hand, views Reddit as a valuable source of authentic human conversations and experiences to help improve the quality of information provided through its products.
Stable Diffusion 3 is Here! A New Era in AI Image Generation
This new model, currently available for early preview, has been lauded for its capabilities in handling multi-subject prompts, enhancing image quality, and improving spelling accuracy. With a parameter range from 800 million to 8 billion, Stable Diffusion 3 offers users unparalleled flexibility in scalability and quality, catering to a wide range of creative needs. The company has emphasized its commitment to safety and preventing misuse of the technology, pledging to collaborate with experts to enhance integrity and safety features. The model's innovative architecture, combining diffusion transformer and flow matching techniques, promises to revolutionize the field of AI image generation.
Murf AI
Introducing Murf, the most versatile AI text-to-speech generator. Create studio-quality voice overs in minutes using lifelike AI voices suitable for podcasts, videos, presentations, and more. Choose from 120+ text to speech voices in 20+ languages, with the ability to add video, music, or images and sync them to the voiceover.
Huawei’s New PanGu-π Beats All Tiny Language Models
Huawei has introduced a cutting-edge approach to developing tiny language models (TLMs) optimized for mobile devices. Traditional large language models, though powerful, are not well-suited for mobile use due to their high computational and memory requirements. The researchers' innovative solution, PanGu-π Pro, utilizes a carefully crafted architecture and advanced training techniques to deliver exceptional efficiency and effectiveness. Notably, the team compressed the tokenizer to reduce the model's size without compromising its language understanding and generation capabilities. Architectural adjustments were also made, including parameter inheritance from larger models and a multi-round training strategy for enhanced learning efficiency. The results of training PanGu-π Pro on a massive multilingual corpus were impressive, with the 1B and 1.5B parameter versions showcasing significant improvements in benchmark evaluation sets.
Amazon’s BASE TTS: The Largest Text-to-Speech Model Ever
Amazon AGI has recently unveiled the creation of BASE TTS, which stands for Big Adaptive Streamable TTS with Emergent abilities. This is the largest text-to-speech model ever made, boasting an impressive 980 million parameters and being trained using 100,000 hours of recorded speech from public sources. While the majority of the data used for training was in English, the model was also exposed to examples of spoken words and phrases in other languages. This extensive training has enabled BASE TTS to develop emergent qualities, breaking through to a higher level of intelligence and demonstrating advanced language attributes such as the correct pronunciation of well-known phrases and the ability to use compound nouns, express emotions, use foreign words, apply paralinguistics and punctuation, and ask questions with emphasis placed on the right word in a sentence.
OpenAI Now Valued at $86 Billion
OpenAI, the renowned artificial intelligence research lab, is currently in discussions to allow its employees to sell their shares at an astounding valuation of $86 billion. This move signifies a momentous milestone for the company, highlighting the significant value and potential perceived in its advanced AI technologies. The valuation serves as a testament to the groundbreaking work being carried out by OpenAI and the high esteem in which the company's contributions to the AI field are held. It also emphasizes the growing recognition of the economic value of AI technologies and the strong investor interest in companies leading the way in this sector. The deal to enable employees to sell their shares at such a high valuation is a noteworthy development, providing a chance for employees to reap the financial rewards of their contributions to the company's success.