Released on, December 15 under an MIT license, the 350-million-parameter model delivers up to six times faster-than-real-time speech on GPUs. It excels in blind tests, beating rivals like ElevenLabs T...
OpenAI has released a major upgrade to its image generation system with ChatGPT Images powered by GPT-Image-1.5, offering faster performance, stronger instruction-following, and more precise editing. ...
DoorDash has launched Zesty, a new AI-powered social app focused on helping users discover local restaurants quickly and easily. Initially available in the San Francisco Bay Area and New York, the app...
Satya Nadella’s keynote centers on a powerful theme: AI is transforming what individuals and organizations can achieve, and the frontier of innovation is moving rapidly. He stresses that embracing new...
The beta feature in Google Translate uses the new Gemini 2.5 Flash Native Audio model to deliver live speech-to-speech translation across more than 70 languages. Users just connect any headphones to a...
Gemini 3 flash is a frontier intelligence built for speed having pro-grade reasoning at a fraction of the cost and it's google's most impressive model for agentic workflows. Gemini 3 Flash offers exce...
HY World 1.5 WorldPlay model by Tencent, person can create detailed 3D scenes like cozy cabins and misty mountains with incredible realism.
Simple text prompts to accurately separate any sound from any audio or audio-visual source
The Nemotron 3 family of open models — in Nano, Super and Ultra sizes.