Google unveiled Gemini 2.0 yesterday, almost exactly one year after Google’s initial Gemini launch. The new release offers enhanced multimodal capabilities like native image and audio output, real-time tool use, and advanced reasoning to enable agentic experiences, such as acting as a universal assistant or research companion. VentureBeat reports: During a recent press conference, Tulsee Doshi, director of product management for Gemini, outlined the system’s enhanced capabilities while demonstrating real-time image generation and multilingual conversations. “Gemini 2.0 brings enhanced performance and new capabilities like native image and multilingual audio generation,” Doshi explained. “It also has native intelligent tool use, which means that it can directly access Google products like search or even execute code.”
The initial release centers on Gemini 2.0 Flash, an experimental version that Google claims operates at twice the speed of its predecessor while surpassing the capabilities of more powerful models. This represents a significant technical achievement,
Leave a Reply