Google has demonstrated AI systems that can decipher text in spoken language, movies, and photographs captured by a phone’s camera.
In one demonstration, a phone-based prototype AI assistant was able to respond to the common query, “Where did I put my glasses?”
It happens one day after competitor OpenAI unveiled GPT-4o, its most recent AI system, in a visually striking presentation that saw it reading facial expressions from a phone camera and having a smooth conversation—including flirtation.
Google is eager to emphasize that its technologies can perform this type of “multimodal” understanding on par with those of its competitor.
Google had hinted at the potential as a part of this “anything you can do, I can do better” kind of rivalry.