OpenAI Unveils New ChatGPT That Listens, Looks and Talks

From The New York Times: 2024-05-14 01:12:00

Apple and Google are evolving their voice assistants into chatbots, whereas OpenAI is turning its chatbot into a voice assistant. OpenAI has introduced a new version of ChatGPT that can process voice commands, images, and videos much faster using the AI system GPT-4o, available for free on smartphones and desktops.

This move reflects a broader trend of merging conversational chatbots with voice assistants, with Google integrating its Gemini chatbot with the Google Assistant and Apple enhancing Siri’s conversational abilities. OpenAI plans to roll out the technology gradually over the coming weeks, marking the first desktop application release for ChatGPT.

OpenAI’s ChatGPT has garnered praise for its ability to generate text, answer questions, write papers, and even produce computer code in response to conversational prompts. By analyzing vast amounts of text online, including Wikipedia articles and chat logs, ChatGPT developed its skills and became a potential alternative to search engines and voice assistants.

Newer versions of the technology embrace multimodal AI by learning from sounds, images, and videos. This approach combines chatbots with AI image, audio, and video generators, advancing the capabilities of AI-powered systems. Companies like OpenAI are at the forefront of innovating in this space by combining different modalities for seamless user interactions.



Read more at The New York Times: OpenAI Unveils New ChatGPT That Listens, Looks and Talks