OpenAI introduces GPT-4o, a versatile AI model that integrates text, audio, and image processing

From The National: 2024-05-14 04:45:36

OpenAI introduces GPT-4o, its new flagship artificial intelligence model that can process audio, vision, and text in real time. The model’s ability to accept various inputs and generate different outputs makes it stand out. GPT-4o can respond to audio inputs in as little as 232 milliseconds, similar to human conversation speed.

GPT-4o simplifies the process of converting input into output by merging text, audio, and image processing into a single model. OpenAI has addressed previous limitations by refining the model’s behaviors and implementing safety systems to mitigate risks. The company aims to attract more users by offering free access to GPT-4o.

OpenAI faces stiff competition in the generative AI landscape, with rivals like Google and Anthropic launching their own advanced models. Google’s Gemini and Gemma, and Anthropic’s Claude 3 offer varying capabilities at different price points. Despite the fierce competition, OpenAI remains dedicated to advancing its technology and maintaining its position as a leader in the field.

Read more at The National: What’s in OpenAI’s new free generative AI model GPT-4o

More Live News

Newmont (NEM) Q2 2025 Earnings: Stock Surges 6%

Ulta Beauty (ULTA) Downgraded to Hold by Loop Capital

Charter Communications (CHTR) Plunges 18% on Disappointing Results