ummid logo
Welcome Guest! You are here: Home » Science & Tecnhnology

'GPT-4o': OpenAI unpacks more capable, conversational yet cost-effective AI model

Microsoft backed OpenAI Tuesday May 14, 2024 launched GPT-4o, a more capable, conversational yet cost-effective and affordable version of its flagship AI model.

Tuesday May 14, 2024 3:30 PM, ummid.com News Network

'GPT-4o': OpenAI unpacks more capable, conversational yet cost-effective AI model

San Francisco: Microsoft backed OpenAI Tuesday May 14, 2024 launched GPT-4o, a more capable, conversational yet cost-effective and affordable version of its flagship AI model.

GPT-4o features real-time voice interaction and improved text, audio, and image handling.

"We’re announcing GPT-4o, our new flagship model that can reason across audio, vision, and text in real time", OpenAI said.

The launch of GPT-4o reflects the intensifying competition among tech companies to expand their user base and monetise their generative AI technology, with OpenAI aiming to maintain its lead in the market.

GPT-4o Features

"GPT-4o (“o” for “omni”) is a step towards much more natural human-computer interaction—it accepts as input any combination of text, audio, and image and generates any combination of text, audio, and image outputs", OpenAI said.

"It can respond to audio inputs in as little as 232 milliseconds, with an average of 320 milliseconds, which is similar to human response time(opens in a new window) in a conversation."

"GPT-4o matches GPT-4 Turbo performance on text in English and code, with significant improvement on text in non-English languages, while also being much faster and 50% cheaper in the API. GPT-4o is especially better at vision and audio understanding compared to existing models", OpenAI said.

Supported Languages

As many as 20 languages, including Urdu, Arabic, Persian, Marathi, Telugu, Gujarat, Tamil, Turkish and Chinese were chosen as representative of the new tokenizer's compression across different language families.

GPT-4o has safety built-in by design across modalities, through techniques such as filtering training data and refining the model’s behavior through post-training.

"We have also created new safety systems to provide guardrails on voice outputs", OpenAI said.

Until now, OpenAI’s most advanced latest Large Language Model (LLM) was the GPT-4, which was only available to paid users. However, the GPT-4o will be freely available.

Developers can also now access GPT-4o in the API as a text and vision model.


Select Language To Read in Urdu, Hindi, Marathi or Arabic.

 

 

For all the latest News, Opinions and Views, download ummid.com App.

Google News

 Post Comments
Note: By posting your comments here you agree to the terms and conditions of www.ummid.com

..