Open AI is set to launch its flagship model of the chatbot with the name ChatGPT-4o. What it can do and how it can make a difference in your life? Let’s talk about that.
What is ChatGPT-4o?
First of all, let me clear up any confusion: it isn’t ChatGPT-5 but it’s a significant upgrade over the existing 4 model of the ChatGPT with much more interesting and practical capabilities. It can work around various modes of prompting like vision (using the device’s camera to see and reason), voice (that’s an obvious one), and regular text input.
The model is complex but surprisingly it is faster than the older version 4, at least in the demo video (I hope that it will be the same when it goes live for public use).
What Is It Capable of Doing?
We haven’t had the chance to get our hands on the 4o so, these are the features that have been shown in the demo video and some of the geeky Twitter accounts which have covered ChatGPT all around.
So, here we go:
Live Translation
Since the advent of ChatGPT, every tech giant has tried to get its fair share of the market in the space of AI. I am here to talk about the integration of AI in the Pixel’s and the Galaxy S mobile phones. They are capable of translating live calls into different languages.
While they do that on a phone call, the 4o can do that in a regular conversation, and that too pretty seamlessly. There were no significant lags while they were presenting that on the demo. The chatbot quickly translated the native language (Spanish) of the host Mira Murati (Chief Technology Officer at OpenAI) to English and vice versa.
That said, the seamlessness depends upon the quality of the internet. To make your experience seamless when it rolls out publicly, make sure to have an internet connection from a dependable ISP like Xfinity which offers affordable plans, supreme network stability, and the super-friendly Xfinity customer service.
It Seems As if We Are Talking to a Human
I was amazed by seeing some of the responses from the 4o.
They had such a realistic human-like element that it felt like they were actually talking to a human. The host and the guests were showing a vocal interaction with the chatbot and now it’s not limited to the textual inputs but users can interact with it through voice, stop it when needed, change the tone of the voice, or the speed of the response. Not to forget, upon asking it can also sing out the response for the users.
It’s not melodious like Adele though but it works.
This unlocks the potential for tons of possibilities for diversified use cases in our daily lives. For example, I will be using it to write and sing for Steve (my baby boy) while I am babysitting and writing articles like these for you.
So, it’s a pretty fun and utilizable tool to have in your pocket.
No More Interaction Through Just the Web
According to Mira Maruti, this version of ChatGPT is going native.
People have been going on the OpenAI’s official website to use ChatGPT and there haven’t been any mobile or desktop applications for the chatbot.
By native she means that OpenAI has developed an application for the ChatGPT-4o to make it easy for users to integrate into the daily workflow. Though in the demo, they were showing a Mac device from Apple so it’s likely that the application will first roll out for Macs and then Windows PCs.
So, that’s good as it is going to be much more optimized for the native operating system.
It Is Capable of Processing Images and Videos
The ChatGPT-4o can interact through the videos and the pictures users will upload.
To demonstrate that, in the demo one of the co-hosts uploaded a video of him smiling and then they asked what’s the guy feeling like. The response from the 4o was pretty accurate and even witty at the end when that co-host told it how amazing it was and they were presenting its capability in front of the audience.
I think this is by far the most human-like response I have seen from an AI-powered tool and I loved it.
It’s Free
Last but not least and certainly the best feature of ChatGPT-4o is that it’s going to be free.
You heard it right! Anyone can use this version of the ChatGPT-4o as they aim to lessen the friction between AI-based chatbots and users.
However, there will be a paid version that can provide similar features just without the capping of the capacity and the number of prompts. So, the paid version of the users will be all set too.