OpenAI New GPT-4o(omni) Model Demo In Mobile App With Audio Conversation- It is Just Amazing

19,700
0
Published 2024-05-15
GPT-4o (“o” for “omni”) is a step towards much more natural human-computer interaction—it accepts as input any combination of text, audio, image, and video and generates any combination of text, audio, and image outputs. It can respond to audio inputs in as little as 232 milliseconds, with an average of 320 milliseconds, which is similar to human response time(opens in a new window) in a conversation. It matches GPT-4 Turbo performance on text in English and code, with significant improvement on text in non-English languages, while also being much faster and 50% cheaper in the API. GPT-4o is especially better at vision and audio understanding compared to existing models.
------------------------------------------------------------------------------------------------
Support me by joining membership so that I can upload these kind of videos
youtube.com/channel/UCNU_lfiiWBdtULKOw6X0Dig/join
-----------------------------------------------------------------------------------
►GenAI on AWS Cloud Playlist:    • Generative AI In AWS-AWS Bedrock Cras...  
►Llamindex Playlist:    • Announcing LlamaIndex Gen AI Playlist...  

►Google Gemini Playlist:    • Google Is On Another Level- Check Out...  
►Langchain Playlist:    • Amazing Langchain Series With End To ...  
►Data Science Projects:
   • Now you Can Crack Any ML Interviews- ...  

►Learn In One Tutorials

Statistics in 6 hours:    • Complete Statistics For Data Science ...  

End To End RAG LLM APP Using LlamaIndex And OpenAI- Indexing And Querying Multiple Pdf's

Machine Learning In 6 Hours:    • Complete Machine Learning In 6 Hours|...  

Deep Learning 5 hours :    • Deep Learning Indepth Tutorials In 5 ...  

►Learn In a Week Playlist

Statistics:   • Live Day 1- Introduction To statistic...  

Machine Learning :    • Announcing 7 Days Live Sessions On Ma...  

Deep Learning:   • 5 Days Live Deep Learning Community S...  

NLP :    • Announcing NLP Live community Sessions  
---------------------------------------------------------------------------------------------------
My Recording Gear
Laptop: amzn.to/4886inY
Office Desk : amzn.to/48nAWcO
Camera: amzn.to/3vcEIHS
Writing Pad:amzn.to/3OuXq41
Monitor: amzn.to/3vcEIHS
Audio Accessories: amzn.to/48nbgxD
Audio Mic: amzn.to/48nbgxD

All Comments (21)
  • @moxes8237
    That’s not the upgraded voice conversation shown in the demo that’s the old one. The new voice doesn’t come out until a few weeks later.

    You will know it’s the new voice model if you tested by asking it to change its voice to a southern or robotic one and it will be able to do so
  • @pranaypaul6361
    last touch was amazing.....we are already living in future!!!
  • Thanks Krish ! Can you please make video on using GPT 4o in one of the use case? For example : Multi Modality Bot, which has ability to read text, vision and audio all in one place. Thanks a lot ! Keep doing the great things.
  • @planbislive
    Hello sir...
    Do u hv any idea that... When everyone can use it like when its gonna be available for free users?
  • @NoDoglapan
    thanks for the video Krish but I hope you dont get in trouble with Miss GPT-4o by constantly disrupting her while she is speaking😂
  • @firdousbhat123
    But Krish, as I know that this audio interaction feature was already there before with chatgpt 4. It's not from omni version.
  • @suraj-gg1qv
    it seems different from the demo that the founders showed us
  • @bhupeshmahara
    Omni model will work freely or we've to purchase the $20 subscription.
  • @anurag9767
    Not available yet for free users , it was said it would be for free users also
  • @mudangkano5267
    Sir right now i don't have voice feature on chatgpt app. If i pay Rs. 1950 for a month, then i will get voice feature right?