OpenAI debuts GPT-4o 'omni

by DWolfe
5 replies
Just an FYI - https://techcrunch.com/2024/05/13/op...del-is-gpt-4o/

GPT-4 Turbo, OpenAI's previous "leading "most advanced" model, was trained on a combination of images and text and could analyze images and text to accomplish tasks like extracting text from images or even describing the content of those images. But GPT-4o adds speech to the mix.

GPT-4o greatly improves the experience in OpenAI's AI-powered chatbot, ChatGPT. The platform has long offered a voice mode that transcribes the chatbot's responses using a text-to-speech model, but GPT-4o supercharges this, allowing users to interact with ChatGPT more like an assistant.

"For example, users can ask the GPT-4o-powered ChatGPT a question and interrupt ChatGPT while it's answering. The model delivers "real-time" responsiveness, OpenAI says, and can even pick up on nuances in a user's voice, in response generating voices in "a range of different emotive styles" (including singing). "

Anybody going to try using this feature as an assistant? Will this eventually put some of the low quality Virtual Assistants out of work?
#‘omni #debuts #gpt4o #openai
Avatar of Unregistered
  • Profile picture of the author dtaylor4
    If anyone can get the voice generation to sound real I'd love to know how, I can't seem to get it right.
    {{ DiscussionBoard.errors[11792255].message }}
  • Profile picture of the author Monetize
    Originally Posted by DWolfe View Post

    Just an FYI - https://techcrunch.com/2024/05/13/op...del-is-gpt-4o/

    GPT-4 Turbo, OpenAI's previous "leading "most advanced" model, was trained on a combination of images and text and could analyze images and text to accomplish tasks like extracting text from images or even describing the content of those images. But GPT-4o adds speech to the mix.

    GPT-4o greatly improves the experience in OpenAI's AI-powered chatbot, ChatGPT. The platform has long offered a voice mode that transcribes the chatbot's responses using a text-to-speech model, but GPT-4o supercharges this, allowing users to interact with ChatGPT more like an assistant.

    "For example, users can ask the GPT-4o-powered ChatGPT a question and interrupt ChatGPT while it's answering. The model delivers "real-time" responsiveness, OpenAI says, and can even pick up on nuances in a user's voice, in response generating voices in "a range of different emotive styles" (including singing). "

    Anybody going to try using this feature as an assistant? Will this eventually put some of the low quality Virtual Assistants out of work?

    Thank you for this.

    I noticed the upgrade last night when I engaged in a chat.

    I did not know it talks, I don't think I will use voice because
    I prefer typing my inquiries and receiving written outputs.

    I may talk to it in the future, but not now, although I do use
    AI voiceovers for my videos.
    {{ DiscussionBoard.errors[11792262].message }}
  • Profile picture of the author max5ty
    AI is expanding and growing.

    Unfortunately, those who are the most excited about using it on their websites, etc., are the ones who will be hurt the most.

    I heard most websites will see about 2/3 less traffic coming from search engines because of AI like Google is using now...and expanding in the short future for searches.

    Added: In case anyone is interested in where I got the 2/3 thing from, here is the article from May 13 that I read:

    https://www.washingtonpost.com/techn...search-io-sge/
    {{ DiscussionBoard.errors[11792302].message }}
    • Profile picture of the author CyberSEO
      Originally Posted by max5ty View Post

      Unfortunately, those who are the most excited about using it on their websites, etc., are the ones who will be hurt the most.
      Such a thing as SEO will cease to exist in the near future. As well as many other processional fields. This is an inevitable process and everyone needs to realize this.
      Signature
      CyberSEO Pro - the ultimate AI autoblogging and RSS, XML, HTML, JSON and CSV import plugin for WordPress with support for OpenAI o1, Claude, Gemini, Llama 3, Midjourney, DALL-E, Stable Diffusion and more.
      {{ DiscussionBoard.errors[11792400].message }}
  • Profile picture of the author aronprins
    This is a epic model - the craziest implementation I've seen so far is over at https://mychatbots.ai

    They can train a AI chatbot on my own data, and use that to write, develop, and offer support to my users!
    {{ DiscussionBoard.errors[11804717].message }}
Avatar of Unregistered

Trending Topics