Open AI GPT-4o - audio, vision, text combined reasoning

  • 1 Replies
  • 35800 Views
*

MikeB

  • Autobot
  • ******
  • 224
Open AI GPT-4o - audio, vision, text combined reasoning
« on: May 14, 2024, 05:37:05 am »
Quote
GPT-4o (“o” for “omni”) is a step towards much more natural human-computer interaction—it accepts as input any combination of text, audio, and image and generates any combination of text, audio, and image outputs. It can respond to audio inputs in as little as 232 milliseconds, with an average of 320 milliseconds, which is similar to human response time(opens in a new window) in a conversation. It matches GPT-4 Turbo performance on text in English and code, with significant improvement on text in non-English languages, while also being much faster and 50% cheaper in the API. GPT-4o is especially better at vision and audio understanding compared to existing models.

https://openai.com/index/hello-gpt-4o/


*

MikeB

  • Autobot
  • ******
  • 224
Re: Open AI GPT-4o - audio, vision, text combined reasoning
« Reply #1 on: May 14, 2024, 05:46:48 am »
They are actually claiming reduced tokens in language. "English 1.1x fewer tokens (from 27 to 24)", with some languages compressing more.

Although there's little to say how it works, as Noam Chomsky points out, nothing is learned from language if a computer arbitrarily uses language and maths to predict with, including arbitrary pre-compression...

Response times are good however.

ChatGPT 3.5 is fast as well. https://chatgpt.com/

 


Requirements for functional equivalence to conscious processing?
by DaltonG (General AI Discussion)
November 19, 2024, 11:56:05 am
Will LLMs ever learn what is ... is?
by HS (Future of AI)
November 10, 2024, 06:28:10 pm
Who's the AI?
by frankinstien (Future of AI)
November 04, 2024, 05:45:05 am
Project Acuitas
by WriterOfMinds (General Project Discussion)
October 27, 2024, 09:17:10 pm
Ai improving AI
by infurl (AI Programming)
October 19, 2024, 03:43:29 am
Atronach's Eye
by WriterOfMinds (Home Made Robots)
October 13, 2024, 09:52:42 pm
Running local AI models
by spydaz (AI Programming)
October 07, 2024, 09:00:53 am
Hi IM BAA---AAACK!!
by MagnusWootton (Home Made Robots)
September 16, 2024, 09:49:10 pm
LLaMA2 Meta's chatbot released
by spydaz (AI News )
August 24, 2024, 02:58:36 pm
ollama and llama3
by spydaz (AI News )
August 24, 2024, 02:55:13 pm
AI controlled F-16, for real!
by frankinstien (AI News )
June 15, 2024, 05:40:28 am
Open AI GPT-4o - audio, vision, text combined reasoning
by MikeB (AI News )
May 14, 2024, 05:46:48 am
OpenAI Speech-to-Speech Reasoning Demo
by MikeB (AI News )
March 31, 2024, 01:00:53 pm
Say good-bye to GPUs...
by MikeB (AI News )
March 23, 2024, 09:23:52 am
Google Bard report
by ivan.moony (AI News )
February 14, 2024, 04:42:23 pm
Elon Musk's xAI Grok Chatbot
by MikeB (AI News )
December 11, 2023, 06:26:33 am

Users Online

185 Guests, 0 Users

Most Online Today: 405. Most Online Ever: 2369 (November 21, 2020, 04:08:13 pm)

Articles