VQA based on image captions

  • 5 Replies
  • 1856 Views
*

hainaa

  • Roomba
  • *
  • 4
VQA based on image captions
« on: April 22, 2019, 07:28:47 am »
Hi,

Is there some paper related to the task of answering an image based question given as input an image and its description.
I tried to search online but I didn't find any but I remember some days back I saw such work. If anyone know of some work related to it, please share the link.

Thanks!

*

LOCKSUIT

  • Emerged from nothing
  • Trusty Member
  • *******************
  • Prometheus
  • *
  • 4659
  • First it wiggles, then it is rewarded.
    • Main Project Thread
Re: VQA based on image captions
« Reply #1 on: April 22, 2019, 08:46:11 am »
like google deepmind or somethin? I seen something like white images with circles and patterns of problem solving in a video i seen online, on I think it was Two Minute Papers's youtube channel. They make it generally aswer questions like what is the most likely pattern that comes next in this array of circles...
Emergent          https://openai.com/blog/

*

ivan.moony

  • Trusty Member
  • ************
  • Bishop
  • *
  • 1729
    • mind-child
Re: VQA based on image captions
« Reply #2 on: April 22, 2019, 10:52:11 am »

*

hainaa

  • Roomba
  • *
  • 4
Re: VQA based on image captions
« Reply #3 on: April 23, 2019, 05:59:01 am »
No, It's different. It's great work but I want one where input is (question, image , image description) and output is the answer for the question.

*

WriterOfMinds

  • Trusty Member
  • ********
  • Replicant
  • *
  • 617
    • WriterOfMinds Blog
Re: VQA based on image captions
« Reply #4 on: April 23, 2019, 06:53:11 am »
The closest thing I can think of is Visual Chatbot. It generates the image description rather than taking it as input, but I think it can use dialogue history (including the generated caption?) to help answer followup questions.

https://arxiv.org/abs/1611.08669

*

hainaa

  • Roomba
  • *
  • 4
Re: VQA based on image captions
« Reply #5 on: April 23, 2019, 06:57:59 am »
Actually this is really interesting and maybe I could get ideas from it. Thanks for the link  :)

 


Requirements for functional equivalence to conscious processing?
by DaltonG (General AI Discussion)
November 19, 2024, 11:56:05 am
Will LLMs ever learn what is ... is?
by HS (Future of AI)
November 10, 2024, 06:28:10 pm
Who's the AI?
by frankinstien (Future of AI)
November 04, 2024, 05:45:05 am
Project Acuitas
by WriterOfMinds (General Project Discussion)
October 27, 2024, 09:17:10 pm
Ai improving AI
by infurl (AI Programming)
October 19, 2024, 03:43:29 am
Atronach's Eye
by WriterOfMinds (Home Made Robots)
October 13, 2024, 09:52:42 pm
Running local AI models
by spydaz (AI Programming)
October 07, 2024, 09:00:53 am
Hi IM BAA---AAACK!!
by MagnusWootton (Home Made Robots)
September 16, 2024, 09:49:10 pm
LLaMA2 Meta's chatbot released
by spydaz (AI News )
August 24, 2024, 02:58:36 pm
ollama and llama3
by spydaz (AI News )
August 24, 2024, 02:55:13 pm
AI controlled F-16, for real!
by frankinstien (AI News )
June 15, 2024, 05:40:28 am
Open AI GPT-4o - audio, vision, text combined reasoning
by MikeB (AI News )
May 14, 2024, 05:46:48 am
OpenAI Speech-to-Speech Reasoning Demo
by MikeB (AI News )
March 31, 2024, 01:00:53 pm
Say good-bye to GPUs...
by MikeB (AI News )
March 23, 2024, 09:23:52 am
Google Bard report
by ivan.moony (AI News )
February 14, 2024, 04:42:23 pm
Elon Musk's xAI Grok Chatbot
by MikeB (AI News )
December 11, 2023, 06:26:33 am

Users Online

405 Guests, 1 User
Users active in past 15 minutes:
WriterOfMinds
[Trusty Member]

Most Online Today: 490. Most Online Ever: 2369 (November 21, 2020, 04:08:13 pm)

Articles