Efficiency!

  • 10 Replies
  • 620 Views
*

frankinstien

  • Starship Trooper
  • *******
  • 396
    • Knowledgeable Machines
Efficiency!
« on: March 25, 2021, 04:30:05 pm »
I'm working on sound processing and using FFT to break the signals into frequency and energy. The computational horsepower to do this is best applied to the GPU even though there are tons of CPU-based solutions out there for FFT. But when you realize how biology does it, where hairs in the cochlea vibrate and the cochlea is tuned for various frequencies the ability to identify the frequencies and energy from each is instant! The moment the sound is captured by the ear it is already identified and accessed to its energy level. With that said perhaps there is a better approach for A.I. hearing where some kind of tuned pipe or pipes identify the frequency and its energy accessed as well. Miniaturizing this is also a challenge.

Does anyone have any ideas?
« Last Edit: March 26, 2021, 02:29:30 pm by frankinstien »

*

MagnusWootton

  • Nomad
  • ***
  • 69
Re: Efficiency!
« Reply #1 on: March 27, 2021, 04:02:11 pm »
Cpus can run lots of fast fourier transforms,  maybe back before we had 3 gigahert quad core cpus you could only have a few running at once, but now you can pretty much spam them into your VST host when your making music.

The mamallian ear is a mystery of nature,   how can we percieve in all directions all sounds from everywhere, just from two diaphram/microphone sources?.    To make a robot my best bet would be to do it more like an eye, and have a separate receptor for every direction.

Another thing, is god could give us "hear vision" if he wanted, and maybe all you need is an ear out front and it can actually see as well,   only god truly knows tho.

*

LOCKSUIT

  • Emerged from nothing
  • Trusty Member
  • *******************
  • Prometheus
  • *
  • 4501
  • First it wiggles, then it is rewarded.
    • Main Project Thread
Re: Efficiency!
« Reply #2 on: March 27, 2021, 05:04:51 pm »
.......What's wrong with usig a microphone? You should be able to get both volume and pitch per "pixel" timestep.
Emergent

*

MagnusWootton

  • Nomad
  • ***
  • 69
Re: Efficiency!
« Reply #3 on: March 27, 2021, 08:46:04 pm »
Yeh but how do you get all the different sounds separated per direction with just 2 microphones.

*

frankinstien

  • Starship Trooper
  • *******
  • 396
    • Knowledgeable Machines
Re: Efficiency!
« Reply #4 on: March 28, 2021, 05:48:27 pm »
Yeh but how do you get all the different sounds separated per direction with just 2 microphones.

There is an ability to remove voices from audio tracks with DFT/FFT:
2DFT
Extract Vocals

But here's a paper that uses a different approach to sound seperation

And here are some A.I. approaches.

Here are some localization approaches using A.I.
Simulation of Human Ear Recognition Sound
Direction


Robotics

*

LOCKSUIT

  • Emerged from nothing
  • Trusty Member
  • *******************
  • Prometheus
  • *
  • 4501
  • First it wiggles, then it is rewarded.
    • Main Project Thread
Re: Efficiency!
« Reply #5 on: March 28, 2021, 06:55:58 pm »
Yeh but how do you get all the different sounds separated per direction with just 2 microphones.

Same way you separate things in an image....they are spaced by timesteps....look a bit to the left and you get another object....an you may even see 2 objects overlayered! :)
Emergent

*

MagnusWootton

  • Nomad
  • ***
  • 69
Re: Efficiency!
« Reply #6 on: March 28, 2021, 08:48:35 pm »
Yeh but how do you get all the different sounds separated per direction with just 2 microphones.

There is an ability to remove voices from audio tracks with DFT/FFT:
2DFT
Extract Vocals

But here's a paper that uses a different approach to sound seperation

And here are some A.I. approaches.

Here are some localization approaches using A.I.
Simulation of Human Ear Recognition Sound
Direction


Robotics

Yes, thats quite a fancy bit of audio engineering there.   the magic filter that separates the instruments from each other is something audio engineers dream about, and its happening now.   quite amazing.

*

frankinstien

  • Starship Trooper
  • *******
  • 396
    • Knowledgeable Machines
Re: Efficiency!
« Reply #7 on: March 30, 2021, 08:10:24 pm »
Yes, thats quite a fancy bit of audio engineering there.   the magic filter that separates the instruments from each other is something audio engineers dream about, and its happening now.   quite amazing.

I just finished a meeting on MS Teams and had the TV on and noticed how I was able to differentiate the voices from the computer speakers and the TV and it was based on localization. Then I thought that even when you're in a crowded room you can focus on voices or sounds by their direction, so it doesn't matter if one is whispering while there are other sounds or noises around because we can detect the location of a sound we can focus on just that information. FFT does have the ability to do phase-detection so if I drop everything other than the phase component of the sound that is of interest it should allow for better speech recognition as well as sound recognition.

This is why I'm migrating a DFT/FFT library to OpenCL, I need to do all kinds of fancy stuff to sound. I have seen some sites that have OpenCL code but no real examples of it working, but if someone here has seen such material please do not hesitate to post it here.

*

MagnusWootton

  • Nomad
  • ***
  • 69
Re: Efficiency!
« Reply #8 on: March 31, 2021, 07:18:44 am »
By the look of the deep learning going into it these days,  maybe its more in the mind in the ear,  but just shows, once u have a full intelligence, u only need a simple ear...

*

infurl

  • Administrator
  • **********
  • Millennium Man
  • *
  • 1172
  • Humans will disappoint you.
    • Home Page
Re: Efficiency!
« Reply #9 on: March 31, 2021, 07:24:50 am »
This is why I'm migrating a DFT/FFT library to OpenCL, I need to do all kinds of fancy stuff to sound. I have seen some sites that have OpenCL code but no real examples of it working, but if someone here has seen such material please do not hesitate to post it here.

Maybe you will find someone knowledgeable on https://www.reddit.com/r/OpenCL/.

*

MikeB

  • Nomad
  • ***
  • 97
Re: Efficiency!
« Reply #10 on: April 01, 2021, 07:37:15 am »
Many microphones are already tuned for voice (300hz-3000hz), so the louder the pickup the more likely it's voice...

A software equaliser can adjust the dba for that range, but uses 1-5% cpu on a 3ghz cpu.