Meta AI builds speech recognition that uses lip sync to filter noise

  • 0 Replies
  • 551 Views
*

MikeB

  • Mechanical Turk
  • *****
  • 154
Quote
“By combining visual cues, such as the movement of the lips and teeth during speaking, along with auditory information for representation learning, AV-HuBERT can capture nuanced associations between the two input streams efficiently even with much smaller amounts of untranscribed video data for pretraining,” Meta AI’s researchers explained.

https://siliconangle.com/2022/01/07/meta-ai-built-speech-recognition-platform-relies-visual-cues-filter-background-noise/