Microsoft's AI is getting crazily good at speech recognition

matt smith doctor who actor british speaking microphone mouth

Microsoft's speech recognition efforts have hit a significant milestone.

It can now transcribe human speech with a 5.1% error rate, Microsoft technical fellow Xuedong Huang wrote in a blog post— the same error rate as humans.

Microsoft actually thought it hit this point last year, when it reached 5.9%, the word error rate it had measured for humans. But then other researchers carried out separate studies and pegged the human error level as slightly lower, 5.1%.

But it has now achieved this — reducing its error rate by 12%, and using AI techniques like "neural-net based acoustic and language models." Another innovation was to take into account the context of the speech to make better guesses as to what unclear words are, like humans do.

For example: It might not be clear from the audio whether someone is saying "that's not fair" or "that's not fur." Traditionally, this ambiguity might lead to transcription errors. But now the speech recognition tech can look at context for clues. If it's a speech about the risks of gambling, then it's probably "that's not fair"; if it's a conversation about fabrics, "that's not fur" probably fits better.

"Reaching human parity with an accuracy on par with humans has been a research goal for the last 25 years," Xuedong Huang wrote. But in practice, Microsoft still faces significant challenges. "such as achieving human levels of recognition in noisy environments with distant microphones, in recognizing accented speech, or speaking styles and languages for which only limited training data is available."

So while Microsoft's tech is impressive, it won't be on a par with humans in all real-world situations just yet.

The researcher added: "Moreover, we have much work to do in teaching computers not just to transcribe the words spoken, but also to understand their meaning and intent. Moving from recognizing to understanding speech is the next major frontier for speech technology."

Join the conversation about this story »

NOW WATCH: Everything we know about the new iPhone that Apple will announce in September

Microsoft's AI is getting crazily good at speech recognition

Trending Articles

RAMAYAMPET Mandal Sarpanch | Upa-Sarpanch | Ward member Mobile Numbers Medak...

लड़कियां सेक्स के दौरान क्यों करती है उह! आह!लड़कियां सेक्स के दौरान क्यों करती...

Neem Baba Extra Questions Answer Class 6 English Poorvi

Throw Back: 4×4 — Sikilitele (Ft Castro) Prod by JQ

Rajasthan Board 10th Result 2016 Roll No wise & Name Wise

Lowe faces four theft charges

Practice Sheet of Right form of verbs for HSC Students

Mafia, Murder & Mayhem In The Motor City: Detroit Mob Hit Timeline (1937-2007)

The 10 Tennessee Cities With The Largest Black Population For 2021

Materials Around Us Class 6 Worksheet Science Chapter 6

デスクトップヒープの枯渇

Best Suvichar in Hindi |बेस्ट सुविचार |शुभ विचार हिंदी में

Kanulanu Thaake Lyrics and translation | Manam (2014)

Korean Sex Porn Videos: XXX Videos & Free Porn Movies

Teen Shot In Miami Drive-By Dies From Injuries

Download: IQ Muzatasha feat Shy D & Pmj – Ulesi NiFertilizer Yamavuto

Mahakal Attitude Status

Property developer set up cannabis factory to help pay off debts...

♡

KB: How to troubleshoot issues when adding a Hyper-V host in System Center...