Machine Learning Times
EXCLUSIVE HIGHLIGHTS
AGI Is Infeasible. Instead, Pursue Superhuman Adaptable Intelligence
  Originally published in Forbes On a recent episode of the...
Artifact-Driven Development: Making It Possible to Query Large Analytics and AI Projects
 A practical introduction to making complex project structure explicit...
Incoherent AGI Hype Spurs An Industrywide Pivot To Hybrid AI
  Originally published in Forbes Recently on The Dr. Data Show,...
The AI Paradox: More Humanlike Means Less Autonomous
  Originally published in Forbes The AI executives are at...
SHARE THIS:
  • Jul 26, 2020
  • Comments Off on Speech Recognition Tech Is Yet Another Example of Bias
  • Industry News, Left-hand
  • 2422 Views

6 years ago
Speech Recognition Tech Is Yet Another Example of Bias

 
Originally published in Scientific American, July 5, 2020.

Siri, Alexa and other programs sometimes have trouble with the accents and speech patterns of people from many underrepresented groups.

“Clow-dia,” I say once. Twice. A third time. Defeated, I say the Americanized version of my name: “Claw-dee-ah.” Finally, Siri recognizes it.

Having to adapt our way of speaking to interact with speech recognition technologies is a familiar experience for people whose first language is not English or who do not have conventionally American-sounding names. I have even stopped using Siri because of it.

Implementation of speech recognition technology in the last few decades has unveiled a very problematic issue ingrained in them: racial bias. One recent study, published in PNAS, showed that speech recognition programs are biased against Black speakers. On average, all five programs from leading technology companies like Apple and Microsoft showed significant race disparities; they were twice as likely to incorrectly transcribe audio from Black speakers as opposed to white speakers.

In normal conversations with other people, we might choose to code-switch, alternating between languages, accents or ways of speaking, depending on one’s audience. But with automated speech recognition programs, there is no code-switching—either you assimilate, or you are not understood. This effectively censors voices that are not part of the “standard” languages or accents used to create these technologies.

To continue reading this article, click here.

Comments are closed.