Many people are excited about creating usable speech technology. However, most of the audio data used by large companies isn’t available to the majority of people, and that data is often biased in terms of language, accent, and gender. Jenny, Josh, and Remy from Mozilla join us to discuss how Mozilla is building an open-source voice database that anyone can use to make innovative apps for devices and the web (Common Voice). They also discuss efforts through Mozilla fellowship program to develop speech tech for African languages and understand bias in data sets.

Many people are excited about creating usable speech technology. However, most of the audio data used by large companies isn’t available to the majority of people, and that data is often biased in terms of language, accent, and gender. Jenny, Josh, and Remy from Mozilla join us to discuss how Mozilla is building an open-source voice database that anyone can use to make innovative apps for devices and the web (Common Voice). They also discuss efforts through Mozilla fellowship program to develop speech tech for African languages and understand bias in data sets.

Leave us a comment

Changelog++ members get a bonus 2 minutes at the end of this episode and zero ads. Join today!

Sponsors:



Linode – Our cloud of choice and the home of Changelog.com. Deploy a fast, efficient, native SSD cloud server for only $5/month. Get 4 months free using the code changelog2019 OR changelog2020. To learn more and get started head to linode.com/changelog.
Pace.dev – Minimalist web based management tool for your teams. Async by default communication and simplistic task management gives you everything you need to build your next thing. Brought to you by Go Time panelist Mat Ryer. Try it out today!
Fastly – Our bandwidth partner. Fastly powers fast, secure, and scalable digital experiences. Move beyond your content delivery network to their powerful edge cloud platform. Learn more at fastly.com.
Rollbar – We move fast and fix things because of Rollbar. Resolve errors in minutes. Deploy with confidence. Learn more at rollbar.com/changelog.

Featuring:


Jenny Zhang – Twitter, WebsiteRemy Muhire – Twitter, GitHubJosh Meyer – Twitter, GitHubChris Benson – Twitter, GitHub, LinkedIn, WebsiteDaniel Whitenack – Twitter, GitHub, Website

Show Notes:



Mozilla Common Voice
Announcement of Josh and Remy’s fellowship work on speech tech for African languages
Artie Bias Corpus
Readings on Demographic Bias in ASR:

Voice recognition still has significant race and gender biases
Gender and Dialect Bias in YouTube’s Automatic Captions
Racial disparities in automated speech recognition

Common Voice LREC Paper
Common Voice + DeepSpeech collaborators for Low-resource languages:

Digital Umuganda
AI Lab, Makerere University
Language Technologies Unit, Bangor University
Linguistics Department, Indiana University Bloomington

“under-sampled majority” is a quote from Joy Boulamwini (see this article)

Something missing or broken? PRs welcome!

Twitter Mentions