Faculty Publications

A study of automatic speech recognition in noisy classroom environments for automated dialog analysis

Nathaniel Blanchard, University of Notre DameFollow
Michael Brady, University of Notre Dame
Andrew M. Olney, University of Memphis
Marci Glaus, University of Wisconsin-Madison
Xiaoyi Sun, University of Wisconsin-Madison
Martin Nystrand, University of Wisconsin-Madison
Borhan Samei, University of Memphis
Sean Kelly, University of Pittsburgh

Abstract

The development of large-scale automatic classroom dialog analysis systems requires accurate speech-to-text translation. A variety of automatic speech recognition (ASR) engines were evaluated for this purpose. Recordings of teachers in noisy classrooms were used for testing. In comparing ASR results, Google Speech and Bing Speech were more accurate with word accuracy scores of 0.56 for Google and 0.52 for Bing compared to 0.41 for AT&T Watson, 0.08 for Microsoft, 0.14 for Sphinx with the HUB4 model, and 0.00 for Sphinx with the WSJ model. Further analysis revealed both Google and Bing engines were largely unaffected by speakers, speech class sessions, and speech characteristics. Bing results were validated across speakers in a laboratory study, and a method of improving Bing results is presented. Results provide a useful understanding of the capabilities of contemporary ASR engines in noisy classroom environments. Results also highlight a list of issues to be aware of when selecting an ASR engine for difficult speech recognition tasks.

Publication Title

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Recommended Citation

Blanchard, N., Brady, M., Olney, A., Glaus, M., Sun, X., Nystrand, M., Samei, B., & Kelly, S. (2015). A study of automatic speech recognition in noisy classroom environments for automated dialog analysis. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 9112, 23-33. https://doi.org/10.1007/978-3-319-19773-9_3

Link to Full Text

COinS

Faculty Publications

A study of automatic speech recognition in noisy classroom environments for automated dialog analysis

Abstract

Publication Title

Recommended Citation

Search

Browse

Author Corner

Libraries

Faculty Publications

A study of automatic speech recognition in noisy classroom environments for automated dialog analysis

Authors

Abstract

Publication Title

Recommended Citation

Share

Search

Browse

Author Corner

Libraries