Faculty Publications

Improved speech inversion using general regression neural network

Shamima Najnin, University of Memphis
Bonny Banerjee, University of MemphisFollow

Abstract

The problem of nonlinear acoustic to articulatory inversion mapping is investigated in the feature space using two models, the deep belief network (DBN) which is the state-of-the-art, and the general regression neural network (GRNN). The task is to estimate a set of articulatory features for improved speech recognition. Experiments with MOCHA-TIMIT and MNGU0 databases reveal that, for speech inversion, GRNN yields a lower root-mean-square error and a higher correlation than DBN. It is also shown that conjunction of acoustic and GRNN-estimated articulatory features yields state-of-the-art accuracy in broad class phonetic classification and phoneme recognition using less computational power.

Publication Title

Journal of the Acoustical Society of America

Recommended Citation

Najnin, S., & Banerjee, B. (2015). Improved speech inversion using general regression neural network. Journal of the Acoustical Society of America (3), EL229-EL235. https://doi.org/10.1121/1.4929626

Link to Full Text

COinS

Faculty Publications

Improved speech inversion using general regression neural network

Abstract

Publication Title

Recommended Citation

Search

Browse

Author Corner

Libraries

Faculty Publications

Improved speech inversion using general regression neural network

Authors

Abstract

Publication Title

Recommended Citation

Share

Search

Browse

Author Corner

Libraries