Electronic Theses and Dissertations

Decoding Perception of Speech from Behavioral Responses using Spatio-Temporal CNNs

Identifier

6663

Kazi Ashraf Moinuddin

Date

2020

Document Type

Thesis

Degree Name

Master of Science

Major

Electrical and Computer Engr

Concentration

Computer Engineering

Committee Chair

Mohammed Yeasin

Committee Member

Gavin Bidelman

Committee Member

Faruk Ahmed

Abstract

Categorical perception (CP) of speech is a complex process reflecting individuals’ ability to perceive sound and is measured using response time (RT). The cognitive processes involved in mapping neural activities to behavioral response are stochastic and further compounded by individuality and variations. This thesis presents a data-driven approach and develops parameter optimized models to understand the relationship between cognitive events and behavioral response (e.g., RT). We introduce convolutional neural networks (CNN) to learn the representation from EEG recordings. In addition, we develop parameter optimized and interpretable models in decoding CP using two representations: 1) spatial-spectral topomaps and 2) evoked response potentials (ERP). We adopt state-of-the-art class discriminative visualization (GradCAM) tools to gain insights (as oppose to the’black box’ models) and building interpretable models. In addition, we develop a diverse set of models to account for the stochasticity and individual variations. We adopted weighted saliency scores of all models to quantify the learned representations’ effectiveness and utility in decoding CP manifested through behavioral response. Empirical analysis reveals that the γ band and early (∼ 0 - 200ms) and late (∼ 300 - 500ms) right hemisphere IFG engagement is critical in determining individuals’ RT. Our observations are consistent with prior findings, further validating the efficacy of our data-driven approach and optimized interpretable models.

Comments

Data is provided by the student.

Library Comment

Dissertation or thesis originally submitted to the local University of Memphis Electronic Theses & dissertation (ETD) Repository.

Recommended Citation

Moinuddin, Kazi Ashraf, "Decoding Perception of Speech from Behavioral Responses using Spatio-Temporal CNNs" (2020). Electronic Theses and Dissertations. 2147.
https://digitalcommons.memphis.edu/etd/2147

Download

COinS

Electronic Theses and Dissertations

Decoding Perception of Speech from Behavioral Responses using Spatio-Temporal CNNs

Identifier

Date

Document Type

Degree Name

Major

Concentration

Committee Chair

Committee Member

Committee Member

Abstract

Comments

Library Comment

Recommended Citation

Search

Browse

Author Corner

Libraries

Electronic Theses and Dissertations

Decoding Perception of Speech from Behavioral Responses using Spatio-Temporal CNNs

Identifier

Author

Date

Document Type

Degree Name

Major

Concentration

Committee Chair

Committee Member

Committee Member

Abstract

Comments

Library Comment

Recommended Citation

Share

Search

Browse

Author Corner

Libraries