Faculty Publications

Improving continuous gesture recognition with spoken prosody

Sanshzar Kettebekov, Pennsylvania State UniversityFollow
Mohammed Yeasin, Pennsylvania State UniversityFollow
Rajeev Sharma, Pennsylvania State University

Abstract

Despite recent advances in gesture recognition, reliance on the visual signal alone to classify unrestricted continuous gesticulation is inherently error-prone. Since there have been some attempts of using speech cues to improve gesture recognition. Some attempts have been made in using speech cues to improve gesture recognition, e.g., keyword-gesture co-analysis. Use of such scheme is burdened by the complexity of natural language understanding. This paper offers a novel "signal-level" perspective by exploring prosodic phenomena of spontaneous gesture and speech co-production. We present a computational framework for improving continuous gesture recognition based on two phenomena that capture voluntary (co-articulation) and involuntary (physiological) contributions of prosodic synchronization. Physiological constraints, manifested as signal interruptions in multimodal production, are exploited in - an audio-visual feature integration framework using Hidden Markov Models (HMMs). Co-articulation is analyzed using a Bayesian network of naïve classifiers to explore alignment of intonationally prominent speech segments and hand kinematics. The efficacy of the proposed approach was demonstrated on a multimodal corpus created from the Weather Channel broadcast. Both schemas were found to contribute uniquely by reducing different error types, which subsequently improves the performance of continuous gesture recognition.

Publication Title

Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

Recommended Citation

Kettebekov, S., Yeasin, M., & Sharma, R. (2003). Improving continuous gesture recognition with spoken prosody. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Retrieved from https://digitalcommons.memphis.edu/facpubs/13897

This document is currently not available here.

COinS

Faculty Publications

Improving continuous gesture recognition with spoken prosody

Abstract

Publication Title

Recommended Citation

Search

Browse

Author Corner

Libraries

Faculty Publications

Improving continuous gesture recognition with spoken prosody

Authors

Abstract

Publication Title

Recommended Citation

Share

Search

Browse

Author Corner

Libraries