A Gaze and Speech Multimodal Interface

Qiaohui Zhang; Atsumi Imamiya; Kentaro Go; Xiaoyang Mao

doi:10.1109/ICDCSW.2004.1284033

24th International Conference on Distributed Computing Systems Workshops, 2004. Proceedings.

A Gaze and Speech Multimodal Interface

Year: 2004, Volume: 2, Pages: 208-214

DOI Bookmark: 10.1109/ICDCSW.2004.1284033

Authors

Qiaohui Zhang, University of Yamanashi
Atsumi Imamiya, University of Yamanashi
Kentaro Go, University of Yamanashi
Xiaoyang Mao, University of Yamanashi

Abstract

Eyesight and speech are two channels that humans naturally use to communicate with each other. However both the eye tracking and the speech recognition technique available today are still far from perfect. Our goal is find how to effectively make use of these error-prone information from both modes, in order to use one mode to correct errors of another mode, overcome the immature of recognition techniques, resolve the ambiguity of the user?s speaking, and improve the interaction speed. The integration strategies and the evaluation experiment demonstrate that these two modalities can be used multimodally to improve the usability and efficiency of user interface, which would not be available to speech-only or gaze-only systems.

Like what you’re reading?

Already a member?

Get this article FREE with a new membership!

An Explanation of Online Secondhand Book Trade Mode under Speech Act Paradigm: A Case Study of Confucius Secondhand Book Website
Education Technology and Training & Geoscience and Remote Sensing
Performance Improvement in Speech Recognition Using Multimodal Features
Third International Conference on Natural Computation (ICNC 2007)
Evaluation of a Multimodal Interface for 3D Terrain Visualization
Visualization Conference, IEEE
SPEECH DETECTION BY FACIAL IMAGE FOR MULTIMODAL SPEECH RECOGNITION
IEEE International Conference on Multimedia and Expo, 2001. ICME 2001.
Speech interface VLSI for car applications
Acoustics, Speech, and Signal Processing, IEEE International Conference on
A Wheelchair Platform Controlled by a Multimodal Interface
2015 2nd International Conference on Information Science and Control Engineering (ICISCE)
Toward Multimodal Interpretation in a Natural Speech/Gesture Interface
Information, Intelligence, and Systems, International Conference on
Speech/Gesture Interface to a Visual-Computing Environment
IEEE Computer Graphics and Applications
Automatic Speech Recognition Dataset Augmentation with Pre-Trained Model and Script
2019 IEEE International Conference on Big Data and Smart Computing (BigComp)
A Multimodal Emotion Recognition Method Based on Speech-Text
2022 11th International Conference of Information and Communication Technology (ICTech)

A Gaze and Speech Multimodal Interface

Authors

Abstract

Related Articles