Whole-Word Recognition from Articulatory Movements for Silent Speech Interfaces

Jun Wang1,  Ashok Samal1,  Jordan R. Green1,  Frank Rudzicz2

1University of Nebraska-Lincoln, United States, 2University of Toronto, Canada


Articulation-based silent speech interfaces convert silently produced speech movements into audible words. These systems are still in their experimental stages, but have significant potential for facilitating oral communication in persons with laryngectomy or speech impairments. In this paper, we report the result of a novel, real-time algorithm that recognizes whole-words based on articulatory movements. This approach differs from prior work that has focused primarily on phoneme-level recognition based on articulatory features. On average, our algorithm missed 1.93 words in a sequence of twenty-five words with an average latency of 0.79 seconds for each word prediction using a data set of 5,500 isolated word samples collected from ten speakers. The results demonstrate the effectiveness of our approach and its potential for building a real-time articulation-based silent speech interface for health applications.

Thank you to our Sponsors






“Microsoft is a trademark of the Microsoft group of companies and is used under license from Microsoft.”









“Intel” and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other Countries.