Home >

news ヘルプ

論文・著書情報


タイトル
和文: 
英文:Multimodal recognition of speech and electrocorticogram 
著者
和文: ミタリ, Shuji Komeiji, Takumi Mitsuhashi, Yasushi Iimura, Hiroharu Suzuki, Hidenori Sugano, 篠田 浩一, 田中 聡久.  
英文: Mitali, Shuji Komeiji, Takumi Mitsuhashi, Yasushi Iimura, Hiroharu Suzuki, Hidenori Sugano, Koichi Shinoda, Toshihisa Tanaka.  
言語 English 
掲載誌/書名
和文: 
英文:2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) 
巻, 号, ページ         pp. 546-550
出版年月 2023年11月20日 
出版者
和文: 
英文:IEEE 
会議名称
和文: 
英文:Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2023 
開催地
和文: 
英文:Taipei 
ファイル
公式リンク https://www.apsipa2023.org/
 
DOI https://doi.org/10.1109/APSIPAASC58517.2023.10317527
アブストラクト Brain-Computer Interface (BCI) provides a novel way of communicating with computers but their poor performance in terms of speed and accuracy in comparison to the other modes of communication has been the biggest obstacle in their usage for practical applications. In this work, we aim to enhance the performance of BCI by utilizing speech data collected along with the electrocorticogram (ECoG) recordings when the person is speaking. While some BCI users may have difficulty in speaking at all, many of them can speak, even though their speech may be unclear. We propose that information from this speech data can help in improving accuracy and to employ such speech context to assist the decoding process, we apply a multimodal recognition method. We use speech data contaminated by noise in our evaluation to simulate the cases where the available speech quality is low. Our experiments using data from five subjects suffering from Epilepsy show that our method of using multimodal input has a significant improvement, an absolute reduction in phrase error rate by 1.1 points from recognition using speech alone and by 51.3 points from recognition using ECoG alone.

©2007 Institute of Science Tokyo All rights reserved.