Home >

news ヘルプ

論文・著書情報


タイトル
和文: 
英文:Recurrent out-of-vocabulary word detection based on distribution of features 
著者
和文: 浅見 太一, 増村 亮, 青野 裕司, 篠田 浩一.  
英文: Taichi Asami, Ryo Masumura, Yushi Aono, Koichi Shinoda.  
言語 English 
掲載誌/書名
和文: 
英文:Computer speech & language 
巻, 号, ページ Volume 58        Page 247-259
出版年月 2019年5月9日 
出版者
和文: 
英文:Elsevier Ltd. 
会議名称
和文: 
英文: 
開催地
和文: 
英文: 
DOI https://doi.org/10.1016/j.csl.2019.04.007
アブストラクト The repeated use of out-of-vocabulary (OOV) words in a spoken document seriously degrades a speech recognizer performance. Even though such recurrent OOV words are often important keywords in a spoken document, they are never correctly recognized. We propose a novel method for robustly detecting recurrent OOV words, which focuses on the degree of consistency among them. It first detects recurrent segments, that is recurrent phoneme sub-sequence in the output of a phoneme sequence decoder. Then, we measure the degree of consistency by using the mean and variance (distribution) of features (DOF) derived from the recurrent segments, and use our DOF for IV/OOV classification. Experiments on academic lectures illustrate that the proposed DOF-based method can robustly detect recurrent OOV words in spontaneous speech and achieves over 60% relative reduction in false alarms. It is also confirmed that detection performance improves as the OOV words are repeated more often.

©2007 Tokyo Institute of Technology All rights reserved.