Home >

news ヘルプ

論文・著書情報


タイトル
和文: 
英文:Video Semantic Indexing and Localization 
著者
和文: 篠田浩一.  
英文: Koichi Shinoda.  
言語 English 
掲載誌/書名
和文: 
英文:5th Joint Meeting of the Acoustical Society of America and the Acoustical Society of Japan 
巻, 号, ページ vol. 140    no. 4    p. 3009
出版年月 2016年11月28日 
出版者
和文: 
英文:tthe Acoustical Society of America through AIP Publishing LLC 
会議名称
和文:第5回日米音響学会ジョイントミーティング 
英文:5th Joint Meeting of the Acoustical Society of America and the Acoustical Society of Japan 
開催地
和文:ハワイ州ホノルル 
英文:Honolulu, Hawaii 
ファイル
公式リンク http://acousticalsociety.org/content/5th-joint-meeting-acoustical-society-america-and-acoustical-society-japan
 
アブストラクト Nowadays Internet traffic has been largely occupied by consumer video but most of them are not accompanied with text tags for search. Hence, video semantic indexing, which extracts visual concepts such as objects, scenes, and actions directly from video contents, has been intensively studied. Fundamentally, this task consists of two problems: localization and recognition. While until recently these two problems have been studied independently, emerging end-to-end deep learning techniques using convolutional neural networks (CNNs) and recurrent neural networks (RNNs) offer effective ways to solve them simultaneously. These techniques are deeply related to spoken word detection techniques in the speech field. In this talk, we overview the recent progress in this area and discuss potential directions for future research.

©2007 Institute of Science Tokyo All rights reserved.