Video Semantic Indexing and Localization

Koichi Shinoda

論文・著書情報

タイトル

和文:
英文:	Video Semantic Indexing and Localization

著者

和文:	篠田浩一.
英文:	Koichi Shinoda.

言語

English

掲載誌/書名

和文:
英文:	5th Joint Meeting of the Acoustical Society of America and the Acoustical Society of Japan

巻, 号, ページ

vol. 140 no. 4 p. 3009

出版年月

2016年11月28日

出版者

和文:
英文:	tthe Acoustical Society of America through AIP Publishing LLC

会議名称

和文:	第5回日米音響学会ジョイントミーティング
英文:	5th Joint Meeting of the Acoustical Society of America and the Acoustical Society of Japan

開催地

和文:	ハワイ州ホノルル
英文:	Honolulu, Hawaii

ファイル

公式リンク

http://acousticalsociety.org/content/5th-joint-meeting-acoustical-society-america-and-acoustical-society-japan

アブストラクト

Nowadays Internet traffic has been largely occupied by consumer video but most of them are not accompanied with text tags for search. Hence, video semantic indexing, which extracts visual concepts such as objects, scenes, and actions directly from video contents, has been intensively studied. Fundamentally, this task consists of two problems: localization and recognition. While until recently these two problems have been studied independently, emerging end-to-end deep learning techniques using convolutional neural networks (CNNs) and recurrent neural networks (RNNs) offer effective ways to solve them simultaneously. These techniques are deeply related to spoken word detection techniques in the speech field. In this talk, we overview the recent progress in this area and discuss potential directions for future research.

Home

各種検索

サポート

T2R2について

関連リンク

論文・著書情報