Home >

news ヘルプ

論文・著書情報


タイトル
和文: 
英文:Semantics for Large-Scale Multimedia: New Challenges for NLP 
著者
和文: メッツェ フロリアン, 篠田 浩一.  
英文: Florian Metze, Koichi Shinoda.  
言語 English 
掲載誌/書名
和文: 
英文: 
巻, 号, ページ        
出版年月 2014年6月22日 
出版者
和文: 
英文: 
会議名称
和文: 
英文:ACL2014 
開催地
和文:ボルチモア 
英文:Baltimore 
公式リンク http://acl2014.org/Metze.htm
 
アブストラクト Thousands of videos are constantly being uploaded to the web, creating a vast resource, and an ever-growing demand for methods to make them easier to retrieve, search, and index. As it becomes feasible to extract both low-level as well as high-level (symbolic) audio, speech, and video features from this data, these need to be processed further, in order to learn and extract meaningful relations between these. The language processing community has made huge process in analyzing the vast amounts of very noisy text data that is available on the Internet. While it is very difficult to create semantic units of low-level image descriptors or non-speech sounds by themselves, it is comparatively easy to ground semantics in the word output of a speech recognizer, or text data that is loosely associated with a video. This creates an opportunity for NLP researchers to use their unique skills, and make significant contributions to solve tasks on data that is even noisier than web text, but (we argue) even more interesting and challenging.

©2007 Institute of Science Tokyo All rights reserved.