Semantics for Large-Scale Multimedia: New Challenges for NLP

Florian Metze; Koichi Shinoda

論文・著書情報

タイトル

和文:
英文:	Semantics for Large-Scale Multimedia: New Challenges for NLP

著者

和文:	メッツェフロリアン, 篠田浩一.
英文:	Florian Metze, Koichi Shinoda.

言語

English

掲載誌/書名

和文:
英文:

巻, 号, ページ

出版年月

2014年6月22日

出版者

和文:
英文:

会議名称

和文:
英文:	ACL2014

開催地

和文:	ボルチモア
英文:	Baltimore

公式リンク

http://acl2014.org/Metze.htm

アブストラクト

Thousands of videos are constantly being uploaded to the web, creating a vast resource, and an ever-growing demand for methods to make them easier to retrieve, search, and index. As it becomes feasible to extract both low-level as well as high-level (symbolic) audio, speech, and video features from this data, these need to be processed further, in order to learn and extract meaningful relations between these. The language processing community has made huge process in analyzing the vast amounts of very noisy text data that is available on the Internet. While it is very difficult to create semantic units of low-level image descriptors or non-speech sounds by themselves, it is comparatively easy to ground semantics in the word output of a speech recognizer, or text data that is loosely associated with a video. This creates an opportunity for NLP researchers to use their unique skills, and make significant contributions to solve tasks on data that is even noisier than web text, but (we argue) even more interesting and challenging.

Home

各種検索

サポート

T2R2について

関連リンク

論文・著書情報