Home >

news ヘルプ

論文・著書情報


タイトル
和文: 
英文:A Path-sequence Based Method for Solving the One-to-multiple Matching Problem in Leaf-Clustering Based Approximate XML Join Algorithms 
著者
和文: 梁 文新, 横田 治夫.  
英文: Wenxin LIANG, Haruo YOKOTA.  
言語 English 
掲載誌/書名
和文:DEWS2006 論文集 
英文:Proc. of DEWS2006 
巻, 号, ページ         pp. 4A-i10
出版年月 2006年3月 
出版者
和文:電子情報通信学会 
英文:IEICE 
会議名称
和文:DEWS2006 
英文:DEWS2006 
開催地
和文:沖縄コンベンションセンター 
英文:Okinawa 
ファイル
公式リンク http://www.ieice.org/iss/de/DEWS/DEWS2006/doc/4A-i10.pdf
 
アブストラクト In previous work, we have proposed approximate XML join algorithms based on the clustered leaf nodes for measuring the approximate similarity between XML documents and integrating them at the subtree classes. However, in a join loop, one base subtree may infrequently happen to be matched with multiple target ones. In this paper, we propose a path-sequence based method to solve the one-to-multiple matching problem in leaf-clustering based approximate XML join algorithms. In our method, each path sequence from the top node to the matched leaf in the base and target subtree is extracted, and the most similar target subtree for the base one is determined by the path-sequence based subtree similarity degree. We conduct experiments to evaluate our method by using both real bibliography and bioinformatics XML documents. The experimental results show that our method can effectively solve the one-to-multiple matching problem for both bibliography and bioinformatics XML data, and hence improve the precision of the leaf-clustering based approximate XML join algorithms.

©2007 Tokyo Institute of Technology All rights reserved.