A mapping scheme of XML documents into relational databases using schema-based path identifiers

Kei Fujimoto, Toshiyuki Shimizu, Masatoshi Yoshikawa, Dao Dinh Kha, Toshiyuki Amagasa

研究成果: Chapter in Book/Report/Conference proceedingConference contribution

13 被引用数 (Scopus)

抄録

In this paper, we propose a mapping scheme of XML documents into relational databases. The scheme enables us to store, retrieve and update XML documents efficiently. When storing XML documents in relational databases, XML tree structures must be preserved explicitly. To this end, a label is assigned to nodes in the XML tree. In general, document retrieval and update performance is affected by node labeling schemes. We use SPIDER (Schema based Path IDentifiER), a labeling scheme for XML documents utilizing DTDs that makes retrieval and update more efficient. SPIDER only identifies paths from root node to a node. Thus, multiple nodes appearing in the same path cannot be distinguished by only using SPIDER. We introduced Sibling Dewey Order to identify such nodes. Generally, when a new node is inserted into XML documents, some other nodes need to be relabeled to preserve the order of nodes. In our method, only Sibling Dewey Order is relabeled; SPIDER is not affected. Since the range of relabeling is small, it is possible to update documents efficiently. We stored documents utilizing SPIDER in a relational database and then translated various XPath expressions into SQL using SPIDER. We perform experiments and demonstrate that the proposed scheme outpeforms conventional methods both in retrieval and update.

本文言語英語
ホスト出版物のタイトルProceedings - International Workshop on Challenges in Web Information Retrieval and Integration, WIRI'05
ページ82-90
ページ数9
DOI
出版ステータス出版済み - 2005
外部発表はい
イベントInternational Workshop on Challenges in Web Information Retrieval and Integration, WIRI'05 - Tokyo, 日本
継続期間: 4 8 20054 9 2005

出版物シリーズ

名前Proceedings - International Workshop on Challenges in Web Information Retrieval and Integration, WIRI'05
2005

会議

会議International Workshop on Challenges in Web Information Retrieval and Integration, WIRI'05
国/地域日本
CityTokyo
Period4/8/054/9/05

All Science Journal Classification (ASJC) codes

  • 工学(全般)

フィンガープリント

「A mapping scheme of XML documents into relational databases using schema-based path identifiers」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル