[1]钱钢,董逸生.一种实现数据集成中查询重写的方法[J].东南大学学报(自然科学版),2004,34(4):441-445.[doi:10.3969/j.issn.1001-0505.2004.04.005]
 Qian Gang,Dong Yisheng.Approach to query reformulation in data integration[J].Journal of Southeast University (Natural Science Edition),2004,34(4):441-445.[doi:10.3969/j.issn.1001-0505.2004.04.005]
点击复制

一种实现数据集成中查询重写的方法()
分享到:

《东南大学学报(自然科学版)》[ISSN:1001-0505/CN:32-1178/N]

卷:
34
期数:
2004年第4期
页码:
441-445
栏目:
计算机科学与工程
出版日期:
2004-07-20

文章信息/Info

Title:
Approach to query reformulation in data integration
作者:
钱钢 董逸生
东南大学计算机科学与工程系, 南京 210096
Author(s):
Qian Gang Dong Yisheng
Department of Computer Science and Engineering, Southeast University, Nanjing 210096, China
关键词:
查询重写 数据集成 路径映射 XML
Keywords:
query reformulation data integration path mapping extensible markup language
分类号:
TP311
DOI:
10.3969/j.issn.1001-0505.2004.04.005
摘要:
基于路径映射的XML数据集成系统在查询重写时可能会生成不合理的子查询. 为了让生成的各个子查询中的实体属性是一致的,按照模式之间的路径映射提出了映射依赖的概念,并设计了一种查询重写的方法.重写时依次遍历查询树的各个结点,记录每个中间结果的PC环境,根据启发式规则判断PC环境与当前映射的依赖是否保持一致.并且在时间复杂度上该方法和数据源的数目成线性关系.
Abstract:
In order to make attributes of entities consistent in every subquery, a concept of mapping dependence is proposed in terms of the path mappings between schemas, and a method of reformulation is designed. By this method, when traversing each nodes of query tree in reformulation, system holds a PC-context for every temporary result reformulated, and in terms of heuristic rules judges whether or not the PC-context and the dependence of current mapping selected are consistent. The time complexity of algorithm in this method is linear with the size of data sources.

参考文献/References:

[1] Lenzerini M.Data integration:a theoretical perspective [A].In:ACM Symposium on Principles of Database Systems [C].Winscdnsim,USA,2002.233-246.
[2] Papakonstantinou Y. Query processing in heterogeneous information sources[D].Department of Computer Science,Stanford University,1996.
[3] Levy A Y, Rajaraman A,Ordille J J.Querying heterogeneous information sources using source descriptions [A].In: Proc of VLDB [C].Bombay,India,1996.251-262.
[4] Cluet S, Veltri P,Vodislav D.Views in a large scale XML repository [A].In:Proc of VLDB [C].Roma,Italy,2001.271-280.
[5] Rahm E, Bernstein P A.A survey of approaches to automatic schema matching [J].VLDB Journal, 2001,10(4):334-350.
[6] Renaud C, Sirot J P,Vodislav D.Semantic integration of XML heterogenneous data sources [EB/OL].http://osage.inria.fr/gemo/Gemo/PUBLI/all-bykey.php?mytexte-xyleme.2003-11.
[7] Deutsch A,Tannen V.Containment and integrity constraints for X Path fragments [EB/OL].http://db.cis.uppen.edu/cgi-bin/person.perl?adeutsch.2001-09-15/2003-11.

备注/Memo

备注/Memo:
作者简介: 钱钢(1975—),男,博士生; 董逸生(联系人),男,教授,博士生导师,ysdong@seu.edu.cn.
更新日期/Last Update: 2004-07-20