期刊文献+

Matching dependencies: semantics and query answering 被引量:2

Matching dependencies: semantics and query answering
原文传递
导出
摘要 Matching dependencies (MDs) are used to declaratively specify the identification (or matching) of cer- tain attribute values in pairs of database tuples when some similarity conditions on other values are satisfied. Their en- forcement can be seen as a natural generalization of entity resolution. In what we call the pure case of MD enforce- ment, an arbitrary value from the underlying data domain can be used for the value in common that is used for a match- ing. However, the overall number of changes of attribute val- ues is expected to be kept to a minimum. We investigate this case in terms of semantics and the properties of data clean- ing through the enforcement of MDs. We characterize the in- tended clean instances, and also the clean answers to queries, as those that are invariant under the cleaning process. The complexity of computing clean instances and clean query an- swering is investigated. Tractable and intractable cases de- pending on the MDs are identified and characterized. Matching dependencies (MDs) are used to declaratively specify the identification (or matching) of cer- tain attribute values in pairs of database tuples when some similarity conditions on other values are satisfied. Their en- forcement can be seen as a natural generalization of entity resolution. In what we call the pure case of MD enforce- ment, an arbitrary value from the underlying data domain can be used for the value in common that is used for a match- ing. However, the overall number of changes of attribute val- ues is expected to be kept to a minimum. We investigate this case in terms of semantics and the properties of data clean- ing through the enforcement of MDs. We characterize the in- tended clean instances, and also the clean answers to queries, as those that are invariant under the cleaning process. The complexity of computing clean instances and clean query an- swering is investigated. Tractable and intractable cases de- pending on the MDs are identified and characterized.
出处 《Frontiers of Computer Science》 SCIE EI CSCD 2012年第3期278-292,共15页 中国计算机科学前沿(英文版)
关键词 DATABASES data cleaning duplicate and entityresolution integrity constraints matching dependencies databases, data cleaning, duplicate and entityresolution, integrity constraints, matching dependencies
  • 相关文献

参考文献28

  • 1Elmagarmid A,Ipeirotis P,Verykios V. Duplicate record detection:a survey[J].IEEE Transactions on Knowledge and Data Engineering,2007,(01):1-16.doi:10.1109/TKDE.2007.250581.
  • 2Bleiholder J,Naumann F. Data fusion[J].ACM Computing Surveys,2008,(01):1-41.
  • 3Benjelloun O,Garcia-Molina H,Menestrina D,Su Q Whang S Widom J. Swoosh:a genetic approach to entity resolution[J].VLDB Journal:Very Large Data Bases,2009.255-276.doi:10.1007/s00778-008-0098-x.
  • 4Fan W. Dependencies revisited for improving data quality[A].2008.159-170.doi:10.1111/j.1476-5381.2008.00102.x.
  • 5Fan W,Jia X,Li J,Ma S. Reasoning about record matching rules[J].Proceedings of the VLDB Endowment,2009,(01):407-418.
  • 6Arenas M,Bertossi L,Chomicki J. Consistent query answers in inconsistent databases[A].1999.68-79.
  • 7Bertossi L. Consistent queryanswering in databases[J].ACM SIGMOD Record,2006,(02):68-76.doi:10.1145/1147376.1147391.
  • 8Chomicki J. Consistent query answering:five easy pieces[A].Springer-verlag,2007.1-17.
  • 9Bertossi L. Database repairing and consistent query answering[A].2011.
  • 10Bertossi L,Bravo L. Consistent query answers in virtual data integration systems[A].Beilin:Springer-Verlag,2005.42-83.doi:10.1097/OPX.0b013e318229635a.

同被引文献22

引证文献2

二级引证文献39

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部