期刊文献+

面向多数据源的数据清洗关键技术的研究 被引量:5

Research on Key Technologies of Data-cleaning With Multi-source
下载PDF
导出
摘要 对于各个领域的信息资源管理而言,数据质量一直是一个非常关键的问题。现实世界中的数据往往存在着各种各样的问题,从简单的拼写错误到复杂的语义不一致错误。数据清洗的目标就是检测并去除数据中存在的各种错误和不一致,提高数据的质量。该文归纳、总结了数据清洗相关研究的现状,提出一个面向多数据源的数据清洗框架的定义。框架实现了术语模型、处理描述文件和共享库等概念和技术。 Data quality is crucial for information management systems in all domains. The real world is often dirty due to various data quality problems, which range from the simple data entry errors to the complex inconsistencies.Data cleaning deals with detecting and removing errors and inconsistencies from data to improve the quality of data.After providing a classification of data quality problems and a survey of data cleaning, this article presents a specification of an extensible data-cleaning framework.The framework realize features like term model,processing description file and rule&Dic base.
出处 《科技资讯》 2009年第1期13-15,共3页 Science & Technology Information
关键词 数据质量 数据清洗 面向多数据源的数据清洗框架 Data quality Data cleaning multi-source Data-cleaning framework
  • 相关文献

参考文献10

  • 1H.Galhardas,D.Florescu,D.Shasha,E.Simon.AJAX:An Extensible Data Cleaning Tool[].In SIGMOD(demonstration paper).
  • 2H.Galhardas,D.Florescu,D.Shasha,E.Simon,CA.Saita.Declarative Data Cleaning:Language,Model,and Algorithms[].Extended version of the VLDB paper.2001
  • 3L.V.S.Lakshmanan,F.Sadri,I.N.Subramanian.Schem-ma SQL-A Language for Interoperability in Rela-tional Multi-database Systems[].ProcOf VLDB.1999
  • 4Object Management Group(OMG).CWM Meta Store Creation Script.Zip. http://www.wiley.com/legacy/c o m p b o o k s/p o o l e/C W M G u i d e/software.htmo . 2002
  • 5Xi nwen Z hang,Sehong Oh,Ravi Sandhu.PBDM:A flexible delegation model in RBAC[].Theth ACM Symp on Aess Control Models and Technologis(SACMAT).2003
  • 6Object Management Group(OMG).Com-mon Warehouse Metamodel(CWM)[].ht tp://wwwomgor g/cgi-bin/doc?Ad.2001
  • 7H.H.Do,E.Rahm.On Mtada Interoperability in Data Warehouses[].TechReport-Department of Comouter ScienceUniversity of Leipzig.
  • 8Raman V,Hellerstein J. M.Potter’s Wheel: An interactive data cleaning system, Proc[C].of the th VLDB Conference Roma Italy.2001
  • 9M. L. Lee,T. W. Ling and W. L. Low.A Knowledge-Based Framework for Intelligent Data Cleaning[]..2001
  • 10Rahm, E,Do, H.H.Data cleaning: problems and current approaches[].IEEE Transactions on Knowledge and Data Engineering.2000

同被引文献28

引证文献5

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部