期刊文献+

Kaphta:Text mining web tool to extract information on the anticancer activity of polyphenols

下载PDF
导出
摘要 In this paper,we describe the application of Kaphta architecture,a resource for text mining of the anticancer activity of polyphenols.The anticancer activity of these compounds against different types of cancer has been widely reported in the literature and they are one of the most promising molecules for the development of anticancer drugs.The architecture,which comprises four sequential and well-defined steps,uses a hybrid approach composed of a dictionary,rules and machine learning to identify abstracts containing sentences with associations between polyphenol,cancer and gene entities.The application of the architecture on 23826 PubMed abstracts generated a knowledge base of indexed abstracts with 172169 sentences containing,polyphenol-cancer and polyphenol-gene associations.A Web tool was implemented that allowed the user to search for information on 2006 polyphenols,240 cancers and 3121 genes entities,and 11750 polyphenol-cancer and 9160 polyphenol-gene associations indexed in the knowledge base.A ranking algorithm calculates scores for each indexed abstract considering the number and type of sentences with entities and rules recognized.A test with users demonstrated that the visualization resources on the web tool contributes to the understanding of the association between polyphenols,genes and cancers,in comparison with the PubMed Tool.The Kaphta architecture and web tool permits to extract knowledge on the anticancer activity of polyphenols and can thus contribute to the exploration of these molecules in the development of anticancer therapies.
出处 《Journal of Polyphenols》 2022年第2期87-100,共14页 多酚杂志(英文)
基金 supported by Biotechnology Unit,Universidade de Ribeirão Preto,Brazil Federal Institute of Education,Science and Technology of South of Minas Gerais-IFSULDEMINAS,Brazil São Paulo Research Foundation(FAPESP)[grant n.17/03237-2].
  • 相关文献

参考文献1

二级参考文献35

  • 1倪勤,刘冰,金明娟,马新源,姚开颜,李其龙,陈坤.CASP3基因多态及单体型分布与乳腺癌危险性的关联研究[J].浙江大学学报(医学版),2011,40(3):259-264. 被引量:9
  • 2吕志敢,郭政.肿瘤坏死因子的研究进展[J].山西医科大学学报,2006,37(3):311-314. 被引量:78
  • 3王娟.血管内皮生长因子及其受体在抗肿瘤治疗应用的研究进展[J].中国医药工业杂志,2007,38(5):381-386. 被引量:9
  • 4曹银芳,关泽红,于晓鸿.凋亡抑制基因Bcl-2在肿瘤中的应用研究进展[J].内蒙古医学杂志,2007,39(8):961-963. 被引量:4
  • 5中国药典.一部[S],2015:212-213.
  • 6C R Cantor, H A Lim. Electrophoresis, supercomputing and the human genomes [ M ] . Singapore: World Scientific Publishing Co. , 1991.
  • 7Cheng D, Knox C, Young N, et al. PolySearch : a webbased text mining system for extracting relationships between human disea- ses, genes, mutations, drugs and metabolites [ J ]. Nucleic Acids Res ,2008,36:399.
  • 8Shannon P, Markiel A, Ozier O, et al. Cytoscape : a software envi- ronment for integrated models of biomolecular interaction networks [ J]. Genome Res,2003,13 ( 11 ) :2498.
  • 9Martncei D, Masseroli M, Pineiroli F. Gene ontology application to genomie functional annotation, statistical analysis and knowl- edge mining[ J]. Stud Health Technol Inform, 2004, 102 ( 5 ) : 108.
  • 10Maere S, Heymans K, Kuiper M. BINGO: a cytoscape plugin to assess overrepresentation of gene ontology categories in biological networks[ J]. Bioinformatics ,2005,21 ( 16 ) : 3448.

共引文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部