摘要
古籍文献普遍存在着引书现象,因而构建一套针对地方志引书的挖掘识别系统,对古籍的研究以及目录学史、藏书史、科技史,都具有重要意义。本文以地方志资料汇编《方志物产》为语料,设计并构建了一个古籍引书挖掘系统。重点讨论了引书的模式提取、N-gram分词识别等功能算法。
There are many citations in Chinese ancient books, so it has significant value to construct a digging system about the cited books for ancient books. Based on the research about Products in Local Chronicles of Guangdong, the paper designs and implements a digging system about the cited books through computer technology. Especially, pattern obtaining methods and N-gram are discussed.
出处
《图书馆杂志》
CSSCI
北大核心
2008年第8期50-54,58,共6页
Library Journal
关键词
古籍数字化整理
地方志
引书
内容挖掘
N-gram算法
Digital collation of ancient books
Local chronicles
Cited books
Content digging
N -gram