摘要
Contextual advertising is a major revenue source for today's companies. Keyword extraction is a key step in this kind of advertising, through which appropriate advertising keywords are extracted from Web pages so that corresponding ads can be triggered. This paper describes a system that learns how to extract keywords from web pages for advertisement targeting. Firstly a text network for a single webpage is build, then PageRank is applied in the network to decide on the importance of a word, finally top-ranked words are selected as keywords of the webpage. The algorithm is tested on the corpus ofblog pages, and the experimental results prove practical and effective.
Contextual advertising is a major revenue source for today’s companies.Keyword extraction is a key step in this kind of advertising,through which appropriate
基金
This study is supported by Beijing Natural Science Foundation of (4092029) and the Fundamental Research Funds for the Central Universities (2009RC0217).