摘要
提出了一个通用的层次化新闻视频处理框架,将新闻视频处理分为句法分段、语义标注以及视频摘要三个层次,并给出了三个层次中涉及的故事单元探测、字幕探测、视频摘要等关键技术的解决方案。框架突破了传统的新闻视频处理框架仅局限于句法分段以及单媒体特征进行处理的缺陷,通过对视音频特征进行多模态的综合分析来获取新闻视频高层的语义内容。实验通过一个新闻视频处理原型系统NVPS验证了框架的可行性,重点对故事单元探测、标题探测以及口播帧探测三个算法进行了实验,实验结果分别达到88%,86%和86%的探测准确率,从而进一步证实了层次框架在新闻视频处理方面的有效性。
A general hierarchical framework of news video process is presented. It divides the news video process into three levels: syntax segmentation level, semantic labeling level and abstraction level. Some key techniques related to these levels are described and solutions of them are introduced. The proposed framework overcomes the shortcomings of traditional news video process methods, which are limited to the content-based segmentation and process based on the single media feature. It acquires the semantic content by the analysis of audio-visual features synthetically. Experiments are carried out on a news video process prototype called NVPS, which validates the feasibility of the framework. Three methods, namely story detection, caption detection and anchor detection methods are tested on NVPS. The results reach to the detection precision of 88%, 86% and 86% respectively, which prove the efficiency of the layered framework in the semantic content analysis of news videos.
出处
《国防科技大学学报》
EI
CAS
CSCD
北大核心
2004年第5期99-103,共5页
Journal of National University of Defense Technology
基金
国家863高技术资助项目(2001AA115123)
关键词
新闻视频
句法分段
语义标注
视频摘要
news video
syntax segmentation
semantic labeling
video abstraction