层次化新闻视频处理框架的设计与实现被引量：3

The Design and Realization of Hierarchical Framework of News Video Process

下载PDF

导出

摘要提出了一个通用的层次化新闻视频处理框架,将新闻视频处理分为句法分段、语义标注以及视频摘要三个层次,并给出了三个层次中涉及的故事单元探测、字幕探测、视频摘要等关键技术的解决方案。框架突破了传统的新闻视频处理框架仅局限于句法分段以及单媒体特征进行处理的缺陷,通过对视音频特征进行多模态的综合分析来获取新闻视频高层的语义内容。实验通过一个新闻视频处理原型系统NVPS验证了框架的可行性,重点对故事单元探测、标题探测以及口播帧探测三个算法进行了实验,实验结果分别达到88%,86%和86%的探测准确率,从而进一步证实了层次框架在新闻视频处理方面的有效性。 A general hierarchical framework of news video process is presented. It divides the news video process into three levels: syntax segmentation level, semantic labeling level and abstraction level. Some key techniques related to these levels are described and solutions of them are introduced. The proposed framework overcomes the shortcomings of traditional news video process methods, which are limited to the content-based segmentation and process based on the single media feature. It acquires the semantic content by the analysis of audio-visual features synthetically. Experiments are carried out on a news video process prototype called NVPS, which validates the feasibility of the framework. Three methods, namely story detection, caption detection and anchor detection methods are tested on NVPS. The results reach to the detection precision of 88%, 86% and 86% respectively, which prove the efficiency of the layered framework in the semantic content analysis of news videos.

作者谢毓湘栾悉道吴玲达老松杨王卫威

机构地区国防科技大学人文与管理学院

出处《国防科技大学学报》 EI CAS CSCD 北大核心 2004年第5期99-103,共5页 Journal of National University of Defense Technology

基金国家863高技术资助项目(2001AA115123)

关键词新闻视频句法分段语义标注视频摘要 news video syntax segmentation semantic labeling video abstraction

分类号 TN941.2 [电子电信—信号与信息处理]

引文网络
相关文献

参考文献8

1Michael R L, Edward Yan, Sam Sze. A Multilingual Multimodal Digital Video Library System[A]. JCDL'02, July 13-17,2002, Portland, Oregon, USA.145-153.
2Shin'ichi Satoh, Name-It: Naming and Detecting Faces in News Videos[J]. IEEE Multimedia, 1999:22-35.
3Lienhart R, Pfeiffer S,Effelsberg W. Video Abstracting[J]. Communications of the ACM, 55-62, Dec. 1997.
4Ma Y F, Lu L, Zhang H J, et al. A User Attention Model for Video Summarization[A]. Proceeding of ACM Multimedia'02, Juan-les-Pins, France, December, 2002.
5Christel M G, Hauptmann A G, Wactlar H D, et al. Collages as Dynamic Summaries for News Video[A]. Proceeding of ACM Multimedia'02, Juan-les-Pins, France, December, 2002.
6谢毓湘,栾悉道,吴玲达,老松杨.一种基于解压的镜头探测方法[J].系统工程与电子技术,2003,25(8):1028-1031. 被引量：8
7马宇飞,白雪生,徐光祐,史元春.新闻视频中口播帧检测方法的研究[J].软件学报,2001,12(3):377-382. 被引量：24
8Hua X S, Chen X R, Liu W Y, et al. Automatic Location of Text in Video Frames[A]. Proceeding of ACM Multimedia 2001 Workshops: MIR2001, Ottawa, Canada, October 5, 2001:24-27.

二级参考文献14

1Ardizzaae E, Caseia M. Automatic Video Database Indexing and Retrieval[J]. Multimedia Tools and Applications, 1997:29 - 56.
2Yu H, Wolf W. A Visual Search System for Video and Image Databases[C]. in Proc.IEEE Int'l Conf. on Multimedia Computing and Systems(Ottawa, Canada), June, 1997: 517-524.
3Zabih R, Miller J, Mai K. A Feature-Based Algorithm for Detecting and Classifying Scene Breaks[ C ]. in Proc. of ACM Multimedia' 95, ( San Francisco, CA), 1995 : 189 - 200.
4J Oh, Hua K A, Liang N. A Content-Based Scene Change Detection and Classification Technique Using Background Tracking[C]. in SPIE Conf.on Multimedia Computing and Networking 2000(San Jose, CA), Jan.2000: 254-265.
5Oh J, Hua K A. An Efficient and Cost-Effective Technique for Browsing and Indexing Large Video Databases[C]. in Prec. of 2000 ACM SIGMOD Intl. Conf. on Management of Data(Dallas, TX), May 2000:415- 426.
6Annan F, Hsu A, Chiu M Y. Image Processing on Compressed Data for Large Video Databases [ C ]. in Proc ACM Multimedia 93 ( Anaheim,CA), 1993:267 - 272.
7Yeo B L, Liu B. An Unified Approach to Temporal Segmentation of Motion JPEG and MPEG Compressed Video[C]. in Proc IEEE Intl. Conf.on Multimedia Computing and Systems(Washington, DC), 1995:81 -90.
8Zhang H J, Chien Y L, Smoliar S W. Video Parsing and Browsing Using Compressed Data[J]. Multimedia Tools and Applications, 1995 (1) : 89- 113.
9Hampapur A, Jain R, Weym outh T E. Production Model Based Digital Video Segmentation[J]. Multimedia Tools and Applications, 1995(1) :9- 47.
10Jongho Nang, Seungwook Hong, Youngin Ibm. An Efficient Video Segmentation Scheme for MPEG Video Stream Using Macroblock Information[C]. in Proc. of ACM Multimedia'99, 1999: 23-26.

共引文献28

1谢毓湘,栾悉道,吴玲达,老松杨.NVPS：一个多模态的新闻视频处理系统[J].情报学报,2004,23(4):404-409. 被引量：5
2于俊清,汤旸,周向东.基于主色特征识别的新闻视频口播帧[J].计算机工程与科学,2004,26(8):28-31. 被引量：3
3于俊清,汤旸,周向东.利用主色模板匹配检测新闻视频口播帧[J].计算机辅助设计与图形学学报,2005,17(3):558-562. 被引量：2
4李默,李弼程,邓子健.新闻视频主持人镜头的半屏幕检测算法[J].计算机工程与应用,2005,41(15):183-185. 被引量：4
5朱志辉.基于视频摘要生成技术的研究[J].微电子学与计算机,2006,23(2):76-78. 被引量：4
6高健,叶静,陈莹莹,聂藩.基于动态自动提取模板的实时口播帧检测方法[J].电视技术,2006,30(12):71-73.
7老松杨,白亮,胡艳丽,陈剑赟.基于领域本体的新闻视频检索[J].小型微型计算机系统,2007,28(8):1470-1476. 被引量：4
8刘宇驰,栾悉道,谢毓湘,吴玲达.新闻视频中非新闻段的去除[J].小型微型计算机系统,2007,28(10):1837-1841. 被引量：2
9文军,曾璞,徐建军,栾悉道,吴玲达.多模态特征融合的新闻视频故事分割方法[J].小型微型计算机系统,2008,29(1):171-174. 被引量：1
10文军,曾璞,栾悉道,吴玲达.自适应的新闻视频播音员镜头探测方法[J].计算机工程,2008,34(3):244-246. 被引量：1

同被引文献19

1谢毓湘,栾悉道,吴玲达,老松杨.NVPS：一个多模态的新闻视频处理系统[J].情报学报,2004,23(4):404-409. 被引量：5
2蒋杰,老松杨,吴玲达,王辰.面向视频服务器的基于内容视频分析与检索系统的设计与实现[J].小型微型计算机系统,2004,25(9):1628-1631. 被引量：1
3刘宇驰,谢毓湘,吴玲达,雷震,戴端辉.一种开放式视频管理框架[J].国防科技大学学报,2006,28(1):73-76. 被引量：4
4吴玲达老松杨王辰等.多媒体信息系统[M].北京：电子工业出版社,2002..
5[5]M Bertini,A D Bimbo,Content-Based Annotation and Retrieval of News Videos[A].IEEE Multimedia and Expo,(ICME'00)[C].2000.483-486.
6[6]CNN[EB/OL].http://www.intel.com/comm-net/cnn_work/index.html,2006-02.
7[7]Informedia[EB/OL].http://www.informedia.cs.emu.edu/,2006-02.
8[8]Fischlár News[EB/OL].http://www.cdvp.dcu.ie/,2006-02.
9[9]The SRI MAESTRO Team.MAESTRO:Conductor of Multimedia Analysis Technologies[J].Communications of the ACM,2000,43(2):57-63.
10[10]P van Beck,A B Benitez,J Heuer,et al.MPEG-7:Multimedia Description Schemes[S].ISO/IEC FDIS 15938-5:2001,2001.

引证文献3

1刘宇驰,栾悉道,吴玲达,谢毓湘.面向Web的新闻视频检索系统[J].计算机工程与科学,2006,28(z2):56-58.
2陈丹雯,徐建军,谢毓湘,吴玲达.虚拟新闻自动生成系统的设计与实现[J].系统仿真学报,2006,18(z1):157-160.
3张鸿雁.基于故事的新闻视频事件专题分析方法[J].科技传播,2012,4(19):28-28. 被引量：2

二级引证文献2

1梁惠敏.解析基于故事的新闻视频事件专题分析方法[J].艺术科技,2013,26(6):65-65. 被引量：2
2王久凌.基于故事的新闻事件专题分析方法[J].新闻传播,2015(5X).

1阿呆.中国移动:第五媒体登上奥运舞台[J].通讯世界,2008(8):29-30.
2高健,叶静,陈莹莹,聂藩.基于动态自动提取模板的实时口播帧检测方法[J].电视技术,2006,30(12):71-73.
3马宇飞,白雪生,徐光祐,史元春.新闻视频中口播帧检测方法的研究[J].软件学报,2001,12(3):377-382. 被引量：24
4吴焕瑞.图书领域的语义标注[J].无线互联科技,2013,10(3):130-131.
5刘蓬侠,曾芷德,李思昆.VLSI并行测试生成系统的一种动态层次框架[J].计算机工程与科学,2001,23(2):79-83. 被引量：2
6文军,吴玲达,曾璞,谢毓湘.新闻视频数据库基于故事单元的“多线程”管理技术研究[J].国防科技大学学报,2010,32(1):116-121. 被引量：2
7赵辉.新闻报道技术研究[J].魅力中国,2011(21):145-146.
8管云林.试论手机报的媒体特征和发展趋势[J].江苏科技信息,2013(12):7-9.
9魏晗,李弼程,张瑞杰,唐永旺.图像语义提取方法研究[J].现代电子技术,2011,34(24):103-106. 被引量：6
10袁凌云,王兴超.语义技术在物联网中的应用研究综述[J].计算机科学,2014,41(S1):239-246. 被引量：6

国防科技大学学报

2004年第5期

浏览历史

内容加载中请稍等...

层次化新闻视频处理框架的设计与实现被引量：3

参考文献8

二级参考文献14

共引文献28

同被引文献19

引证文献3

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

层次化新闻视频处理框架的设计与实现 被引量：3

参考文献8

二级参考文献14

共引文献28

同被引文献19

引证文献3

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

层次化新闻视频处理框架的设计与实现被引量：3