期刊文献+
共找到214篇文章
< 1 2 11 >
每页显示 20 50 100
Unifying Convolution and Transformer Decoder for Textile Fiber Identification
1
作者 许罗力 李粉英 常姗 《Journal of Donghua University(English Edition)》 CAS 2023年第4期357-363,共7页
At present,convolutional neural networks(CNNs)and transformers surpass humans in many situations(such as face recognition and object classification),but do not work well in identifying fibers in textile surface images... At present,convolutional neural networks(CNNs)and transformers surpass humans in many situations(such as face recognition and object classification),but do not work well in identifying fibers in textile surface images.Hence,this paper proposes an architecture named FiberCT which takes advantages of the feature extraction capability of CNNs and the long-range modeling capability of transformer decoders to adaptively extract multiple types of fiber features.Firstly,the convolution module extracts fiber features from the input textile surface images.Secondly,these features are sent into the transformer decoder module where label embeddings are compared with the features of each type of fibers through multi-head cross-attention and the desired features are pooled adaptively.Finally,an asymmetric loss further purifies the extracted fiber representations.Experiments show that FiberCT can more effectively extract the representations of various types of fibers and improve fiber identification accuracy than state-of-the-art multi-label classification approaches. 展开更多
关键词 non-destructive textile fiber identification transformer decoder asymmetric loss
下载PDF
Viterbi Decoder ACS单元中路径度量值存储空间的优化
2
作者 郭正伟 赵勇 《现代电子技术》 2007年第17期71-73,共3页
ACS单元的设计及路径度量(PM)值的存储是Viterbi Decoder硬件实现的重要部分之一。介绍了一种码率为1/2的硬判决Viterbi Decoder的ACS部分的硬件实现方法。采用了一种全新的设计与存储方式,即原位运算旋转地址的方式,极大地节省了在ACS... ACS单元的设计及路径度量(PM)值的存储是Viterbi Decoder硬件实现的重要部分之一。介绍了一种码率为1/2的硬判决Viterbi Decoder的ACS部分的硬件实现方法。采用了一种全新的设计与存储方式,即原位运算旋转地址的方式,极大地节省了在ACS运算过程中用以存储路径度量值的RAM空间,大量的实验证明,设计的译码器在资源消耗上有较大优势。 展开更多
关键词 卷积码 VITERBI decoder ACS单元 路径度量 分支度量 幸存路径 回溯
下载PDF
Low-loss belief propagation decoder with Tanner graph in quantum error-correction codes 被引量:1
3
作者 颜丹丹 范兴奎 +1 位作者 陈祯羽 马鸿洋 《Chinese Physics B》 SCIE EI CAS CSCD 2022年第1期143-149,共7页
Quantum error-correction codes are immeasurable resources for quantum computing and quantum communication.However,the existing decoders are generally incapable of checking node duplication of belief propagation(BP)on ... Quantum error-correction codes are immeasurable resources for quantum computing and quantum communication.However,the existing decoders are generally incapable of checking node duplication of belief propagation(BP)on quantum low-density parity check(QLDPC)codes.Based on the probability theory in the machine learning,mathematical statistics and topological structure,a GF(4)(the Galois field is abbreviated as GF)augmented model BP decoder with Tanner graph is designed.The problem of repeated check nodes can be solved by this decoder.In simulation,when the random perturbation strength p=0.0115-0.0116 and number of attempts N=60-70,the highest decoding efficiency of the augmented model BP decoder is obtained,and the low-loss frame error rate(FER)decreases to 7.1975×10^(-5).Hence,we design a novel augmented model decoder to compare the relationship between GF(2)and GF(4)for quantum code[[450,200]]on the depolarization channel.It can be verified that the proposed decoder provides the widely application range,and the decoding performance is better in QLDPC codes. 展开更多
关键词 tanner graph belief propagation decoder augmented model fourier transform
原文传递
Determination of quantum toric error correction code threshold using convolutional neural network decoders 被引量:1
4
作者 王浩文 薛韵佳 +2 位作者 马玉林 华南 马鸿洋 《Chinese Physics B》 SCIE EI CAS CSCD 2022年第1期136-142,共7页
Quantum error correction technology is an important solution to solve the noise interference generated during the operation of quantum computers.In order to find the best syndrome of the stabilizer code in quantum err... Quantum error correction technology is an important solution to solve the noise interference generated during the operation of quantum computers.In order to find the best syndrome of the stabilizer code in quantum error correction,we need to find a fast and close to the optimal threshold decoder.In this work,we build a convolutional neural network(CNN)decoder to correct errors in the toric code based on the system research of machine learning.We analyze and optimize various conditions that affect CNN,and use the RestNet network architecture to reduce the running time.It is shortened by 30%-40%,and we finally design an optimized algorithm for CNN decoder.In this way,the threshold accuracy of the neural network decoder is made to reach 10.8%,which is closer to the optimal threshold of about 11%.The previous threshold of 8.9%-10.3%has been slightly improved,and there is no need to verify the basic noise. 展开更多
关键词 quantum error correction toric code convolutional neural network(CNN)decoder
原文传递
Low-complexity MP3 decoder based on Broadcom embedded platform
5
作者 冉川 沈庭芝 《Journal of Beijing Institute of Technology》 EI CAS 2011年第1期94-99,共6页
A low-complexity MP3 decoder based on Broadcom embedded platform was proposed.C-code level optimization algorithms on inverse quantization,stereo decoding and alias reduction based on PC were proposed to further reduc... A low-complexity MP3 decoder based on Broadcom embedded platform was proposed.C-code level optimization algorithms on inverse quantization,stereo decoding and alias reduction based on PC were proposed to further reduce the amount of memory usage and the computational complexity.Furthermore,the executable file of the optimized MP3 decoder was generated under the Linux environment,and transplanted to the set-top box based on Broadcom embedded platform.Experiment results showed that the total time for decoding was reduced on the embedded platform,and the goal of real time and fluent playing of audio files was fulfilled,which demonstrated the effectiveness of the proposed MP3 decoder.The proposed MP3 decoder could be applied in fields such as the set-top box based on Broadcom embedded platform and other portable devices. 展开更多
关键词 MP3 decoder algorithm optimization LINUX BROADCOM
下载PDF
Program Error Mitigation in MLC NAND Flash Memory with Soft Decision Decoders
6
作者 Zequn Fang Zheng Ma +2 位作者 Xiaohu Tang Yue Xiao Youhua Tang 《China Communications》 SCIE CSCD 2021年第4期76-87,共12页
Benefiting from strong decoding capabilities,soft-decision decoding has been used to replace hard-decision decoding in various communication systems,and NAND flash memory systems are no exception.However,soft-decision... Benefiting from strong decoding capabilities,soft-decision decoding has been used to replace hard-decision decoding in various communication systems,and NAND flash memory systems are no exception.However,soft-decision decoding relies heavily on accurate soft information.Owing to the incremental step pulse programming(ISPP),program errors(PEs)in multi-level cell(MLC)NAND flash memory have different characteristics compared to other types of errors,which is very difficult to obtain such accurate soft information.Therefore,the characteristics of the log-likelihood ratio(LLR)of PEs are investigated first in this paper.Accordingly,a PE-aware statistical method is proposed to determine the usage of PE mitigation schemes.In order to reduce the PE estimating workload of the controller,an adaptive blind clipping(ABC)scheme is proposed subsequently to approximate the PEs contaminated LLR with different decoding trials.Finally,simulation results demonstrate that(1)the proposed PE-aware statistical method is effective in practice,and(2)ABC scheme is able to provide satisfactory bit error rate(BER)and frame error rate(FER)performance in a penalty of negligible increasing of decoding latency. 展开更多
关键词 program errors soft-decision decoder NAND flash memory clipping approximation
下载PDF
Radiation Tolerant Viterbi Decoders for On-Board Processing(OBP) in Satellite Communications
7
作者 Zhen Gao Lina Yan +3 位作者 Jinhua Zhu Ruishi Han Ullah Anees Reviriego Pedro 《China Communications》 SCIE CSCD 2020年第1期140-150,共11页
Modern satellite communication systems require on-board processing(OBP)for performance improvements,and SRAM-FPGAs are an attractive option for OBP implementation.However,SRAM-FPGAs are sensitive to radiation effects,... Modern satellite communication systems require on-board processing(OBP)for performance improvements,and SRAM-FPGAs are an attractive option for OBP implementation.However,SRAM-FPGAs are sensitive to radiation effects,among which single event upsets(SEUs)are important as they can lead to data corruption and system failure.This paper studies the fault tolerance capability of a SRAM-FPGA implemented Viterbi decoder to SEUs on the user memory.Analysis and fault injection experiments are conducted to verify that over 97%of the SEUs on user memory would not lead to output errors.To achieve a better reliability,selective protection schemes are then proposed to further improve the reliability of the decoder to SEUs on user memory with very small overhead.Although the results are obtained for a specific FPGA implementation,the developed reliability estimation model and the general conclusions still hold for other implementations. 展开更多
关键词 viterbi decoder on-board processing FPGA user memory fault tolerance single event upsets
下载PDF
Design and Implementation of the Motion Compensation Module for HDTV Video Decoder
8
作者 王涛 郑世宝 +1 位作者 邱琳 王峰 《Journal of Shanghai Jiaotong university(Science)》 EI 2006年第1期1-8,共8页
This paper presented a new solution for motion compensation module in the high definition television (HDTV) video decoder. The overall architecture and the design of the major functional units, such as the motion vect... This paper presented a new solution for motion compensation module in the high definition television (HDTV) video decoder. The overall architecture and the design of the major functional units, such as the motion vector decoder, the predictor, and the mixer, were discussed. Based on the exploitation of the special characteristics inherent in the motion compensation algorithm, the motion compensation module and its functional units adopt various novel architectures in order to allow the module to meet real-time constraints. This solution resolves the problem of high hardware costs, low bus efficiency and complex control schemes in conventional designs. 展开更多
关键词 asynchronous macroblock pipelining architecture HDTV video decoder motion compensation module MPEG-2
下载PDF
A Highly Compatible Circular-Shifting Network for Partially Parallel QC-LDPC Decoder
9
作者 Yanzhi Wang Zhenzhi Wu +2 位作者 Peipei Liu Ning Guan Hua Wang 《International Journal of Communications, Network and System Sciences》 2017年第5期24-34,共11页
The conventional methodology for designing QC-LDPC decoders is applied for fixed configurations used in wireless communication standards, and the supported largest expansion factor Z (the parallelism of the layered de... The conventional methodology for designing QC-LDPC decoders is applied for fixed configurations used in wireless communication standards, and the supported largest expansion factor Z (the parallelism of the layered decoding) is a fixed number. In this paper, we study the circular-shifting network for decoding LDPC codes with arbitrary Z factor, especially for decoding large Z (Z P) codes, where P is the decoder parallelism. By buffering the P-length slices from the memory, and assembling the shifted slices in a fixed routine, the P-parallelism shift network can process Z-parallelism circular-shifting tasks. The implementation results show that the proposed network for arbitrary sized data shifting consumes only one times of additional resource cost compared to the traditional solution for only maximum P sized data shifting, and achieves significant saving on area and routing complexity. 展开更多
关键词 PARTIALLY PARALLEL Layered Decoding Circular-Shifting NETWORK QC-LDPC decoder Arbitrary Expansion Factor
下载PDF
Comparison and Design of Decoder in B3G Mobile Communication System
10
作者 Mingxiang GUAN Mingchuan YANG 《Communications and Network》 2009年第1期20-24,共5页
Turbo code has been shown to have ability to achieve performance that is close to Shannon limit. It has been adopted by various commercial communication systems. Both universal mobile telecommunications system (UMTS) ... Turbo code has been shown to have ability to achieve performance that is close to Shannon limit. It has been adopted by various commercial communication systems. Both universal mobile telecommunications system (UMTS) TDD and FDD have also employed turbo code as the error correction coding scheme. It outperforms convolutional code in large block size, but because of its time delay, it is often only used in the non-real-time service. In this paper, we discuss the encoder and decoder structure of turbo code in B3G mobile communication System. In addition, various decoding techniques, such as the Log-MAP, Max-log-MAP and SOVA algorithm for non-real-time service are deduced and compared. The performance results of decoder and algorithms in different configurations are also shown. 展开更多
关键词 decoder BEYOND 3G MOBILE COMMUNICATION system DECODING algorithm
下载PDF
Serial Genetic Algorithm Decoder for Low Density Parity Check Codes
11
作者 Hasna Chaibi 《International Journal of Communications, Network and System Sciences》 2015年第9期358-366,共9页
Genetic algorithms are successfully used for decoding some classes of error correcting codes, and offer very good performances for solving large optimization problems. This article proposes a new decoder based on Seri... Genetic algorithms are successfully used for decoding some classes of error correcting codes, and offer very good performances for solving large optimization problems. This article proposes a new decoder based on Serial Genetic Algorithm Decoder (SGAD) for decoding Low Density Parity Check (LDPC) codes. The results show that the proposed algorithm gives large gains over sum-product decoder, which proves its efficiency. 展开更多
关键词 SERIAL Genetic Algorithm Sum-Product decoder Sigmoidal Function LDPC Code Error CORRECTING CODES
下载PDF
Service Adaptable 3G Turbo Decoder for Indoor/Low Range Outdoor Environment
12
作者 Costas CHAIKALIS Nicholas S. SAMARAS 《International Journal of Communications, Network and System Sciences》 2009年第8期704-713,共10页
For the well-known 3G mobile communications standard UMTS, four different service classes have been specified. Considering two turbo decoding algorithms, like SOVA and log-MAP, it would be desirable to use an efficien... For the well-known 3G mobile communications standard UMTS, four different service classes have been specified. Considering two turbo decoding algorithms, like SOVA and log-MAP, it would be desirable to use an efficient turbo decoder. In this paper this decoder is shown to adapt dynamically to different service scenarios, considering parameters like performance and complexity for indoor/low range outdoor operating en-vironment. The scenarios show that for streaming service class real-time class applications the proposed de-coding algorithm depends on data rate;for the majority of scenarios SOVA is proposed, whereas log-MAP is optimal for increased data rates and medium-sized frames. On the other hand, conversational service class real-time applications cannot be established. For the majority of non real-time applications (interactive and background service classes) either algorithm can be used, while log-MAP is proposed for medium data rates and frame lengths. 展开更多
关键词 Reconfigurable Systems Turbo decoder UMTS Flat Reyleigh FADING Indoor/Low RANGE Out-door Operating Environment
下载PDF
A Low Power and High Speed Viterbi Decoder Based on Deep Pipelined, Clock Blocking and Hazards Filtering
13
作者 C. ARUN V. RAJAMANI 《International Journal of Communications, Network and System Sciences》 2009年第6期575-582,共8页
A high speed and low power Viterbi decoder architecture design based on deep pipelined, clock gating and toggle filtering has been presented in this paper. The Add-Compare-Select (ACS) and Trace Back (TB) units and it... A high speed and low power Viterbi decoder architecture design based on deep pipelined, clock gating and toggle filtering has been presented in this paper. The Add-Compare-Select (ACS) and Trace Back (TB) units and its sub circuits of the decoder have been operated in deep pipelined manner to achieve high transmission rate. The Power dissipation analysis is also investigated and compared with the existing results. The techniques that have been employed in our low-power design are clock-gating and toggle filtering. The synthesized circuits are placed and routed in the standard cell design environment and implemented on a Xilinx XC2VP2fg256-6 FPGA device. Power estimation obtained through gate level simulations indicated that the proposed design reduces the power dissipation of an original Viterbi decoder design by 68.82% and a speed of 145 MHz is achieved. 展开更多
关键词 VITERBI decoder Convolutional Codes High-Speed Low Power Consumption Parallel Processing DEEP PIPELINING
下载PDF
FPGA Design and Implementation of a Convolutional Encoder and a Viterbi Decoder Based on 802.11a for OFDM
14
作者 Yan Sun Zhizhong Ding 《Wireless Engineering and Technology》 2012年第3期125-131,共7页
In this paper, a modified FPGA scheme for the convolutional encoder and Viterbi decoder based on the IEEE 802.11a standards of WLAN is presented in OFDM baseband processing systems. The proposed design supports a gene... In this paper, a modified FPGA scheme for the convolutional encoder and Viterbi decoder based on the IEEE 802.11a standards of WLAN is presented in OFDM baseband processing systems. The proposed design supports a generic, robust and configurable Viterbi decoder with constraint length of 7, code rate of 1/2 and decoding depth of 36 symbols. The Viterbi decoder uses full-parallel structure to improve computational speed for the add-compare-select (ACS) modules, adopts optimal data storage mechanism to avoid overflow and employs three distributed RAM blocks to complete cyclic trace-back. It includes the core parts, for example, the state path measure computation, the preservation and transfer of the survivor path and trace-back decoding, etc. Compared to the general Viterbi decoder, this design can effectively decrease the 10% of chip logic elements, reduce 5% of power consumption, and increase the encoder and decoder working performance in the hardware implementation. Lastly, relevant simulation results using Verilog HDL language are verified based on a Xinlinx Virtex-II FPGA by ISE 7.1i. It is shown that the Viterbi decoder is capable of decoding (2, 1, 7) convolutional codes accurately with a throughput of 80 Mbps. 展开更多
关键词 FPGA Convolutional ENCODER VITERBI decoder IEEE 802.11a OFDM
下载PDF
Design of 2-to-4 All-Optical Decoder with the Help of Terahertz Optical Asymmetric Demultiplexer
15
作者 Arunava Bhattacharyya Dilip Kumar Gayen Tanay Chattopadhyay 《International Journal of Modern Nonlinear Theory and Application》 2016年第1期67-72,共6页
An all-optical 2-to-4 decoder unit with the assist of terahertz optical asymmetric demultiplexer (TOAD) is presented. The all-optical 2-to-4 decoder with a set of all-optical switches is designed which can be used to ... An all-optical 2-to-4 decoder unit with the assist of terahertz optical asymmetric demultiplexer (TOAD) is presented. The all-optical 2-to-4 decoder with a set of all-optical switches is designed which can be used to achieve a high-speed central processor unit using optical hardware. The unique output lines can be used for all-optical header processing. We attempt to develop an integrated all-optical circuit which can perform decoding of signal. This scheme is very simple and flexible for performing different logic operation and to design advanced complex logic. Simulated results are confirming the described methods. 展开更多
关键词 Terahertz Optical Asymmetric Demultiplexer Semiconductor Optical Amplifier All-Optical decoder
下载PDF
Lowering the Error Floor of ADMM Penalized Decoder for LDPC Codes 被引量:1
16
作者 Jiao Xiaopeng Mu Jianjun 《China Communications》 SCIE CSCD 2016年第8期127-135,共9页
Decoding by alternating direction method of multipliers(ADMM) is a promising linear programming decoder for low-density parity-check(LDPC) codes. In this paper, we propose a two-step scheme to lower the error floor of... Decoding by alternating direction method of multipliers(ADMM) is a promising linear programming decoder for low-density parity-check(LDPC) codes. In this paper, we propose a two-step scheme to lower the error floor of LDPC codes with ADMM penalized decoder.For the undetected errors that cannot be avoided at the decoder side, we modify the code structure slightly to eliminate low-weight code words. For the detected errors induced by small error-prone structures, we propose a post-processing method for the ADMM penalized decoder. Simulation results show that the error floor can be reduced significantly over three illustrated LDPC codes by the proposed two-step scheme. 展开更多
关键词 LDPC codes linear programming decoding alternating direction method of multipliers(ADMM) error floor
下载PDF
Improved Polar Decoder Utilizing Neural Network in Fast Simplified Successive-Cancellation Decoding
17
作者 Jiaxin Fang Chunwu Liu 《Journal of Computer and Communications》 2020年第7期90-99,共10页
<div style="text-align:justify;"> Polar codes using successive-cancellation decoding always suffer from high latency for its serial nature. Fast simplified successive-cancellation decoding algorithm im... <div style="text-align:justify;"> Polar codes using successive-cancellation decoding always suffer from high latency for its serial nature. Fast simplified successive-cancellation decoding algorithm improves the situation in theoretically but not performs well as expected in practical for the workload of nodes identification and the existence of many short blocks. Meanwhile, Neural network (NN) based decoders have appeared as potential candidates to replace conventional decoders for polar codes. But the exponentially increasing training complexity with information bits is unacceptable which means it is only suitable for short codes. In this paper, we present an improvement that increases decoding efficiency without degrading the error-correction performance. The long polar codes are divided into several sub-blocks, some of which can be decoded adopting fast maximum likelihood decoding method and the remained parts are replaced by several short codes NN decoders. The result shows that time steps the proposed algorithm need only equal to 79.8% of fast simplified successive-cancellation decoders require. Moreover, it has up to 21.2 times faster than successive-cancellation decoding algorithm. More importantly, the proposed algorithm decreases the hardness when applying in some degree. </div> 展开更多
关键词 Polar Codes Decoding Latency Fast Simplified Successive-Cancellation Decoding (Fast-SSC) Neural Network (NN)
下载PDF
A Novel Decoder Based on Parallel Genetic Algorithms for Linear Block Codes
18
作者 Abdeslam Ahmadi Faissal El Bouanani +1 位作者 Hussain Ben-Azza Youssef Benghabrit 《International Journal of Communications, Network and System Sciences》 2013年第1期66-76,共11页
Genetic algorithms offer very good performances for solving large optimization problems, especially in the domain of error-correcting codes. However, they have a major drawback related to the time complexity and memor... Genetic algorithms offer very good performances for solving large optimization problems, especially in the domain of error-correcting codes. However, they have a major drawback related to the time complexity and memory occupation when running on a uniprocessor computer. This paper proposes a parallel decoder for linear block codes, using parallel genetic algorithms (PGA). The good performance and time complexity are confirmed by theoretical study and by simulations on BCH(63,30,14) codes over both AWGN and flat Rayleigh fading channels. The simulation results show that the coding gain between parallel and single genetic algorithm is about 0.7 dB at BER = 10﹣5 with only 4 processors. 展开更多
关键词 CHANNEL Coding Linear Block Codes META-HEURISTICS PARALLEL Genetic ALGORITHMS PARALLEL Decoding ALGORITHMS Time Complexity Flat FADING CHANNEL AWGN
下载PDF
RETRACTED: <i>Improved Polar Decoder Utilizing Neural Network in Fast Simplified Successive-Cancellation Decoding</i>
19
作者 Jiaxin Fang Chunwu Liu 《Optics and Photonics Journal》 2020年第6期149-158,共12页
<div style="text-align:justify;"> <p style="text-align:justify;background:white;"> <span style="font-size:10.0pt;font-family:" color:black;"="">This artic... <div style="text-align:justify;"> <p style="text-align:justify;background:white;"> <span style="font-size:10.0pt;font-family:" color:black;"="">This article has been retracted to straighten the academic record. In making this decision the Editorial Board follows COPE's </span><span><a href="http://publicationethics.org/files/retraction%20guidelines.pdf"><span style="font-size:10.0pt;font-family:;" "="">Retraction Guidelines</span></a></span><span style="font-size:10.0pt;font-family:" color:black;"="">. The aim is to promote the circulation of scientific research by offering an ideal research publication platform with due consideration of internationally accepted standards on publication ethics. The Editorial Board would like to extend its sincere apologies for any inconvenience this retraction may have caused.</span><span style="font-size:10.0pt;font-family:" color:black;"=""></span> </p> <p style="text-align:justify;background:white;"> <span style="font-size:10.0pt;font-family:" color:black;"="">Please see the </span><span><a href="https://www.scirp.org/journal/paperinformation.aspx?paperid=101825"><span style="font-size:10.0pt;font-family:;" "="">article page</span></a></span><span style="font-size:10.0pt;font-family:" color:black;"=""> for more details. </span><span><a href="https://www.scirp.org/pdf/opj_2020072814494052.pdf"><span style="font-size:10.0pt;font-family:;" "="">The full retraction notice</span></a></span><span style="font-size:10.0pt;font-family:" color:black;"=""> in PDF is preceding the original paper which is marked "RETRACTED". </span> </p> <br /> </div> 展开更多
关键词 Polar Codes Decoding Latency Fast Simplified Successive-Cancellation Decoding (Fast-SSC) Neural Network (NN)
下载PDF
A Video Captioning Method by Semantic Topic-Guided Generation
20
作者 Ou Ye Xinli Wei +2 位作者 Zhenhua Yu Yan Fu Ying Yang 《Computers, Materials & Continua》 SCIE EI 2024年第1期1071-1093,共23页
In the video captioning methods based on an encoder-decoder,limited visual features are extracted by an encoder,and a natural sentence of the video content is generated using a decoder.However,this kind ofmethod is de... In the video captioning methods based on an encoder-decoder,limited visual features are extracted by an encoder,and a natural sentence of the video content is generated using a decoder.However,this kind ofmethod is dependent on a single video input source and few visual labels,and there is a problem with semantic alignment between video contents and generated natural sentences,which are not suitable for accurately comprehending and describing the video contents.To address this issue,this paper proposes a video captioning method by semantic topic-guided generation.First,a 3D convolutional neural network is utilized to extract the spatiotemporal features of videos during the encoding.Then,the semantic topics of video data are extracted using the visual labels retrieved from similar video data.In the decoding,a decoder is constructed by combining a novel Enhance-TopK sampling algorithm with a Generative Pre-trained Transformer-2 deep neural network,which decreases the influence of“deviation”in the semantic mapping process between videos and texts by jointly decoding a baseline and semantic topics of video contents.During this process,the designed Enhance-TopK sampling algorithm can alleviate a long-tail problem by dynamically adjusting the probability distribution of the predicted words.Finally,the experiments are conducted on two publicly used Microsoft Research Video Description andMicrosoft Research-Video to Text datasets.The experimental results demonstrate that the proposed method outperforms several state-of-art approaches.Specifically,the performance indicators Bilingual Evaluation Understudy,Metric for Evaluation of Translation with Explicit Ordering,Recall Oriented Understudy for Gisting Evaluation-longest common subsequence,and Consensus-based Image Description Evaluation of the proposed method are improved by 1.2%,0.1%,0.3%,and 2.4% on the Microsoft Research Video Description dataset,and 0.1%,1.0%,0.1%,and 2.8% on the Microsoft Research-Video to Text dataset,respectively,compared with the existing video captioning methods.As a result,the proposed method can generate video captioning that is more closely aligned with human natural language expression habits. 展开更多
关键词 Video captioning encoder-decoder semantic topic jointly decoding Enhance-TopK sampling
下载PDF
上一页 1 2 11 下一页 到第
使用帮助 返回顶部