摘要
Discriminative Latent Model(DLM) is proposed for Multiword Expressions(MWEs) extraction in Chinese text to improve the performance of Machine Translation(MT) system such as Template Based MT(TBMT).For MT systems to become of further practical use,they need to be enhanced with MWEs processing capability.As our study towards this goal,we propose DLM,which is developed for sequence labeling task including hidden structures,to extract MWEs for MT systems.DLM combines the advantages of existing discriminative models,which can learn hidden structures in sequence labeling task.In our evaluations,DLM achieves precisions ranging up to 90.73% for some type of MWEs,which is higher than state-of-the-art discriminative models.Such results demonstrate that it is feasible to automatically identify many Chinese MWEs using our DLM tool.With MWEs processing model,BLEU score of MT system has also been increased by up to 0.3 in close test.
Discriminative Latent Model (DLM) is proposed for Multiword Expressions (MWEs) ex- traction in Chinese text to improve the performance of Machine Translation (MT) system such as Tem- plate Based MT (TBMT). For MT systems to be- come of further practical use, they need to be en- hanced with MWEs processing capability. As our study towards this goal, we propose DLM, which is developed for sequence labeling task including hid- den structures, to extract MWEs for MT systems. DLM combines the advantages of existing discrimi- native models, which can learn hidden structures in sequence labeling task. In our evaluations, DLM a- chieves precisions ranging up to 90.73% for some type of MWEs, which is higher than state-of-the-art discriminative models. Such results demonstrate that it is feasible to automatically identify many Chinese MWEs using our DLM tool. With MWEs processing model, BLEU score of MT system has also been in- creased by up to 0.3 in close test.
基金
supported by Liaoning Province Doctor Startup Fund under Grant No.20101021
the Fund of the State Ethic Affairs Commissions under Grant No.10DL08
AnHui Provincie Key Laboratory of Affective Computing and Advanced Intelligent Machine