摘要
针对具有多模分布结构的高维数据的分类问题,该文提出一种无限最大间隔线性判别投影(i MMLDP)模型。与现有全局投影方法不同,模型通过联合Dirichlet过程及最大间隔线性判别投影(MMLDP)模型将数据划分为若干个局部区域,并在每一个局部学习一个最大边界线性判别投影分类器。组合各局部分类器,实现全局非线性的投影与分类。i MMLDP模型利用贝叶斯框架联合建模,将聚类、投影及分类器进行联合学习,可以有效发掘数据的隐含结构信息,因而,可以较好地对非线性可分数据,尤其是具有多模分布特性数据进行分类。得益于非参数贝叶斯先验技术,可以有效避免模型选择问题,即局部区域划分数量。基于仿真数据集、公共数据集及雷达实测数据集验证了所提方法的有效性。
An infinite Max-Margin Linear Discriminant Projection (iMMLDP) model is developed to deal with the classification problem on multimodal distributed high-dimensional data. Different from global projection, iMMLDP divides the data into a set of local regions via Dirichlet Process (DP) mixture model and meanwhile learns a linear Max-Margin Linear Discriminant Projection (MMLDP) classifier in each local region. By assembling these local classifiers, a flexible nonlinear classifier is constructed. Under this framework, iMMLDP combines dimensionality reduction, clustering and supervised classification in a principled way, therefore, an underlying structure of the data could be uncovered. As a result, the model can handle the classification of data with global nonlinear structure, especially the data with multi-modally distributed structure. With the help of Bayesian nonparametric prior, the model selection problem (e.g. the number of local regions) can be avoided. The proposed model is implemented on synthesized and real-world data, including multi-modally distributed datasets and measured radar high range resolution profile (HRRP) data, to validate its efficiency and effectiveness.
出处
《电子与信息学报》
EI
CSCD
北大核心
2017年第12期2795-2802,共8页
Journal of Electronics & Information Technology
基金
国家杰出青年科学基金(61525105)
国家自然科学基金(61201292
61322103
61372132)
全国优秀博士学位论文作者专项资金(FANEDD-201156)
陕西省自然科学基础研究计划(2016JQ6048)
航空科学基金(20142081009)
上海航天科技创新基金(SAST2015009)
航空电子系统射频综合仿真航空科技重点实验室基金~~