In commercial applications of phase Doppler anemometry (PDA), the effectiveness of non sphericity of particles is present and the response of PDA system deviates from the theoretical prediction. In this paper, the st...In commercial applications of phase Doppler anemometry (PDA), the effectiveness of non sphericity of particles is present and the response of PDA system deviates from the theoretical prediction. In this paper, the statistic characteristics of PDA signal related to irregular particles is analyzed and a method of statistic classification of irregular particles is proposed.It proves that the parameter of PDA signal for irregular particles is an unbiased estimation for spherical ones, the mean of the phase difference is in direct proportion to the mean diameter of particles and the standard deviation of the phase difference increases linearly with the standard deviation of irregular particles. As an application of the identification of irregular objects, fuzzy patterns and similarities of haemocytes are used to recognize and quantify cell samples.The statistic classification of particles is more significant in practice.展开更多
In order to classify the Intemet traffic of different Internet applications more quickly, two open Internet traffic traces, Auckland I1 and UNIBS traffic traces, are employed as study objects. Eight earliest packets w...In order to classify the Intemet traffic of different Internet applications more quickly, two open Internet traffic traces, Auckland I1 and UNIBS traffic traces, are employed as study objects. Eight earliest packets with non-zero flow payload sizes are selected and their payload sizes are used as the early-stage flow features. Such features can be easily and rapidly extracted at the early flow stage, which makes them outstanding. The behavior patterns of different Intemet applications are analyzed by visualizing the early-stage packet size values. Analysis results show that most Internet applications can reflect their own early packet size behavior patterns. Early packet sizes are assumed to carry enough information for effective traffic identification. Three classical machine learning classifiers, classifier, naive Bayesian trees, i. e., the naive Bayesian and the radial basis function neural networks, are used to validate the effectiveness of the proposed assumption. The experimental results show that the early stage packet sizes can be used as features for traffic identification.展开更多
Some dimensionality reduction (DR) approaches based on support vector machine (SVM) are proposed. But the acquirement of the projection matrix in these approaches only considers the between-class margin based on S...Some dimensionality reduction (DR) approaches based on support vector machine (SVM) are proposed. But the acquirement of the projection matrix in these approaches only considers the between-class margin based on SVM while ignoring the within-class information in data. This paper presents a new DR approach, call- ed the dimensionality reduction based on SVM and LDA (DRSL). DRSL considers the between-class margins from SVM and LDA, and the within-class compactness from LDA to obtain the projection matrix. As a result, DRSL can realize the combination of the between-class and within-class information and fit the between-class and within-class structures in data. Hence, the obtained projection matrix increases the generalization ability of subsequent classification techniques. Experiments applied to classification techniques show the effectiveness of the proposed method.展开更多
A novel regularization method -- discriminative regularization (DR)is presented. The method provides a general way to incorporate the prior knowledge for the classification. By introducing the prior information into...A novel regularization method -- discriminative regularization (DR)is presented. The method provides a general way to incorporate the prior knowledge for the classification. By introducing the prior information into the regularization term, DR is used to minimize the empirical loss between the desired and actual outputs, as well as maximize the inter-class separability and minimize the intra-class compactness in the output space simultane- ously. Furthermore, by embedding equality constraints in the formulation, the solution of DR can solve a set of linear equations. Classification experiments show the superiority of the proposed DR.展开更多
Noise-assisted multivariate empirical mode decomposition(NA-MEMD) is suitable to analyze multichannel electroencephalography(EEG) signals of non-stationarity and non-linearity natures due to the fact that it can provi...Noise-assisted multivariate empirical mode decomposition(NA-MEMD) is suitable to analyze multichannel electroencephalography(EEG) signals of non-stationarity and non-linearity natures due to the fact that it can provide a highly localized time-frequency representation.For a finite set of multivariate intrinsic mode functions(IMFs) decomposed by NA-MEMD,it still raises the question on how to identify IMFs that contain the information of inertest in an efficient way,and conventional approaches address it by use of prior knowledge.In this work,a novel identification method of relevant IMFs without prior information was proposed based on NA-MEMD and Jensen-Shannon distance(JSD) measure.A criterion of effective factor based on JSD was applied to select significant IMF scales.At each decomposition scale,three kinds of JSDs associated with the effective factor were evaluated:between IMF components from data and themselves,between IMF components from noise and themselves,and between IMF components from data and noise.The efficacy of the proposed method has been demonstrated by both computer simulations and motor imagery EEG data from BCI competition IV datasets.展开更多
Tensor representation is useful to reduce the overfitting problem in vector-based learning algorithm in pattern recognition.This is mainly because the structure information of objects in pattern analysis is a reasonab...Tensor representation is useful to reduce the overfitting problem in vector-based learning algorithm in pattern recognition.This is mainly because the structure information of objects in pattern analysis is a reasonable constraint to reduce the number of unknown parameters used to model a classifier.In this paper, we generalize the vector-based learning algorithm TWin Support Vector Machine(TWSVM) to the tensor-based method TWin Support Tensor Machines(TWSTM), which accepts general tensors as input.To examine the effectiveness of TWSTM, we implement the TWSTM method for Microcalcification Clusters(MCs) detection.In the tensor subspace domain, the MCs detection procedure is formulated as a supervised learning and classification problem, and TWSTM is used as a classifier to make decision for the presence of MCs or not.A large number of experiments were carried out to evaluate and compare the performance of the proposed MCs detection algorithm.By comparison with TWSVM, the tensor version reduces the overfitting problem.展开更多
Previously, a novel classifier called Kernel-based Nonlinear Discriminator (KND) was proposed to discriminate a pattern class from other classes by minimizing mean effect of the latter. To consider the effect of the t...Previously, a novel classifier called Kernel-based Nonlinear Discriminator (KND) was proposed to discriminate a pattern class from other classes by minimizing mean effect of the latter. To consider the effect of the target class, this paper introduces an oblique projection algorithm to determine the coefficients of a KND so that it is extended to a new version called extended KND (eKND). In eKND construction, the desired output vector of the target class is obliquely projected onto the relevant subspace along the subspace related to other classes. In addition, a simple technique is proposed to calculate the associated oblique projection operator. Experimental results on handwritten digit recognition show that the algorithm performes better than a KND classifier and some other commonly used classifiers.展开更多
By utilizing hyperbolic tangent function,we constructed a novel hyperbolic tangent loss function to reduce the influences of outliers on support vector machine(SVM)classification problem.The new loss function not only...By utilizing hyperbolic tangent function,we constructed a novel hyperbolic tangent loss function to reduce the influences of outliers on support vector machine(SVM)classification problem.The new loss function not only limits the maximal loss value of outliers but also is smooth.Hyperbolic tangent SVM(HTSVM)is then proposed based on the new loss function.The experimental results show that HTSVM reduces the effects of outliers and gives better generalization performance than the classical SVM on both artificial data and UCI data sets.Therefore,the proposed hyperbolic tangent loss function and HTSVM are both effective.展开更多
When using AdaBoost to select discriminant features from some feature space (e.g. Gabor feature space) for face recognition, cascade structure is usually adopted to leverage the asymmetry in the distribution of positi...When using AdaBoost to select discriminant features from some feature space (e.g. Gabor feature space) for face recognition, cascade structure is usually adopted to leverage the asymmetry in the distribution of positive and negative samples. Each node in the cascade structure is a classifier trained by AdaBoost with an asymmetric learning goal of high recognition rate but only moderate low false positive rate. One limitation of AdaBoost arises in the context of skewed example distribution and cascade classifiers: AdaBoost minimizes the classification error, which is not guaranteed to achieve the asymmetric node learning goal. In this paper, we propose to use the asymmetric AdaBoost (Asym- Boost) as a mechanism to address the asymmetric node learning goal. Moreover, the two parts of the selecting features and forming ensemble classifiers are decoupled, both of which occur simultaneously in AsymBoost and AdaBoost. Fisher Linear Discriminant Analysis (FLDA) is used on the selected fea- tures to learn a linear discriminant function that maximizes the separability of data among the different classes, which we think can improve the recognition performance. The proposed algorithm is dem- onstrated with face recognition using a Gabor based representation on the FERET database. Ex- perimental results show that the proposed algorithm yields better recognition performance than AdaBoost itself.展开更多
Based on pattern recognition theory and neural network technology, moving objects automatic detection and classification method integrating advanced wavelet analysis are discussed in detail. An algorithm of moving tar...Based on pattern recognition theory and neural network technology, moving objects automatic detection and classification method integrating advanced wavelet analysis are discussed in detail. An algorithm of moving targets pattern recognition on the combination of inter-frame difference and wavelet neural network is presented. The experimental results indicate that the designed BP wavelet network using this algorithm can recognize and classify moving targets rapidly and effectively.展开更多
Clinical data have shown that survival rates vary considerably among brain tumor patients,according to the type and grade of the tumor.Metabolite profiles of intact tumor tissues measured with high-resolution magic-an...Clinical data have shown that survival rates vary considerably among brain tumor patients,according to the type and grade of the tumor.Metabolite profiles of intact tumor tissues measured with high-resolution magic-angle spinning proton nuclear magnetic resonance spectroscopy (HRMAS 1H NMRS) can provide important information on tumor biology and metabolism.These metabolic fingerprints can then be used for tumor classification and grading,with great potential value for tumor diagnosis.We studied the metabolic characteristics of 30 neuroepithelial tumor biopsies,including two astrocytomas (grade I),12 astrocytomas (grade II),eight anaplastic astrocytomas (grade III),three glioblastomas (grade IV) and five medulloblastomas (grade IV) from 30 patients using HRMAS 1H NMRS.The results were correlated with pathological features using multivariate data analysis,including principal component analysis (PCA).There were significant differences in the levels of N-acetyl-aspartate (NAA),creatine,myo-inositol,glycine and lactate between tumors of different grades (P<0.05).There were also significant differences in the ratios of NAA/creatine,lactate/creatine,myo-inositol/creatine,glycine/creatine,scyllo-inositol/creatine and alanine/creatine (P<0.05).A soft independent modeling of class analogy model produced a predictive accuracy of 87% for high-grade (grade III-IV) brain tumors with a sensitivity of 87% and a specificity of 93%.HRMAS 1H NMR spectroscopy in conjunction with pattern recognition thus provides a potentially useful tool for the rapid and accurate classification of human brain tumor grades.展开更多
文摘In commercial applications of phase Doppler anemometry (PDA), the effectiveness of non sphericity of particles is present and the response of PDA system deviates from the theoretical prediction. In this paper, the statistic characteristics of PDA signal related to irregular particles is analyzed and a method of statistic classification of irregular particles is proposed.It proves that the parameter of PDA signal for irregular particles is an unbiased estimation for spherical ones, the mean of the phase difference is in direct proportion to the mean diameter of particles and the standard deviation of the phase difference increases linearly with the standard deviation of irregular particles. As an application of the identification of irregular objects, fuzzy patterns and similarities of haemocytes are used to recognize and quantify cell samples.The statistic classification of particles is more significant in practice.
基金The Program for New Century Excellent Talents in University(No.NCET-11-0565)the Fundamental Research Funds for the Central Universities(No.K13JB00160,2012JBZ010,2011JBM217)+2 种基金the Ph.D.Programs Foundation of Ministry of Education of China(No.20120009120010)the Program for Innovative Research Team in University of Ministry of Education of China(No.IRT201206)the Natural Science Foundation of Shandong Province(No.ZR2012FM010,ZR2011FZ001)
文摘In order to classify the Intemet traffic of different Internet applications more quickly, two open Internet traffic traces, Auckland I1 and UNIBS traffic traces, are employed as study objects. Eight earliest packets with non-zero flow payload sizes are selected and their payload sizes are used as the early-stage flow features. Such features can be easily and rapidly extracted at the early flow stage, which makes them outstanding. The behavior patterns of different Intemet applications are analyzed by visualizing the early-stage packet size values. Analysis results show that most Internet applications can reflect their own early packet size behavior patterns. Early packet sizes are assumed to carry enough information for effective traffic identification. Three classical machine learning classifiers, classifier, naive Bayesian trees, i. e., the naive Bayesian and the radial basis function neural networks, are used to validate the effectiveness of the proposed assumption. The experimental results show that the early stage packet sizes can be used as features for traffic identification.
文摘Some dimensionality reduction (DR) approaches based on support vector machine (SVM) are proposed. But the acquirement of the projection matrix in these approaches only considers the between-class margin based on SVM while ignoring the within-class information in data. This paper presents a new DR approach, call- ed the dimensionality reduction based on SVM and LDA (DRSL). DRSL considers the between-class margins from SVM and LDA, and the within-class compactness from LDA to obtain the projection matrix. As a result, DRSL can realize the combination of the between-class and within-class information and fit the between-class and within-class structures in data. Hence, the obtained projection matrix increases the generalization ability of subsequent classification techniques. Experiments applied to classification techniques show the effectiveness of the proposed method.
基金Supported by the National Natural Science Foundation of China(60773061)the Natural Science Foundation of Jiangsu Province(BK2008381)~~
文摘A novel regularization method -- discriminative regularization (DR)is presented. The method provides a general way to incorporate the prior knowledge for the classification. By introducing the prior information into the regularization term, DR is used to minimize the empirical loss between the desired and actual outputs, as well as maximize the inter-class separability and minimize the intra-class compactness in the output space simultane- ously. Furthermore, by embedding equality constraints in the formulation, the solution of DR can solve a set of linear equations. Classification experiments show the superiority of the proposed DR.
基金Projects(61201302,61372023,61671197)supported by the National Natural Science Foundation of ChinaProject(201308330297)supported by the State Scholarship Fund of ChinaProject(LY15F010009)supported by Zhejiang Provincial Natural Science Foundation,China
文摘Noise-assisted multivariate empirical mode decomposition(NA-MEMD) is suitable to analyze multichannel electroencephalography(EEG) signals of non-stationarity and non-linearity natures due to the fact that it can provide a highly localized time-frequency representation.For a finite set of multivariate intrinsic mode functions(IMFs) decomposed by NA-MEMD,it still raises the question on how to identify IMFs that contain the information of inertest in an efficient way,and conventional approaches address it by use of prior knowledge.In this work,a novel identification method of relevant IMFs without prior information was proposed based on NA-MEMD and Jensen-Shannon distance(JSD) measure.A criterion of effective factor based on JSD was applied to select significant IMF scales.At each decomposition scale,three kinds of JSDs associated with the effective factor were evaluated:between IMF components from data and themselves,between IMF components from noise and themselves,and between IMF components from data and noise.The efficacy of the proposed method has been demonstrated by both computer simulations and motor imagery EEG data from BCI competition IV datasets.
基金Supported by the National Natural Science Foundation of China (No. 60771068)the Natural Science Basic Research Plan in Shaanxi Province of China (No. 2007F248)
文摘Tensor representation is useful to reduce the overfitting problem in vector-based learning algorithm in pattern recognition.This is mainly because the structure information of objects in pattern analysis is a reasonable constraint to reduce the number of unknown parameters used to model a classifier.In this paper, we generalize the vector-based learning algorithm TWin Support Vector Machine(TWSVM) to the tensor-based method TWin Support Tensor Machines(TWSTM), which accepts general tensors as input.To examine the effectiveness of TWSTM, we implement the TWSTM method for Microcalcification Clusters(MCs) detection.In the tensor subspace domain, the MCs detection procedure is formulated as a supervised learning and classification problem, and TWSTM is used as a classifier to make decision for the presence of MCs or not.A large number of experiments were carried out to evaluate and compare the performance of the proposed MCs detection algorithm.By comparison with TWSVM, the tensor version reduces the overfitting problem.
基金Supported by the key project of Chinese Ministry of Education(No.1051150)
文摘Previously, a novel classifier called Kernel-based Nonlinear Discriminator (KND) was proposed to discriminate a pattern class from other classes by minimizing mean effect of the latter. To consider the effect of the target class, this paper introduces an oblique projection algorithm to determine the coefficients of a KND so that it is extended to a new version called extended KND (eKND). In eKND construction, the desired output vector of the target class is obliquely projected onto the relevant subspace along the subspace related to other classes. In addition, a simple technique is proposed to calculate the associated oblique projection operator. Experimental results on handwritten digit recognition show that the algorithm performes better than a KND classifier and some other commonly used classifiers.
基金National Natural Science Foundation of China(No.60705004)
文摘By utilizing hyperbolic tangent function,we constructed a novel hyperbolic tangent loss function to reduce the influences of outliers on support vector machine(SVM)classification problem.The new loss function not only limits the maximal loss value of outliers but also is smooth.Hyperbolic tangent SVM(HTSVM)is then proposed based on the new loss function.The experimental results show that HTSVM reduces the effects of outliers and gives better generalization performance than the classical SVM on both artificial data and UCI data sets.Therefore,the proposed hyperbolic tangent loss function and HTSVM are both effective.
基金Supported by the NSFC-RGC Joint Research Fund (No.60518002)Talent Promotion Foundation of Anhui Province (No.2004Z026)The Science Research Fund of MOE-Microsoft Key Laboratory of Multimedia Com-puting and Communication (No.05071811)
文摘When using AdaBoost to select discriminant features from some feature space (e.g. Gabor feature space) for face recognition, cascade structure is usually adopted to leverage the asymmetry in the distribution of positive and negative samples. Each node in the cascade structure is a classifier trained by AdaBoost with an asymmetric learning goal of high recognition rate but only moderate low false positive rate. One limitation of AdaBoost arises in the context of skewed example distribution and cascade classifiers: AdaBoost minimizes the classification error, which is not guaranteed to achieve the asymmetric node learning goal. In this paper, we propose to use the asymmetric AdaBoost (Asym- Boost) as a mechanism to address the asymmetric node learning goal. Moreover, the two parts of the selecting features and forming ensemble classifiers are decoupled, both of which occur simultaneously in AsymBoost and AdaBoost. Fisher Linear Discriminant Analysis (FLDA) is used on the selected fea- tures to learn a linear discriminant function that maximizes the separability of data among the different classes, which we think can improve the recognition performance. The proposed algorithm is dem- onstrated with face recognition using a Gabor based representation on the FERET database. Ex- perimental results show that the proposed algorithm yields better recognition performance than AdaBoost itself.
文摘Based on pattern recognition theory and neural network technology, moving objects automatic detection and classification method integrating advanced wavelet analysis are discussed in detail. An algorithm of moving targets pattern recognition on the combination of inter-frame difference and wavelet neural network is presented. The experimental results indicate that the designed BP wavelet network using this algorithm can recognize and classify moving targets rapidly and effectively.
基金supported by the National Natural Science Foundation of China (Grant Nos. 20573132 and 20575074)China Postdoctoral Science Foundation (Grant No. 20090450065)State Key Laboratory of Mag-netic Resonance and Atomic and Molecular Physics (Grant No. T152805)
文摘Clinical data have shown that survival rates vary considerably among brain tumor patients,according to the type and grade of the tumor.Metabolite profiles of intact tumor tissues measured with high-resolution magic-angle spinning proton nuclear magnetic resonance spectroscopy (HRMAS 1H NMRS) can provide important information on tumor biology and metabolism.These metabolic fingerprints can then be used for tumor classification and grading,with great potential value for tumor diagnosis.We studied the metabolic characteristics of 30 neuroepithelial tumor biopsies,including two astrocytomas (grade I),12 astrocytomas (grade II),eight anaplastic astrocytomas (grade III),three glioblastomas (grade IV) and five medulloblastomas (grade IV) from 30 patients using HRMAS 1H NMRS.The results were correlated with pathological features using multivariate data analysis,including principal component analysis (PCA).There were significant differences in the levels of N-acetyl-aspartate (NAA),creatine,myo-inositol,glycine and lactate between tumors of different grades (P<0.05).There were also significant differences in the ratios of NAA/creatine,lactate/creatine,myo-inositol/creatine,glycine/creatine,scyllo-inositol/creatine and alanine/creatine (P<0.05).A soft independent modeling of class analogy model produced a predictive accuracy of 87% for high-grade (grade III-IV) brain tumors with a sensitivity of 87% and a specificity of 93%.HRMAS 1H NMR spectroscopy in conjunction with pattern recognition thus provides a potentially useful tool for the rapid and accurate classification of human brain tumor grades.