期刊文献+
共找到2篇文章
< 1 >
每页显示 20 50 100
Distributed and Weighted Extreme Learning Machine for Imbalanced Big Data Learning 被引量:10
1
作者 Zhiqiong Wang Junchang Xin +4 位作者 Hongxu Yang Shuo Tian Ge Yu Chenren Xu Yudong Yao 《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2017年第2期160-173,共14页
The Extreme Learning Machine(ELM) and its variants are effective in many machine learning applications such as Imbalanced Learning(IL) or Big Data(BD) learning. However, they are unable to solve both imbalanced ... The Extreme Learning Machine(ELM) and its variants are effective in many machine learning applications such as Imbalanced Learning(IL) or Big Data(BD) learning. However, they are unable to solve both imbalanced and large-volume data learning problems. This study addresses the IL problem in BD applications. The Distributed and Weighted ELM(DW-ELM) algorithm is proposed, which is based on the Map Reduce framework. To confirm the feasibility of parallel computation, first, the fact that matrix multiplication operators are decomposable is illustrated.Then, to further improve the computational efficiency, an Improved DW-ELM algorithm(IDW-ELM) is developed using only one Map Reduce job. The successful operations of the proposed DW-ELM and IDW-ELM algorithms are finally validated through experiments. 展开更多
关键词 weighted Extreme learning Machine(ELM) imbalanced big data MapReduce framework user-defined counter
原文传递
A Novel Method for Prediction of Protein Domain Using Distance-Based Maximal Entropy
2
作者 Shu-xue Zou Yan-xin Huang Yan Wang Chun-guang Zhou 《Journal of Bionic Engineering》 SCIE EI CSCD 2008年第3期215-223,共9页
Detecting the boundaries of protein domains is an important and challenging task in both experimental and computational structural biology. In this paper, a promising method for detecting the domain structure of a pro... Detecting the boundaries of protein domains is an important and challenging task in both experimental and computational structural biology. In this paper, a promising method for detecting the domain structure of a protein from sequence information alone is presented. The method is based on analyzing multiple sequence alignments derived from a database search. Multiple measures are defined to quantify the domain information content of each position along the sequence. Then they are combined into a single predictor using support vector machine. What is more important, the domain detection is first taken as an imbal- anced data learning problem. A novel undersampling method is proposed on distance-based maximal entropy in the feature space of Support Vector Machine (SVM). The overall precision is about 80%. Simulation results demonstrate that the method can help not only in predicting the complete 3D structure of a protein but also in the machine learning system on general im- balanced datasets. 展开更多
关键词 protein domain boundary SVM imbalanced data learning distance-based maximal entropy
下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部