Objective To build a prostate cancer(PCa) risk prediction model based on common clinical indicators to provide a theoretical basis for the diagnosis and treatment of PCa and to evaluate the value of artificial intelli...Objective To build a prostate cancer(PCa) risk prediction model based on common clinical indicators to provide a theoretical basis for the diagnosis and treatment of PCa and to evaluate the value of artificial intelligence(AI) technology under healthcare data platforms.Methods After preprocessing of the data from Population Health Data Archive,smuothly clipped absolute deviation(SCAD) was used to select features.Random forest(RF),support vector machine(SVM),back propagation neural network(BP),and convolutional neural network(CNN) were used to predict the risk of PCa,among which BP and CNN were used on the enhanced data by SMOTE.The performances of models were compared using area under the curve(AUC) of the receiving operating characteristic curve.After the optimal model was selected,we used the Shiny to develop an online calculator for PCa risk prediction based on predictive indicators.Results Inorganic phosphorus,triglycerides,and calcium were closely related to PCa in addition to the volume of fragmented tissue and free prostate-specific antigen(PSA).Among the four models,RF had the best performance in predicting PCa(accuracy:96.80%;AUC:0.975,95% CI:0.964-0.986).Followed by BP(accuracy:85.36%;AUC:0.892,95% CI:0.849-0.934) and SVM(accuracy:82.67%;AUC:0.824,95% CI:0.805-0.844).CNN performed worse(accuracy:72.37%;AUC:0.724,95% CI:0.670-0.779).An online platform for PCa risk prediction was developed based on the RF model and the predictive indicators.Conclusions This study revealed the application value of traditional machine learning and deep learning models in disease risk prediction under healthcare data platform,proposed new ideas for PCa risk prediction in patients suspected for PCa and had undergone core needle biopsy.Besides,the online calculation may enhance the practicability of AI prediction technology and facilitate medical diagnosis.展开更多
文摘Objective To build a prostate cancer(PCa) risk prediction model based on common clinical indicators to provide a theoretical basis for the diagnosis and treatment of PCa and to evaluate the value of artificial intelligence(AI) technology under healthcare data platforms.Methods After preprocessing of the data from Population Health Data Archive,smuothly clipped absolute deviation(SCAD) was used to select features.Random forest(RF),support vector machine(SVM),back propagation neural network(BP),and convolutional neural network(CNN) were used to predict the risk of PCa,among which BP and CNN were used on the enhanced data by SMOTE.The performances of models were compared using area under the curve(AUC) of the receiving operating characteristic curve.After the optimal model was selected,we used the Shiny to develop an online calculator for PCa risk prediction based on predictive indicators.Results Inorganic phosphorus,triglycerides,and calcium were closely related to PCa in addition to the volume of fragmented tissue and free prostate-specific antigen(PSA).Among the four models,RF had the best performance in predicting PCa(accuracy:96.80%;AUC:0.975,95% CI:0.964-0.986).Followed by BP(accuracy:85.36%;AUC:0.892,95% CI:0.849-0.934) and SVM(accuracy:82.67%;AUC:0.824,95% CI:0.805-0.844).CNN performed worse(accuracy:72.37%;AUC:0.724,95% CI:0.670-0.779).An online platform for PCa risk prediction was developed based on the RF model and the predictive indicators.Conclusions This study revealed the application value of traditional machine learning and deep learning models in disease risk prediction under healthcare data platform,proposed new ideas for PCa risk prediction in patients suspected for PCa and had undergone core needle biopsy.Besides,the online calculation may enhance the practicability of AI prediction technology and facilitate medical diagnosis.