摘要
Background:During the COVID-19 pandemic,clinical trial recruitment could not be carried out due to travel restrictions,transmission risks and other factors,resulting in the stagnation of many ongoing or upcoming clinical trials.Objective:An intelligent screening tool was developed using artificial intelligence technology to rapidly prescreen potential patients for phase I solid tumor drug clinical trials.Methods:A total of 429 screening process records were collected from 27 phase I solid tumor drug clinical trials at the First Affiliated Hospital of Bengbu Medical College from April 2018 to May 2021.Features of the experimental data were analyzed,and the collinearity(principal component analysis)and strong correlation(χ^(2)test)among features were eliminated.XGBoost,random forest,and naive Bayes were used to determine the weight importance of the features.Finally,prescreening models were constructed using a classification machine learning algorithm,and the optimal model was selected.Results:Among the 429 screening records,33 were generated by repeated subject participation in different clinical trials,and of the remaining 396 screening records,246(62.12%)were screened successfully.The gold standard for subject screening success was the final judgment made by the principal investigator(PI)based on the clinical trial protocol.A Venn diagram was used to identify the important feature intersections of the machine learning algorithms.After intersecting the top 15 characteristic variables of the different feature screening models,9 common variables were obtained:age,sex,distance from residence to the central institution,tumor histology,tumor stage,tumorectomy,interval from diagnosis/postoperative to screening,chemotherapy,and Eastern Cooperative Oncology Group(ECOG)score.To select the optimal subset,the 9 important feature variables were expanded to 12 and 15 feature subsets,and the performance of different feature subsets under different machine learning models was validated.The results showed that optimal performance,accuracy and practicability were achieved using XGBoost with the 12-feature subset.The final model could accurately predict the screening success rates in both internal(AUC=0.895)and external(AUC=0.796)validation and has been transformed into a convenient tool to facilitate its application in clinical settings.Subjects with a probability exceeding or equal to the threshold in the final model had a greater probability of being successfully screened.Conclusion:Based on the optimal model,we created an online prediction calculator and visualization app,the Intelligent Screening Service Platform(ISSP),which can rapidly screen patients for phase I solid tumor drug clinical trials.The IsSP can effectively solve the problems of space and time intervals.On the mobile terminal,matching between clinical trial projects and patients can be achieved,and the rapid screening of clinical trial subjects can be completed to obtain more clinical trial subjects.As an auxiliary tool,the ISSP optimizes the screening process of clinical trials and provides more convenient services for clinical investigators and patients.
基金
supported by the Science Key Project of Bengbu Medical College(No.2022byzd068)
the University Synergy Innovation Program of Anhui Province(No.GXXT-2022-058)
The Anhui Provincial University Natural Science Key Project(No.2022AH051458)provided us with language polishing.