Ayuda
Ir al contenido

Dialnet


Resumen de Predicting Positive Repeat Prostate Biopsy Outcomes: Comparison of Machine Learning Approaches to Identify Key Parameters and Optimal Algorithms

Xinru Zhang, Chao Feng, Xiao Bai, Xufeng Peng, Qian Guo, Lei Chen, Jingdong Xue

  • Background: Innovative strategies are necessary to enhance prostate cancer diagnosis whilst reducing unnecessary and invasive repeat biopsies. This study aimed to determine the significant parameters affecting repeat prostate biopsy outcomes and develop an optimal machine learning algorithm for predicting positive repeat prostate biopsy results.

    Methods: We analysed data from 174 men who underwent repeated prostate biopsies between January 2008 and December 2022. Systematic multiple-core, ultrasound-targeted prostate biopsies were performed, each two samples from prostatic transitional zone and peripheral zone were obtained bilaterally. Clinical characteristics were collected, including patients’ age, initial prostate volume, prostate-specific antigen (PSA) level, free PSA (fPSA)/PSA ratio, biopsy core numbers, pathological result; The time interval between first and latest prostate biopsy; Latest PSA level, fPSA/PSA ratio, biopsy core numbers; And final pathological diagnosis. Six feature selection methods, namely, variable ranking, correlation matrix, random forest regression, recursive feature elimination, cross-validation and forward selection, were employed to identify key influencing factors for repeat biopsy outcomes. Subsequently, the performance of seven machine learning algorithms, namely, multivariable logistic regression (LR), K-nearest neighbour search (KNN), support vector classification (SVC), decision tree (DT), random forest classifier (RF), naïve Bayes classifier (NBC) and gradient booster tree (GB), was assessed based on accuracy, misclassification, recall, specificity, precision and receiver operating characteristic (ROC) area under the curve (AUC). About 70% of patients were used as the training dataset, meanwhile remaining 30% as validation dataset.

    Results: 52 were ultimately diagnosed with prostate cancer following the final pathological examination. The remaining 122 patients were negative. Amongst six feature selection methods, the variable ranking emerged as the most effective method for identifying the essential factors influencing repeat biopsy results. Amongst the machine learning algorithms, SVC demonstrated superior accuracy (0.7365), low recall rate (0.2500) and low misclassification rate (0.2093) for both patients with cancer and healthy individuals. Meanwhile, the ROC curve of SVC showed a relatively high AUC (0.6871).

    Conclusions: We developed an SVC-based machine learning algorithm for predicting positive repeat prostate biopsy results. Our analysis revealed that initial and latest prostate volumes, initial and latest PSA levels, latest fPSA/PSA ratio and age are significant factors for this model.


Fundación Dialnet

Dialnet Plus

  • Más información sobre Dialnet Plus