2025 Proffered Presentations
S042: USE OF MACHINE-LEARNING TO PREDICT GROSS TOTAL RESECTION OF NON-FUNCTIONING PITUITARY ADENOMAS: A SINGLE INSTITUTION SERIES
Benjamin B Fixman, BS; Ishan Shah, BS; Apurva Prasad, BA; Gage A Guerra, BS; Jeffrey J Feng, MD; Robert G Briggs, MD; Gabriel Zada, MD, MS; University of Southern California
Pituitary adenomas are a common intracranial neoplasm for which surgical resection remains the definitive treatment. Though the decision to operate is multifactorial, the likelihood of GTR is a particularly notable consideration. The rapid advancement of machine learning (ML) presents an opportunity to determine the likelihood of GTR based on pre-operative characteristics and aid in preoperative planning for these patients.
A prospectively maintained database of 219 endoscopic operations completed by one surgeon at our institution between 2013-2023 was first analyzed with a logistic regression model for statistical inference of potentially predictive features. 4 features, including gender, prior operation, tumor diameter, and Knosp score were selected for model training and prediction. Next, the dataset was stratified by outcome and split 80:20 (training/validation:testing). Various models, including Logistic Regression, K-Nearest Neighbors, Random Forest, and Gradient Boosting were trained, 5-fold cross-validated, and hyperparameter tuned to optimize classification accuracy using mlr3 in R. Finally, the optimal model was trained on the full training/validation dataset and used to make predictions on the test set.
Gradient-boosting, which employs successive iterations of decision trees trained on the residuals of previous trees, proved optimal at minimizing cross-validation error (0.27). On an independent test set, this model achieved an error: 0.23, sensitivity: 0.83, specificity: 0.64, AUC: 0.76, PPV: 0.83, NPV: 0.64.
The use of ML to pre-operatively predict probability of GTR may help surgeons and patients make informed decisions regarding risk and benefit of surgical resection. Our models performed well on an independent test set, achieving 77% accuracy. The PPV and NPV highlight their utility.
Gender | Reoperation | Diameter | Knosp Score | GTR | Patients | |
---|---|---|---|---|---|---|
N (%) [SD] | M: 113 (51.6) | No: 175 (79.9) | Median: 25mm [9.93] |
0: 33 (15.1) Low:109 (49.8) High: 77 (35.2) |
Yes: 147 (67.1) | 219 |
Variable | Coefficient | P Value |
---|---|---|
Gender | -0.59 | 0.098 |
Reoperation | -0.34 | 0.40 |
Maximal Tumor Diameter | -0.071 | 0.0010 |
Knosp (Low) | 0.58 | 0.27 |
Knosp (High) | -1.03 | 0.061 |
Model | Error | Sensitivity | Specificity | AUC |
---|---|---|---|---|
Logistic Regression | 0.27 | 0.89 | 0.43 | 0.78 |
K Nearest Neighbors | 0.29 | 0.90 | 0.34 | 0.74 |
Random Forest | 0.30 | 0.89 | 0.32 | 0.75 |
Gradient Boosting | 0.27 | 0.87 | 0.49 | 0.76 |
Model | Error | Sensitivity | Specificity | AUC | PPV | NPV |
---|---|---|---|---|---|---|
Gradient Boosting | 0.23 | 0.83 | 0.64 | 0.76 | 0.83 | 0.64 |