Development and validation of supervised machine learning multivariable prediction models for the diagnosis of Pneumocystis jirovecii pneumonia using nasopharyngeal swab PCR in adults in a low-HIV prevalence setting.

Chew R., Woods ML., Paterson DL.

BackgroundThe global burden of the opportunistic fungal disease Pneumocystis jirovecii pneumonia (PJP) remains substantial. Polymerase chain reaction (PCR) on nasopharyngeal swabs (NPS) has high specificity and may be a viable alternative to the gold standard diagnostic of PCR on invasively collected lower respiratory tract specimens, but has low sensitivity. Sensitivity may be improved by incorporating NPS PCR results into machine learning models.MethodsThree supervised multivariable diagnostic models (random forest, logistic regression and extreme gradient boosting) were constructed and validated using a 111-person Australian dataset. The predictors were age, gender, immunosuppression type and NPS PCR result. Model performance metrics such as accuracy, sensitivity, specificity and predictive values were compared to select the best-performing model.ResultsThe logistic regression model performed best, with 80% accuracy, improving sensitivity to 86% and maintaining acceptable specificity of 70%. Using this model, positive and negative NPS PCR results indicated post-test probabilities of 84% (likely PJP) and 26% (unlikely PJP), respectively.ConclusionsThe logistic regression model should be externally validated in a wider range of settings. As the predictors are simple, routinely collected patient variables, this model may represent a diagnostic advance suitable for settings where collection of lower respiratory tract specimens is difficult but PCR is available.

DOI

10.1093/inthealth/ihae052

Type

Journal article

Publication Date

2025-09-01T00:00:00+00:00

Volume

17

Pages

804 - 808

Total pages

4

Addresses

Mathematical and Economic Modelling Department, Mahidol Oxford Tropical Medicine Research Unit, c/o Faculty of Tropical Medicine, Mahidol University, 3rd floor, 60th Anniversary Chalermprakiat Building, 420/6 Ratchawithi Road, Ratchathewi, Bangkok 10400, Thailand.

Keywords

Nasopharynx, Humans, Pneumocystis carinii, Pneumonia, Pneumocystis, HIV Infections, Prevalence, Logistic Models, Sensitivity and Specificity, Polymerase Chain Reaction, Adult, Aged, Middle Aged, Australia, Female, Male, Supervised Machine Learning

Permalink More information Close