Skip to main content

Predicting postoperative pulmonary infection in elderly patients undergoing major surgery: a study based on logistic regression and machine learning models

Abstract

Background

Postoperative pulmonary infection (POI) is strongly associated with a poor prognosis and has a high incidence in elderly patients undergoing major surgery. Machine learning (ML) algorithms are increasingly being used in medicine, but the predictive role of logistic regression (LR) and ML algorithms for POI in high-risk populations remains unclear.

Methods

We conducted a retrospective cohort study of older adults undergoing major surgery over a period of six years. The included patients were randomly divided into training and validation sets at a ratio of 7:3. The features selected by the least absolute shrinkage and selection operator regression algorithm were used as the input variables of the ML and LR models. The random forest of multiple interpretable methods was used to interpret the ML models.

Results

Of the 9481 older adults in our study, 951 developed POI. Among the different algorithms, LR performed the best with an AUC of 0.80, whereas the decision tree performed the worst with an AUC of 0.75. Furthermore, the LR model outperformed the other ML models in terms of accuracy (88.22%), specificity (90.29%), precision (44.42%), and F1 score (54.25%). Despite employing four interpretable methods for RF analysis, there existed a certain degree of inconsistency in the results. Finally, to facilitate clinical application, we established a web-friendly version of the nomogram based on the LR algorithm; In addition, patients were divided into three significantly distinct risk intervals in predicting POI.

Conclusions

Compared with popular ML algorithms, LR was more effective at predicting POI in older patients undergoing major surgery. The constructed nomogram could identify high-risk elderly patients and facilitate perioperative management planning.

Trial registration

The study was retrospectively registered (NCT06491459).

Key Summary Point

• Postoperative pulmonary infection (POI) significantly impacts the prognosis of elderly patients undergoing major surgery, and identifying its risk factors can enhance treatment options.

• This study developed a POI prediction model using logistic regression (LR) and machine learning (ML) algorithms, with LR demonstrating the best performance (AUC of 0.80). Additionally, four methods were employed to explain the feature importance in the ML models.

• The nomogram generated from the LR model offers clinicians a practical tool for identifying high-risk elderly patients and optimizing perioperative management strategies.

Peer Review reports

Introduction

Postoperative pulmonary infection (POI) is frequently linked to poor prognoses, prolonged hospital stays, and increased healthcare costs [1,2,3]. The incidence is between 0.9% and 39%, depending on the population and definition of the study [4, 5]. Due to the loss of physiological organ reserves and the increased colonization of gram-negative bacteria in the upper respiratory tract, older people are more likely to develop POI [6]. In a study of cancer surgery, the hazard ratio for the one-year cumulative prevalence of POI in patients over 65 years of age was as high as 3.7-fold [7]. Owing to the characteristics of major surgery, such as long operation time and repeated ischemia and reperfusion, it is easy to cause changes in the immune cells of the body and produce systemic inflammatory reactions, resulting in lung function damage [3, 8]. Major surgery is associated with poor outcomes, and with the aging of the population, the proportion of elderly patients is gradually growing [9,10,11].

Furthermore, previous studies have shown that changes in circulating inflammatory cells (such as lymphocyte, neutrophil, and monocyte counts) are associated with postoperative complications [12,13,14]. Circulating inflammatory cell counts are readily available markers that may provide valuable information for predicting the risk of postoperative complications in elderly patients. Although some studies have explored the risk factors for POI [15,16,17], few studies with large samples focused on POI after major surgery in older adults.

Machine learning (ML) is becoming increasingly prevalent in the medical field, especially for predicting complications [18]. ML is a branch of artificial intelligence that builds models using vast amounts of medical data, leveraging algorithms that do not assume linearity in the relationships between variables. ML has the advantage of revealing deeper correlations in data and is robust to noisy information [19]. ML parses medical information and can be used to build disease occurrence, diagnosis, and prognosis assessment models to aid clinical decision-making [20, 21]. Leading ML algorithms, such as decision tree (DT), random forest (RF), support vector machine (SVM), gradient boosting decision tree (GBDT), extreme gradient boosting (XGBoost), and multilayer perceptron (MLP), represent some of the most widely used models in the field [22]. However, some studies have shown that logistic regression (LR) models do not perform worse than ML models and even outperform ML in predicting complications [23,24,25]. ML faces limitations, including the need for large training datasets and poor interpretability, as the influence of individual variables on model outcomes is often unclear [26]. To our knowledge, although there are studies involving different algorithms, few have investigated the performance of different algorithms in predicting POI in older adults undergoing major surgery.

This study aimed to identify the predictors of POI for perioperative management and to compare the predictive performance of the ML and LR models for POI after major surgery in elderly patients. Furthermore, various interpretable methods were employed to explore the variable importance of ML models.

Materials and methods

We obtained all data from the electronic database system for elderly patients who underwent surgery at the Wuhan Union Hospital from January 2014 to December 2019. In retrospective enrollment, the inclusion criteria were (1) age ≥ 65 years and (2) mechanical ventilation during major surgery. The exclusion criteria were: (1) preoperative tracheal intubation, (2) preoperative pneumonia, (3) organ transplantation, and (4) missing data. According to the literature [27, 28], major surgery is any procedure involving general anesthesia for non-percutaneous, non-endoscopic, and invasive surgery. This study adhered to the principles outlined in the Declaration of Helsinki and was approved by the Ethics Committee of our institution (Ethics Number: 2021 − 0986).

Data collection

We collected appropriate clinical variables based on the literature and clinical experience. Demographic data included age, male sex, body mass index (BMI), history of smoking and alcohol consumption, cancer, chronic obstructive pulmonary disease (COPD), hypertension, coronary artery disease, and diabetes, and laboratory test data included hemoglobin, platelet count, albumin, leukocyte count, lymphocyte count, neutrophil count, monocyte count, blood urea nitrogen (BUN), creatinine, total bilirubin (TBIL), alanine aminotransferase (ALT), and aspartate aminotransferase (AST). Surgery-related factors included type of surgery, emergency surgery, surgical incision, operation time, red blood cell transfusion (RBC transfusion), and colloidal infusion. To ensure the reliability of the data, they were reviewed and reconfirmed by two researchers, and contradictory data were resolved by the research team after discussion.

Primary outcome

The primary outcome was POI incidence during hospitalization. It is defined as new or progressive radiological infiltration accompanied by at least two symptoms: antibiotic treatment, body temperature above 38 °C, increased or decreased white blood cell count, and/or purulent airway secretions [29, 30].

Data preprocessing

Figure 1 shows the detailed process of the data preprocessing. We randomly divided the patients into training and validation sets in a ratio of 7:3. To reduce feature dimensionality and mitigate overfitting, we employed the least absolute shrinkage and selection operator (LASSO) regression algorithm to identify relevant risk factors. In the LASSO regression analysis, selected variables were incorporated into the model, and the regularization parameter λ was determined using the minimum standard error method (1-SE rule) through 10-fold cross-validation. A correlation heat map and variance inflation factor (VIF) were used to access multicollinearity between variables.

Fig. 1
figure 1

The general workflow of this study

Due to the imbalanced distribution of POI in the dataset, random undersampling (RUS) was applied to balance the samples. To prevent data leakage, RUS was performed only on the training set. Dummy coding was applied for unordered multi-categorical variables. For continuous data, we applied standardization to produce data with a mean of 0 and a standard deviation of 1.

Data modeling

The data modelling process is illustrated in Fig. 1. Two distinct classes of modeling algorithms were considered for the building: the traditional LR and ML algorithms. To obtain more convincing results, we selected the following six popular ML algorithms: DT, RF, SVM, GBDT, XGBoost, and MLP.

To optimize model performance, we tuned hyperparameters using grid search with five-fold cross-validation (GridSearchCV). GridSearchCV evaluates all possible parameter combinations through cross-validation and selects the best set based on performance scores. For cross-validation, the training set was split into five equal parts. In each iteration, one part served as the validation set, while the other four were used for training. This process was repeated five times, and the model’s performance was optimized based on the area under the receiver operating characteristic curve (AUC).

The best parameters were applied to the validation set. Model performance was evaluated using AUC and the Brier score, where the Brier score ranges from 0 to 1 (lower values indicate better calibration). Additional metrics, including accuracy, sensitivity, specificity, precision and F1 score were used to compare model performance. The cut-off threshold was determined using clinical experience or the Youden index.

The feature importance attribute with the model, local interpretable model-agnostic explanations (LIME), permutation feature importance, and the Shapley additive explanation (SHAP) method were selected for feature importance analysis. A nomogram was constructed using the LR algorithm. The study population was divided into three groups based on the nomogram and clinical practice. The three groups differed significantly in the predicted risk of POI: low-risk (< 10%), intermediate-risk (10-30%), and high-risk (> 30%).

Data-analysis

Continuous variables are presented as means ± standard deviations or medians (interquartile ranges, IQR), using independent samples t-tests and non-parametric tests, respectively. Categorical variables were expressed as frequencies and percentages and compared using the chi-squared test or Fisher’s exact test.

A P-value < 0.05 (two-sided) was considered statistically significant. The XGBoost algorithm is constructed using the XGBoost package. Other ML algorithms were built using the scikit-learn package. Construction of the ML models was completed using Python (version 3.8.8). The remaining data analyses were performed using R software (version 4.1.1).

Results

Baseline characteristics

This study assessed the eligibility of older adults undergoing major surgery at our hospital over a six-year period. Based on the inclusion and exclusion criteria, 9481 patients were analyzed. The clinical characteristics of the patients are summarized in Table 1. A total of 951 patients developed POI, accounting for 10.03% of the population, while 8530 did not. The median age of the patients with POI was 69 years, and 53.78% were male. Compared to non-POI patients, patients with POI were more male, had more comorbidities, lower hemoglobin and albumin levels, higher levels of inflammatory factors, and longer operation time. In addition, the length of postoperative hospital stay and total length of hospitalization were significantly higher in patients who developed POI (16 vs. 10, P < 0.001; 25 vs. 16, P < 0.001). There were no significant differences between the two cohorts (training and validation sets) for any of the features collected, including the incidence of POI (10.03% vs. 10.02%, P = 1).

Table 1 Baseline characteristics of the cohort

Feature selection

Based on the LASSO regression analysis of the training set, five variables were entered into the final model (Fig. 2). These five variables were COPD, neutrophil count, albumin level, surgical incision, and operation time, which were independently associated with POI (P < 0.05). To explore multicollinearity between the independent variables, we calculated Spearman’s correlation coefficients and visualized them in a heatmap (Fig. 3). The heatmap shows low correlations between the variables. Furthermore, all variance inflation factor values were below 2, confirming minimal multicollinearity. The hyperparameters of the ML models are detailed in Table S1.

Fig. 2
figure 2

LASSO regression plot. (A) Plot of LASSO coefficient paths. (B) 10-fold cross-validation curve for the penalty term. LASSO, least absolute shrinkage and selection operator

Fig. 3
figure 3

The correlation heat map of variables in the final model

Algorithm performance

The LR model achieved the highest AUC of 0.80 on the validation set, while the DT model had the lowest AUC of 0.75 (Fig. 4A). Furthermore, the LR model’s AUC was 0.81 (95% CI: 0.795–0.837) in the training set (Fig. 4B). The results showed that the AUC of LR significantly improved when compared to DT, RF, SVM, GBDT, and XGBoost (DT vs. LR: ∆AUC = -0.045, P < 0.001; RF vs. LR: ∆AUC = -0.014, P = 0.014; GBDT vs. LR: ∆AUC = -0.02, P = 0.005; XGBoost vs. LR: ∆AUC = 0.014, P = 0.032; SVM vs. LR: ∆AUC = 0.029, P = 0.001). Although there was a difference in AUC between LR and MLP, it did not reach statistical significance (MLP vs. LR: ∆AUC = -0.002, P = 0.56) (Table S2). The Brier scores of the seven algorithms ranged from 0.08 to 0.22, indicating good calibration (LR: 0.081; DT: 0.197; RF: 0.187; XGBoost: 0.186; GBDT: 0.189; SVM: 0.219; MLP: 0.178) (Figure S1). The low Brier score of LR reflects its superior reliability in probability prediction. Calibration curve further confirmed that the LR model outperformed other algorithms in calibration (Figure S1). Additionally, the LR model outperformed the other ML models in terms of accuracy (88.22%), specificity (90.29%), precision (44.42%), and F1 score (54.25%) (Table S2).

Fig. 4
figure 4

ROC curves of different algorithms. (A) ROC curves of seven models on the validation set. (B) ROC curves of the LR model on the training and validation sets. ROC, receiver operating characteristic; LR, logistic regression; DT, decision tree; RF, random forest; GBDT, gradient boosting decision tree; XGBoost, extreme gradient boosting; MLP, multilayer perceptron; SVM, support vector machine

Algorithm interpretability and application

The four interpretable methods of the RF model showed that operation time, open thoracotomy, albumin level, and neutrophil count were important features (Fig. 5). The rankings of the top five variables in the feature importance and permutation importance methods were generally consistent. However, there were some differences in the rankings between the LIME importance and SHAP methods. Although the top three variables were generally related to surgery, there may be slight variations. In addition, the ranking of COPD fluctuated significantly across various methods.

Fig. 5
figure 5

Global feature importance of the RF model using four interpretability methods, ranked in descending order. (A) Ranking of feature importance indicated by Feature Importance plot. The matrix plot depicts displays the mean importance of each feature based on the reduction in impurity or information gain. (B) Ranking of feature importance indicated by Permutation Importance plot. The plot showing the mean decrease in model performance when each feature is randomly permuted. (C) Ranking of feature importance indicated by SHAP summary plot. The mean SHAP values for each feature represent their contribution to the model’s output. (D) Ranking of feature importance indicated by LIME plot. The mean LIME values approximate the model’s behavior locally using an interpretable surrogate model. RF, random forest; SHAP, Shapley additive explanation; LIME: local interpretable model-agnostic explanations

Based on AUC, specificity, accuracy, precision, F1 score, Brier score, and calibration curve evaluation, we ultimately selected the LR model as the primary model and proceeded with its visualization. A web-based version of the nomogram (Fig. 6) is available at https://liujie123.shinyapps.io/dynnomapp/. The proportion of patients stratified by the nomogram was similar in both cohorts, with 71.84%, 19.92%, and 8.24% of patients in the low-, medium-, and high-risk groups (development cohort) and 72.61%, 19.94%, and 7.45% (validation cohort), respectively (Fig. 7). Compared to the low-risk group, the incidence of POI was 4.62-fold and 12.98-fold higher in the medium-risk and high-risk groups of the development cohort, respectively (Table 2).

Fig. 6
figure 6

The nomogram based on the logistic regression algorithm. (A) Building a nomogram in the training cohort by incorporating five parametric parameters; (B) Online web server interface for the nomogram

Fig. 7
figure 7

The incidence of POI by risk group in the development cohort and validation cohort. POI, postoperative pulmonary infection

Table 2 Odds for POI by risk group in the development cohort and validation cohort

Discussion

In this study on POI in older adults undergoing major surgery, we compared the predictive role of multiple models. The results showed that compared with the ML algorithms, the classical LR algorithm had a better prediction effect on POI after major surgery in older patients. We also developed a visual nomogram and performed risk stratification using the LR algorithm. Based on the preoperative and intraoperative variables, the model could help physicians assess the risk of POI earlier and formulate relevant strategies. Our study included a large sample size and identified simple predictors. The incidence of POI was 10.03%, consistent with previous literature [31, 32].

Consistent with previous studies, the MLP and LR algorithms performed better [23]. The AUC of the above two models reached more than 0.79, accompanied by high accuracy and specificity, reflecting strong discriminatory ability. Despite progress in ML interpretability, challenges remain due to differing perspectives on explanations. The consistency and variability among the four interpretability methods provided a comprehensive understanding of feature importance. The consistent recognition of features such as operation time, open thoracotomy, albumin level, and neutrophil count underscored their significance in the decision-making process, offering guidance for further analysis and interpretation. However, the variability in feature rankings across methods also reflects challenges in ML development. As ML models grow more complex, explaining their internal mechanisms becomes increasingly difficult [33]. The differences in feature importance rankings stem from the underlying principles of each method. SHAP values are based on game theory, providing a comprehensive explanation of feature contributions in the context of interactions between features [34]. Permutation Importance evaluates feature importance by measuring performance changes when a feature is perturbed, focusing on individual contributions without considering interactions [35]. The Feature Importance method, based on impurity reduction in tree-based models, may overestimate the importance of high-cardinality features. LIME offers local explanations by approximating the model’s behavior around a specific instance but may not fully capture global feature contributions across the entire dataset [36]. These varying approaches can lead to discrepancies in feature importance rankings, reflecting their distinct mechanisms for calculating feature contributions. The interactions between features can lead to different ranking results across methods. We chose LR as the primary model due to its intuitive interpretation of regression coefficients and lower risk of overfitting compared to complex models. Its simplicity and transparency make it suitable for clinical practice, where clear explanations are essential.

In our study, COPD, neutrophil count, albumin level, surgical incision, and operation time were independent risk factors associated with POI. Most studies have confirmed that COPD, operation time, and surgical incision are independently associated with POI [16, 17]. Serum albumin levels reflect the patient’s nutritional status and are considered negative for acute-phase protein, which is closely associated with increased postoperative complications and mortality [37]. Low albumin levels may impair immune function and increase infection risk. Moreover, decreased albumin levels may contribute to increased capillary permeability into the extravascular space, causing pleural effusion and pulmonary edema, and increasing the risk of POI [38]. Neutrophils reflect the inflammatory state during disease progression. Neutrophil recruitment and activation are vital for immune defense during pathogenic infections [39].

Some preventive measures can be taken for controllable risk factors in the final model, such as preoperative use of bronchodilators combined with glucocorticoids to treat COPD [40], increasing albumin levels, and improving malnutrition [41]. The LR-based nomogram may aid in risk assessment, stratification, and identifying high-risk patients.

In our study, some factors reported in other studies were not identified as independent risk factors, such as RBC transfusion and urea nitrogen levels [42, 43]. This reflects the great variability of the elderly with major surgery from other study populations, and demonstrates the importance of constructing a POI model for the elderly undergoing major surgery. Additionally, these studies mainly used the LR algorithm and did not explore the predictive performance of the various ML algorithms.

Compared with previous studies, our study had the following strengths: first, the research population was an elderly group with a high incidence; second, the type of surgery was major surgery, better targeting high-risk groups; third, in addition to using traditional algorithms, we explored model performance using popular ML algorithms; fourth, we applied interpretability techniques to ML algorithms; fifthly, based on the nomogram and predicted probabilities, we stratified the population for better use in clinical practice. In future work, integrating our model into electronic health records or clinical decision support systems could provide clinicians with real-time predictions during preoperative assessments, thereby aiding personalized interventions and optimizing perioperative management.

Our study had some limitations. First, as a single-center retrospective study, it may have underestimated the incidence of POI. However, the large sample size and incidence rate align with literature reports. Second, the model lacks external validation; therefore, further external validation across multiple centers is needed in the future to assess its generalizability. Third, some parameters related to mechanical ventilation were not collected. However, our model had good discriminative ability. Fourth, the use of LASSO regression for feature selection presents certain limitations. LASSO tends to select a small number of features and, when features are highly correlated, may randomly select one, overlooking others. This could lead to a reduction in model performance, as it may fail to capture all potential variable interactions. Fifth, although the LR model performed well in overall predictions, ML models may exhibit superior predictive power in cases with more features and more complex relationships. Future research will incorporate additional features to further explore the performance of various algorithms.

Conclusions

In our study, the incidence of POI after major surgery in the elderly patients was 10.03%, which was significantly associated with the length of postoperative hospitalization. This study identified five simple and easy-to-collect independent risk factors for POI in elderly patients undergoing major surgery and compared the effectiveness of different algorithms for predicting POI. Furthermore, we constructed a visualized nomogram of POI and performed risk stratification to guide the perioperative clinical management of elderly patients undergoing major surgery.

Data availability

The raw datasets used in the current study can be obtained from the corresponding author upon reasonable request.

Abbreviations

POI:

Postoperative pulmonary infection

ML:

Machine learning

LR:

Logistic regression

AUC:

Area under the receiver operating characteristic curve

DT:

Decision tree

RF:

Random forest

SVM:

Support vector machine

GBDT:

Gradient boosting decision tree

XGBoost:

Extreme gradient boosting

MLP:

Multilayer perceptron

BMI:

Body mass index

COPD:

Chronic obstructive pulmonary disease

BUN:

Blood urea nitrogen

TBIL:

Total bilirubin

ALT:

Alanine aminotransferase

AST:

Aspartate aminotransferase

RBC:

Red blood cell

LASSO:

Least absolute shrinkage and selection operator

VIF:

Variance inflation factor

RUS:

Random undersampling

GridSearchCV:

Grid search with cross-validation

LIME:

Local interpretable model-agnostic explanations

SHAP:

Shapley additive explanation

IQR:

Interquartile range

References

  1. Papazian L, Klompas M, Luyt CE. Ventilator-associated pneumonia in adults: a narrative review. Intensive Care Med. 2020;46(5):888–906.

    Article  PubMed  PubMed Central  Google Scholar 

  2. Hui S, Fowler AJ, Cashmore RMJ, Fisher TJ, Schlautmann J, Body S, Lan-Pak-Kee V, Webb M, Kyriakides M, Ng JY, et al. Routine postoperative noninvasive respiratory support and pneumonia after elective surgery: a systematic review and meta-analysis of randomised trials. Br J Anaesth. 2022;128(2):363–74.

    Article  PubMed  Google Scholar 

  3. Stotts C, Corrales-Medina VF, Rayner KJ. Pneumonia-Induced inflammation, resolution and cardiovascular disease: causes, consequences and clinical opportunities. Circ Res. 2023;132(6):751–74.

    Article  CAS  PubMed  Google Scholar 

  4. Liang S, Zhang X, Hu Y, Yang J, Li K. Association between perioperative chlorhexidine oral care and postoperative pneumonia in non-cardiac surgical patients: A systematic review and meta-analysis. Surgery. 2021;170(5):1418–31.

    Article  PubMed  Google Scholar 

  5. Redelmeier DA, McAlister FA, Kandel CE, Lu H, Daneman N. Postoperative pneumonia in elderly patients receiving acid suppressants: a retrospective cohort analysis. BMJ. 2010;340:c2608.

    Article  PubMed  Google Scholar 

  6. Henig O, Kaye KS. Bacterial pneumonia in older adults. Infect Dis Clin North Am. 2017;31(4):689–713.

    Article  PubMed  PubMed Central  Google Scholar 

  7. Jung J, Moon SM, Jang HC, Kang CI, Jun JB, Cho YK, Kang SJ, Seo BJ, Kim YJ, Park SB, et al. Incidence and risk factors of postoperative pneumonia following cancer surgery in adult patients with selected solid cancer: results of Cancer POP study. Cancer Med. 2018;7(1):261–9.

    Article  PubMed  Google Scholar 

  8. Naito H, Nojima T, Fujisaki N, Tsukahara K, Yamamoto H, Yamada T, Aokage T, Yumoto T, Osako T, Nakao A. Therapeutic strategies for ischemia reperfusion injury in emergency medicine. Acute Med Surg. 2020;7(1):e501.

    Article  PubMed  PubMed Central  Google Scholar 

  9. Partridge L, Deelen J, Slagboom PE. Facing up to the global challenges of ageing. Nature. 2018;561(7721):45–56.

    Article  CAS  PubMed  Google Scholar 

  10. Becher RD, Vander Wyk B, Leo-Summers L, Desai MM, Gill TM. The incidence and cumulative risk of major surgery in older persons in the united States. Ann Surg. 2023;277(1):87–92.

    Article  PubMed  Google Scholar 

  11. Weiser TG, Haynes AB, Molina G, Lipsitz SR, Esquivel MM, Uribe-Leitz T, Fu R, Azad T, Chao TE, Berry WR, et al. Estimate of the global volume of surgery in 2012: an assessment supporting improved health outcomes. Lancet. 2015;385(Suppl 2):S11.

    Article  PubMed  Google Scholar 

  12. Mungan İ, Dicle ÇB, Bektaş Ş, Sarı S, Yamanyar S, Çavuş M, Turan S, Bostancı EB. Does the preoperative platelet-to-lymphocyte ratio and neutrophil-to-lymphocyte ratio predict morbidity after gastrectomy for gastric cancer? Mil Med Res. 2020;7(1):9.

    CAS  PubMed  PubMed Central  Google Scholar 

  13. Shi H, Sang H, Zhang Z, Chen B, Li L, Liu F, Xia W, Zhou Y, Liu K, Li X, et al. Neutrophil-to-Lymphocyte ratio is associated with clinical outcomes in patients treated with mechanical thrombectomy for posterior circulation large vessel occlusion. World Neurosurg. 2024;193:628–35.

    Article  PubMed  Google Scholar 

  14. Herrero-Cervera A, Soehnlein O, Kenne E. Neutrophils in chronic inflammatory diseases. Cell Mol Immunol. 2022;19(2):177–91.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  15. Ahn J, Chang JS, Kim JW. Postoperative pneumonia and aspiration pneumonia following elderly hip fractures. J Nutr Health Aging. 2022;26(7):732–8.

    Article  CAS  PubMed  Google Scholar 

  16. Gao YC, Zhang YW, Shi L, Gao W, Li YJ, Chen H, Rui YF. What are risk factors of postoperative pneumonia in geriatric individuals after hip fracture surgery: A systematic review and Meta-Analysis. Orthop Surg. 2023;15(1):38–52.

    Article  PubMed  Google Scholar 

  17. Russotto V, Sabaté S, Canet J. Development of a prediction model for postoperative pneumonia: A multicentre prospective observational study. Eur J Anaesthesiol. 2019;36(2):93–104.

    Article  PubMed  Google Scholar 

  18. Alowais SA, Alghamdi SS, Alsuhebany N, Alqahtani T, Alshaya AI, Almohareb SN, Aldairem A, Alrashed M, Bin Saleh K, Badreldin HA, et al. Revolutionizing healthcare: the role of artificial intelligence in clinical practice. BMC Med Educ. 2023;23(1):689.

    Article  PubMed  PubMed Central  Google Scholar 

  19. Artzi NS, Shilo S, Hadar E, Rossman H, Barbash-Hazan S, Ben-Haroush A, Balicer RD, Feldman B, Wiznitzer A, Segal E. Prediction of gestational diabetes based on nationwide electronic health records. Nat Med. 2020;26(1):71–6.

    Article  CAS  PubMed  Google Scholar 

  20. Xue B, Li D, Lu C, King CR, Wildes T, Avidan MS, Kannampallil T, Abraham J. Use of machine learning to develop and evaluate models using preoperative and intraoperative data to identify risks of postoperative complications. JAMA Netw Open. 2021;4(3):e212240.

    Article  PubMed  PubMed Central  Google Scholar 

  21. Nishi H, Oishi N, Ishii A, Ono I, Ogura T, Sunohara T, Chihara H, Fukumitsu R, Okawa M, Yamana N, et al. Predicting clinical outcomes of large vessel occlusion before mechanical thrombectomy using machine learning. Stroke. 2019;50(9):2379–88.

    Article  PubMed  Google Scholar 

  22. Shamout F, Zhu T, Clifton DA. Machine learning for clinical outcome prediction. IEEE Rev Biomed Eng. 2021;14:116–26.

    Article  PubMed  Google Scholar 

  23. Nusinovici S, Tham YC, Chak Yan MY, Wei Ting DS, Li J, Sabanayagam C, Wong TY, Cheng CY. Logistic regression was as good as machine learning for predicting major chronic diseases. J Clin Epidemiol. 2020;122:56–69.

    Article  PubMed  Google Scholar 

  24. Song X, Liu X, Liu F, Wang C. Comparison of machine learning and logistic regression models in predicting acute kidney injury: A systematic review and meta-analysis. Int J Med Inf. 2021;151:104484.

    Article  Google Scholar 

  25. Christodoulou E, Ma J, Collins GS, Steyerberg EW, Verbakel JY, Van Calster B. A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models. J Clin Epidemiol. 2019;110:12–22.

    Article  PubMed  Google Scholar 

  26. Cabitza F, Rasoini R, Gensini GF. Unintended consequences of machine learning in medicine. JAMA. 2017;318(6):517–8.

    Article  PubMed  Google Scholar 

  27. Schwarze ML, Barnato AE, Rathouz PJ, Zhao Q, Neuman HB, Winslow ER, Kennedy GD, Hu YY, Dodgion CM, Kwok AC, et al. Development of a list of high-risk operations for patients 65 years and older. JAMA Surg. 2015;150(4):325–31.

    Article  PubMed  PubMed Central  Google Scholar 

  28. Gill TM, Murphy TE, Gahbauer EA, Leo-Summers L, Becher RD. Geriatric vulnerability and the burden of disability after major surgery. J Am Geriatr Soc. 2022;70(5):1471–80.

    Article  PubMed  PubMed Central  Google Scholar 

  29. Bluth T, Serpa Neto A, Schultz MJ, Pelosi P, Gama de Abreu M, Bluth T, Bobek I, Canet JC, Cinnella G, de Baerdemaeker L, et al. Effect of intraoperative high positive End-Expiratory pressure (PEEP) with recruitment maneuvers vs low PEEP on postoperative pulmonary complications in obese patients: A randomized clinical trial. JAMA. 2019;321(23):2292–305.

    Article  PubMed  PubMed Central  Google Scholar 

  30. Schuijt MTU, Hol L, Nijbroek SG, Ahuja S, van Meenen D, Mazzinari G, Hemmes S, Bluth T, Ball L, Gama-de Abreu M, et al. Associations of dynamic driving pressure and mechanical power with postoperative pulmonary complications-posthoc analysis of two randomised clinical trials in open abdominal surgery. EClinicalMedicine. 2022;47:101397.

    Article  PubMed  PubMed Central  Google Scholar 

  31. Gupta H, Gupta PK, Schuller D, Fang X, Miller WJ, Modrykamien A, Wichman TO, Morrow LE. Development and validation of a risk calculator for predicting postoperative pneumonia. Mayo Clin Proc. 2013;88(11):1241–9.

    Article  PubMed  Google Scholar 

  32. El Solh AA, Bhora M, Pineda L, Dhillon R. Nosocomial pneumonia in elderly patients following cardiac surgery. Respir Med. 2006;100(4):729–36.

    Article  PubMed  Google Scholar 

  33. Hassija V, Chamola V, Mahapatra A, Singal A, Goel D, Huang K, Scardapane S, Spinelli I, Mahmud M, Hussain A. Interpreting Black-Box models: A review on explainable artificial intelligence. Cogn Comput. 2024;16(1):45–74.

    Article  Google Scholar 

  34. Li X, Zhao Y, Zhang D, Kuang L, Huang H, Chen W, Fu X, Wu Y, Li T, Zhang J, et al. Development of an interpretable machine learning model associated with heavy metals’ exposure to identify coronary heart disease among US adults via SHAP: findings of the US NHANES from 2003 to 2018. Chemosphere. 2023;311(Pt 1):137039.

    Article  CAS  PubMed  Google Scholar 

  35. Li W, Huang G, Tang N, Lu P, Jiang L, Lv J, Qin Y, Lin Y, Xu F, Lei D. Effects of heavy metal exposure on hypertension: A machine learning modeling approach. Chemosphere. 2023;337:139435.

    Article  CAS  PubMed  Google Scholar 

  36. Ribeiro MT, Singh S, Guestrin C. Why Should I Trust You? Explaining the Predictions of Any Classifier. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; San Francisco, California, USA. Association for Computing Machinery 2016: 1135–1144.

  37. Eckart A, Struja T, Kutz A, Baumgartner A, Baumgartner T, Zurfluh S, Neeser O, Huber A, Stanga Z, Mueller B, et al. Relationship of nutritional status, inflammation, and serum albumin levels during acute illness: A prospective study. Am J Med. 2020;133(6):713–e722717.

    Article  CAS  PubMed  Google Scholar 

  38. Hübner M, Mantziari S, Demartines N, Pralong F, Coti-Bertrand P, Schäfer M. Postoperative Albumin Drop Is a Marker for Surgical Stress and a Predictor for Clinical Outcome: A Pilot Study. Gastroenterol Res Pract 2016, 2016:8743187.

  39. Allie SR, Randall TD. Pulmonary immunity to viruses. Clin Sci (Lond). 2017;131(14):1737–62.

    Article  CAS  PubMed  Google Scholar 

  40. Agusti A, Fabbri LM, Singh D, Vestbo J, Celli B, Franssen FME, Rabe KF, Papi A. Inhaled corticosteroids in COPD: friend or foe? Eur Respir J 2018, 52(6).

  41. Soeters PB, Wolfe RR, Shenkin A. Hypoalbuminemia: pathogenesis and clinical significance. JPEN J Parenter Enter Nutr. 2019;43(2):181–93.

    Article  CAS  Google Scholar 

  42. Russotto V, Sabaté S, Canet J, Network, ftPgtESoACT. Development of a prediction model for postoperative pneumonia: A multicentre prospective observational study. Eur J Anaesthesiol| EJA. 2019;36(2):93–104.

    Article  PubMed  Google Scholar 

  43. Arozullah AM, Khuri SF, Henderson WG, Daley J. Development and validation of a multifactorial risk index for predicting postoperative pneumonia after major noncardiac surgery. Ann Intern Med. 2001;135(10):847–57.

    Article  CAS  PubMed  Google Scholar 

Download references

Acknowledgements

We would like to thank all the participants.

Funding

This study was supported by the National Key Research and Development Program of China (Grant No. 2018YFC2001903), National Natural Science Foundation of China (Grant No. 81873952), and National Natural Science Foundation of China (Grant No. 81901948).

Author information

Authors and Affiliations

Authors

Contributions

JL and XL designed and analyzed the study. YW, ZX, YH, LC and YL conducted the data collection. YB, YF, GL and WX wrote the manuscript. QW edited the manuscript. All authors reviewed and approved the final version of the manuscript.

Corresponding author

Correspondence to Qingping Wu.

Ethics declarations

Ethics approval and consent to participate

The study was conducted in accordance with the Declaration of Helsinki and approved by the Ethics Committee of Wuhan Union Hospital (Approval No.2021 − 0986). To protect patient privacy, all personally identifiable information was anonymized. Due to the lack of identifiable information and retrospective nature of the study, informed consent was obtained.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Liu, J., Li, X., Wang, Y. et al. Predicting postoperative pulmonary infection in elderly patients undergoing major surgery: a study based on logistic regression and machine learning models. BMC Pulm Med 25, 128 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12890-025-03582-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12890-025-03582-4

Keywords