Predicting postoperative pulmonary infection in elderly patients undergoing major surgery: a study based on logistic regression and machine learning models

Liu, Jie; Li, Xia; Wang, Yanting; Xu, Zhenzhen; Lv, Yong; He, Yuyao; Chen, Lu; Feng, Yiqi; Liu, Guoyang; Bai, Yunxiao; Xie, Wanli; Wu, Qingping

doi:10.1186/s12890-025-03582-4

Research
Open access
Published: 19 March 2025

Predicting postoperative pulmonary infection in elderly patients undergoing major surgery: a study based on logistic regression and machine learning models

Jie Liu^1,2^na1,
Xia Li¹^na1,
Yanting Wang¹,
Zhenzhen Xu¹,
Yong Lv¹,
Yuyao He¹,
Lu Chen¹,
Yiqi Feng¹,
Guoyang Liu¹,
Yunxiao Bai¹,
Wanli Xie¹ &
…
Qingping Wu¹

BMC Pulmonary Medicine volume 25, Article number: 128 (2025) Cite this article

443 Accesses
Metrics details

Abstract

Background

Postoperative pulmonary infection (POI) is strongly associated with a poor prognosis and has a high incidence in elderly patients undergoing major surgery. Machine learning (ML) algorithms are increasingly being used in medicine, but the predictive role of logistic regression (LR) and ML algorithms for POI in high-risk populations remains unclear.

Methods

We conducted a retrospective cohort study of older adults undergoing major surgery over a period of six years. The included patients were randomly divided into training and validation sets at a ratio of 7:3. The features selected by the least absolute shrinkage and selection operator regression algorithm were used as the input variables of the ML and LR models. The random forest of multiple interpretable methods was used to interpret the ML models.

Results

Of the 9481 older adults in our study, 951 developed POI. Among the different algorithms, LR performed the best with an AUC of 0.80, whereas the decision tree performed the worst with an AUC of 0.75. Furthermore, the LR model outperformed the other ML models in terms of accuracy (88.22%), specificity (90.29%), precision (44.42%), and F1 score (54.25%). Despite employing four interpretable methods for RF analysis, there existed a certain degree of inconsistency in the results. Finally, to facilitate clinical application, we established a web-friendly version of the nomogram based on the LR algorithm; In addition, patients were divided into three significantly distinct risk intervals in predicting POI.

Conclusions

Compared with popular ML algorithms, LR was more effective at predicting POI in older patients undergoing major surgery. The constructed nomogram could identify high-risk elderly patients and facilitate perioperative management planning.

Trial registration

The study was retrospectively registered (NCT06491459).

Key Summary Point

• Postoperative pulmonary infection (POI) significantly impacts the prognosis of elderly patients undergoing major surgery, and identifying its risk factors can enhance treatment options.

• This study developed a POI prediction model using logistic regression (LR) and machine learning (ML) algorithms, with LR demonstrating the best performance (AUC of 0.80). Additionally, four methods were employed to explain the feature importance in the ML models.

• The nomogram generated from the LR model offers clinicians a practical tool for identifying high-risk elderly patients and optimizing perioperative management strategies.

Peer Review reports

Introduction

Postoperative pulmonary infection (POI) is frequently linked to poor prognoses, prolonged hospital stays, and increased healthcare costs [1,2,3]. The incidence is between 0.9% and 39%, depending on the population and definition of the study [4, 5]. Due to the loss of physiological organ reserves and the increased colonization of gram-negative bacteria in the upper respiratory tract, older people are more likely to develop POI [6]. In a study of cancer surgery, the hazard ratio for the one-year cumulative prevalence of POI in patients over 65 years of age was as high as 3.7-fold [7]. Owing to the characteristics of major surgery, such as long operation time and repeated ischemia and reperfusion, it is easy to cause changes in the immune cells of the body and produce systemic inflammatory reactions, resulting in lung function damage [3, 8]. Major surgery is associated with poor outcomes, and with the aging of the population, the proportion of elderly patients is gradually growing [9,10,11].

Furthermore, previous studies have shown that changes in circulating inflammatory cells (such as lymphocyte, neutrophil, and monocyte counts) are associated with postoperative complications [12,13,14]. Circulating inflammatory cell counts are readily available markers that may provide valuable information for predicting the risk of postoperative complications in elderly patients. Although some studies have explored the risk factors for POI [15,16,17], few studies with large samples focused on POI after major surgery in older adults.

Machine learning (ML) is becoming increasingly prevalent in the medical field, especially for predicting complications [18]. ML is a branch of artificial intelligence that builds models using vast amounts of medical data, leveraging algorithms that do not assume linearity in the relationships between variables. ML has the advantage of revealing deeper correlations in data and is robust to noisy information [19]. ML parses medical information and can be used to build disease occurrence, diagnosis, and prognosis assessment models to aid clinical decision-making [20, 21]. Leading ML algorithms, such as decision tree (DT), random forest (RF), support vector machine (SVM), gradient boosting decision tree (GBDT), extreme gradient boosting (XGBoost), and multilayer perceptron (MLP), represent some of the most widely used models in the field [22]. However, some studies have shown that logistic regression (LR) models do not perform worse than ML models and even outperform ML in predicting complications [23,24,25]. ML faces limitations, including the need for large training datasets and poor interpretability, as the influence of individual variables on model outcomes is often unclear [26]. To our knowledge, although there are studies involving different algorithms, few have investigated the performance of different algorithms in predicting POI in older adults undergoing major surgery.

This study aimed to identify the predictors of POI for perioperative management and to compare the predictive performance of the ML and LR models for POI after major surgery in elderly patients. Furthermore, various interpretable methods were employed to explore the variable importance of ML models.

Materials and methods

We obtained all data from the electronic database system for elderly patients who underwent surgery at the Wuhan Union Hospital from January 2014 to December 2019. In retrospective enrollment, the inclusion criteria were (1) age ≥ 65 years and (2) mechanical ventilation during major surgery. The exclusion criteria were: (1) preoperative tracheal intubation, (2) preoperative pneumonia, (3) organ transplantation, and (4) missing data. According to the literature [27, 28], major surgery is any procedure involving general anesthesia for non-percutaneous, non-endoscopic, and invasive surgery. This study adhered to the principles outlined in the Declaration of Helsinki and was approved by the Ethics Committee of our institution (Ethics Number: 2021 − 0986).

Data collection

We collected appropriate clinical variables based on the literature and clinical experience. Demographic data included age, male sex, body mass index (BMI), history of smoking and alcohol consumption, cancer, chronic obstructive pulmonary disease (COPD), hypertension, coronary artery disease, and diabetes, and laboratory test data included hemoglobin, platelet count, albumin, leukocyte count, lymphocyte count, neutrophil count, monocyte count, blood urea nitrogen (BUN), creatinine, total bilirubin (TBIL), alanine aminotransferase (ALT), and aspartate aminotransferase (AST). Surgery-related factors included type of surgery, emergency surgery, surgical incision, operation time, red blood cell transfusion (RBC transfusion), and colloidal infusion. To ensure the reliability of the data, they were reviewed and reconfirmed by two researchers, and contradictory data were resolved by the research team after discussion.

Primary outcome

The primary outcome was POI incidence during hospitalization. It is defined as new or progressive radiological infiltration accompanied by at least two symptoms: antibiotic treatment, body temperature above 38 °C, increased or decreased white blood cell count, and/or purulent airway secretions [29, 30].

Data preprocessing

Figure 1 shows the detailed process of the data preprocessing. We randomly divided the patients into training and validation sets in a ratio of 7:3. To reduce feature dimensionality and mitigate overfitting, we employed the least absolute shrinkage and selection operator (LASSO) regression algorithm to identify relevant risk factors. In the LASSO regression analysis, selected variables were incorporated into the model, and the regularization parameter λ was determined using the minimum standard error method (1-SE rule) through 10-fold cross-validation. A correlation heat map and variance inflation factor (VIF) were used to access multicollinearity between variables.

Due to the imbalanced distribution of POI in the dataset, random undersampling (RUS) was applied to balance the samples. To prevent data leakage, RUS was performed only on the training set. Dummy coding was applied for unordered multi-categorical variables. For continuous data, we applied standardization to produce data with a mean of 0 and a standard deviation of 1.

Data modeling

The data modelling process is illustrated in Fig. 1. Two distinct classes of modeling algorithms were considered for the building: the traditional LR and ML algorithms. To obtain more convincing results, we selected the following six popular ML algorithms: DT, RF, SVM, GBDT, XGBoost, and MLP.

To optimize model performance, we tuned hyperparameters using grid search with five-fold cross-validation (GridSearchCV). GridSearchCV evaluates all possible parameter combinations through cross-validation and selects the best set based on performance scores. For cross-validation, the training set was split into five equal parts. In each iteration, one part served as the validation set, while the other four were used for training. This process was repeated five times, and the model’s performance was optimized based on the area under the receiver operating characteristic curve (AUC).

The best parameters were applied to the validation set. Model performance was evaluated using AUC and the Brier score, where the Brier score ranges from 0 to 1 (lower values indicate better calibration). Additional metrics, including accuracy, sensitivity, specificity, precision and F1 score were used to compare model performance. The cut-off threshold was determined using clinical experience or the Youden index.

The feature importance attribute with the model, local interpretable model-agnostic explanations (LIME), permutation feature importance, and the Shapley additive explanation (SHAP) method were selected for feature importance analysis. A nomogram was constructed using the LR algorithm. The study population was divided into three groups based on the nomogram and clinical practice. The three groups differed significantly in the predicted risk of POI: low-risk (< 10%), intermediate-risk (10-30%), and high-risk (> 30%).

Data-analysis

Continuous variables are presented as means ± standard deviations or medians (interquartile ranges, IQR), using independent samples t-tests and non-parametric tests, respectively. Categorical variables were expressed as frequencies and percentages and compared using the chi-squared test or Fisher’s exact test.

A P-value < 0.05 (two-sided) was considered statistically significant. The XGBoost algorithm is constructed using the XGBoost package. Other ML algorithms were built using the scikit-learn package. Construction of the ML models was completed using Python (version 3.8.8). The remaining data analyses were performed using R software (version 4.1.1).

Results

Baseline characteristics

This study assessed the eligibility of older adults undergoing major surgery at our hospital over a six-year period. Based on the inclusion and exclusion criteria, 9481 patients were analyzed. The clinical characteristics of the patients are summarized in Table 1. A total of 951 patients developed POI, accounting for 10.03% of the population, while 8530 did not. The median age of the patients with POI was 69 years, and 53.78% were male. Compared to non-POI patients, patients with POI were more male, had more comorbidities, lower hemoglobin and albumin levels, higher levels of inflammatory factors, and longer operation time. In addition, the length of postoperative hospital stay and total length of hospitalization were significantly higher in patients who developed POI (16 vs. 10, P < 0.001; 25 vs. 16, P < 0.001). There were no significant differences between the two cohorts (training and validation sets) for any of the features collected, including the incidence of POI (10.03% vs. 10.02%, P = 1).

Table 1 Baseline characteristics of the cohort

Full size table

Feature selection

Based on the LASSO regression analysis of the training set, five variables were entered into the final model (Fig. 2). These five variables were COPD, neutrophil count, albumin level, surgical incision, and operation time, which were independently associated with POI (P < 0.05). To explore multicollinearity between the independent variables, we calculated Spearman’s correlation coefficients and visualized them in a heatmap (Fig. 3). The heatmap shows low correlations between the variables. Furthermore, all variance inflation factor values were below 2, confirming minimal multicollinearity. The hyperparameters of the ML models are detailed in Table S1.

Algorithm performance

The LR model achieved the highest AUC of 0.80 on the validation set, while the DT model had the lowest AUC of 0.75 (Fig. 4A). Furthermore, the LR model’s AUC was 0.81 (95% CI: 0.795–0.837) in the training set (Fig. 4B). The results showed that the AUC of LR significantly improved when compared to DT, RF, SVM, GBDT, and XGBoost (DT vs. LR: ∆AUC = -0.045, P < 0.001; RF vs. LR: ∆AUC = -0.014, P = 0.014; GBDT vs. LR: ∆AUC = -0.02, P = 0.005; XGBoost vs. LR: ∆AUC = 0.014, P = 0.032; SVM vs. LR: ∆AUC = 0.029, P = 0.001). Although there was a difference in AUC between LR and MLP, it did not reach statistical significance (MLP vs. LR: ∆AUC = -0.002, P = 0.56) (Table S2). The Brier scores of the seven algorithms ranged from 0.08 to 0.22, indicating good calibration (LR: 0.081; DT: 0.197; RF: 0.187; XGBoost: 0.186; GBDT: 0.189; SVM: 0.219; MLP: 0.178) (Figure S1). The low Brier score of LR reflects its superior reliability in probability prediction. Calibration curve further confirmed that the LR model outperformed other algorithms in calibration (Figure S1). Additionally, the LR model outperformed the other ML models in terms of accuracy (88.22%), specificity (90.29%), precision (44.42%), and F1 score (54.25%) (Table S2).

Algorithm interpretability and application

The four interpretable methods of the RF model showed that operation time, open thoracotomy, albumin level, and neutrophil count were important features (Fig. 5). The rankings of the top five variables in the feature importance and permutation importance methods were generally consistent. However, there were some differences in the rankings between the LIME importance and SHAP methods. Although the top three variables were generally related to surgery, there may be slight variations. In addition, the ranking of COPD fluctuated significantly across various methods.

Based on AUC, specificity, accuracy, precision, F1 score, Brier score, and calibration curve evaluation, we ultimately selected the LR model as the primary model and proceeded with its visualization. A web-based version of the nomogram (Fig. 6) is available at https://liujie123.shinyapps.io/dynnomapp/. The proportion of patients stratified by the nomogram was similar in both cohorts, with 71.84%, 19.92%, and 8.24% of patients in the low-, medium-, and high-risk groups (development cohort) and 72.61%, 19.94%, and 7.45% (validation cohort), respectively (Fig. 7). Compared to the low-risk group, the incidence of POI was 4.62-fold and 12.98-fold higher in the medium-risk and high-risk groups of the development cohort, respectively (Table 2).

Table 2 Odds for POI by risk group in the development cohort and validation cohort

Full size table

Discussion

In this study on POI in older adults undergoing major surgery, we compared the predictive role of multiple models. The results showed that compared with the ML algorithms, the classical LR algorithm had a better prediction effect on POI after major surgery in older patients. We also developed a visual nomogram and performed risk stratification using the LR algorithm. Based on the preoperative and intraoperative variables, the model could help physicians assess the risk of POI earlier and formulate relevant strategies. Our study included a large sample size and identified simple predictors. The incidence of POI was 10.03%, consistent with previous literature [31, 32].

Consistent with previous studies, the MLP and LR algorithms performed better [23]. The AUC of the above two models reached more than 0.79, accompanied by high accuracy and specificity, reflecting strong discriminatory ability. Despite progress in ML interpretability, challenges remain due to differing perspectives on explanations. The consistency and variability among the four interpretability methods provided a comprehensive understanding of feature importance. The consistent recognition of features such as operation time, open thoracotomy, albumin level, and neutrophil count underscored their significance in the decision-making process, offering guidance for further analysis and interpretation. However, the variability in feature rankings across methods also reflects challenges in ML development. As ML models grow more complex, explaining their internal mechanisms becomes increasingly difficult [33]. The differences in feature importance rankings stem from the underlying principles of each method. SHAP values are based on game theory, providing a comprehensive explanation of feature contributions in the context of interactions between features [34]. Permutation Importance evaluates feature importance by measuring performance changes when a feature is perturbed, focusing on individual contributions without considering interactions [35]. The Feature Importance method, based on impurity reduction in tree-based models, may overestimate the importance of high-cardinality features. LIME offers local explanations by approximating the model’s behavior around a specific instance but may not fully capture global feature contributions across the entire dataset [36]. These varying approaches can lead to discrepancies in feature importance rankings, reflecting their distinct mechanisms for calculating feature contributions. The interactions between features can lead to different ranking results across methods. We chose LR as the primary model due to its intuitive interpretation of regression coefficients and lower risk of overfitting compared to complex models. Its simplicity and transparency make it suitable for clinical practice, where clear explanations are essential.

In our study, COPD, neutrophil count, albumin level, surgical incision, and operation time were independent risk factors associated with POI. Most studies have confirmed that COPD, operation time, and surgical incision are independently associated with POI [16, 17]. Serum albumin levels reflect the patient’s nutritional status and are considered negative for acute-phase protein, which is closely associated with increased postoperative complications and mortality [37]. Low albumin levels may impair immune function and increase infection risk. Moreover, decreased albumin levels may contribute to increased capillary permeability into the extravascular space, causing pleural effusion and pulmonary edema, and increasing the risk of POI [38]. Neutrophils reflect the inflammatory state during disease progression. Neutrophil recruitment and activation are vital for immune defense during pathogenic infections [39].

Some preventive measures can be taken for controllable risk factors in the final model, such as preoperative use of bronchodilators combined with glucocorticoids to treat COPD [40], increasing albumin levels, and improving malnutrition [41]. The LR-based nomogram may aid in risk assessment, stratification, and identifying high-risk patients.

In our study, some factors reported in other studies were not identified as independent risk factors, such as RBC transfusion and urea nitrogen levels [42, 43]. This reflects the great variability of the elderly with major surgery from other study populations, and demonstrates the importance of constructing a POI model for the elderly undergoing major surgery. Additionally, these studies mainly used the LR algorithm and did not explore the predictive performance of the various ML algorithms.

Compared with previous studies, our study had the following strengths: first, the research population was an elderly group with a high incidence; second, the type of surgery was major surgery, better targeting high-risk groups; third, in addition to using traditional algorithms, we explored model performance using popular ML algorithms; fourth, we applied interpretability techniques to ML algorithms; fifthly, based on the nomogram and predicted probabilities, we stratified the population for better use in clinical practice. In future work, integrating our model into electronic health records or clinical decision support systems could provide clinicians with real-time predictions during preoperative assessments, thereby aiding personalized interventions and optimizing perioperative management.

Our study had some limitations. First, as a single-center retrospective study, it may have underestimated the incidence of POI. However, the large sample size and incidence rate align with literature reports. Second, the model lacks external validation; therefore, further external validation across multiple centers is needed in the future to assess its generalizability. Third, some parameters related to mechanical ventilation were not collected. However, our model had good discriminative ability. Fourth, the use of LASSO regression for feature selection presents certain limitations. LASSO tends to select a small number of features and, when features are highly correlated, may randomly select one, overlooking others. This could lead to a reduction in model performance, as it may fail to capture all potential variable interactions. Fifth, although the LR model performed well in overall predictions, ML models may exhibit superior predictive power in cases with more features and more complex relationships. Future research will incorporate additional features to further explore the performance of various algorithms.

Conclusions

In our study, the incidence of POI after major surgery in the elderly patients was 10.03%, which was significantly associated with the length of postoperative hospitalization. This study identified five simple and easy-to-collect independent risk factors for POI in elderly patients undergoing major surgery and compared the effectiveness of different algorithms for predicting POI. Furthermore, we constructed a visualized nomogram of POI and performed risk stratification to guide the perioperative clinical management of elderly patients undergoing major surgery.

Data availability

The raw datasets used in the current study can be obtained from the corresponding author upon reasonable request.

Abbreviations

POI:: Postoperative pulmonary infection
ML:: Machine learning
LR:: Logistic regression
AUC:: Area under the receiver operating characteristic curve
DT:: Decision tree
RF:: Random forest
SVM:: Support vector machine
GBDT:: Gradient boosting decision tree
XGBoost:: Extreme gradient boosting
MLP:: Multilayer perceptron
BMI:: Body mass index
COPD:: Chronic obstructive pulmonary disease
BUN:: Blood urea nitrogen
TBIL:: Total bilirubin
ALT:: Alanine aminotransferase
AST:: Aspartate aminotransferase
RBC:: Red blood cell
LASSO:: Least absolute shrinkage and selection operator
VIF:: Variance inflation factor
RUS:: Random undersampling
GridSearchCV:: Grid search with cross-validation
LIME:: Local interpretable model-agnostic explanations
SHAP:: Shapley additive explanation
IQR:: Interquartile range

References

Papazian L, Klompas M, Luyt CE. Ventilator-associated pneumonia in adults: a narrative review. Intensive Care Med. 2020;46(5):888–906.
Article PubMed PubMed Central Google Scholar
Hui S, Fowler AJ, Cashmore RMJ, Fisher TJ, Schlautmann J, Body S, Lan-Pak-Kee V, Webb M, Kyriakides M, Ng JY, et al. Routine postoperative noninvasive respiratory support and pneumonia after elective surgery: a systematic review and meta-analysis of randomised trials. Br J Anaesth. 2022;128(2):363–74.
Article PubMed Google Scholar
Stotts C, Corrales-Medina VF, Rayner KJ. Pneumonia-Induced inflammation, resolution and cardiovascular disease: causes, consequences and clinical opportunities. Circ Res. 2023;132(6):751–74.
Article CAS PubMed Google Scholar
Liang S, Zhang X, Hu Y, Yang J, Li K. Association between perioperative chlorhexidine oral care and postoperative pneumonia in non-cardiac surgical patients: A systematic review and meta-analysis. Surgery. 2021;170(5):1418–31.
Article PubMed Google Scholar
Redelmeier DA, McAlister FA, Kandel CE, Lu H, Daneman N. Postoperative pneumonia in elderly patients receiving acid suppressants: a retrospective cohort analysis. BMJ. 2010;340:c2608.
Article PubMed Google Scholar
Henig O, Kaye KS. Bacterial pneumonia in older adults. Infect Dis Clin North Am. 2017;31(4):689–713.
Article PubMed PubMed Central Google Scholar
Jung J, Moon SM, Jang HC, Kang CI, Jun JB, Cho YK, Kang SJ, Seo BJ, Kim YJ, Park SB, et al. Incidence and risk factors of postoperative pneumonia following cancer surgery in adult patients with selected solid cancer: results of Cancer POP study. Cancer Med. 2018;7(1):261–9.
Article PubMed Google Scholar
Naito H, Nojima T, Fujisaki N, Tsukahara K, Yamamoto H, Yamada T, Aokage T, Yumoto T, Osako T, Nakao A. Therapeutic strategies for ischemia reperfusion injury in emergency medicine. Acute Med Surg. 2020;7(1):e501.
Article PubMed PubMed Central Google Scholar
Partridge L, Deelen J, Slagboom PE. Facing up to the global challenges of ageing. Nature. 2018;561(7721):45–56.
Article CAS PubMed Google Scholar
Becher RD, Vander Wyk B, Leo-Summers L, Desai MM, Gill TM. The incidence and cumulative risk of major surgery in older persons in the united States. Ann Surg. 2023;277(1):87–92.
Article PubMed Google Scholar
Weiser TG, Haynes AB, Molina G, Lipsitz SR, Esquivel MM, Uribe-Leitz T, Fu R, Azad T, Chao TE, Berry WR, et al. Estimate of the global volume of surgery in 2012: an assessment supporting improved health outcomes. Lancet. 2015;385(Suppl 2):S11.
Article PubMed Google Scholar
Mungan İ, Dicle ÇB, Bektaş Ş, Sarı S, Yamanyar S, Çavuş M, Turan S, Bostancı EB. Does the preoperative platelet-to-lymphocyte ratio and neutrophil-to-lymphocyte ratio predict morbidity after gastrectomy for gastric cancer? Mil Med Res. 2020;7(1):9.
CAS PubMed PubMed Central Google Scholar
Shi H, Sang H, Zhang Z, Chen B, Li L, Liu F, Xia W, Zhou Y, Liu K, Li X, et al. Neutrophil-to-Lymphocyte ratio is associated with clinical outcomes in patients treated with mechanical thrombectomy for posterior circulation large vessel occlusion. World Neurosurg. 2024;193:628–35.
Article PubMed Google Scholar
Herrero-Cervera A, Soehnlein O, Kenne E. Neutrophils in chronic inflammatory diseases. Cell Mol Immunol. 2022;19(2):177–91.
Article CAS PubMed PubMed Central Google Scholar
Ahn J, Chang JS, Kim JW. Postoperative pneumonia and aspiration pneumonia following elderly hip fractures. J Nutr Health Aging. 2022;26(7):732–8.
Article CAS PubMed Google Scholar
Gao YC, Zhang YW, Shi L, Gao W, Li YJ, Chen H, Rui YF. What are risk factors of postoperative pneumonia in geriatric individuals after hip fracture surgery: A systematic review and Meta-Analysis. Orthop Surg. 2023;15(1):38–52.
Article PubMed Google Scholar
Russotto V, Sabaté S, Canet J. Development of a prediction model for postoperative pneumonia: A multicentre prospective observational study. Eur J Anaesthesiol. 2019;36(2):93–104.
Article PubMed Google Scholar
Alowais SA, Alghamdi SS, Alsuhebany N, Alqahtani T, Alshaya AI, Almohareb SN, Aldairem A, Alrashed M, Bin Saleh K, Badreldin HA, et al. Revolutionizing healthcare: the role of artificial intelligence in clinical practice. BMC Med Educ. 2023;23(1):689.
Article PubMed PubMed Central Google Scholar
Artzi NS, Shilo S, Hadar E, Rossman H, Barbash-Hazan S, Ben-Haroush A, Balicer RD, Feldman B, Wiznitzer A, Segal E. Prediction of gestational diabetes based on nationwide electronic health records. Nat Med. 2020;26(1):71–6.
Article CAS PubMed Google Scholar
Xue B, Li D, Lu C, King CR, Wildes T, Avidan MS, Kannampallil T, Abraham J. Use of machine learning to develop and evaluate models using preoperative and intraoperative data to identify risks of postoperative complications. JAMA Netw Open. 2021;4(3):e212240.
Article PubMed PubMed Central Google Scholar
Nishi H, Oishi N, Ishii A, Ono I, Ogura T, Sunohara T, Chihara H, Fukumitsu R, Okawa M, Yamana N, et al. Predicting clinical outcomes of large vessel occlusion before mechanical thrombectomy using machine learning. Stroke. 2019;50(9):2379–88.
Article PubMed Google Scholar
Shamout F, Zhu T, Clifton DA. Machine learning for clinical outcome prediction. IEEE Rev Biomed Eng. 2021;14:116–26.
Article PubMed Google Scholar
Nusinovici S, Tham YC, Chak Yan MY, Wei Ting DS, Li J, Sabanayagam C, Wong TY, Cheng CY. Logistic regression was as good as machine learning for predicting major chronic diseases. J Clin Epidemiol. 2020;122:56–69.
Article PubMed Google Scholar
Song X, Liu X, Liu F, Wang C. Comparison of machine learning and logistic regression models in predicting acute kidney injury: A systematic review and meta-analysis. Int J Med Inf. 2021;151:104484.
Article Google Scholar
Christodoulou E, Ma J, Collins GS, Steyerberg EW, Verbakel JY, Van Calster B. A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models. J Clin Epidemiol. 2019;110:12–22.
Article PubMed Google Scholar
Cabitza F, Rasoini R, Gensini GF. Unintended consequences of machine learning in medicine. JAMA. 2017;318(6):517–8.
Article PubMed Google Scholar
Schwarze ML, Barnato AE, Rathouz PJ, Zhao Q, Neuman HB, Winslow ER, Kennedy GD, Hu YY, Dodgion CM, Kwok AC, et al. Development of a list of high-risk operations for patients 65 years and older. JAMA Surg. 2015;150(4):325–31.
Article PubMed PubMed Central Google Scholar
Gill TM, Murphy TE, Gahbauer EA, Leo-Summers L, Becher RD. Geriatric vulnerability and the burden of disability after major surgery. J Am Geriatr Soc. 2022;70(5):1471–80.
Article PubMed PubMed Central Google Scholar
Bluth T, Serpa Neto A, Schultz MJ, Pelosi P, Gama de Abreu M, Bluth T, Bobek I, Canet JC, Cinnella G, de Baerdemaeker L, et al. Effect of intraoperative high positive End-Expiratory pressure (PEEP) with recruitment maneuvers vs low PEEP on postoperative pulmonary complications in obese patients: A randomized clinical trial. JAMA. 2019;321(23):2292–305.
Article PubMed PubMed Central Google Scholar
Schuijt MTU, Hol L, Nijbroek SG, Ahuja S, van Meenen D, Mazzinari G, Hemmes S, Bluth T, Ball L, Gama-de Abreu M, et al. Associations of dynamic driving pressure and mechanical power with postoperative pulmonary complications-posthoc analysis of two randomised clinical trials in open abdominal surgery. EClinicalMedicine. 2022;47:101397.
Article PubMed PubMed Central Google Scholar
Gupta H, Gupta PK, Schuller D, Fang X, Miller WJ, Modrykamien A, Wichman TO, Morrow LE. Development and validation of a risk calculator for predicting postoperative pneumonia. Mayo Clin Proc. 2013;88(11):1241–9.
Article PubMed Google Scholar
El Solh AA, Bhora M, Pineda L, Dhillon R. Nosocomial pneumonia in elderly patients following cardiac surgery. Respir Med. 2006;100(4):729–36.
Article PubMed Google Scholar
Hassija V, Chamola V, Mahapatra A, Singal A, Goel D, Huang K, Scardapane S, Spinelli I, Mahmud M, Hussain A. Interpreting Black-Box models: A review on explainable artificial intelligence. Cogn Comput. 2024;16(1):45–74.
Article Google Scholar
Li X, Zhao Y, Zhang D, Kuang L, Huang H, Chen W, Fu X, Wu Y, Li T, Zhang J, et al. Development of an interpretable machine learning model associated with heavy metals’ exposure to identify coronary heart disease among US adults via SHAP: findings of the US NHANES from 2003 to 2018. Chemosphere. 2023;311(Pt 1):137039.
Article CAS PubMed Google Scholar
Li W, Huang G, Tang N, Lu P, Jiang L, Lv J, Qin Y, Lin Y, Xu F, Lei D. Effects of heavy metal exposure on hypertension: A machine learning modeling approach. Chemosphere. 2023;337:139435.
Article CAS PubMed Google Scholar
Ribeiro MT, Singh S, Guestrin C. Why Should I Trust You? Explaining the Predictions of Any Classifier. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; San Francisco, California, USA. Association for Computing Machinery 2016: 1135–1144.
Eckart A, Struja T, Kutz A, Baumgartner A, Baumgartner T, Zurfluh S, Neeser O, Huber A, Stanga Z, Mueller B, et al. Relationship of nutritional status, inflammation, and serum albumin levels during acute illness: A prospective study. Am J Med. 2020;133(6):713–e722717.
Article CAS PubMed Google Scholar
Hübner M, Mantziari S, Demartines N, Pralong F, Coti-Bertrand P, Schäfer M. Postoperative Albumin Drop Is a Marker for Surgical Stress and a Predictor for Clinical Outcome: A Pilot Study. Gastroenterol Res Pract 2016, 2016:8743187.
Allie SR, Randall TD. Pulmonary immunity to viruses. Clin Sci (Lond). 2017;131(14):1737–62.
Article CAS PubMed Google Scholar
Agusti A, Fabbri LM, Singh D, Vestbo J, Celli B, Franssen FME, Rabe KF, Papi A. Inhaled corticosteroids in COPD: friend or foe? Eur Respir J 2018, 52(6).
Soeters PB, Wolfe RR, Shenkin A. Hypoalbuminemia: pathogenesis and clinical significance. JPEN J Parenter Enter Nutr. 2019;43(2):181–93.
Article CAS Google Scholar
Russotto V, Sabaté S, Canet J, Network, ftPgtESoACT. Development of a prediction model for postoperative pneumonia: A multicentre prospective observational study. Eur J Anaesthesiol| EJA. 2019;36(2):93–104.
Article PubMed Google Scholar
Arozullah AM, Khuri SF, Henderson WG, Daley J. Development and validation of a multifactorial risk index for predicting postoperative pneumonia after major noncardiac surgery. Ann Intern Med. 2001;135(10):847–57.
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We would like to thank all the participants.

Funding

This study was supported by the National Key Research and Development Program of China (Grant No. 2018YFC2001903), National Natural Science Foundation of China (Grant No. 81873952), and National Natural Science Foundation of China (Grant No. 81901948).

Author information

Jie Liu and Xia Li contributed equally to this work.

Authors and Affiliations

Department of Anesthesiology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, People’s Republic of China
Jie Liu, Xia Li, Yanting Wang, Zhenzhen Xu, Yong Lv, Yuyao He, Lu Chen, Yiqi Feng, Guoyang Liu, Yunxiao Bai, Wanli Xie & Qingping Wu
Department of Anesthesiology, The Second Affiliated Hospital of Chongqing Medical University, Chongqing, China
Jie Liu

Authors

Jie Liu
View author publications
You can also search for this author inPubMed Google Scholar
Xia Li
View author publications
You can also search for this author inPubMed Google Scholar
Yanting Wang
View author publications
You can also search for this author inPubMed Google Scholar
Zhenzhen Xu
View author publications
You can also search for this author inPubMed Google Scholar
Yong Lv
View author publications
You can also search for this author inPubMed Google Scholar
Yuyao He
View author publications
You can also search for this author inPubMed Google Scholar
Lu Chen
View author publications
You can also search for this author inPubMed Google Scholar
Yiqi Feng
View author publications
You can also search for this author inPubMed Google Scholar
Guoyang Liu
View author publications
You can also search for this author inPubMed Google Scholar
Yunxiao Bai
View author publications
You can also search for this author inPubMed Google Scholar
Wanli Xie
View author publications
You can also search for this author inPubMed Google Scholar
Qingping Wu
View author publications
You can also search for this author inPubMed Google Scholar

Contributions

JL and XL designed and analyzed the study. YW, ZX, YH, LC and YL conducted the data collection. YB, YF, GL and WX wrote the manuscript. QW edited the manuscript. All authors reviewed and approved the final version of the manuscript.

Corresponding author

Correspondence to Qingping Wu.

Ethics declarations

Ethics approval and consent to participate

The study was conducted in accordance with the Declaration of Helsinki and approved by the Ethics Committee of Wuhan Union Hospital (Approval No.2021 − 0986). To protect patient privacy, all personally identifiable information was anonymized. Due to the lack of identifiable information and retrospective nature of the study, informed consent was obtained.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Liu, J., Li, X., Wang, Y. et al. Predicting postoperative pulmonary infection in elderly patients undergoing major surgery: a study based on logistic regression and machine learning models. BMC Pulm Med 25, 128 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12890-025-03582-4

Download citation

Received: 20 June 2024
Accepted: 05 March 2025
Published: 19 March 2025
DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12890-025-03582-4

Predicting postoperative pulmonary infection in elderly patients undergoing major surgery: a study based on logistic regression and machine learning models

Abstract

Background

Methods

Results

Conclusions

Trial registration

Key Summary Point

Introduction

Materials and methods

Data collection

Primary outcome

Data preprocessing

Data modeling

Data-analysis

Results

Baseline characteristics

Feature selection

Algorithm performance

Algorithm interpretability and application

Discussion

Conclusions

Data availability

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s note

Electronic supplementary material

Supplementary Material 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Pulmonary Medicine

Contact us