Skip to main content

The performance of EuroSCORE II in CABG patients in relation to sex, age, and surgical risk: a nationwide study in 14,118 patients

Abstract

Background

To determine the discriminative accuracy and calibration of EuroSCORE II in relation to age, sex, and surgical risk in a large nationwide coronary artery bypass grafting (CABG) cohort.

Methods

All 14,118 patients undergoing isolated CABG in Sweden during 2012–2017 were included. Individual patient data were taken from the SWEDEHEART registry. Patients were divided by age (< 60, 60–69, 70–79, ≥ 80 years), sex, and surgical risk (low: EuroSCORE < 4%, intermediate: 4–8%, high: > 8%). Discriminative accuracy was determined by the area under the receiver operating characteristic curve (AUC) and calibration by the observed/estimated (O/E) mortality ratio at 30 days.

Results

AUC and O/E ratio were 0.82 (95% CI 0.79–0.85) and 0.58 (0.50–0.66) overall, 0.82 (0.79–0.86) and 0.57 (0.48–0.66) in men, and 0.79 (0.73–0.85) and 0.60 (0.47–0.75) in women. Regarding age, discriminative accuracy was highest in patients aged 60–69 years (AUC: 0.86 [0.80–0.93]) but was satisfactory in all groups (AUC: 0.74–0.80). O/E ratio varied from 0.26 for patients > 60 years to 0.90 for patients > 80 years. Regarding surgical risk, AUC and O/E ratio were 0.63 (0.44–0.83) and 0.18 (0.09–0.30) in low-risk patients, 0.60 (0.55–0.66) and 0.57 (0.46–0.68) in intermediate-risk patients, and 0.78 (0.73–0.83) and 0.78 (0.64–0.92) in high-risk patients.

Conclusions

EuroSCORE II had good discriminative accuracy independently of sex and age, but markedly overestimated mortality risk, especially in younger patients. Accuracy and calibration were better in high-risk patients than in low-risk and intermediate-risk patients.

Peer Review reports

Introduction

Several risk stratification models based on patient characteristics, comorbidities, and type of surgical procedure have been developed to estimate the mortality risk after cardiac surgery [1]. The European System for Cardiac Operative Risk Evaluation (EuroSCORE), first introduced in 1999, was designed to improve patient selection and became widely adopted [2]. However, as perioperative and postoperative care improved, the discriminative accuracy and calibration of EuroSCORE I decreased. A new version, EuroSCORE II, which outperforms EuroSCORE I for risk stratification, was therefore introduced in 2011 [3]. Today, EuroSCORE II and the Society of Thoracic Surgery Predicted Risk of Mortality (STS-PROM) are the most widely recognized and utilized risk stratification tools [4, 5]. EuroSCORE II and STS-PROM have comparable discriminative accuracy and calibration regarding in-hospital and 30-day mortality in coronary artery bypass grafting (CABG) and in aortic valve replacement (AVR) patients [5,6,7,8].

Previous analyses of cardiac surgery risk scores, including EuroSCORE II, have noted that the scores overestimate the risk of death after CABG in octogenarians [9,10,11,12,13,14] and in high-risk patients [15, 16]. However, most of these studies were performed in single-centre cohorts of limited size, and large contemporary population-based studies are lacking. In the present study, we hypothesized on the basis of these previous studies that EuroSCORE II would perform less well in octogenarians and in high-risk patients. To test this hypothesis, we assessed the predictive accuracy and calibration of EuroSCORE II in different age groups, in men and women, and in patients with low, intermediate, and high surgical risk, using a large nationwide cohort of CABG patients.

Material and methods

Study population

All consecutive patients > 18 years of age who underwent first-time isolated CABG in Sweden between 1 January 2012 and 30 November 2017 were identified in the Swedish Cardiac Surgery Registry [17], which is part of the Swedish Web System for Enhancement and Development of Evidence-Based Care in Heart Disease Evaluated According to Recommended Therapies registry (SWEDEHEART) [18]. All patients were followed up until death, emigration, or 31 December 2017, whichever occurred first. The study cohort were divided into groups based on age at the time of CABG (< 60, 60–69, 70–79, and ≥ 80 years), sex, and risk group according to the EuroSCORE II surgical risk, with low risk defined as EuroSCORE < 4%, intermediate risk as 4–8%, and high risk as > 8%.

Data sources

Individual patient data from two nationwide registries were merged on the basis of the personal identification number which all Swedish residents are given at birth or shortly after immigration [19]. Operative details and patient characteristics including EuroSCORE II were extracted from the Swedish Heart Surgery Registry, which prospectively collects detailed information, including risk stratification, on all cardiac surgery patients and operations performed in Sweden since 1992 and has a coverage of 98–99% [17]. Mortality was extracted from the Cause of Death register, which has collected information on date and cause of death based on ICD codes since 1961 [20].

EuroSCORE II

EuroSCORE II estimates the 30-day mortality risk after cardiac surgery, expressed as a percentage. The variables included in EuroSCORE II are age, sex, presence of renal impairment, extracardiac arteriopathy, poor mobility, previous cardiac surgery, chronic pulmonary disease, active endocarditis, critical preoperative state, insulin-treated diabetes mellitus, New York Heart Association (NYHA) class of heart failure, unstable angina defined as Canadian Cardiology Society (CCS) class 4 angina, left ventricular function (LVEF; > 50%, 30–50%, 20–30%, < 20%), recent myocardial infarction (within 90 days), pulmonary hypertension, urgency of the procedure, weight of the intervention, and surgery on the thoracic aorta [3].

Outcome

The outcome was all-cause mortality defined as any death occurring between the start of surgery and 30 days after isolated CABG. The expected 30-day mortality, based on the calculated EuroSCORE II for each patient, was compared with the observed 30-day mortality.

Statistical analysis

Continuous variables were described as means and standard deviations and categorical variables as numbers and percentages. The discriminative accuracy was calculated with c-statistics [21] from a logistic regression and reported as the area under the receiver operating characteristic curve (AUC) with 95% confidence intervals (CIs), both for all patients and stratified by age group, risk group, and sex. Receiver operating characteristic curves and AUC were used to analyse the sensitivity and specificity of expected versus observed mortality within 30 days after surgery. The observed 30-day all-cause mortality was compared with the expected 30-day mortality, based on the calculated EuroSCORE II for each patient. The comparison was achieved by calculating the ratio of observed versus estimated mortality (O/E ratio) for all patients and for the respective groups. In addition, 95% CIs were constructed for the ratios with the bootstrap percentile method using 1000 bootstrapped samples. All tests were two-sided and conducted at the 5% significance level. All statistical analyses were performed using version 9.4 of SAS (Cary, NC).

Results

General

The study population consisted of 14,118 consecutive CABG patients. Their mean age was 68.5 years, and 18.3% were women. Baseline characteristics for patients are presented according to age in Table 1, and according to sex and surgical risk in Additional file 1: Tables S1 and S2. The proportions of comorbidities increased by age group except for diabetes, which was less common in the more elderly patients (Table 1). The proportions of men in each EuroSCORE II surgical risk score category were: low risk 43.6%, intermediate risk 49.5%, and high risk 6.8%. The corresponding proportions for women were 19.9%, 63.7%, and 16.4%, respectively (Additional file 1: Table S1). Baseline characteristics by risk group are given in Additional file 1: Table S2.

Table 1 Baseline demographics and EuroSCORE II variables among the CABG patients, overall and by age group

Mortality

Overall, the actual 30-day mortality was 1.5% for all patients, 1.3% in men, and 2.3% in women (Table 1). The 30-day mortality increased with age (< 60 years: 0.4%, 60–69 years: 0.8%, 70–79 years: 1.8%, > 80 years: 4.1%; Table 1) and with risk score (low: 0.2%, intermediate: 1.3%, high: 8.0%; Supplementary Table 2).

Performance of the EuroSCORE II model in CABG patients by age group

The overall discriminative accuracy of EuroSCORE II in the study population was good (AUC: 0.82; 95% CI 0.79–0.85; Fig. 1A). The accuracy of the model was acceptable in all age groups. The highest accuracy was observed in patients aged 60–69 years (AUC: 0.86, 95% CI 0.80–0.93], followed by those aged 70–79 years (AUC: 0.74, 95% CI 0.68–0.79) and > 80 years (AUC: 0.74, 95% CI 0.66–0.81; Fig. 1B).

Fig. 1
figure 1

Panel A Average area under the receiver operating characteristic curve (AUC) for EuroSCORE II in the total group; Panel B AUC for EuroSCORE II by age group; Panel C AUC for EuroSCORE II by surgical risk; Panel D AUC for Euroscore II by sex

Figure 2A shows the calibration of the EuroSCORE II model in CABG patients by age. The O/E ratio for all ages was 0.58 (95% CI 0.50–0.66). The EuroSCORE II model overestimated the mortality risk in all age groups. The calibration was poorest in patients younger than 60 years, and improved with increasing age (< 60 years: O/E = 0.26 [95% CI 0.11–0.45], 60–69 years: 0.43 [0.31–0.56], 70–79 years: 0.61 [0.49–0.73], ≥ 80 years: 0.90 [0.67–1.13]).

Fig. 2
figure 2

Panel A Observed and expected (O/E) 30-day mortality by age group in the total group; Panel B O/E 30-day mortality by surgical risk group; Panel C O/E 30-day mortality by age group among men; Panel D O/E 30-day mortality by age group among women

Performance of the EuroSCORE II model in CABG patients by sex

The discriminative accuracy of the EuroSCORE II model was acceptable in both men (AUC: 0.82, 95% CI 0.79–0.86) and women (AUC: 0.79, 95% CI 0.73–0.85) (Fig. 1D). Figures 2C and D show the calibration of the EuroSCORE II model in men versus women. The O/E ratio was 0.57 (95% CI 0.48–0.66) for men and 0.60 (95% CI 0.47–0.75) for women. Among men, the best calibration was observed in patients aged > 80 years (O/E: 0.99, 95% CI 0.72–1.27) and the lowest in patients aged < 60 years (O/E: 0.18, 95% CI 0.04–0.36). Among women, the best calibration was again observed among patients aged > 80 years (O/E: 0.69, 95% CI 0.35–1.09), while the lowest calibration was in aged 60–69 years (O/E: 0.51, 95% CI 0.24–0.82).

Performance of the EuroSCORE II model in CABG patients by surgical risk group

The discriminative accuracy of EuroSCORE II was best among patients with high surgical risk (AUC: 0.78, 95% CI 0.73–0.83; Fig. 1C), and was lower among patients with intermediate surgical risk (AUC: 0.60, 95% CI 0.55–0.66) and low surgical risk (AUC: 0.63, 95% CI 0.44–0.83). Patients in the high-risk group had the highest O/E ratio (0.78, 95% CI 0.64–0.92), followed by patients with intermediate risk (O/E: 0.57, 95% CI 0.46–0.68) and low risk (O/E: 0.18, 95% CI 0.09–0.30) (Fig. 2B).

Discussion

In this population-based study, we investigated the discrimination accuracy and calibration of the EuroSCORE II risk stratification tool in a large nationwide cohort of CABG patients. The main findings were as follows. Firstly, EuroSCORE II had good discriminative accuracy independently of sex and age, but markedly overestimated the mortality risk, especially in younger patients. Secondly, the discriminative accuracy and calibration were better in high-risk patients than in low-risk and intermediate-risk patients.

Risk stratification tools are used to determine the mortality risk in individual patients, but can also be used to facilitate operation program planning by optimizing patient mix, for quality assessment, and in benchmarking for comparisons between centres and surgeons. To achieve this, the tool needs to have high discriminative accuracy. The present study showed that EuroSCORE II had good discriminative accuracy when applied to a nationwide CABG cohort, and that the accuracy was mainly independent of age and sex. The overall AUC in the present study (0.82) was comparable to the accuracy achieved in the original validation data set of EuroSCORE II [3]. The acceptable overall discriminative accuracy of EuroSCORE II has been confirmed in several studies in different cardiac surgery populations as well as in meta-analyses [6, 7], showing an AUC of 0.77–0.81. The present study showed that the best discriminatory accuracy was detected in patients aged 60–69 years, and that this accuracy decreased somewhat with increasing age. These results are in accordance with those of Poullis et al., who suggested that the EuroSCORE II tool should be used with caution in patients > 70 years old [12]. The present finding of highest accuracy in patients aged 60–70 years can likely be explained by overrepresentation of patients of this age in the original EuroSCORE II dataset that was used to develop the score [3].

Besides the good discriminatory accuracy of EuroSCORE II, the results from the present study showed a marked overestimation of mortality in our CABG population, with an overall O/E ratio of 0.57. In comparison, a meta-analysis based on 22 studies in 145,592 mixed cardiac surgery patients reported an O/E ratio of 1.02 [6], while a large study in 16,096 CABG patients found an O/E ratio of 0.72 [16]. The present study does not give any clear explanation for the lower observed mortality in our study, though it may be at least partly due to improved intraoperative and postoperative care in this more contemporary study population. Nevertheless, the results imply that EuroSCORE II needs to be calibrated for different populations and/or procedures.

We observed the lowest O/E ratio in younger patients, with a value of 0.26 for patients < 60 years and 0.42 for patients aged 60–69 years, while the ratio was 0.89 in patients ≥ 80 years. This was a surprising result, given that some smaller studies have indicated that EuroSCORE II overestimates mortality in octogenarians [9, 10, 13]. Hence, our hypothesis that EuroSCORE II would perform less well in octogenarians could not be confirmed, since the discrimination accuracy was only somewhat lower in older patients and the calibration was better. We also hypothesized that EuroSCORE II would perform less well in patients with high surgical risk. This hypothesis was based on a study by Howell et al. [15] which showed low discriminative accuracy in high-risk patients (AUC: 0.65), and another study by Osnabrugge et al. [16] demonstrating a low O/E ratio (0.51) in high-risk aortic valve replacement patients. The results of the present study did not support our hypothesis, since both the discrimination accuracy and the calibration were better in high-risk than in low-risk and intermediate-risk patients.

The present study has both strengths and limitations. Strengths include the large nationwide study cohort, which is by far the largest yet used to examine the performance of EuroSCORE in relation to all three of age, sex, and surgical risk. Limitations include the definition of high, intermediate, and low surgical risk, which was adapted from the EACTS/ESC guideline definition in aortic valve replacement patients [22]. A consensus definition in CABG patients is lacking.

Conclusion

EuroSCORE II showed a satisfactory discriminative accuracy when applied in a large cohort of CABG patients. However, it markedly overestimated the mortality risk in this study cohort, especially in younger patients. This poor calibration strongly suggests that it is necessary to calibrate EuroSCORE II for different study populations.

Availability of data and materials

The data underlying this article will be shared on reasonable request to the corresponding author.

Abbreviations

AUC:

Area under the receiver operating characteristic curve

BMI:

Body mass index

CABG:

Coronary artery bypass grafting

CCS:

Canadian cardiovascular society functional classification of angina

CI:

Confidence interval

EuroSCORE:

The European system for cardiac operative risk evaluation

LVEF:

Left ventricular function

NYHA:

New York heart association class of heart failure.

O/E:

Observed/expected ratio

SWEDEHEART:

Swedish web system for enhancement and development of evidence-based care in heart disease evaluated according to recommended therapies

References

  1. Nilsson J, Algotsson L, Höglund P, Lührs C, Brandt J. Comparison of 19 preoperative risk stratification models in open-heart surgery. Eur Heart J. 2006;27:867–74.

    Article  Google Scholar 

  2. Nashef SA, Roques F, Michel P, Gauducheau E, Lemeshow S, Salamon R. European system for cardiac operative risk evaluation (EuroSCORE). Eur J Cardiothorac Surg. 1999;16:9–13.

    Article  CAS  Google Scholar 

  3. Nashef SA, Roques F, Sharples LD, Nilsson J, Smith C, Goldstone AR, et al. EuroSCORE II. Eur J Cardiothorac Surg. 2012;41:734–44.

    Article  Google Scholar 

  4. Nashef SA, Roques F, Hammill BG, Peterson ED, Michel P, Grover FL, et al. Validation of European system for cardiac operative risk evaluation (EuroSCORE) in North American cardiac surgery. Eur J Cardiothorac Surg. 2002;22:101–5.

    Article  Google Scholar 

  5. Ad N, Holmes SD, Patel J, Pritchard G, Shuman DJ, Halpin L. Comparison of EuroSCORE II, original EuroSCORE, and the Society of Thoracic Surgeons risk score in cardiac surgery patients. Ann Thorac Surg. 2016;102:573–9.

    Article  Google Scholar 

  6. Guida P, Mastro F, Scrascia G, Whitlock R, Paparella D. Performance of the European system for cardiac operative risk evaluation II: a meta-analysis of 22 studies involving 145,592 cardiac surgery procedures. J Thorac Cardiovasc Surg. 2014;148:3049–57.

    Article  Google Scholar 

  7. Sullivan GP, Wallach JD, Ioannidis JP. Meta-analysis comparing established risk prediction models (EuroSCORE II, STS score, and ACEF score) for perioperative mortality during cardiac surgery. Am J Cardiol. 2016;118:1574–82.

    Article  Google Scholar 

  8. Sinha S, Dimagli A, Dixon L, Gaudino M, Caputo M, Vohra HA, et al. Systematic review and meta-analysis of mortality risk prediction models in adult cardiac surgery. Interact Cardiovasc Thorac Surg. 2021;33:673–86.

    Article  Google Scholar 

  9. Luc JGY, Graham MM, Norris CM, Al Shouli S, Nijjar YS, Meyer SR. Predicting operative mortality in octogenarians for isolated coronary artery bypass grafting surgery: a retrospective study. BMC Cardiovasc Disord. 2017;17:275.

    Article  Google Scholar 

  10. Provenchère S, Chevalier A, Ghodbane W, Bouleti C, Montravers P, Longrois D, et al. Is the EuroSCORE II reliable to estimate operative mortality among octogenarians? PLoS ONE. 2017;16: e0187056.

    Article  Google Scholar 

  11. Nezic D, Spasic T, Micovic S, Kosevic D, Petrovic I, Lausevic-Vuk L, et al. Consecutive observational study to validate EuroSCORE II performances on a single-center contemporary cardiac surgical cohort. J Cardiothorac Vasc Anesth. 2016;30:345–51.

    Article  Google Scholar 

  12. Poullis M, Pullan M, Chalmers J, Mediratta N. The validity of the original EuroSCORE and EuroSCORE II in patients over the age of seventy. Interact Cardiovasc Thorac Surg. 2015;20:172–7.

    Article  Google Scholar 

  13. Pietrzyk E, Michta K, Gorczyca-Michta I, Wożakowska-Kapłon B. Coronary artery bypass grafting in patients over 80 years of age: a single-centre experience. Kardiol Pol. 2014;72:598–603.

    Article  Google Scholar 

  14. Hogervorst EK, Rosseel PMJ, van de Watering LMG, Brand A, Bentala M, van der Meer BJM, et al. Prospective validation of the EuroSCORE II risk model in a single Dutch cardiac surgery centre. Neth Heart J. 2018;26:540–51.

    Article  CAS  Google Scholar 

  15. Howell NJ, Head SJ, Freemantle N, van der Meulen TA, Senanayake E, Menon A, et al. The new EuroSCORE II does not improve prediction of mortality in high-risk patients undergoing cardiac surgery: a collaborative analysis of two European centres. Eur J Cardiothorac Surg. 2013;44:1006–11.

    Article  Google Scholar 

  16. Osnabrugge RL, Speir AM, Head SJ, Fonner CE, Fonner EA, Kappetein P, et al. Performance of EuroSCORE II in a large US database: implications for transcatheter aortic valve implantation. Eur J Cardiothorac Surg. 2014;46:400–8.

    Article  Google Scholar 

  17. Vikholm P, Ivert T, Nilsson J, Holmgren A, Freter W, Ternstrom L, et al. Validity of the Swedish cardiac surgery Registry. Interact Cardiovasc Thorac Surg. 2018;27:67–74.

    Article  Google Scholar 

  18. Jernberg T, Attebring MF, Hambraeus K, Ivert T, James S, Jeppsson A, et al. The Swedish web-system for enhancement and development of evidence-based care in heart disease evaluated according to recommended therapies (SWEDEHEART). Heart. 2010;96:1617–21.

    Article  Google Scholar 

  19. Ludvigsson JF, Andersson E, Ekbom A, Feychting M, Kim J-L, Reuterwall C, et al. External review and validation of the Swedish national inpatient register. BMC Public Health. 2011;9:450.

    Article  Google Scholar 

  20. Brooke H, Talbäck M, Hörnblad J, Johansson L, Ludvigsson JF, Druid H, et al. The Swedish cause of death register. Eur J Epidemiol. 2017;32:765–73.

    Article  CAS  Google Scholar 

  21. Caetano SJ, Sonpavde G, Pond GR. C-statistic: a brief explanation of its construction, interpretation, and limitations. Eur J Cancer. 2018;90:130–2.

    Article  CAS  Google Scholar 

  22. Baumgartner H, Falk V, Bax JJ, De Bonis M, Hamm C, Holm PJ, et al., ESC Scientific Document Group. 2017 ESC/EACTS Guidelines for the management of valvular heart disease. Eur Heart J 2017; 21: 2739–91

Download references

Acknowledgements

The authors thank Christopher Backstrom for the statistical analyses.

Funding

Open access funding provided by University of Gothenburg. The study was supported by the Swedish state under the ALF agreement between the Swedish government and the county councils concerning economic support of research and education of doctors (Grant No. ALFGBG-942665 to SN), the Local Research and Development Council Skaraborg (VGFOUSKB-964094 to MK), Skaraborg Hospital Research Fund (VGFOUSKAS-936367 to MK, VGFOUSKAS-963405 to MK), and Västra Götaland Regional Research Fund (VGFOUREG-940375 to MK).

Author information

Authors and Affiliations

Authors

Contributions

MK was involved in the study design, retrieval of data, analysis of data, and writing of the report. SN was involved in the study design, retrieval of data, analysis of data, and writing of the report. MS was involved in the analysis of data and writing of the report. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Martin Karlsson.

Ethics declarations

Ethics approval and consent to participate

The study was approved by the Regional Ethics Board of Gothenburg (approval number: 139-16), which waived the need for individual patient consent due to the retrospective registry-based study design.

Competing interests

The authors have not disclosed any competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1

. Tables S1 and S2.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Silverborn, M., Nielsen, S. & Karlsson, M. The performance of EuroSCORE II in CABG patients in relation to sex, age, and surgical risk: a nationwide study in 14,118 patients. J Cardiothorac Surg 18, 40 (2023). https://doi.org/10.1186/s13019-023-02141-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s13019-023-02141-4

Keywords