Development and external validation of a multivariable nomogram for predicting severe immune checkpoint inhibitor-associated myocarditis in advanced lung cancer
Original Article

Development and external validation of a multivariable nomogram for predicting severe immune checkpoint inhibitor-associated myocarditis in advanced lung cancer

Tao Luan1,2,3,4,5#, Kangjing Ma1#, Shuaiying Wang2#, Qingqing Yang1, Hongbin Ma1, Wei Liu1, Junyi Mo1, Can Huang1, Jiawen Chen1, Xinqing Lin1, Zekun Chenli1, Zijun Rong1, Baodan Yu1, Shanshan Li1, Yang Luan3, Qun Lv6, Hanxiu Lu3, Gang Yang3, Yunhui Zhang3, Leyang Yao7, Yijie Fang1, Xinfei He1, Zhenyi Lin1, Panxiao Shen1, Suneng Fu1,4, Xiaowu Tan1,8, Xiaohong Xie1, Limin Wang5, Chengzhi Zhou1

1Department of, The First Affiliated Hospital of Guangzhou Medical University, National Center for Respiratory Medicine, National Clinical Research Center for Respiratory Disease, State Key Laboratory of Respiratory Disease, Guangzhou Institute of Respiratory Health, Guangzhou, China; 2Anesthesia Surgery Center, Zhejiang Cancer Hospital, Hangzhou, China; 3Faculty of Life Science and Technology, Kunming University of Science and Technology, The Affiliated Hospital of Kunming University of Science and Technology, Kunming, China; 4Guangzhou National Laboratory, Guangzhou International Bio Island, Guangzhou, China; 5Department of Pulmonary and Critical Care Medicine, Affiliated Hangzhou First People Hospital, School of Medicine, Westlake University, Hangzhou, China; 6Department of, Hangzhou Second People’s Hospital, Affiliated Hospital of Hangzhou Normal University, Hangzhou, China; 7Department of, Jiexi County Second People’s Hospital, Jieyang, China; 8Department of Pulmonary and Critical Care Medicine, the Second Affiliated Hospital, University of South China, Hengyang, China

Contributions: (I) Conception and design: X Xie, L Wang, C Zhou; (II) Administrative support: C Zhou; (III) Provision of study materials or patients: T Luan, S Wang, Q Yang, X Lin, B Yu, Y Luan, Q Lv, H Lu, G Yang, Y Zhang, L Yao, P Shen, S Fu, X Tan; (IV) Collection and assembly of data: K Ma, H Ma, W Liu, J Mo, C Huang, J Chen, Z Chenli, Z Rong, S Li, H Lu, G Yang, Y Fang, X He, Z Lin; (V) Data analysis and interpretation: T Luan, K Ma, S Wang; (VI) Manuscript writing: All authors; (VII) Final approval of manuscript: All authors.

#These authors contributed equally to this work as co-first authors.

Correspondence to: Xiaohong Xie. Department of, The First Affiliated Hospital of Guangzhou Medical University, National Center for Respiratory Medicine, National Clinical Research Center for Respiratory Disease, State Key Laboratory of Respiratory Disease, Guangzhou Institute of Respiratory Health, Guangzhou 510000, China. Email: gyxxh@126.com; Limin Wang. Department of Pulmonary and Critical Care Medicine, Affiliated Hangzhou First People Hospital, School of Medicine, Westlake University, Hangzhou 310022, China. Email: lemonwlm@163.com; Chengzhi Zhou. Department of, The First Affiliated Hospital of Guangzhou Medical University, National Center for Respiratory Medicine, National Clinical Research Center for Respiratory Disease, State Key Laboratory of Respiratory Disease, Guangzhou Institute of Respiratory Health, Guangzhou 510000, China. Email: doctorzcz@163.com.

Background: Immune checkpoint inhibitors (ICIs) have improved outcomes in advanced lung cancer, but immune checkpoint inhibitor-associated myocarditis (ICIM), particularly severe ICIM, remains a rare yet potentially fulminant and fatal immune-related adverse event (irAE). We aimed to develop and externally validate a clinically accessible risk-prediction model based on routine baseline cardiac assessments to enable early identification and risk stratification of patients at high risk of severe ICIM around ICI initiation.

Methods: This multicenter retrospective study included a training set, an internal validation set, and an external validation set. Patients with early-stage lung cancer, no ICI exposure, incomplete baseline cardiac evaluation [electrocardiogram (ECG)/echocardiography], missing predictor data, or insufficient follow-up were excluded; pre-existing cardiovascular comorbidities were not exclusionary and were recorded as baseline characteristics. The primary endpoint was severe ICIM [Common Terminology Criteria for Adverse Events (CTCAE) v5.0 grade ≥3]. Candidate predictors [high-sensitivity troponin, B-type natriuretic peptide (BNP), ECG, and echocardiography] were obtained from baseline/routine assessments at ICI initiation, prior to any ICIM event. Model Discrimination was assessed using receiver operating characteristic curve/area under the curve (ROC/AUC) with calibration plots and decision curve analysis (DCA) evaluating calibration and clinical utility.

Results: A total of 834 patients were enrolled, with an overall mean age of 60 years [standard deviation (SD) =14]; by cohort, mean ages were approximately 59 years (training set), 61 years (internal validation set), and 61 (external validation set) years. Males accounted for 79.5% of the total cohort, with a slightly higher proportion in the internal validation set (83.2%) compared to the training (75.5%) and external validation (81.4%) sets. The prevalence of smoking was 61.0% and comparable across cohorts; 52.3% had comorbidities. Adenocarcinoma was the most common subtype (39.8%), followed by squamous cell carcinoma (35.4%), with similar distributions across cohorts. Based on these cohorts, four readily available predictors (troponin, BNP, ECG, echocardiography) were retained to construct the severe ICIM model and nomogram. The AUCs were 0.926 [95% confidence interval (CI): 0.880–0.971] in the training set, 0.885 (95% CI: 0.742–1.000) in the internal validation set, and 0.949 (95% CI: 0.884–1.000) in the external validation set; calibration plots and DCA demonstrated favorable agreement between predicted and actual outcomes, as well as significant net benefit across clinically relevant thresholds.

Conclusions: This four-predictor severe ICIM risk model and nomogram, developed from multicenter real-world data and externally validated, supports early risk stratification and monitoring strategies for advanced lung cancer patients receiving ICIs. Prospective studies are warranted to confirm its generalizability and optimize implementation.

Keywords: Lung cancer; immunotherapy-induced myocarditis; risk-scoring model


Submitted Nov 18, 2025. Accepted for publication Feb 28, 2026. Published online May 26, 2026.

doi: 10.21037/tlcr-2025-aw-1252


Highlight box

Key findings

• We developed and externally validated a four-factor nomogram to predict severe immune checkpoint inhibitor-associated myocarditis (ICIM) in advanced lung cancer.

• High-sensitivity troponin, B-type natriuretic peptide, electrocardiography, and echocardiography were retained as practical predictors, and the model showed good discrimination and calibration across training, internal validation, and external validation cohorts.

What is known and what is new?

• Immune checkpoint inhibitors (ICIs) have improved outcomes in advanced lung cancer, but ICIM remains a rare and potentially fatal immune-related adverse event. Most existing tools are used after myocarditis is suspected or diagnosed.

• This study provides a clinically accessible baseline risk-prediction model based on routine cardiac assessments at ICI initiation, with multicenter development and external validation in real-world lung cancer cohorts.

What is the implication, and what should change now?

• Routine baseline cardiac assessment may help identify patients at increased risk of severe ICIM before or at the start of immunotherapy.

• This model may support risk-stratified monitoring, earlier multidisciplinary evaluation, and more practical cardio-oncology surveillance in routine care.


Introduction

In recent years, immunotherapy has emerged as a promising treatment approach for various types of cancer, including advanced lung cancer. The use of immune checkpoint inhibitors (ICIs), such as programmed cell death protein 1 (PD-1) inhibitors, has shown significant efficacy in improving the outcomes of patients with advanced non-small cell lung cancer (NSCLC) (1,2). However, along with their therapeutic benefits, ICIs have been associated with immune-related adverse events (irAEs) that can affect different organs, including the heart (3).

One of the potentially life-threatening irAEs associated with immunotherapy is immune myocarditis, an inflammatory condition affecting the heart muscle (4,5). Immune myocarditis can presented with a wide range of symptoms, including chest pain, dyspnea, palpitations, and heart failure, and may lead to severe cardiac dysfunction and even death if not promptly recognized and managed (6,7). Given the increasing use of ICIs in the treatment of lung cancer patients, the risk of developing immune myocarditis has become a topic of significant clinical relevance (8).

Several studies have investigated the incidence, risk factors, clinical presentation, and management of immune myocarditis in cancer patients receiving immunotherapy (9-11). However, although registry-based and multicenter studies have proposed clinical risk stratification approaches incorporating troponin elevation, ECG/conduction abnormalities, and ventricular dysfunction, most tools are applied at the time of suspected or established ICIM and have limited validation in Asian lung cancer real-world cohorts (10). Therefore, a pragmatic baseline risk-prediction tool that can be applied around ICI initiation—distinct from diagnostic evaluation for active myocarditis—may help triage monitoring intensity and enable earlier recognition of high-risk patients in routine practice (12).

Baseline abnormalities in cardiac biomarkers and routine tests may also have clinical plausibility for risk stratification (13). Pre-treatment troponin or BNP elevation and ECG/echocardiographic abnormalities can reflect subclinical myocardial injury or reduced cardiac reserve rather than autoimmune susceptibility per se, potentially lowering the threshold for clinically overt myocarditis once immune activation occurs during ICI therapy. In line with cardio-oncology guidance recommending baseline ECG and cardiac biomarkers before ICI initiation, we aimed to develop and externally validate a parsimonious model based on routinely available assessments (troponin, BNP, ECG, and echocardiography) to predict subsequent ICIM risk in advanced lung cancer. We present this article in accordance with the TRIPOD reporting checklist (available at https://tlcr.amegroups.com/article/view/10.21037/tlcr-2025-aw-1252/rc).


Methods

Patient data

This retrospective analysis included 834 patients with advanced lung cancer who received ICIs at The First Affiliated Hospital of Guangzhou Medical University, Zhejiang Cancer Hospital, and The Affiliated Hospital of Kunming University of Science and Technology between January 2018 and June 2024 (Figure 1). Data from the Guangzhou center were randomly split at a 7:3 ratio into a training cohort (n=306) and an internal test cohort (n=131); the remaining 397 patients constituted the external validation cohort (Figure 1). Given the retrospective design and the rarity of severe ICIM, the study size was determined pragmatically by including all consecutive eligible patients during the prespecified period; therefore, no a priori sample size calculation was performed (14).

Figure 1 Flowchart of patient selection and Risk-Scoring model development. Flow diagram of patient selection and cohort splitting. From all ICI-treated patients, exclusions yielded 834 eligible cases; these were partitioned into a development cohort (patients without severe ICIM, CTCAE grade 0–2, n=403; severe ICIM, CTCAE v5.0 grade ≥3, n=34) and an external validation cohort (patients without severe ICIM, CTCAE grade 0–2, n=366; severe ICIM, CTCAE v5.0 grade ≥3, n=31). The development set was used to build the logistic prediction model; the external set was used for out-of-sample validation. CTCAE, Common Terminology Criteria for Adverse Events; ICI, immune checkpoint inhibitor; ICIM, immune checkpoint inhibitor-associated myocarditis.

Inclusion criteria

  • Patients with advanced stage lung cancer receiving immunotherapy;
  • Patients with available data on immunotherapy treatment;
  • Patients who have undergone cardiac evaluation with electrocardiogram (ECG) and echocardiogram;
  • Patients with pathology confirmation of lung cancer;
  • Patients with complete data on gender, age, tumor-node-metastasis (TNM) staging, and smoking status;
  • Patients with follow-up data on the occurrence of immune myocarditis.

Exclusion criteria

  • Patients with early stage lung cancer;
  • Patients who have not received immunotherapy treatment;
  • Patients with incomplete data on cardiac evaluation;
  • Patients without confirmed pathology of lung cancer;
  • Patients with missing data on predictive factors;
  • Patients lost to follow-up for monitoring immune myocarditis occurrence.

Pre-existing cardiovascular comorbidities were not used as exclusion criteria and were recorded as baseline characteristics. The study was conducted in accordance with the Declaration of Helsinki and its subsequent amendments. This study was approved by the Ethics Committee of The First Affiliated Hospital of Guangzhou Medical University (approval No. KX-2023-009-01 approval date: 2023-09-01). Because this was a retrospective analysis of de-identified data, the requirement for written informed consent was waived by the ethics committee. All participating hospitals were informed and agreed to the study.

Immune checkpoint inhibitor-associated myocarditis (ICIM) diagnosis and grading

ICIM was adjudicated by two experienced physicians. After excluding alternative etiologies, diagnosis integrated medical history and clinical symptoms, the temporal relationship between ICI exposure and symptom onset, dynamic changes in troponin and other myocardial injury biomarkers, imaging abnormalities, cardiac magnetic resonance (CMR), and endomyocardial biopsy when indicated (15). For prediction modeling, candidate predictors were restricted to baseline/routine cardiac assessments at ICI initiation prior to any ICIM event; abnormalities recorded during suspected myocarditis were used only for endpoint adjudication and CTCAE grading. Toxicity grading followed CTCAE v5.0 (16), with ICIM classified as mild (grades 1–2) or severe (grade ≥3) (17). For model development and validation, the primary endpoint was severe ICIM (CTCAE v5.0 grade ≥3); patients without severe ICIM (no ICIM or mild ICIM, CTCAE grade 0–2) constituted the reference group (18).

Data collection

Demographic and clinical variables were collected, including age, sex, smoking status, comorbidities, lung cancer subtype, clinical stage, and treatment regimen. Candidate predictors were obtained from baseline/routine assessments at ICI initiation, prior to ICIM occurrence, including high-sensitivity troponin, BNP, ECG findings, and echocardiography results.

For laboratory predictors, results were recorded as normal versus abnormal based on the assay-specific upper reference limit (99th percentile/ULN) used at each participating center and operationalized using the local laboratory flag (normal vs abnormal) to accommodate inter-center assay heterogeneity. Index-center reference ranges for selected laboratory tests (CK: 50–310 U/L; BNP: <100 pg/mL) are provided in the Supplementary Materials.

ECG was categorized as abnormal if the baseline report documented any rhythm or conduction abnormality or repolarization change (e.g., ST-T changes, atrioventricular/bundle-branch block, atrial/ventricular arrhythmias); otherwise, it was categorized as normal (17). Echocardiography was categorized as abnormal if baseline transthoracic echocardiography documented ventricular systolic dysfunction [e.g., reduced (LVEF)], regional wall-motion abnormality, clinically relevant pericardial effusion, or other structural/functional abnormalities; otherwise, it was categorized as normal (17).

Outcome data were collected according to the adjudication and grading process described above.

Statistical analysis

Data from the Guangzhou center were randomly divided into a training cohort and internal validation cohorts at a 7:3 ratio for model development and internal evaluation. Non-normal distributed data were summarized as median (interquartile ranges). Categorical variables were analyzed by Chi-squared or Fisher’s exact test, and continuous variables by Student’s t-test or rank-sum test.

In the training cohort, least absolute shrinkage and selection operator (LASSO)-logistic regression was used to select candidate predictors derived from baseline assessments at ICI initiation and to develop a multivariable logistic model, which was visualized as a nomogram. Model performance was assessed by discrimination (ROC/AUC), calibration (calibration curves with Brier score), and clinical utility [decision curve analysis (DCA) across clinically plausible threshold probabilities] (19).

For clinical classification, an operating probability cutoff was prespecified in the training cohort and applied unchanged to the internal test and external validation cohorts; however, threshold-based classification metrics [sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV)] were not reported in this manuscript and are planned for prospective validation.

Two-sided P<0.05 was considered statistically significant. Analyses were performed using R (version 4.2.2) together with MSTATA (www.mstata.com).

Model development and validation

Predictors selected by LASSO-logistic regression in the training cohort were used to construct the prediction nomogram and multivariable logistic model which was then evaluated in the internal test and external validation cohorts by ROC/AUC, calibration, and DCA. Point assignment for the nomogram/risk score was derived from model coefficients, and the total score mapped to the predicted probability of severe ICIM. Model building and visualization were implemented in R 4.2.2 and MSTATA.


Results

Patient characteristics

In this study, we analyzed the baseline demographic and clinical characteristics of a total of 834 individuals across different cohorts. The mean age of the overall cohort was 60 years, with a standard deviation of 14. When stratified by cohort, the mean age was similar across the training cohort (59 years) and the external test cohort (61 years), while the internal test cohort had a slightly higher mean age of 61 years. In terms of gender distribution, 79.5% of the overall cohort was male, with a slightly higher proportion in the internal test cohort (83.2%) compared to the training cohort (75.5%) and external test cohort (81.4%). The prevalence of smoking was found in 61.0% of the overall cohort, with a similar distribution across the training, internal test, and external test cohorts. When considering comorbidities, 52.3% of the overall cohort had comorbid conditions, with comparable rates among the different cohorts. Regarding lung cancer types, adenocarcinoma was the most common subtype (39.8%), followed by squamous cell carcinoma (35.4%), with similar distribution across the cohorts (Table 1).

Table 1

Patient demographics and baseline characteristics

Characteristic Cohort P value
Overall (N=834) Training cohort (N=306) Internal test cohort (N=131) External test cohort (N=397)
Age (years) 60±14 59±14 61±14 61±13 0.17
Gender 0.08
   Male 663 (79.5) 231 (75.5) 109 (83.2) 323 (81.4)
   Female 171 (20.5) 75 (24.5) 22 (16.8) 74 (18.6)
Smoking 0.18
   No 325 (39.0) 130 (42.5) 53 (40.5) 142 (35.8)
   Yes 509 (61.0) 176 (57.5) 78 (59.5) 255 (64.2)
Comorbidity 0.60
   No 398 (47.7) 139 (45.4) 64 (48.9) 195 (49.1)
   Yes 436 (52.3) 167 (54.6) 67 (51.1) 202 (50.9)
Lung cancer type 0.92
   Adenocarcinoma 332 (39.8) 124 (40.5) 50 (38.2) 158 (39.8)
   Others 207 (24.8) 71 (23.2) 33 (25.2) 103 (25.9)
   Squamous cell carcinoma 295 (35.4) 111 (36.3) 48 (36.6) 136 (34.3)
Clinical stage 0.02
   III C 153 (18.3) 71 (23.2) 20 (15.3) 62 (15.6)
   IV 681 (81.7) 235 (76.8) 111 (84.7) 335 (84.4)
Treatment 0.19
   First line 179 (21.5) 59 (19.3) 24 (18.3) 96 (24.2)
   Non-first line 655 (78.5) 247 (80.7) 107 (81.7) 301 (75.8)
Heart failure 0.81
   No 741 (88.8) 273 (89.2) 118 (90.1) 350 (88.2)
   Yes 93 (11.2) 33 (10.8) 13 (9.9) 47 (11.8)
Troponin <0.001
   Normal 610 (73.1) 203 (66.3) 83 (63.4) 324 (81.6)
   Abnormal 224 (26.9) 103 (33.7) 48 (36.6) 73 (18.4)
Creatine kinase <0.001
   Normal 587 (70.4) 167 (54.6) 75 (57.3) 345 (86.9)
   Abnormal 247 (29.6) 139 (45.4) 56 (42.7) 52 (13.1)
BNP <0.001
   Normal 654 (78.4) 202 (66.0) 96 (73.3) 356 (89.7)
   Abnormal 180 (21.6) 104 (34.0) 35 (26.7) 41 (10.3)
ECG 0.52
   Normal 551 (66.1) 203 (66.3) 81 (61.8) 267 (67.3)
   Abnormal 283 (33.9) 103 (33.7) 50 (38.2) 130 (32.7)
Echocardiography 0.94
   Normal 582 (69.8) 213 (69.6) 90 (68.7) 279 (70.3)
   Abnormal 252 (30.2) 93 (30.4) 41 (31.3) 118 (29.7)
IL-1β () 11.3±4.9 11.3±4.9 12.0±4.6 11.1±5.0 0.26
IL-6 () 30±14 30±14 31±14 30±14 0.74
IL-8 () 36±19 36±19 36±21 37±19 0.81

Data are presented as mean ± standard deviation or n (%). , one-way ANOVA; Pearson’s Chi-squared test. ANOVA, analysis of variance; BNP, B-type natriuretic peptide; ECG, electrocardiogram; IL, interleukin.

Clinical staging revealed a significant difference between cohorts, with 81.7% of the overall cohort classified as stage IV. Treatment patterns varied across the different cohorts, with 78.5% of individuals did not receive first-line treatment. The prevalence of heart failure was 11.2% in the overall cohort, with consistent rates in the training, internal test, and external test cohorts. Biomarker analysis indicated abnormal levels of troponin, creatine kinase, BNP, and IL-8 in a subset of individuals, with significant differences noted in their distribution among the cohorts. Cardiac parameters assessed by ECG and echocardiography showed variations in abnormalities among the cohorts, with no distinct patterns observed. Levels of inflammatory markers IL-1β and IL-6 did not exhibit significant differences among the cohorts.

Univariate analysis of influencing factors for severe ICIM

Univariate analysis of factors influencing severe ICIM revealed significant associations (P<0.05) with the following variables: troponin, B-type natriuretic peptide (BNP), and echocardiography, totaling 3 factors (Table 2).

Table 2

Patient demographics and baseline characteristics

Characteristics Training cohort Internal test cohort External test cohort
No (N=278) Yes (N=28) P value No (N=125) Yes (N=6) P value No (N=366) Yes (N=31) P value
Age (years) 59±14 60±14 0.77 61±14 57±12 0.53 61±13 62±12 0.81
Gender 0.60 0.59 0.92
   Male 211 [76] 20 [71] 103 [82] 6 [100] 298 [81] 25 [81]
   Female 67 [24] 8 [29] 22 [18] 0 [0] 68 [19] 6 [19]
Smoking 0.041 0.22 0.007
   No 113 [41] 17 [61] 49 [39] 4 [67] 124 [34] 18 [58]
   Yes 165 [59] 11 [39] 76 [61] 2 [33] 242 [66] 13 [42]
Comorbidity 0.61 >0.99 0.77
   No 125 [45] 14 [50] 61 [49] 3 [50] 179 [49] 16 [52]
   Yes 153 [55] 14 [50] 64 [51] 3 [50] 187 [51] 15 [48]
Lung cancer type 0.054 >0.99 0.36
   Adenocarcinoma 116 [42] 8 [29] 47 [38] 3 [50] 142 [39] 16 [52]
   Others 67 [24] 4 [14] 32 [26] 1 [17] 96 [26] 7 [23]
   Squamous cell carcinoma 95 [34] 16 [57] 46 [37] 2 [33] 128 [35] 8 [26]
Clinical stage 0.01 0.23 0.30
   III C 59 [21] 12 [43] 18 [14] 2 [33] 55 [15] 7 [23]
   IV 219 [79] 16 [57] 107 [86] 4 [67] 311 [85] 24 [77]
Treatment 0.42 >0.99 0.51
   First line 52 [19] 7 [25] 23 [18] 1 [17] 87 [24] 9 [29]
   Non-first line 226 [81] 21 [75] 102 [82] 5 [83] 279 [76] 22 [71]
Heart failure 0.75 >0.99 >0.99
   No 247 [89] 26 [93] 112 [90] 6 [100] 322 [88] 28 [90]
   Yes 31 [11] 2 [7] 13 [10] 0 [0] 44 [12] 3 [10]
Troponin <0.001 0.002 <0.001
   Normal 195 [70] 8 [29] 83 [66] 0 [0] 310 [85] 14 [45]
   Abnormal 83 [30] 20 [71] 42 [34] 6 [100] 56 [15] 17 [55]
Creatine kinase <0.001 0.08 <0.001
   Normal 163 [59] 4 [14] 74 [59] 1 [17] 340 [93] 5 [16]
   Abnormal 115 [41] 24 [86] 51 [41] 5 [83] 26 [7] 26 [84]
BNP <0.001 0.005 <0.001
   Normal 200 [72] 2 [7] 95 [76] 1 [17] 353 [96] 3 [10]
   Abnormal 78 [28] 26 [93] 30 [24] 5 [83] 13 [4] 28 [90]
ECG <0.001 0.20 <0.001
   Normal 199 [72] 4 [14] 79 [63] 2 [33] 261 [71] 6 [19]
   Abnormal 79 [28] 24 [86] 46 [37] 4 [67] 105 [29] 25 [81]
Echocardiography <0.001 0.01 <0.001
   Normal 210 [76] 3 [11] 89 [71] 1 [17] 276 [75] 3 [10]
   Abnormal 68 [24] 25 [89] 36 [29] 5 [83] 90 [25] 28 [90]
IL-1β () 11.1±4.9 13.1±4.5 0.04 11.8±4.5 15.7±4.0 0.06 11.1±5.1 11.8±4.0 0.38
IL-6 () 30±14 31±14 0.71 30±14 34±7 0.29 30±14 25±15 0.12
IL-8 () 36±19 34±15 0.49 36±21 26±24 0.35 37±19 38±17 0.72

Data are presented as mean ± standard deviation or n [%]. , Welch two sample t-test; Pearson’s Chi-squared test; Fisher’s exact test. , Welch two sample t-test; Fisher’s exact test. BNP, B-type natriuretic peptide; ECG, electrocardiogram; IL, interleukin.

Lasso regression was employed to further analyze the aforementioned factors. Considering the importance of electrocardiography in the diagnosis of myocarditis, the model identified the following as the best matching factors: troponin, BNP, echocardiography, and electrocardiography, totaling 4 factors (Figure 2A,2B).

Figure 2 LASSO-logistic predictor selection. (A) Coefficient trajectories for candidate variables across log(λ); numbers on top indicate the count of non-zero coefficients. (B) Ten-fold cross-validated binomial deviance versus log(λ); vertical dotted lines mark λ_min and λ_1se. The parsimonious λ was chosen to construct the prediction model. LASSO, least absolute shrinkage and selection operator.

Multivariate analysis of factors influencing severe ICIM

Multivariate logistic regression analysis identified the following as independent risk factors for severe ICIM: troponin [odds ratio (OR) =3.88, 95% confidence interval (CI): 1.41–10.66, P=0.008], BNP (OR =14.97, 95% CI: 3.29–68.21, P<0.001), ECG (OR =2.90, 95% CI: 0.75–11.23, P=0.12), and echocardiography (OR =7.86, 95% CI: 1.87–33.02, P=0.005) (Figure 3A). Despite the borderline statistics, ECG was retained in the final nomogram because of its clinical salience, high accessibility, and consistent directional signal in prior cardio-oncology literature, thereby adding pragmatic triage value within the multivariable framework. The accuracy and discriminative power of these four factors were validated using receiver operating characteristic (ROC) curves (Figure 3B).

Figure 3 Predictive contribution and single-marker discrimination. (A) Multivariable logistic coefficients retained after LASSO for troponin, BNP, ECG, and echocardiography (abnormal vs. normal). (B) ROC curves of each single marker for predicting severe ICIM with AUCs shown on the plot. AUC, area under the curve; BNP, B-type natriuretic peptide; CI, confidence interval; ECG, electrocardiogram; ICIM, immune checkpoint inhibitor-associated myocarditis; LASSO, least absolute shrinkage and selection operator; ROC, receiver operating characteristic.

Construction of a clinical nomogram model for severe ICIM prediction

Based on the results of multivariate logistic regression, a nomogram for predicting severe ICIM outcomes was constructed. In personalized medicine, troponin, BNP, echocardiography, and ECG can be used to determine the corresponding scores on the nomogram. By summing the scores of each variable, the nomogram enables the estimation of severe ICIM probability in patients receiving ICI treatment (Figure 4).

Figure 4 Nomogram of the multivariable logistic prediction model for severe ICIM. Points are assigned for troponin, BNP, ECG, and echocardiography (abnormal vs. normal). The total points map to the linear predictor and the predicted probability of severe ICIM at baseline. BNP, B-type natriuretic peptide; ECG, electrocardiogram; ICIM, immune checkpoint inhibitor-associated myocarditis.

Validation of the accuracy and discriminative power of the severe ICIM nomogram model

Random sampling was used, with a ratio of 7:3 between the training set (n=306) and the internal validation set (n=131). Additionally, 397 cases of clinical data from Zhejiang Cancer Hospital and The Affiliated Hospital of Kunming University of Science and Technology were used as the external validation set. ROC curves and calibration curves were employed to validate the accuracy and discriminative power of the nomogram. Results showed that the area under the curve (AUC) was 0.926 (95% CI: 0.880–0.971) for the training set, 0.885 (95% CI: 0.742–1.000) for the internal validation set, and 0.949 (95% CI: 0.884–1.000) for the external validation set (Figure 5A). Meanwhile, the calibration curves of the training set, internal validation set, and external validation set exhibited good agreement (Figure 5B-5D). The corresponding Brier scores were 5.0 (95% CI: 3.4–6.7) in the training set, 4.8 (95% CI: 2.5–7.0) in the internal validation set, and 3.3 (95% CI: 2.0–4.6) in the external validation set (Figure 5B-5D). This indicates that the predictive model has strong discriminative power.

Figure 5 Model performance across cohorts. (A) ROC curves for the development (training), internal test, and external validation cohorts with corresponding AUCs. (B-D) Calibration plots for the same cohorts; grey dashed line indicates perfect calibration. Brier scores are displayed on each panel. AUC, area under the curve; CI, confidence interval; ROC, receiver operating characteristic.

Evaluation of the clinical utility of the severe ICIM nomogram model

Clinical DCA was used to analyze the clinical utility of this model. Results showed maximum benefit in the training set (threshold probability: 0.00–0.62), internal validation set, and external validation set (threshold probability: 0.16–0.96) within these threshold probability ranges. DCA showed that the nomogram yielded a positive net benefit over the ‘treat-all’ and ‘treat-none’ strategies across clinically relevant threshold probabilities, supporting potential clinical utility and informing threshold selection in practice (Figure 6A-6C).

Figure 6 Clinical utility of the model by decision curve analysis. (A-C) Net benefit across threshold probabilities in the development, internal test, and external validation cohorts, comparing the prediction model (red) against “treat-all” and “treat-none” strategies.

Discussion

Key findings

In a cohort of advanced lung cancer patients treated with ICIs, we developed and validated a risk-prediction model for severe ICIM. The model integrates four readily obtainable clinical variables—high-sensitivity troponin, BNP, ECG, and transthoracic echocardiography—and provides individualized risk stratification via a nomogram. Across the training and external validation cohorts, the model demonstrated robust discrimination (AUC >0.80) and good calibration, indicating stable predictive performance. High-risk patients exhibited a significantly higher incidence of severe ICIM than low-risk patients; baseline troponin elevation was most strongly associated, with BNP, ECG abnormalities and echocardiographic findings providing complementary predictive value in risk stratification. Notably, ECG abnormality showed an OR of 2.90 (95% CI: 0.75–11.23; P=0.12). Although not statistically significant, its direction was consistent and clinically meaningful, and we therefore retained ECG in the nomogram to reflect the value of electrical changes in risk identification (16,20). These observations align with prior evidence that nearly all patients with severe ICIM present with elevated troponin (~94%) and the majority with new ECG abnormalities (~88%) (4).

Comparison with existing literature

Our findings are highly concordant with contemporary guidelines and large cohorts. The 2022 ESC cardio-oncology guidelines recommend routine ECG and troponin monitoring before and early during ICI therapy, with echocardiography for higher-risk patients (17,21). We confirm that baseline troponin abnormalities predict subsequent severe ICIM, consistent with clinical observations. Although ECG did not reach statistical significance in our dataset, its effect direction mirrors published data: multicenter reports indicate that ~90% of severe ICIM cases develop new ECG changes, including ST-T abnormalities, conduction blocks, and arrhythmias (4), underscoring ECG’s role as a pragmatic “sentinel” sign. Given its high accessibility and sensitivity to electrical disturbances, we judged ECG to have clinical utility within a multivariable framework despite borderline statistics. We also observed higher severe ICIM risk among older and male patients, in line with systematic evidence (4,22). By contrast, BNP contributed less to prediction—likely reflecting limited specificity and susceptibility to comorbidities—whereas troponin, a more specific marker of myocyte injury, remained central (18,19,23). Unlike diagnostic studies emphasizing CMR or endomyocardial biopsy, our approach prioritizes low-barrier markers for early risk triage, which enhances real-world scalability.

Pathophysiological rationale

Severe ICIM arises when blockade of PD-1/PD-L1 or CTLA-4 pathways disrupts immune tolerance and triggers T-cell-mediated myocardial inflammation (22,24-27) (4,20-22). Myocyte injury leads to early troponin release, while conduction system involvement causes arrhythmias and conduction delays—mechanistically consistent with our ECG signal despite limited power for statistical significance (23,24). With disease progression, ventricular dysfunction emerges, BNP levels rises, and echocardiography may reveal regional wall-motion abnormalities (28). Importantly, approximately half of severe ICIM patients maintain preserved ejection fraction early in the course (4), indicating that imaging derangements often lag behind biomarker and electrical changes (29). The model’s architecture—“troponin → ECG → echocardiography” aligns with a pathophysiological continuum—thus confers clinical interpretability.

Clinical implications

Although severe ICIM is uncommon (<1%) (4,11), its carries substantial mortality (30–50%) (24) and poses major threats to treatment continuity (28). Our model supports early identification of high-risk patients during ICI initiation, enabling intensified monitoring, multidisciplinary co-management, and timely initiation of immunosuppression when appropriate (25,30). In practice, ECG abnormalities—even with modest standalone effect—add triage value as part of a composite tool, especially in resource-constrained settings (24,26). Because the model relies on routine tests rather than repeated CMR or prophylactic biopsy, it offers favorable accessibility and cost-effectiveness (27,30), positioning it as a clinical “gatekeeper” for real-world deployment.

Strengths and limitations

Strengths include a prespecified multicenter development-validation workflow (stratified internal split plus an independent external cohort) that supports transportability; TRIPOD-aligned, transparent reporting; and exclusive reliance on routine, low-cost cardio-oncology tests (ECG, echocardiography, hs-troponin/BNP) with formal calibration and decision-curve analysis, using thresholds fixed in training and applied unchanged to validation—facilitating bedside uptake.

Limitations merit caution. (I) Retrospective design with potential residual confounding and documentation bias. (II) Severe ICIM is a rare outcome, and the limited number of severe events implies a lower events-per-predictor ratio (EPV) increasing uncertainty and potential overfitting (29), therefore, discrimination estimates (including high AUCs) should be interpreted cautiously under marked class imbalance. (III) Heterogeneous diagnostics and case ascertainment across centers (e.g., variation in CMR/biopsy availability) may contribute to outcome misclassification. (IV) Predictors were restricted to baseline assessments at ICI initiation and did not model longitudinal trajectories (e.g., dynamic troponin/ECG changes). (V) ECG abnormality showed a nonsignificant association but was retained for clinical interpretability within a multivariable framework. (VI) For rare events, ROC may overstate clinical performance; calibration drift may occur at extreme risk deciles. (VII) External validity focused on advanced lung cancer cohorts; prospective, registry-embedded impact evaluation remains needed. (VIII) We did not perform a temporally separated validation: splitting by calendar periods would further reduce events per period and could yield unstable estimates; robust temporal validation would require harmonized timestamps across centers (ICI initiation date, baseline cardiac assessment date, follow-up window, and adjudication/diagnosis date), which were not consistently available without substantial re-curation. (IX) Threshold-based classification metrics (sensitivity, specificity, PPV/NPV) and calibration intercept/slope were not reported; future prospective validation will prespecify operating points and provide these metrics to support clinical implementation (31).

Future directions

Built from widely available clinical indicators, this nomogram offers an immediately implementable way to flag patients at heightened risk of severe ICIM and to guide monitoring intensity. To translate it responsibly into practice, actions should focus on:

  • Incorporating longitudinal trajectories to enable dynamic risk prediction and to verify the time-dependent contribution of ECG and other variables;
  • Broadening external validation across tumor types and diverse ICI regimens;

Integrating the tool with electronic health records (EHRs) for real-time risk scoring and automated alerts (17,32).

With continued optimization and prospective multicenter validation, the model could anchor standardized cardio-oncology safety workflows while preserving immunotherapy delivery.


Conclusions

We developed and externally validated a simple four-predictor model—high-sensitivity troponin, BNP, ECG, and echocardiography—to estimate the risk of severe ICI-related myocarditis (severe ICIM) in advanced lung cancer. The nomogram showed strong discrimination and good calibration across internal and external cohorts, supporting pre-ICI triage and risk-stratified monitoring using readily available tests. Prospective multicenter studies and EHR integration are warranted to confirm generalizability and facilitate clinical implementation.


Acknowledgments

None.


Footnote

Reporting Checklist: The authors have completed the TRIPOD reporting checklist. Available at https://tlcr.amegroups.com/article/view/10.21037/tlcr-2025-aw-1252/rc

Data Sharing Statement: Available at https://tlcr.amegroups.com/article/view/10.21037/tlcr-2025-aw-1252/dss

Peer Review File: Available at https://tlcr.amegroups.com/article/view/10.21037/tlcr-2025-aw-1252/prf

Funding: This work was supported by National Natural Science Foundation of China (No. 82570007), Development and Industrialization of a Robotic Bronchoscopy Platform (2025B1111010001), Plan on enhancing scientific research in GMU (GMUCR2024-01011), Major Project of Guangzhou National Laboratory (GZNL2023A02011), Study of the Carcinogenic Mechanisms and Molecular Characteristics in Primary Pulmonary Lymphoepithelioma-like Carcinom (2024B03J0862), and the Construction Fund of Key Medical Disciplines of Hangzhou (2025HZGF06).

Conflicts of Interest: All authors have completed the ICMJE uniform disclosure form (available at https://tlcr.amegroups.com/article/view/10.21037/tlcr-2025-aw-1252/coif). The authors have no conflicts of interest to declare.

Ethical Statement: The authors are accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved. The study was conducted in accordance with the Declaration of Helsinki and its subsequent amendments. This study was approved by the Ethics Committee of the First Affiliated Hospital of Guangzhou Medical University (Approval number: KX-2023-009-01 approval date: 2023-09-01). Because this was a retrospective analysis of de-identified data, the requirement for written informed consent was waived by the ethics committee. All participating hospitals were informed and agreed to the study.

Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0/.


References

  1. Brahmer J, Reckamp KL, Baas P, et al. Nivolumab versus Docetaxel in Advanced Squamous-Cell Non-Small-Cell Lung Cancer. N Engl J Med 2015;373:123-35. [Crossref] [PubMed]
  2. Garon EB, Hellmann MD, Rizvi NA, et al. Five-Year Overall Survival for Patients With Advanced Non‒Small-Cell Lung Cancer Treated With Pembrolizumab: Results From the Phase I KEYNOTE-001 Study. J Clin Oncol 2019;37:2518-27. [Crossref] [PubMed]
  3. Yin Q, Wu L, Han L, et al. Immune-related adverse events of immune checkpoint inhibitors: a review. Front Immunol 2023;14:1167975. [Crossref] [PubMed]
  4. Mahmood SS, Fradley MG, Cohen JV, et al. Myocarditis in Patients Treated With Immune Checkpoint Inhibitors. J Am Coll Cardiol 2018;71:1755-64. [Crossref] [PubMed]
  5. Zhan L, Qin Z, Yang L, et al. Immune-related myocarditis characterized by malignant arrhythmia caused by conduction system involvement: case report and literature review. Front Oncol 2025;15:1589990. [Crossref] [PubMed]
  6. Moslehi JJ, Salem JE, Sosman JA, et al. Increased reporting of fatal immune checkpoint inhibitor-associated myocarditis. Lancet 2018;391:933. [Crossref] [PubMed]
  7. Moslehi J, Lichtman AH, Sharpe AH, et al. Immune checkpoint inhibitor-associated myocarditis: manifestations and mechanisms. J Clin Invest 2021;131:e145186. [Crossref] [PubMed]
  8. Munir AZ, Gutierrez A, Qin J, et al. Immune-checkpoint inhibitor-mediated myocarditis: CTLA4, PD1 and LAG3 in the heart. Nat Rev Cancer 2024;24:540-53. [Crossref] [PubMed]
  9. Salem JE, Allenbach Y, Vozy A, et al. Abatacept for Severe Immune Checkpoint Inhibitor-Associated Myocarditis. N Engl J Med 2019;380:2377-9. [Crossref] [PubMed]
  10. Laenens D, Yu Y, Santens B, et al. Incidence of Cardiovascular Events in Patients Treated With Immune Checkpoint Inhibitors. J Clin Oncol 2022;40:3430-8. [Crossref] [PubMed]
  11. Nielsen DL, Juhl CB, Nielsen OH, et al. Immune Checkpoint Inhibitor-Induced Cardiotoxicity: A Systematic Review and Meta-Analysis. JAMA Oncol 2024;10:1390-9. [Crossref] [PubMed]
  12. Mahmood SS, Fradley MG, Cohen JV, et al. Myocarditis in Patients Treated With Immune Checkpoint Inhibitors. J Am Coll Cardiol 2018;71:A699.
  13. Rothman AMK, Vachiery JL, Howard LS, et al. Intravascular Ultrasound Pulmonary Artery Denervation to Treat Pulmonary Arterial Hypertension (TROPHY1): Multicenter, Early Feasibility Study. JACC Cardiovasc Interv 2020;13:989-99. [Crossref] [PubMed]
  14. Riley RD, Ensor J, Snell KIE, et al. Calculating the sample size required for developing a clinical prediction model. BMJ 2020;368:m441. [Crossref] [PubMed]
  15. Thuny F, Bonaca MP, Cautela J. What Is the Evidence of the Diagnostic Criteria and Screening of Immune Checkpoint Inhibitor-Induced Myocarditis? JACC CardioOncol 2022;4:624-8. [Crossref] [PubMed]
  16. Freites-Martinez A, Santana N, Arias-Santiago S, et al. Using the Common Terminology Criteria for Adverse Events (CTCAE - Version 5.0) to Evaluate the Severity of Adverse Events of Anticancer Therapies. Actas Dermosifiliogr (Engl Ed) 2021;112:90-2. [Crossref] [PubMed]
  17. Brahmer JR, Abu-Sbeih H, Ascierto PA, et al. Society for Immunotherapy of Cancer (SITC) clinical practice guideline on immune checkpoint inhibitor-related adverse events. J Immunother Cancer 2021;9:e002435. [Crossref] [PubMed]
  18. Thygesen K, Alpert JS, Jaffe AS, et al. Fourth Universal Definition of Myocardial Infarction (2018). J Am Coll Cardiol 2018;72:2231-64. [Crossref] [PubMed]
  19. Debray TPA, Collins GS, Riley RD, et al. Transparent reporting of multivariable prediction models developed or validated using clustered data (TRIPOD-Cluster): explanation and elaboration. BMJ 2023;380:e071058. [Crossref] [PubMed]
  20. Zlotoff DA, Hassan MZO, Zafar A, et al. Electrocardiographic features of immune checkpoint inhibitor associated myocarditis. J Immunother Cancer 2021;9:e002007. [Crossref] [PubMed]
  21. Lyon AR, López-Fernández T, Couch LS, et al. 2022 ESC Guidelines on cardio-oncology developed in collaboration with the European Hematology Association (EHA), the European Society for Therapeutic Radiology and Oncology (ESTRO) and the International Cardio-Oncology Society (IC-OS). Eur Heart J 2022;43:4229-361. [Crossref] [PubMed]
  22. Mauriello A, Correra A, Maratea AC, et al. Immune Checkpoint Inhibitor-Associated Myocarditis: Risk, Diagnosis, and Clinical Impact. J Clin Med 2026;15:814. [Crossref] [PubMed]
  23. Murtagh G, deFilippi C, Zhao Q, et al. Circulating biomarkers in the diagnosis and prognosis of immune checkpoint inhibitor-related myocarditis: time for a risk-based approach. Front Cardiovasc Med 2024;11:1350585. [Crossref] [PubMed]
  24. Feng S, Zhao B, Sha S, et al. Overview of immune checkpoint inhibitor associated myocarditis mechanisms diagnostics and treatment. Front Immunol 2025;16:1677984. [Crossref] [PubMed]
  25. Ma P, Liu J, Qin J, et al. Expansion of Pathogenic Cardiac Macrophages in Immune Checkpoint Inhibitor Myocarditis. Circulation 2024;149:48-66. [Crossref] [PubMed]
  26. Won T, Kalinoski HM, Wood MK, et al. Cardiac myosin-specific autoimmune T cells contribute to immune-checkpoint-inhibitor-associated myocarditis. Cell Rep 2022;41:111611. [Crossref] [PubMed]
  27. Axelrod ML, Meijers WC, Screever EM, et al. T cells specific for α-myosin drive immunotherapy-related myocarditis. Nature 2022;611:818-26. [Crossref] [PubMed]
  28. Talukder S, Ghose A, Chakraborty T, et al. Evolving cardiac biomarkers for immune checkpoint inhibitor related myocarditis in cancer patients. Int J Cardiol Heart Vasc 2023;49:101278. [Crossref] [PubMed]
  29. Chen YC, Dolladille C, Rao A, et al. Immune Checkpoint Inhibitor Myocarditis and Left Ventricular Systolic Dysfunction. JACC CardioOncol 2025;7:234-48. [Crossref] [PubMed]
  30. Badaan S, Abbass A, Tov S, et al. How should immune checkpoint inhibitor myocarditis be treated? Cardiooncology 2025;11:108. [Crossref] [PubMed]
  31. Van Calster B, McLernon DJ, van Smeden M, et al. Calibration: the Achilles heel of predictive analytics. BMC Med 2019;17:230. [Crossref] [PubMed]
  32. Chandra A, Philips ST, Pandey A, et al. Electronic Health Records-Based Cardio-Oncology Registry for Care Gap Identification and Pragmatic Research: Procedure and Observational Study. JMIR Cardio 2021;5:e22296. [Crossref] [PubMed]
Cite this article as: Luan T, Ma K, Wang S, Yang Q, Ma H, Liu W, Mo J, Huang C, Chen J, Lin X, Chenli Z, Rong Z, Yu B, Li S, Luan Y, Lv Q, Lu H, Yang G, Zhang Y, Yao L, Fang Y, He X, Lin Z, Shen P, Fu S, Tan X, Xie X, Wang L, Zhou C. Development and external validation of a multivariable nomogram for predicting severe immune checkpoint inhibitor-associated myocarditis in advanced lung cancer. Transl Lung Cancer Res 2026;15(5):118. doi: 10.21037/tlcr-2025-aw-1252

Download Citation