Introduction

Neoadjuvant treatment is highly recommended for patients with human epidermal growth factor receptor 2-positive (HER2+) breast cancer (BC), given its well-established benefits in improving surgery outcome and survival1,2. Pathological complete response (pCR) is a standard endpoint for measuring the short-term outcome of neoadjuvant treatment, while the long-term endpoints are often survival outcomes3. Approximately 50–70% of HER2 + BC cases also express hormone receptors (HR), which have a complex crosstalk with HER24,5. The HR-positive (HR+)/HER2 + population faces higher treatment resistance and a less favorable prognosis6. Achieving pCR in neoadjuvant therapy is more challenging in the HER2+/HR + population when compared with the HER2+/HR-negative (HR-) population7. This highlights an unmet clinical need to determine the optimal neoadjuvant strategy for HR+/HER2 + BC. Commonly used regimens and drugs are summarized in Supplementary Table S1.

Targeted therapy for HER2 + BC has advanced significantly over the past decades. Trastuzumab combined with pertuzumab is the current standard of care for primary HER2 + BC in the neoadjuvant setting1,2. Besides, novel HER2-targeting therapies showed promising results in the neoadjuvant setting, such as antibody-drug conjugate trastuzumab emtansine (T-DM1)8,9 or trastuzumab deruxtecan (T-DXd)10 and tyrosine kinase inhibitor (TKI) neratinib11 or pyrotinib12. These trials confirmed that the HR + population achieved lower pCR rates than the HR- population. Notably, no head-to-head comparison of all regimens has been conducted in the HR+/HER2 + population, raising the question of whether the HR + population might benefit from different HER2-targeting regimens compared to the overall HER2 + population.

Chemotherapy exhibits reduced benefits in the HR + BC population compared to HR- BC13. Therefore, there is a growing trend to minimize chemotherapy to balance risks and benefits14. Conventionally, anthracyclines were frequently used in chemotherapy regimens. However, recent studies have shown that the cardiotoxicity of anthracyclines may be superimposed with antibodies15, and omitting the use of anthracyclines does not influence pCR or survival outcomes in patients with HER2 + BC16. The TRYPHAENA trial added carboplatin to anthracycline-free neoadjuvant chemotherapy, which yielded similar pCR and event-free survival (EFS) between both groups17, establishing the role of carboplatin in treating HER2 + BC. Therefore, there are clinical considerations regarding whether anthracyclines could be omitted and whether carboplatin should be added to neoadjuvant chemotherapy regimens for HR+/HER2 + BC treatment.

Due to the complex interaction between the downstream pathways of HR and HER2, endocrine therapy (ET) is recommended for HR+/HER2 + BC treatment18,19. Dual pathway blockade has demonstrated considerable efficacy in advanced BC20,21,22. Therefore, neoadjuvant ET has been well studied in the HR+/HER2 + population and yielded positive outcomes23,24,25,26. Nonetheless, debates persist regarding the clinical applications of neoadjuvant ET, including its optimal timing, potential long-term impact on prognosis, the suitability of pCR as an efficacy endpoint, and its potential to replace chemotherapy.

Previous network meta-analyses have primarily focused on the overall HER2 + BC population, with limited exploration of the HR+/HER2 + subgroup.16,27,28,29,30, While a few studies have evaluated pCR rates across regimens for HR+/HER2 + BC, these analyses often lacked comprehensive consideration of long-term outcomes16,30,31.

Leveraging newly released data from recent trials, such as WGS-TP-II32 and PHEDRA12, this study aims to demonstrate the optimal HER2-targeting regimen for HR+/HER2 + BC by integrating both short-term and long-term outcomes. Additionally, we aimed to explore the roles of carboplatin, anthracycline, and ET in the neoadjuvant period, providing updated insights into the management of this challenging subgroup.

Materials and methods

Search strategy and selection criteria

This study is a systematic review with network meta-analysis. Our review protocol was registered on the PROSPERO website (CRD42023385644), and this systematic review is reported following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines and its extension for network meta-analysis33,34,35.

Literature was retrieved from Medline, EMBASE, Cochrane Library, and Web of Science with no restrictions on publication year or language. Detailed strategies used in each database are provided in Supplementary Table S2–S5. Search terms were built based on population, intervention, and outcome of interest. References of formerly published trials and reviews were also checked for missing literature. Unpublished trials and gray literature were not included.

The predefined inclusion and exclusion criteria are as follows: Neoadjuvant clinical trials were eligible if they met all the following criteria: (1) included patients with HR+/HER2 + BC, (2) compared any neoadjuvant regimens that included HER2-targeting therapies, and (3) reported any pCR or EFS (or disease-free survival) outcome. Trials were excluded if they (1) were bioequivalence studies, (2) did not report the outcome of patients with HR+/HER2 + BC, and (3) compared solely on variations in dose or sequence between treatment regimens. The literature retrieved was selected by two reviewers independently.

Interventions

Interventions were merged according to HER2-targeting therapies of the neoadjuvant regimens, and interventions of interest included six types as follows:

  1. (1)

    T-DM1-based regimens (T-DM1), which include T-DM1 alone, T-DM1 + pertuzumab, or T-DM1 + lapatinib, either with or without chemotherapy or ET.

  2. (2)

    trastuzumab and pertuzumab with chemotherapy (HP + C).

  3. (3)

    trastuzumab and TKI with chemotherapy (H + TKI + C). TKI includes lapatinib, neratinib, and pyrotinib.

  4. (4)

    trastuzumab with chemotherapy (H + C).

  5. (5)

    lapatinib with chemotherapy (L + C).

  6. (6)

    any chemotherapy with no HER2-targeting regimen (C).

Outcomes

The primary outcome of this study was the number of patients achieving pCR, defined as the absence of residual invasive cancer in the breast and no pathological involvement of axillary lymph nodes (ypT0/Tis ypN0). Other definitions of pCR were also accepted as (1) the absence of residual invasive and in situ cancer in the breast and no pathological involvement of axillary lymph nodes (ypT0 ypN0) or (2) the absence of residual invasive cancer in the breast (ypT0/Tis) since some trials did not report ypT0/Tis ypN0, and sensitivity analysis was conducted to assess the potential bias involved by this. The secondary outcome was EFS, defined as the time from randomization to an event which may include disease progression, discontinuation of the treatment for any reason, or death.

Data analysis

All data were independently extracted by two reviewers. Data were extracted on summary estimates, including trial characteristics (trial name, register number, starting year, country, and total participants), patient characteristics (age, tumor stage, tumor size, menopausal status, and HR status), intervention details (regimen and total duration), and interested outcomes. For pCR, the number of patients and the pCR definitions in the corresponding trial were extracted. For EFS, three-year or five-year EFS rates, hazard ratios (HRs), and the 95% confidence interval (CI) between treatment arms were extracted.

All studies included were assessed for the risk of bias with the Cochrane risk-of-bias tool (RoB2)36 by two reviewers independently. Five domains listed below were checked and reported as low risk, some concern, or high risk for each trial.

  1. (1)

    Bias arising from the randomization process, assessing the generation and concealment of allocation sequence, and the differences between intervention groups.

  2. (2)

    Bias due to deviations from intended interventions, assessing the blinding of participants and personnel, and deviations from the intended interventions.

  3. (3)

    Bias due to missing outcome data, assessing the availability of outcome data.

  4. (4)

    Bias in the measurement of the outcome, assessing the appropriateness and blinding of outcome assessment.

  5. (5)

    Bias in the selection of the reported result.

All statistical analyses were conducted based on the HR+/HER2 + BC population. For each comparison of regimens reported in more than one trial, we performed a direct meta-analysis with odds ratios (ORs) of pCR for pooling effect sizes. A common effect model was used, and ORs were reported with 95% CIs, where a random effects model was also conducted for sensitivity analysis. All tests were two-sided and statistical significance was set at p < 0·05. Cochran’s Q test was performed to assess the statistical heterogeneity. For comparison groups with I2 statistic > 25% or p-value of the Q test < 0·1, sensitivity analysis was conducted to identify the source of heterogeneity by deleting each trial. Peter’s test was conducted to assess publication bias in each direct comparison, and results were presented by funnel plots37.

For indirect comparisons, we conducted a network meta-analysis with the common effect model in a Bayesian framework. Four Markov chains were run for 50,000 simulation iterations after 10,000 tuning iterations. ORs with 95% CIs for pCR analysis and HRs with 95% CIs for EFS analysis were reported. A node-splitting analysis was performed to determine the inconsistency between direct and indirect comparisons38, and a comparison-adjusted funnel plot was constructed to evaluate the small-study effects across the network39. Sensitivity analysis was performed by reevaluating the results while excluding trials with a high risk of overall bias, using other TKIs than lapatinib, or reporting pCR outcomes in different definitions. The ranking probability was evaluated for each treatment regimen and summarized with the surface under the cumulative ranking curve (SUCRA). A clustered SUCRA ranking plot considering both pCR and EFS was also presented40.

To further analyze the efficacy of chemotherapy and ET regimens, direct comparisons were performed between anthracycline-containing and non-anthracycline therapy, carboplatin-containing and non-carboplatin therapy, as well as regimens with or without ET. Heterogeneity was assessed with Cochran’s Q test, and subgroup analysis was performed to identify its source.

All statistical analyses were conducted in R version 4.3.1. Plots in risk of bias analysis were made with the ‘robvis’ package. Direct meta-analysis was performed with ‘meta’ package. Bayesian network meta-analysis was performed with ‘gemtc’ and ‘rjags’ package.

Results

Literature retrieval and study characteristics

2844 articles were retrieved from four databases, of which 58 articles8,9,11,12,17,25,26,32,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74,75,76,77,78,79,80,81,82,83,84,85,86,87,88,89,90, including 28 trials, were eligible and were extracted for accessible data (Fig. 1; Table 1). 20 trials with 2809 patients with HR+/HER2 + BC were included in the primary pCR analysis comparing HER2-targeting regimens. Additionally, five trials with 877 patients were assessed for ET efficacy, six trials with 818 patients for carboplatin, and four trials with 482 patients for anthracycline.

The duration of neoadjuvant treatment ranged from 12 to 30 weeks, due to differences in cycle numbers and length. Among all HER2 + cases included, 63·0% (4347/6904) were HR+. All trials reported the number of patients with pCR in the HR + subgroup, while six of them provided EFS results for the HR + subgroup.

The risk of bias assessment identified one trial with a high overall risk of bias due to the missing randomization process. Most trials (21/28) had a low overall risk of bias, while six trials had some concerns about the risk of bias (Supplementary Fig. S1, Supplementary Table S6).

Fig. 1
figure 1

PRISMA flow chart of literature retrieval and selecting.

Table 1 Patient characteristics of the include trials.

Pairwise meta-analysis

From trials that reported pCR in the HR+/HER2 + population, six direct comparisons were made: (1) T-DM1 vs. HP + C (2) HP + C vs. H + C (3) H + TKI + C vs. H + C (4) H + TKI + C vs. L + C (5) L + C vs. H + C (6) C vs. H + C. HP + C demonstrated a significantly higher pCR rate compared to H + C, with an OR (95% CI) of 1·77 (1·05–2·98). H + TKI + C achieved a higher pCR rate than both H + C and L + C with ORs (95% CI) of 1·61 (1·22–2·11) and 2·29 (1·63–3·21), respectively. Both L + C and C were associated with significantly lower pCR than H + C, with ORs of 0·64 (95% CI 0·48–0·85) and 0·39 (95% CI 0·23–0·66), respectively (Fig. 2). Sensitivity analyses with random effects models all resulted similarly (Supplementary Fig. S2, Supplementary Table S7).

When comparing T-DM1 vs. HP + C, a high I2 statistic was observed (I2 = 45%, p = 0·12) after the heterogeneity test. Sensitivity analysis showed that the lowest I2 statistic (I2 = 15%, Supplementary Fig. S3) was achieved when the TEAL trial was excluded, with no impact on the outcome of this comparison (OR 0·93, 95%CI 0·66–1·31). No evidence of publication bias was observed after Peter’s test, and all funnel plots were approximately symmetrical (Supplementary Fig. S4).

Fig. 2
figure 2

Forest plots of direct comparisons for pCR. (a) T-DM1 vs. HP + C; (b) HP + C vs. H + C; (c) H + TKI + C vs. H + C; (d) H + TKI + C vs. L + C; (e) L + C vs. H + C; (f) C vs. H + C. T-DM1 = trastuzumab emtansine based regimens; HP + C = trastuzumab and pertuzumab with chemotherapy; H + TKI + C = trastuzumab and TKI with chemotherapy; H + C = trastuzumab with chemotherapy; L + C = lapatinib with chemotherapy; C = chemotherapy with no HER2-targeting regimen. The squares and the lines crossing the square stand for OR and its 95%CI. The diamonds stand for the estimated pooled OR and its 95% CI. Heterogeneity was tested with Cochran’s Q test, and all statistical tests were two-sided.

Network meta-analysis

The network comparing pCR included 20 trials with 15 comparisons, and the network comparing EFS included five trials with ten comparisons (Fig. 3). ORs of pCR and HRs of EFS were listed in Table 2 with their 95% CI. We observed statistically significant differences in 11 comparisons for pCR and no statistically significant difference for EFS across different treatment arms.

The node-splitting model was used for the consistency test, and no significant inconsistency was found (Supplementary Fig. S5). The adjusted funnel plot was symmetrical, indicating no significant small-study effect (Supplementary Fig. S6). Sensitivity analysis was performed by excluding trials with a high risk of overall bias, using other TKIs instead of lapatinib, or reporting pCR in different definitions, and most results remained stable (Supplementary Table S8–S10).

Table 2 League table of odds ratio for pCR and hazard ratio for EFS.
Fig. 3
figure 3

Network plots for pathological complete response (pCR) and event free survival (EFS). (a) pCR, and (b) EFS. Each line represents a direct comparison, and its width is weighted by the number of trials reporting this comparison, which is presented next to it. T-DM1 = trastuzumab emtansine based regimens; HP + C = trastuzumab and pertuzumab with chemotherapy; H + TKI + C = trastuzumab and TKI with chemotherapy; H + C = trastuzumab with chemotherapy; L + C = lapatinib with chemotherapy; C = chemotherapy with no HER2-targeting regimen.

Fig. 4
figure 4

Two-dimensional ranking plot of SUCRA value for highest probability of pCR rate and lowest risk of EFS events. T-DM1 = trastuzumab emtansine based regimens; HP + C = trastuzumab and pertuzumab with chemotherapy; H + TKI + C = trastuzumab and TKI with chemotherapy; H + C = trastuzumab with chemotherapy; L + C = lapatinib with chemotherapy; C = chemotherapy with no HER2-targeting regimen.

Ranking of regimens

SUCRA values were calculated and indicated that T-DM1 (SUCRA 0.86) had the highest probability of achieving pCR. HP + C (SUCRA 0.82) closely followed in second place, and H + TKI + C (SUCRA 0.71) ranked third. However, when considering EFS, HP + C (SUCRA 0.69) emerged as the top regimen, followed by H + TKI + C (SUCRA 0.63).

A clustered ranking plot was generated by combining the values of SUCRA from both pCR (on the y-axis) and EFS (on the x-axis) (Fig. 4). HP + C emerged as the optimal choice (top right) considering both short-term clinical response and long-term survival outcome. Detailed ranking probabilities are shown in Supplementary Fig. S7.

Chemotherapy and endocrine therapy analysis

Pairwise meta-analyses were performed to further analyze the efficacy of different chemotherapy and ET with the same HER2-targeting regimen. We conducted three direct meta-analyses comparing pCR rates separately between treatments with or without anthracycline (OR 0·74, 95% CI 0·51–1·07), carboplatin (OR 1·27, 95% CI 0·95–1·96), and ET (OR 0·69, 95% CI 0·52–0·92) (Supplementary Fig. S8 A–C). When comparing ET, significant heterogeneity was observed (I2 = 81%, p < 0·01). Therefore, we conducted a subgroup analysis, which indicated that the subgroup where chemotherapy was replaced with ET still presented heterogeneity (I2 = 65%, p = 0·09) (Supplementary Fig. S8 D). This heterogeneity could be attributed to variations in neoadjuvant treatment duration. For example, the ET group in the PHERGain trial received 24 weeks of neoadjuvant therapy, while the chemotherapy group in the PHERGain trial and both groups in the WSG-TP-II trial received 12 weeks. Thus, the random effects model was used to compare results with the common effect model (Supplementary Fig. S8 E). For trials incorporating ET alongside other treatments, higher pCR was observed with no statistical significance in the ET group (OR 1·18, 95% CI 0·80–1·73). However, for trials in which chemotherapy was replaced with ET, the pooled result showed a significantly lower pCR rate in ET in both common effect model (OR 0·34, 95% CI 0·22–0·53) and random effects model (OR 0·35, 95% CI 0·16–0·75).

Discussion

Our findings suggest that the HP + C regimen is the optimal neoadjuvant strategy for HR+/HER2 + BC when considering both pCR and EFS. The anthracycline-free regimen achieved a numerically higher pCR than the anthracycline-containing regimen, and the inclusion of carboplatin also increased the possibility of achieving pCR, although not reaching statistical significance. The efficacy of neoadjuvant ET remains inconclusive.

Both direct and indirect comparisons indicate that the T-DM1-based or dual HER2-targeting regimens are better than single HER2-targeting regimens in improving pCR rates. Though no significant differences were observed in pCR and EFS when comparing T-DM1, HP + C, and H + TKI + C regimens, HP + C had the highest probability of ranking first when considering both pCR and EFS. T-DM1 showed certain advantages in ranking when comparing pCR but ranked lower when comparing EFS. Although the result of the EFS analysis may not be robust enough due to insufficient data, we found that both the KRISTINE9 and I-SPY II8 trials reported higher risks of EFS events in the T-DM1 group, suggesting T-DM1 may not be the preferred choice for long-term survival benefits. Moreover, results from the KRISTINE9 and KAITLIN91 trials demonstrated that T-DM1 combined with pertuzumab did not outperform HP + C in either neoadjuvant or adjuvant settings. Thus, further studies are necessary to investigate short-term and long-term benefits of T-DM1 and other antibody-drug conjugates. Based on the existing evidence, the H + TKI + C regimen does not demonstrate significant advantages over the HP + C regimen. Considering that TKIs have a significantly higher incidence of grade 3 or higher adverse events (AEs) in the overall HER2 + population, and there is no evidence suggesting that the risk of AEs is associated with HR status, we could not recommend H + TKI + C over HP + C at present. We noticed that pyrotinib, a new TKI drug, achieved a relatively higher pCR rate12. New TKI drugs with improved clinical benefits and safety profiles may hold promise in the future. We conclude that HP + C remains the current optimal neoadjuvant strategy for HR+/HER2 + BC.

The optimal chemotherapy regimen to combine with targeted therapy is controversial. An EBCTCG meta-analysis in the overall BC population demonstrated that adding anthracycline to a taxane regimen yields better outcomes, and higher cumulative doses of anthracycline and taxane are associated with greater benefits92. A phase II trial presented at ASCO 2023 showed that albumin-bound paclitaxel combined with trastuzumab and pertuzumab, followed by anthracycline-based regimens, achieved a higher pCR in the HER2+/HR- BC subgroup (81·8%) compared to the HER2+/HR + subgroup (31·6%)93. The TRYPHAENA trial reported similar long-term outcomes between groups with or without anthracycline17, and the TRAIN-2 trial found that the neoadjuvant treatment of the HR + subgroup favored the non-anthracycline strategy numerically87. Our results support the omitting of anthracycline in the neoadjuvant chemotherapy for HR+/HER2 + BC, as we observed higher pCR in anthracycline-free regimens in all included trials, although not statistically significant in the pooled result. Furthermore, the addition of carboplatin to the neoadjuvant chemotherapy regimen appears to increase the likelihood of achieving pCR in this population. Further trials are needed to support this inference since the CI still includes 1.

ET is recommended for HER2+/HR + BC treatment. However, it remains unclear if ET should be initiated in the neoadjuvant period. Our analysis identified significant heterogeneity among trial results, likely due to varied strategies in incorporating ET, including adding ET to standard treatments or replacing chemotherapy with ET. The NSABP B52 trial reported an insignificant improvement in pCR rate for the group adding ET to the HP + C regimen94. Our subgroup analysis demonstrated no heterogeneity among the results of studies with ET added to standard treatments, all of which observed higher pCR in the ET group, although the pooled result showed no statistical significance. On the other hand, in trials where neoadjuvant chemotherapy was replaced with ET, relatively low pCR rates were reported. For example, the single-arm trials PAMELA and TBCRC006 reported pCR rates of 18% and 21%, respectively, when ET was combined with trastuzumab and lapatinib without chemotherapy95,96. Similar pCR were reported in the PerELISA and PHERGain trials when combining ET with trastuzumab and pertuzumab (21% and 35%, respectively)23,26. In our study, only two included trials replaced chemotherapy with ET, both reporting lower pCR in the ET group compared to the chemotherapy group.

For the sole goal of reducing tumor size in neoadjuvant therapy, replacing neoadjuvant chemotherapy with ET may not be optimal, given ET’s lack of a direct inhibitory effect on cancer cell growth. However, neoadjuvant treatment is not limited to local control but is focused on prognosis and guiding postoperative treatment. In this case, even if neoadjuvant ET may not significantly improve the pCR rate, long-term outcomes should be considered when evaluating its benefits. Notably, the three-year follow-up of PHERGain showed that the invasive disease-free survival rate was highest in patients who achieved pCR with ET and without chemotherapy97. This indicates that pCR might not be the optimal endpoint for evaluating responses to neoadjuvant ET, and future trials should explore alternative indicators such as preoperative endocrine prognostic index and endocrine sensitive disease rate for their clinical significance.

Patients with HER2+/HR + BC exhibit a lower likelihood of achieving pCR and face poorer prognoses than those with HER2+/HR- BC6,7. Our study provided a precise perspective and a focused and detailed analysis on the HR + subgroup of HER2 + BC, addressing an unmet need in this challenging population. Notably, this is the first study to compare EFS across different neoadjuvant regimens specifically for HR+/HER2 + BC, offering insights into both short-term (pCR) and long-term (EFS) outcomes. Furthermore, our work is pioneering in evaluating the role of ET in the neoadjuvant setting for this subgroup.

Some unavoidable limitations should be noted. First, the network of EFS analysis included only five trials, which may limit the robustness of the findings. Second, safety analysis was not conducted due to insufficient data in the HR+/HER2 + population. However, we anticipate a similar safety profile as the overall HER2 + population, which has been extensively studied. Notably, TKI agents are generally associated with more serious AEs than antibodies16,27,29. Third, this analysis did not account for variations in TNM staging, the duration of neoadjuvant treatment, or subsequent adjuvant therapy across the included trials. These differences could contribute to heterogeneity in patient cohorts, treatments, and endpoint evaluations.

In conclusion, despite the result being insignificant in EFS comparisons, the HP + C regimen emerged as the optimal neoadjuvant treatment for HR+/HER2 + BC when considering both pCR and EFS outcomes. Additionally, anthracycline-free carboplatin-containing chemotherapy may be a viable combination treatment, although the result did not reach statistical significance. Further studies on neoadjuvant ET are essential to explore appropriate endpoints for evaluating its efficacy and to provide deeper insights into its role in the treatment of HR+/HER2 + BC.