Introduction

With the introduction of combination therapy consisting of all-trans retinoic acid (ATRA) and arsenic trioxide (ATO), initial chemotherapy has become more effective for the treatment of acute promyelocytic leukemia (APL)1,2. In Japan, ATO became available for insurance coverage in December 2004. Combination therapy with ATRA and ATO is now the standard of care for low- to intermediate-risk APL. Despite the different origins of therapy-related APL (t-APL) and de novo APL, studies have shown that the prognoses of these two types of APL are similar when treated with a combination of ATRA and ATO3. Currently, treatment options for patients with relapsing or t-APL include ATO-based regimens, which have shown promising results with complete remission (CR) rates of 80–90% and 5-year overall survival (OS) rates of 50–70%. However, patients who fail to respond to ATO or experience multiple relapses may require allogeneic hematopoietic cell transplantation (HCT). The prognosis for these patients remains challenging, with 5-year OS rates ranging from 30 to 50% after allogeneic HCT4.

Given the rarity of t-APL, this study takes an exploratory approach to comparing outcomes between patients with t-APL and de novo APL undergoing allogeneic HCT. The study aims to provide valuable insights despite the inherent limitations of such a comparison and to offer essential insights into the prognosis and appropriate treatment strategies for these patient groups before insurance coverage for CPX-3515. CPX-351 is a liposomal formulation that combines two chemotherapy drugs, cytarabine and daunorubicin, in a fixed 5:1 molar ratio, and it has been shown to be effective for therapy-related leukemia.

Patients and methods

Study design and definitions

This retrospective study involved > 300 Japanese transplant centers in the Transplant Registry Unified Management Program database, which contains physician-reviewed data with informed consent and yearly follow-ups6,7. All methods in this study were performed in accordance with relevant guidelines and regulations. The study protocol was approved by the Data Management Committee of the Japanese Society for Hematopoietic Cell Transplantation and the Ethics Review Board of St. Mary’s Hospital (approval number: 24-1102). This observational study was conducted using data from the Transplant Registry Uniform Management Program database, and all participants provided informed consent.

In total, 144 patients aged > 15 years who received their first allogeneic HCT for t-APL (n = 20) and de novo APL (n = 124) were eligible for the study. All patients included in this study had APL with confirmed t(15;17) translocation and/or PML::RARA fusion. The presence of PML::RARA was verified in all cases using a combination of cytogenetic analysis, fluorescence in situ hybridization, and/or reverse transcription polymerase chain reaction. Any patients lacking confirmed PML::RARA fusion were excluded from the study to ensure a homogeneous patient population with classical molecular characteristics of APL. The selection criteria were APL according to the World Health Organization classification8, one allogeneic HCT between January 2008 and December 2020, no or one autologous HCT before the first allogeneic HCT, and availability of sufficient follow-up data including neutrophil engraftment and acute graft-versus-host disease (GVHD). We divided the patients into two groups based on a diagnosis of t-APL or de novo APL. Cytogenetic risk was diagnosed by multiple experts9. The preparative regimens were classified as myeloablative conditioning or reduced-intensity conditioning, based on the Center for International Blood and Marrow Transplant Research functional criteria10. Neutrophil engraftment was defined as the time required to reach an absolute neutrophil count of ≥ 500/μL for 3 consecutive days. Primary graft failure was defined as failure of neutrophil engraftment by day + 35. Incidences of grade II–IV acute GVHD and the presence or absence of chronic GVHD were defined as reported previously11,12. Patients were considered evaluable for chronic GVHD if they had engraftment. The percentages of patients who relapsed or were refractory to allogeneic HCT were determined (relapse). The survival and OS rates were calculated. GVHD-free, relapse-free survival (GRFS) was defined as being alive without grade III–IV acute GVHD, moderate to severe chronic GVHD requiring systemic immunosuppressive therapy, relapse, or death13. Chronic GVHD-free, relapse-free survival (CRFS) was defined as the absence of chronic GVHD requiring systemic immunosuppressive therapy, relapse, or death14. All data were censored at the date of the last reported follow-up. Analyzed outcomes included transplant-related mortality (TRM) (defined as death occurring while a patient is in continuous remission after HCT, encompassing all causes of non-leukemia death following the transplant procedure), disease-free survival (DFS), and OS.

Statistical analysis

The frequencies and descriptive statistics of patient-, disease-, and transplantation-related variables in the two groups were calculated. The distributions of categorical and continuous variables in the two groups were compared using Pearson’s chi-squared test or the Mann–Whitney U-test. The probabilities of neutrophil engraftment, acute and chronic GVHD, TRM, and relapse were calculated using cumulative incidence curves to accommodate competing risks. The 95% confidence intervals (CIs) for all probabilities and P-values for pairwise comparisons were derived from pointwise estimates and calculated using standard techniques. To accommodate competing risks, cumulative incidence curves for TRM and relapse were estimated, and their differences were compared using Gray’s test15. Competing risk events for TRM and relapse were relapse and death, respectively. The Fine and Gray model was used for each group to evaluate the effects of patient-, disease-, and transplantation-related variables on TRM16. Baseline patient variables for analysis included age at HCT (age of ≥ 55 years), sex, Karnofsky performance status (KPS of < 80)17 at HCT, disease status at HCT (CR vs. non-CR), prior autologous HCT, and HCT-specific comorbidity index (HCT-CI of > 2)18.

To evaluate survival, Kaplan–Meier curves generated for the two groups were compared using the log-rank test, and univariate and multivariate Cox regression analyses were performed. The Cox proportional-hazards model included patient- and transplantation-related variables. A two-sided P-value of < 0.05 was considered statistically significant. All statistical analyses were performed using EZR (Saitama Medical Center, Saitama, Japan; http://www.jichi.ac.jp/saitama-sct/SaitamaHP.files/statmedEN.html)19, a graphical user interface for R (The R Foundation for Statistical Computing; http://www.r-project.org) that extends the functionality of R Commander by adding statistical functions. Analyses were conducted using R version 4.3.1 and R Commander version 2.9-1.

Results

Patient characteristics

Table 1 shows the patient-related variables of the two groups of patients. Among the 20 patients with t-APL, the most frequent primary malignancy was breast cancer (n = 11), followed by gastric cancer (n = 4), cervical cancer (n = 1), germinal cell tumor (n = 1), peripheral T-cell lymphoma (n = 1), soft tissue sarcoma (n = 1), and tongue cancer (n = 1). Chemotherapy alone was the most frequent treatment for primary malignancies (n = 10), followed by chemotherapy plus radiotherapy (n = 6). The percentages of patients who were male (P = 0.001), had a higher KPS at HCT (P = 0.009), and had a lower HCT-CI at HCT were lower in the t-APL group than in the de novo APL group.

Table 1 Comparison of patients with t-APL vs. de novo APL.

The data suggest a trend in which male patients with both t-APL and de novo APL have a higher frequency of additional chromosomal abnormalities than female patients; however, these differences did not reach statistical significance. Specifically, in the t-APL cohort, 40% (2 of 5) of male patients had additional chromosomal abnormalities compared with 20% (3 of 15) of female patients (P = 0.560). In the de novo APL cohort, 22% (18 of 81) of male patients exhibited these abnormalities compared with 9% (4 of 43) of female patients (P = 0.087). The lack of statistical significance suggests that these observed differences may be due to chance rather than a true biological or clinical difference between the sexes.

There were no statistically significant differences between the t-APL and de novo APL groups regarding either transplant-related characteristics (Table 2) or outcomes (Table 3).

Table 2 Comparison of transplant-related characteristics of patients with t-APL vs. de novo APL.
Table 3 Comparison of transplant-related outcomes of patients with t-APL vs. de novo APL.

The median follow-up period for patients diagnosed with t-APL was 114 months (range, 16–173 months). For patients with de novo APL, the median follow-up was 87 months (range, 3–167 months).

Overall, these findings comprehensively reflect the long-term outcomes associated with both t-APL and de novo APL.

Infectious complications

Two of 20 patients (10%) with t-APL were positive for hepatitis C virus and human immunodeficiency virus at the time of allogeneic HCT. However, these viral infections had no impact on the outcome of allogeneic HCT for these patients. The incidence of infections after allogeneic HCT was not significantly different between the t-APL and de novo APL groups (Supplementary Table 1).

TRM and relapse

There were no significant differences in the cumulative incidences of TRM between the t-APL and de novo APL groups (Supplementary Fig. 1A). The Fine and Gray model showed that male sex (hazard ratio [HR], 3.389) was significantly associated with TRM (Table 4).

Table 4 Multivariate analysis of TRM, relapse or progression, and DFS/GRFS/CRFS/OS of patients with t-APL vs. de novo APL.

There was no significant difference in the cumulative incidence of relapse between the t-APL and de novo APL groups (Supplementary Fig. 1B). The Fine and Gray model showed that non-CR at HCT (HR, 1.806) and prior autologous HCT (HR, 1.878) were significantly associated with relapse.

DFS, GRFS, CRFS, and OS

There were no significant differences in DFS, GRFS, CRFS, or OS between the t-APL and de novo APL groups (Supplementary Fig. 1C–E and Fig. 1). The 1- and 3-year OS rates were 80.0% and 53.8% (95% CI 55.1–92.0% and 29.9–72.8%) in the t-APL group and 65.1% and 52.4% (95% CI 56.0–72.8% and 43.1–60.8%) in the de novo APL group, respectively. Patients without CR at HCT had significantly worse OS than those with CR at HCT in the t-APL and de novo APL groups (P = 0.004) (Fig. 1). The 1- and 3-year OS rates were 71.1% and 61.1% (95% CI 60.7–79.1% and 50.3–70.2%) in patients with CR and 60.0% and 36.5% (95% CI 45.1–72.0% and 23.2–49.9%) in patients without CR, respectively. The 3-year OS rates in the t-APL and de novo APL group were 70.0% and 60.1% (95% CI 37.9–89.2% and 48.7–69.7%) in patients with CR and 37.5% and 36.5% (95% CI 10.0–65.9% and 21.8–51.3%) in those without CR, respectively.

Fig. 1
figure 1

Kaplan–Meier estimates of OS. Comparisons are shown between the t-APL and de novo APL groups and between patients with and without CR at HCT. There were no significant differences in OS between the t-APL and de novo APL groups, but patients with CR at HCT had significantly better OS than those without CR at HCT (P = 0.004). OS, overall survival; APL, acute promyelocytic leukemia; CR, complete remission; HCT, hematopoietic cell transplantation.

Multivariate Cox regression analysis identified male sex as an independent predictor of worse DFS (HR, 1.942), GRFS (HR, 1.693), CRFS (HR, 1.771), and OS (HR, 2.089) (Table 4). A KPS of < 80 at HCT was identified as an independent predictor of worse DFS (HR, 1.986) and OS (HR, 1.972). CR at HCT was an independent predictor of worse DFS (HR, 1.799), GRFS (HR, 1.583), and OS (HR, 1.753). Prior autologous HCT was identified as an independent predictor of worse DFS (HR, 1.622) and CRFS (HR, 1.753).

Discussion

The present study found no significant differences in DFS, GRFS, CRFS, or OS between the t-APL and de novo APL groups. However, as previously reported20, prior autologous HCT was identified as an independent predictor of worse relapses, DFS, and CRFS, and patients without CR at HCT had significantly worse relapses, DFS, GRFS, and OS than did those with CR at HCT.

The lack of statistically significant differences suggests that patients with t-APL, despite having therapy-related disease, did not fare worse in transplant outcomes compared with those with de novo APL. This information could be clinically relevant because it might indicate that transplantation is an equally viable treatment option for both groups of patients. However, it is essential to note that the absence of statistical significance does not necessarily mean the groups are identical. The study may have lacked sufficient statistical power to identify small differences that might exist. A large cohort study showed that although t-APL generally has a better prognosis than other therapy-related acute myeloid leukemias21, its outcomes may be slightly less favorable than those of de novo APL. However, with modern treatment approaches, particularly ATRA- and ATO-based regimens, the outcomes of patients with t-APL have substantially improved and may be approaching those of patients with de novo APL22. t-APL may lead to residual organ damage or decreased bone marrow function due to prior treatment, all of which might affect the prognosis. Some differences in the mutational landscape exist between t-APL and de novo APL, but these differences do not appear to be as clinically significant as once thought. The prognosis for both forms of the disease depends primarily on the specific mutations present, especially those influencing the treatment response, rather than whether the leukemia is therapy-related or de novo in origin. However, there is no significant difference in response to ATRA or ATO between t-APL and de novo APL4. Similar OS was observed between patients with t-APL and de novo APL despite the fact that patients with t-APL were more likely to have higher comorbidity indices and potentially harbor mutations associated with a poor prognosis. This difference between the patient groups might be attributed to several factors. First, the effectiveness of modern APL-specific therapies, such as ATRA and ATO, may mitigate some of the adverse prognostic factors associated with t-APL. Second, the graft-versus-leukemia effect of allogeneic HCT may overcome the potential resistance mechanisms in t-APL cells. Finally, improvements in the supportive care and management of transplant-related complications may reduce the impact of comorbidities on transplant outcomes. However, further research is needed to fully elucidate the biological and clinical factors contributing to these comparable outcomes.

Notably, the same treatment approach for de novo APL is recommended for t-APL, and a good prognosis can be expected with appropriate treatment. Whereas earlier studies reported relapse rates of up to 20% in patients with high-risk APL20, more recent data from the ATRA and ATO era show lower relapse rates. For example, the APL0406 trial reported a 2-year cumulative relapse incidence of only 1.9% in patients treated with ATRA plus ATO23. This marked reduction in relapse rates has prompted a reevaluation of the role of HCT in APL treatment. Although HCT was once considered the standard approach for patients in second remission, emerging evidence suggests that transplantation may be deferred in patients who achieve a second molecular CR with ATO-based salvage therapy23,24. Nevertheless, HCT remains a critical option for patients with persistent molecular disease or multiple relapses. A meta-analysis of autologous and allogeneic HCT for recurrent APL published recently found that TRM is a problem with allogeneic HCT and that autologous and allogeneic HCT have similar recurrence rates (24% vs. 23%)25. However, the benefits of HCT for t-APL have not been thoroughly studied, and its effectiveness remains unclear. In our study, TRM after allogeneic HCT for t-APL was similar to that for de novo APL, and TRM for t-APL with a high HCT-CI was not different from that for de novo APL. These findings suggest that although t-APL is a therapy-related leukemia—typically associated with a worse prognosis in other subtypes of acute myeloid leukemia—it responds similarly to treatment and demonstrates comparable outcomes to de novo APL, even in the setting of allogeneic HCT.

Our study showed that male sex was independently associated with worse TRM, DFS, GRFS, CRFS, and OS in patients with APL undergoing allogeneic HCT. This sex-based disparity in APL outcomes may be attributed to several factors:

  • Biological differences: Sex hormones influence immune function and hematopoiesis. Estrogen, for example, has been shown to protect hematopoietic stem cells and enhance immune responses26. However, there are no reports of estrogen being associated with treatment efficacy or complications in the context of allogeneic HCT.

  • Genetic factors: The National Institutes of Health Consensus Development Project on Criteria for Clinical Trials in Chronic GVHD12 suggests that men may have a higher prevalence of additional chromosomal abnormalities, potentially contributing to more aggressive disease.

  • Pharmacokinetics: Differences in drug metabolism between sexes can impact treatment efficacy and toxicity27. For example, male patients may exhibit higher clearance rates for certain chemotherapeutic agents, potentially resulting in suboptimal drug exposure28. However, no data specifically link these agents to treatment efficacy or complications in the context of allogeneic HCT.

  • Transplant-related factors: Sex-mismatched donor–recipient pairs, particularly female donors to male recipients, have been associated with an increased risk of GVHD and mortality following allogeneic HCT. However, this risk generally ranges from approximately 20% to 25%29.

Further research is needed to elucidate the precise mechanisms underlying the observed sex-based disparity in APL outcomes and to develop targeted interventions to improve outcomes for male patients. Understanding these factors might lead to more personalized treatment approaches and potentially improve outcomes for male patients with APL.

Our study had several important limitations, most notably its retrospective, registry-based design and the small sample size of patients with t-APL. The limited number of patients with t-APL (n = 20) compared with de novo APL (n = 124) may have reduced our ability to detect significant differences between the groups and limited the generalizability of our findings. Additionally, the inherent differences in patient characteristics and treatment histories between patients with t-APL and de novo APL pose challenges for direct comparison. Despite these limitations, this exploratory analysis provides valuable insights into the potential efficacy of allogeneic HCT for both patient groups, especially given the rarity of t-APL. However, potential selection bias cannot be ruled out because of the lack of proper randomization, and we lacked data for HCT-ineligible patients. Moreover, the analysis did not include details on various conditioning regimens and post-transplant therapies, which may have influenced the outcomes. Our findings provide a comprehensive overview of the long-term outcomes associated with both t-APL and de novo APL within the study’s timeframe. However, the relatively short follow-up period restricted our ability to fully assess long-term TRM and complications. These limitations highlight the need for larger, prospective studies with longer follow-up periods to validate our findings and provide more robust evidence to guide clinical decision-making for the treatment of patients with t-APL and de novo APL.

In conclusion, our data suggest that the results of allogeneic HCT for patients with t-APL were comparable to those for patients with de novo APL. However, patients without CR require an additional range of treatment strategies and interventions aimed at optimizing outcomes before and after receiving allogeneic HCT containing CPX-351. This might be effective against therapy-related leukemia, although its effect on t-APL is unknown. An optimal therapy for relapsed or refractory APL has not been established, and further advances continue to be made.