Introduction

B cell-derived malignancies are typically accompanied by immune dysfunction and are associated with an increased risk of infections [1,2,3,4]. Upon diagnosis, the already impaired immune function is subject to further decline, which can be ascribed to the progression of lymphoma-associated immunodeficiency that is further aggravated by treatment once initiated. Therefore, it comes as no surprise that infections are among the main determinants of outcomes in patients with B cell-derived malignancies, generally conferring highly increased mortality in this patient population [2, 5,6,7].

In a recent study of chronic lymphocytic leukemia (CLL), we observed that individuals diagnosed with CLL demonstrated an increased use of antimicrobials, and thus risk of infections, relative to healthy controls starting more than 5 years before diagnosis [8]. Furthermore, approaching diagnosis, the risk of infections increased markedly, likely reflecting a sudden worsening of immune function. This suggested two immunological phenomena at play in these patients. First, a prolonged period of modestly increased risk of infections, and subsequently, the ostensible emergence of a malignancy resulting in more profound immune disruption.

In the present investigation, we continued this line of inquiry, extending our analyzes to include other B cell-derived malignancies and exploring the clinical significance of prediagnostic infections. In general, B cell-derived malignancies share a variety of immunological characteristics, such as leukopenia, T-cell exhaustion, and B cell dysfunction that, in part, contribute to the patients’ increased risk of infections [7, 9, 10]. As such, it stands to reason that patients with other B cell-derived malignancies than CLL would also demonstrate a sudden increase in the risk of infections shortly before diagnosis, possibly preceded by patterns indicating prolonged pre-diagnostic periods of immune dysfunction. Moreover, we previously argued that pre-diagnostic infections occurring shortly before diagnosis play, for all practical purposes, a similar causal role in terms of mortality as infections occurring between diagnosis and the start of treatment, as the precise timing of the lymphoma diagnosis can be construed as somewhat arbitrary [11]. We thus further expanded the analysis to include an investigation of the prognostic significance of pre-diagnostic infections in the context of other established predictors of survival.

Using data from Danish population-wide registers, we hypothesized that, like CLL, diffuse large B cell lymphoma (DLBCL), multiple myeloma (MM), follicular lymphoma (FL), marginal zone lymphoma (MZL), and lymphoplasmacytic lymphoma (LPL) would also demonstrate varying degrees of immunodeficiency, and thus higher rates of infections prior to diagnosis. Subsequently, for each of these malignancies, we assessed the clinical impact of recent pre-diagnostic infections on overall survival (OS) from the time of diagnosis in the context of other established international indices (IPIs).

Methods

Cohorts

Our investigation made use of two similar, but separate, data sources resulting in two cohorts: a population cohort and a clinical cohort, with overlapping time periods and patients. Cohort selection criteria are described in detail in the supplementary information (Supplementary Fig. 1, Supplementary Methods). Briefly, the population cohort was assembled using nationwide population- and cancer registers and allowed for matched case-control analyzes of pre-diagnostic use of antimicrobials [12, 13]. We identified all patients aged 18 and older diagnosed with a B cell-derived malignancy between June 1996 and Oct 2019 in the Danish Cancer Register. Each patient was matched with a minimum of five and up to 15 controls from the general population, matching on birth year and sex, such that the control was ensured to be alive and to not have been diagnosed with the same hematological malignancy at the time of diagnosis of their assigned patient. On the other hand, the clinical cohort was assembled from nationwide clinical quality registers providing information on IPIs and finer details on prognostic factors such as immunoglobulin heavy-chain variable (IGHV) status in CLL and cell of origin (COO) status in DLBCL. This enabled corresponding analyzes of pre-diagnostic use of antimicrobials stratified by prognostic factors. The clinical cohort was also used to investigate the impact of pre-diagnostic antimicrobials on OS after diagnosis in the context of known prognostic factors, specifically IPIs [14,15,16,17,18,19]. Notably, the clinical cohort comprised only patients and no population controls.

Throughout, each B cell-derived malignancy subtype was addressed in a separate analysis, subtypes being DLBCL, MM, FL, MZL (mæucosa-associated lymphoid tissue [MALT] lymphoma and nodal MZL [nMZL]), and LPL including Waldenström’s macroglobulinemia. Subtypes were classified using ICD-10 and ICD-O-3 codes (see supplementary – Cancer type groupings).

Antimicrobial prescriptions

Data on antimicrobial prescriptions were retrieved from the Danish National Prescription Register from June 1996 through Oct 2019 and from Jan 2004 through Dec 2020 for the population cohort and clinical cohort, respectively (see Supplementary Methods) [20]. In Denmark, antimicrobials are not available over-the-counter. Even so, prescription data do not include antimicrobials administered in-hospital. We divided antimicrobial prescriptions into subgroups based on anatomical therapeutic chemical (ATC) codes as previously published [8]. Multiple antimicrobial prescriptions filled within 14 days of one-another were considered a single infectious event. This design was implemented across ATC-subgroups for the main analyzes and per ATC-subgroup for supplementary analyzes.

Statistical Analysis

Prediagnostic antimicrobial prescriptions

We followed all individuals in the population cohort from June 1996 or the date of birth, whichever came later, until the diagnosis of B cell-derived malignancy. Controls were followed until the time of the diagnosis of their assigned case (i.e. pseudodiagnosis). Using Cox regression, we estimated hazard ratios (HR) for antimicrobial prescription in one-year intervals, comparing subjects who would later be diagnosed with CLL, DLBCL, MM, FL, MZL, and LPL, to their matched controls. In this analysis, we used age as the underlying time and stratified baseline hazards such that each case and their set of matched controls constituted a stratum. The regression was further adjusted for geographical region (time-dependent) as the strata were not defined to match on region, only on birth year and sex.

The clinical cohort was used for corresponding analyzes of the risk of prediagnostic antimicrobial use stratified by prognostic or morphological markers, i.e., disease-specific IPI, or IGHV- or COO status in individuals later developing CLL or DLBCL, respectively. In addition, these analyzes were adjusted for age, sex, calendar period at diagnosis, and geographical region at diagnosis.

Survival after diagnosis

The clinical cohort was used for Cox-regression to estimate HRs for all-cause mortality among patients, comparing subjects according to the number of antimicrobial prescriptions filled (0 vs 1 vs 2 vs >2) within one year prior to their diagnosis - henceforth referred to as recent infections. Survival analyzes were additionally adjusted for sex, age at diagnosis, calendar period at diagnosis, geographical region at diagnosis, and the corresponding IPI. In sensitivity analyzes, we examined whether the prognostic significance of prediagnostic antimicrobial use systematically varied according to sex or if the individual was diagnosed before or after age 50 years, which did not seem to be the case (results not shown).

Statistical analyzes were performed using R version 4.0.3 [21]. Register data were obtained through Danish Health Data Authority [22]. The study was approved by SSI QC and Compliance (j.nr. 21/00805) (population cohort) and National Ethics Committee 1804410 with Data Protection Agency P-2020-561 (clinical cohort).

Results

The population cohort included 30,389 individuals diagnosed with a B cell-derived malignancy with a total observed time at risk for antimicrobial prescriptions of 355,372 person-years occurring before diagnosis, accumulating 213,649 antimicrobial prescriptions in that period. Controls comprised 435,846 individuals with a total observed time at risk of 5,238,485 person-years occurring before pseudodiagnosis, accumulating 2,748,300 antimicrobial prescriptions (Table 1).

Table 1 Descriptive characteristics of the population cohort

The clinical cohort included 18,560 subjects in total. Pre-diagnostic time at risk for antimicrobials totaled 174,752 person-years, during which 107,268 prescriptions of antimicrobials were filled (Supplementary Fig. 1). Post-diagnostic follow-up totaled 81,624 person-years with a median of 10.2 years (interquartile range: 6.4–13.2), during which 6697 (36.1%) subjects died (Table 2).

Table 2 Descriptive characteristics of the clinical cohort.

Prediagnostic use of antimicrobials

For each separate disease entity, patients demonstrated an increased risk of antimicrobial prescriptions in the year preceding diagnosis relative to their matched controls. Except for FL, we observed that the diagnosis of any B cell-derived malignancy was preceded by several years of increased use of antimicrobials as compared with matched controls. However, the timing and the offset of the accelerating rate of pre-diagnostic use of antimicrobials varied somewhat according to the specific disease entity (Fig. 1).

Fig. 1: Patterns of antimicrobial use preceding diagnosis of B cell derived malignancy.
figure 1

a chronic lymphocytic leukemia (CLL), b diffuse large B cell lymphoma (DLBCL), c multiple myeloma (MM), d follicular lymphoma (FL), f marginal zone lymphoma (MZL), and lymphoplasmacytic lymphoma (LPL) including Waldenström macroglobulinaemia. Shown are hazard ratios with 95% confidence intervals for the use of any type of antimicrobial in cases relative to matched controls at distinct periods prior to diagnosis.

Among patients diagnosed with FL, the use of antimicrobials overlapped with that of their matched controls throughout all periods of observation until the year before diagnosis (Fig. 1d). By contrast, patients who developed CLL, MM, MZL, and LPL demonstrated a pre-diagnostic antimicrobial use that exceeded that of their matched controls at least a decade prior to diagnosis. This difference was noted to be increasing as the diagnosis approached, with the trend of increasing use accelerating at least five years prior to diagnosis (Fig. 1a, c, e, and f). Much like the other diseases, DLBCL demonstrated a marked increase in the use of antimicrobials in the last year prior to diagnosis, however, DLBCL was preceded by two decades of stable and elevated antimicrobial use as compared to their matched controls (Fig. 1b).

Subgroup analyzes of ATC groups were conducted using the population cohort (Supplementary Figs. 27). Penicillins were the most frequently used antimicrobials across all malignancies, followed by macrolides (Table 1).

Investigating the HRs of antimicrobial prescriptions for each ATC-subgroup as a function of time until diagnosis, we next highlighted some differences in trends between malignancies. For instance, future CLL patients demonstrated significantly increased HRs of macrolide prescriptions and antiviral prescriptions for more than a decade prior to diagnosis as compared to their controls (Supplementary Fig. 2). Future DLBCL patients demonstrated a consistent and stable increased risk for penicillin prescriptions starting more than a decade prior to their diagnosis. Likewise, the pattern of macrolide prescriptions in future DLBCL patients was similar to that of future CLL patients, with the HRs for macrolide prescriptions seeing a steady increase leading up to diagnosis (Supplementary Fig. 3). The same held true for future MM, MZL, and LPL patients (Supplementary Figs. 4, 6, and 7). For antimicrobials other than penicillin, the HRs for which were nominally elevated, we noted no significant trends in HRs for antimicrobial prescriptions for future FL patients when stratifying by antimicrobial subgroup (Supplementary Fig. 5).

Analyzes of the pre-diagnostic antimicrobial use stratified by IPI, IGHV status, and COO status did not demonstrate easily interpreted differences between strata (Supplementary Figs. 810).

Survival after diagnosis

Across all examined malignancies, we demonstrate shorter OS from the time of diagnosis with an increasing number of infections in the year preceding diagnosis (Fig. 2; p < 0.001). After adjusting for age at diagnosis, sex, calendar period of diagnosis, geographical region at diagnosis, and disease-specific IPI, having filled more than two antimicrobial prescriptions within one year before diagnosis continued to associate with significantly shorter OS as compared to having filled none for all malignancies examined (Fig. 3), except in the case of LPL (Fig. 3f). The association between antimicrobial prescriptions and OS was most pronounced in CLL, DLBCL, MZL, and LPL. (Fig. 2) At the same time, the disease-specific prognostic scores maintained their independent prognostic value for all examined malignancies. (Fig. 3, Supplementary Table 3).

Fig. 2: Prognostic implications of antimicrobial use preceding diagnosis B cell-derived malignancy.
figure 2

Shown are Kaplan–Meier curves for all-cause mortality after diagnosis, stratified by the total number of antimicrobial prescriptions within one year prior to diagnosis.

Fig. 3: Effects of prognostic factors on all-cause mortality after diagnosis.
figure 3

Shown are hazard ratios obtained from separate Cox-regressions adjusted for age, calendar period and geographical region at diagnosis, with time since diagnosis as the underlying time scale.  Analyses were stratified by disease type: a chronic lymphocytic leukemia (CLL), b diffuse large B cell lymphoma (DLBCL), (c) multiple myeloma (MM), (d follicular lymphoma (FL), e marginal zone lymphoma (MZL), and (f) lymphoplasmacytic lymphoma including Waldenström macrogobulinaemia (LPL). * indicates p < 0.05. 

Discussion

We used antimicrobial prescriptions as a proxy for infections and compared individuals who would later develop B cell-derived malignancies to matched controls for extensive periods of time before diagnosis. In doing so, we showed that except for FL the risk of infections had been increased several years prior to diagnosis of all the examined malignancies. We also showed that for all disease types, the risk of infections increased markedly in the year just prior to diagnosis, likely a consequence of undiagnosed malignancy. We next compared overall survival among subjects with cancer stratified by the number of recent infections. In survival analyzes using comprehensive prognostic information, we demonstrated that treatment for more than two recent infections was associated with shorter OS, independent of age, sex, calendar period, geographical region, and IPI.

Prediagnostic infections

While ours is not the first investigation to report an increased risk of infections before diagnosis of B cell-derived malignancy [8, 23, 24], we were able to expand on previous observations with detailed analyzes of temporal variations within and between the different malignancies. Disregarding the last year before diagnosis, three different patterns of pre-diagnostic infections could be discerned for the examined B cell-derived malignancies, suggestive of different pathways of pathogenesis. The first pattern was observed in individuals who were later diagnosed with CLL, MM, LPL, and MZL, such that the risk of infections had gradually increased over the span of a decade or more (Fig. 1). This pattern indicated a continuous deterioration of immune competence in conjunction with – and possibly because of – the development of these malignancies. For CLL, MM, and LPL this assumption aligns with previous observations as disease precursors such as monoclonal B-lymphocytosis and monoclonal gammopathy of undetermined significance (both IgM and non-IgM) can be demonstrated decades prior to overt malignancy have been associated with an increased risk of infections [9, 25,26,27]. As previously discussed for CLL [8], we cannot from these analyzes determine if recurrent infections play a causal role in the development of these malignancies. The increased risk of prediagnostic infections could be causally unrelated to the pathogenesis and merely reflect an underlying immune deficiency, or the infections could promote the development of indolent precursors and/or to the precursors’ continued transformation into malignant cells. In the latter case, the recurrent infections and the induced immune dysfunction would act in synergy to accelerate pathogenesis beyond their individual effects.

MZL has been associated with specific chronic infections and with autoimmune conditions. The underlying biological mechanism behind these associations is one involving persistent antigenic drive resulting in poly-, oligo- and ultimately monoclonal B cell expansion [28]. Interpreting the observed prolonged period of increased use of antimicrobials before diagnosis as being due to underlying immune dysfunction is compatible with this understanding of MZL pathogenesis. Meanwhile, we speculate that the accelerating use of antimicrobials starting around 5 years prior to diagnosis may be an early manifestation of lymphoma-induced immune dysfunction [29].

A second pattern of pre-diagnostic infections was observed in individuals who developed DLBCL. This group of patients demonstrated a 10% increased risk of infections for up to 15 years prior to the typical accelerating rate of antimicrobial use heralding diagnosis of all examined lymphomas (Fig. 1b). Profound immunodeficiencies as seen with, e.g., AIDS or organ transplantation as well as immunologic conditions like autoimmune diseases have both been linked with the risk of DLBCL [30,31,32,33,34]. Also, causal associations have been established between certain infections and DLBCL, e.g., with EBV especially in the setting of overt immune deficiency [1, 35].

Again, we cannot from our investigation determine the precise mechanisms underlying the observed pattern of pre-diagnostic infections, i.e., whether and to what extent immunodeficiency facilitates infections’ direct and indirect contributions to lymphomagenesis. We note, however, that in a previous investigation elevated levels of serological markers of immune activation could be demonstrated more than 7.5 years before DLBCL diagnosis [29]. We therefore speculate that the observed association between infections and DLBCL reflects the scenario when immunodeficiency predisposes to DLBCL via failure to eradicate malignant cells, failure to curb oncogenic infections, and through continuous immune stimulation caused by recurrent infections.

The third pattern of pre-diagnostic infections was observed for individuals who developed FL who, unlike patients with any of the other studied B cell derived malignancies, had similar use of antimicrobials as their controls until shortly before the diagnosis of malignancy. FL etiology remains poorly understood and the few risk factors for FL that have been suggested would account only for a small proportion of cases [36]. Our study indicates that neither risk factors for FL nor FL precursor states appear to manifest as an increased risk of infections and that FL stands out from other B cell malignancies insofar as immune deficiency is not a hallmark of FL risk. Previous studies have also found that immune deficiency does not inherently elevate the risk of developing FL, corroborating our results [37].

We hypothesized that aggressive disease, reflected by a high-risk IPI score and non-GCB- and unmutated IGHV status in DLBCL and CLL, respectively, would be indicative of rapid tumor growth and subsequently lower historical prediagnostic rates of use of antimicrobials (i.e. length time bias) [38, 39]. However, with the exception of CLL-IPI high and very high risk compared to CLL-IPI low risk, we did not observe any difference in the prediagnostic risk of infections according to IPI, COO- or IGHV status (Supplementary Figs. 810). These analyzes were statistically well-powered and suggest a similar increased risk of prediagnostic infections in individuals who are later diagnosed with aggressive and more indolent phenotype CLL and DLBCL.

To summarize, dependent on disease subtype, individuals diagnosed with B cell-derived malignancies demonstrated an increased risk of infections up to 15 years prior to diagnosis, which accelerated markedly the year just prior to diagnosis. Different patterns of antimicrobial use, superimposed, suggest distinct immunological phenomena at play, one prolonged and one acute.

Examination of the biological mechanisms underlying the former phenomenon may shed further light on lymphoma pathogenesis and pave the way for future preventative strategies; the latter may have more immediate clinical implications for prognostication, discussed forthwith.

Survival after diagnosis

B cell-derived malignancies are subject to significant diagnostic delay, with some studies reporting a delay of three to five months even for symptomatic patients [40]. Undiagnosed lymphoma is most likely also the explanation for the sudden increased risk of infections in the year prior to diagnosis. As infections after diagnosis of B cell-derived malignancy are associated with increased mortality [41, 42], we expected patients with multiple recent infections prior to diagnosis to also demonstrate shorter OS after diagnosis compared to patients without. Indeed, univariate analyzes demonstrated that having more than two recent infections associated with shorter OS after diagnosis (Fig. 2), and remained predictive of significantly shorter OS in analyzes adjusted for other studied prognostic factors (Fig. 3). The effect of recent infections on mortality was particularly pronounced in patients with DLBCL and CLL, and to a lesser extent in FL, MZL, and LPL, whereas recent infections barely affected survival after MM diagnosis. Interpreting the number of recent infections as a proxy for level of immunodeficiency suggests that the mechanisms are a combination of increased rates of infectious death and treatment constraints.

Although a leading cause of death in patients with B cell-derived malignancies, neither infections nor measures of immunodeficiency are included in current IPIs and staging systems [14,15,16,17,18,19]. If our findings are validated in an independent cohort, we suggest that more than two recent infections prior to diagnosis of B cell-derived malignancy are indicative of marked immunodeficiency and could be considered for inclusion in revised IPIs to refine gold standard prognostication.

Strengths and Limitations

We performed a population-based epidemiological investigation of 30 389 and 18 560 patients in the population cohort and the clinical cohort, respectively, with near-complete information on prescriptions and follow-up. For the population cohort, we were able to match each patient with a minimum of five and up to 15 controls. All survival analyzes were performed using a clinical cohort with information on prognostic factors such as IPI available for all patients [43,44,45,46]. Further, all patients in the clinical cohort had at least one full year of recorded prescription history prior to diagnosis.

Information on specific pathogens, such as H. pylori, implicated in the lymphomagenesis of mucosa-associated lymphoid tissue (MALT) lymphomas was not available [1]. Interestingly, the use of both penicillin and macrolides were increased years prior to MZL diagnosis and demonstrated the highest absolute risk of macrolide use across all malignancies investigated, suggesting that eradication therapy could indeed interfere with our observations (Supplementary Fig. 6) [47]. Our findings regarding antiviral treatment should be interpreted with caution. On the one hand, most infectious viral syndromes are treated only symptomatically, while on the other hand they may be more commonly treated in individuals at risk of the infection to run a severe course, e.g., because of impaired immune function (Supplementary Figs. 27 ‘Antivirals’). These mechanisms are likely to both influence the observed associations. Further, B-symptoms related to malignancy may commonly be mistaken for routine infections [48], and some newly diagnosed lymphoma and MM patients demonstrate elevated C-reactive protein levels [49, 50]. Thus some antimicrobial prescriptions may represent early malignancy rather than true infections. Likewise, others have suggested that the risk of being diagnosed with a B cell-derived malignancy is presumably higher among heavy antimicrobial users during routine follow-up and visits related to comorbidities [8]. When analyzing pre-diagnostic antimicrobial use, we did not adjust for comorbidities as the diagnosis of a (future) malignancy was implemented as an exposure, and subsequently used to assess differences in rates of antimicrobial use in the years preceding (pseudo)diagnosis between future patients and controls. Comorbidities, in this context, would therefore serve as a mediator of the effect of future malignancy on preceding rates of antimicrobial use. Therefore, any adjustment for comorbidities would serve to obfuscate the differences in rates of antimicrobial use observed among future patients and controls. In a similar vein, regarding analyzes of survival after diagnosis, we have previously shown that comorbidities in patients with CLL are associated with a higher risk of dying from infections [5]. We did not adjust for comorbidities in our analyzes of survival after diagnosis, as these were designed only to shed light on the independent effects of recent infections when combined with established prognostic factors such as IPIs, age, and sex. Specific comorbidities could be subject for further investigation in the context of infections and B cell-derived malignancies [51]. Lastly, the patients in the population and clinical cohorts overlap to a large degree, but due to mandatory pseudonymization being carried out by two separate institutions, we could not determine to what extent.

Conclusion

The risk of infections was significantly increased up to 15 years prior to diagnosis of B cell-derived malignancies and accelerating just prior to diagnosis, and only in the case of FL was the former pattern not observed. Interpreting the findings as resulting from two superimposed patterns of antimicrobial use, one prolonged and one acute aligns with previous observations of long-lasting immunodeficiency preceding sudden cancer emergence. Antimicrobial use and disease aggressiveness as determined by established IPIs were not mutually associated. Further examination of the biological mechanisms underlying these phenomena may shed further light on lymphoma pathogenesis and pave the way for future preventative strategies. Patients treated for more than two recent infections before diagnosis of B cell-derived malignancy demonstrated shorter OS even after adjusting for other known prognostic factors. Thus, the number of recent infections may constitute an independent prognostic tool and could be considered in combination with current IPIs and staging systems to predict patient survival more accurately.