Introduction

Testosterone (T) levels have been previously suggested to be associated with premature death, muscle strength, and body fatness [1,2,3]. However, the effects of T deficiency or low levels of T (total T ≤ 300 ng/dL or 3.0 ng/mL) [4] on cardiovascular disease (CVD) remained inconsistent and controversial [5]. In parallel, this inconsistency and controversy has been reported with low levels of calculated free T [6, 7].

Studies have suggested that as many as 38.7% of men in the United States (US) over 45 years old demonstrate low T or T deficiency [8,9,10] with close to 2.4 million men (aged 40–69 years) with T deficiency [11]. However, a greater concern remains that it has been projected that by 2025 ~6.5 million US men (aged 30–80 years) will develop T deficiency, partly due to the increasing rate of aging population and the obesity epidemic [2, 9, 12]. A recent report indicates that 25% of men aged >65 years have low total T, but at least 50% of them have low levels when using free T as the diagnostic criterion suggesting that free T can be a better test for T deficiency/hypogonadism diagnosis [13]. Yet, this contention remains debatable [14].

Previous studies have provided valuable insight, but they have had small samples of minority populations with limited generalizability to non-Hispanic (NH)-Black and Mexican American men [15, 16]. The latter homogenous group constitutes more than 60% of the Hispanic population in the US [17]. Previous studies have demonstrated racial and ethnic differences (NH-White, NH-Black and Mexican Americans) with total T levels among adult and adolescent men [18], and in association with CVD [19].

Therefore, the objectives of this investigation is to determine the association of total serum T and calculated free T with CVD, and its specific disease outcomes (myocardial infarction [MI], heart failure [HF], coronary heart disease [CAD]), and to assess whether these associations vary among a US nationally representative sample of NH-White, NH-Black, and Mexican American men in the NHANES waves full sample [1988–1991, 1999–2004, and 2011–2016] and subset sample [1988–1991, 1999–2004, and 2013–2016]). We hypothesize that these associations will vary by race and ethnicity.

Methods

Study population

The National Health and Nutrition Examination Survey (NHANES) is a program from the National Center for Health Statistics (NCHS)- Centers for Disease Control and Prevention (CDC) to investigate the health of adults and children in the US [20]. NHANES is a prevalent study that uses a multistage, stratified and clustered probability sampling strategy in which Hispanics (Mexican Americans), NH-Blacks, and the elderly are oversampled to ensure adequate sample size and to represent the total US civilian, non-institutionalized population [21]. Information about the survey design, data collection and methodology is available on the NHANES website (https://wwwn.cdc.gov/nchs/nhanes/Default.aspx. Accessed Jan. 2020).

This investigation included men in the 1988–1991 (Phase 1), 1999–2004, and 2011–2016 NHANES cycles. Sex steroid hormones were measured from stored surplus serum samples by the study investigators in 7058 males aged ≥20 y. These participants were stratified in a random sample in the morning examination sessions of each cycle to reduce extraneous variation due to diurnal production of hormones.

Participants with prostate cancer history were excluded because their treatments may affect sex steroid hormone. Exclusion criteria included men younger than 20 y, covariates’ missing information, missing sex hormone measurements and having extreme hormone measurements leaving a final full sample of 7058 males. NHANES 2011–2012 wave did not measure estradiol and SHBG, and therefore a subset sample of 1988–1991, 1999–2004, and 2013–2016 waves was developed to adjust for estradiol and SHBG (n = 5139).

Assessment of testosterone, estradiol and SHBG

Information on the blood draw, process, storage and shipping methods was published elsewhere [21]. NHANES 1988–1991 and 1999–2004 measured total T, estradiol, and SHBG using the electrochemiluminescence immunoassays on the 2010 Elecsys system (Roche Diagnostics, Laval, QC, Canada; and Roche Diagnostics, Indianapolis, IN, USA). The lower limits of detection of the assays were 3 nmol/L for SHBG, 5 pg/mL for estradiol, and 2 ng/dL for T. Duplicates (n = 21) were assayed for quality control purposes: coefficients of variation were 4.8% for testosterone, 21.4% for estradiol, and 5.6% for SHBG. NHANES 2011–2016 measured total T and estradiol with LC-MS/MS and isolated from 100 μL serum by 2 serial liquid–liquid extraction steps and quantified with [13C] stable isotope–labeled T as the internal standard. The lower limit of detection was 0.3 ng/dL. Sex hormone binding globulin (SHBG) was measured based on the reaction of SHBG with immunoantibodies and chemoluminescence measurements of the reaction products and subjecting to a magnetic field.

Total T below or equal to 3.0 ng/mL was defined as low T or T deficiency [4]. Total T was also categorized (quintiles [Q]) to compare the prevalence of CVD between Q5 vs Q1 of total T under the hypothesis that high levels of T is a potential risk factor for CVD. Calculated free T was obtained by published formulas with information for total T, estradiol, SHBG, and serum albumin collected in NHANES [22, 23]. Free T below or equal to ≤0.065 ng/mL was considered low per expert opinion noted in the American Urological Association White Paper- Paduch et al [24]. Calculated free T was also categorized (quintiles) to compare the prevalence of of CVD between Q5 vs Q1 of calculate free T.

Assessment of cardiovascular disease (CVD)

In this study, we defined CVD as any reported diagnosis of HF, coronary artery disease (CAD), MI, and stroke. NHANES participants were asked the following structure questions: “Has a doctor or other health professional ever told you that you had congestive HF?,” “Has a doctor or other health professional ever told you that you had CAD?,” “Has a doctor or other health professional ever told you that you a heart attack (or MI)?,” or “Has a doctor or other health professional ever told you that you a stroke?.” Participants who answered “yes” to any of these questions were included in the positive status of CVD.

Assessment of covariates

Age, cigarette smoking, race/ethnicity, alcohol consumption, education, diabetes and physical activity during the past 30 days were self-reported during the NHANES interviews. Glucose was defined in NHANES using the glucose hexokinase method with a Hitachi Model 704 multichannel analyzer (Boehringer Mannheim Diagnostics, Indianapolis, IN). Body mass index (BMI) was measured as weight in kilograms divided by height in meters squared. Overall obesity was defined by BMI ≥ 30 kg/m2. Individuals were classified as having diabetes if their fasting plasma glucose levels were ≥126 mg/dl, or if they responded positively to questions about medication treatment or being “told by a doctor you have diabetes or sugar diabetes. Three readings of systolic and diastolic blood pressure were obtained from participants who attended the mobile examination center. We used the average of those three measurements (≥140/90 mmHg). We also considered the current use of antihypertensive medication treatment or being “told by a doctor you have hypertension” as an indication of high blood pressure (hypertension). Serum total cholesterol was measured enzymatically [25], and serum lipid measurement was performed according to the criteria of CDC’s Lipid Standardization Program [26]. Details related to the laboratory procedures have been published previously [21]. Code availability upon request to corresponding author.

Statistical analysis

Sampling weights were applied to account for selection probabilities, over-sampling, non-response, and differences between the sample and the total US population. Geometric means and 95% confidence intervals (CI’s) for total T, calculated free T, estradiol and SHBG concentrations were estimated in the full and subset samples. For this analysis, total T concentrations were transformed using natural logarithm because they were right skewed. The information on the definition has been published previously [21]. In brief, for descriptive analysis, we compared the distribution of lifestyle and sociodemographic factors by full and subset samples using t-test statistic for means from continuous variables and chi-squared for categorical factors (Table 1 and Supplemental Table 1).

Table 1 Selected characteristics of the U.S. population of adult men 20 y and older in the full and subset samples in NHANES 1988–1991, 1999–2004, 2011–2016.

We used weighted logistic regression models to estimate multivariable-adjusted odd ratios and 95% CI’s for prevalent CVD and specific outcomes (HF, CAD and MI) associated independently with total T and calculated free T. Weighted multivariable adjusted analyses were performed in 2 models, namely, full sample [1988–1991, 1999–2004, and 2011–2016] and subset sample [1988–1991, 1999–2004, and 2013–2016]. The full sample models were adjusted for race and ethnicity, age, smoking status, education, history of hypertension, physical activity, alcohol consumption, BMI, diabetes, and total cholesterol. In parallel, the subset sample models were adjusted for same risk factors plus estradiol and SHBG [27], which were not included in NHANES 2011–2012. In order to test for a linear trend across categories of total T and calculated free T, we modeled categories of total and calculated free T as continuous variables using the median for each category.

Stratified and weighted multivariable adjusted analyses were conducted by race and ethnicity (NH-White, NH-Black, and Mexican American) because this factor has been observed to modify T levels [18]. All p values were two-sided; alpha = 0.05 was considered the cut-off for statistical significance. Multiplicative interactions terms were incorporated into the models and tested using the Wald test. All statistical analyses were performed using SAS (SAS Institute v.9.4, Cary, NC).

Results

Within the full and subset samples, we found 7058 and 5139 men, respectively. A total of 3723 men were NHWs (78.97%), 1870 NHBs (11.01%), and 1465 Mexican Americans (10.02%) in the full sample with similar percentages in the subset sample (Table 1). Mean age in the full sample is 48.75 and subset sample is 48.87. In both full and subset samples, men had higher education (>12 years-high school, >30% some college), were overweight/obese (mean BMI > 28 kg/m2), were never smokers (>47%), had prevalent diabetes (>12%) and hypertension (>43%), were physically active (>55%), and had moderate levels of alcohol consumption (mean >13 g) and total cholesterol (mean >191 mg/dL). Similar differences between the full and subset samples were observed when selected characteristic were stratified by CVD status (Supplemental Table 1).

In the full sample, only low T was associated with an increased prevalence of CVD (OR = 1.26, 95% CI, 1.02–1.57) after adjusting for CVD risk factors (Table 2). In the subset sample, after adjusting for the same CVD risk factors plus estradiol and SHBG levels, low T (OR = 1.57, 95% CI, 1.17–2.11), quintiles of total T (Q1 vs Q5, OR = 2.25, 95% CI, 1.01–5.01, Ptrend = 0.02), low calculated free T (OR = 1.53, 95% CI, 1.10–2.17) and quintiles of calculated free T (Q1 vs Q5, OR = 1.59, 95% CI, 0.68–3.72, Ptrend = 0.03) were associated with an increased prevalence of CVD (Table 2).

Table 2 Multivariable associations of total and calculated free testosterone (T) with cardiovascular diseases in the full and subset samples in NHANES III Phase I (1988–1991), 1999–2004, and 2011–2016.

In both full and subset samples, low T was significantly associated with an increased prevalence of MI (OR = 1.40, 95% CI, 1.03–1.89, and OR = 1.72, 95% CI, 1.08–2.75, respectively) (Table 3). In both full and subset samples, low T was associated with an increase prevalence of HF (OR = 1.51, 95% CI, 1.09–2.10, and OR = 1.74, 95% CI, 1.08–2.85, respectively). Similar inverse associations with HF were found with quintiles of total T (Q1 vs Q5) in the full (Q1 vs Q5, OR = 1.92, 95% CI, 1.09–3.51, Ptrend = 0.03) and subset samples (Q1 vs Q5, OR = 3.34, 95% CI, 1.22–9.13, Ptrend = 0.004). For HF and CAD, only in the subset samples we found that a continuous increment of total T reduced prevalence of these diseases (Table 3).

Table 3 CVD specific analysis: Multivariable associations of total and calculated free testosterone with myocardial infarction, heart failure, and coronary heart disease in the full and subset samples in NHANES III Phase I (1988–1991), 1999–2004, 2011–2016.

Among NH-White men, both full and subset samples shows that only low T was significantly associated with an increased prevalence of CVD (OR = 1.34, 95% CI, 1.04–1.73, and OR = 1.73, 95% CI, 1.26–2.38, respectively) (Table 4). Only in the subset sample, low calculated free T (OR = 1.67, 95% CI, 1.10–2.49) and quintiles of calculated free T (Q1 vs Q5, OR = 1.80, 95% CI, 0.66–4.95, Ptrend = 0.03) were associated with an increased prevalence of CVD. Among Mexican American men, only in the subset sample we found that with continuous increments of total T (OR = 0.71, 95% CI, 0.51–0.97, Ptrend = 0.03) and calculated free T (OR = 0.73, 95% CI, 0.58–0.92, Ptrend = 0.01) there were reduced associations with prevalence of CVD. Low calculated free T (OR = 2.55, 95% CI, 1.35–4.81) and quintiles of calculated free T (Q1 vs Q5, OR = 1.57, 95% CI, 0.27–8.97, Ptrend = 0.03) were associated with an increased prevalence of CVD. In general, among NH-Black men, there were no significant associations (Table 4).

Table 4 Race and Ethnicity: Multivariable associations of total and calculated free testosterone with cardiovascular diseases in in the full and subset samples in NHANES III Phase I (1988–1991), and the 1999–2004, 2011–2016.

Discussion

To our knowledge, the novelty of this study is the quantification of the associations of total T and calculated free T with CVD, and its specific disease outcomes (MI, HF, CAD), among a US nationally representative sample of NH-White, NH-Black, and Mexican American men. Our findings showed that low T or T deficiency, low calculated free T, total T (Q1 vs Q5), and calculated free T (Q1 vs Q5) were associated with an increased prevalence of CVD after adjusting for CVD-risk factors plus estradiol and SHBG (subset sample). Similarly, low T was associated with an increased prevalence of MI and HF, and the continuous increment of total T was associated with a decreased prevalence of CAD and HF. In general, the direction and significance of these associations were consistent among NH-White and Mexican American men, but not Black men.

Three meta-analyses of observational studies reported that low levels of total T were associated with an increased incidence of CVD and CVD mortality in 2011 [28,29,30], but others have not [31,32,33]. Subsequently, a 2018 larger meta-analysis of observational studies confirmed these inverse associations [34]. Yet, none of these meta-analyses conducted specific analysis among NH-Black and Mexican American men. Our findings among the overall population (NH-White, NH-Black and Mexican American) full sample are consistent with the results of these meta-analyses in relation to the negative association between low levels of T and higher risk of CVD. Furthermore, the largest studies included in the 2018 meta-analysis [34] were conducted mainly among NH-White men (between 2084 and 3637 participants included in the independent studies) [7, 35,36,37,38,39,40]. In our study, the largest racial group was NH-White men (n = 3723), and our findings in this group were similar to those reported by the previous meta-analyses [28,29,30, 34].

Differences in the levels of total T among adult and adolescent NH-White, NH-Black and Mexican American men have been previously noted and found that Mexican American adult and adolescent men had higher levels of total T than their counterparts NH-White and NH-Blacks [18, 41]. These previous findings have the potential to provide insight to our study observations as we only found significant associations between low levels of T and CVD among Mexican American and NH-White men.

Similar to the previous studies of total T, low free T has been linked with an increased risk of CVD and CVD mortality [7, 30]. However, these previous free T studies did not conduct specific analysis on NH-Black and Hispanic men. In a meta-analysis of 7 prospective studies, low free T was associated with an increased risk of CVD among healthy, middled aged and older men [30]. Our findings are consistent with these previous studies [30] as we found that low calculated free T was associated with an increased prevalence of CVD in the overall population (n = 5139) and among NH-White men (n = 2688, which included middle aged and older adults). In our study, a similar significant association was observed among Mexican Americans (middled aged and older men, n = 1465).

There is a considerable debate regarding whether calculated free T is a stronger indicator of T deficiency/hypogonadism diagnosis [13, 24] than total T particularly in view of recent reports of stronger associations of low free T (compared with low total T) with CVD mortality [7] and prostate cancer [42]. However, our findings with low T and low calculated free T among NH-White and Mexican American men do not support that contention.

What remains to be determined is the observation of inconsistent associations between T deficiency and CVD among Mexican American and NH-Black men, who have the highest prevalence for diabetes, obesity, and metabolic syndrome [19, 43] and which are considered among the strongest risk factors for T deficiency and risk of CVD [2, 19, 44], after taking into account these comorbidities. These inconsistent associations may suggest that other biological pathway (e.g., inflammation pathway [45, 46]) may influence the interplay between T deficiency and CVD by race/ethnicity.

Our study has strengths. NHANES includes a nationally representative sample of the civilian non-institutionalized US population; therefore, the findings of this study can be generalized to the US population. Furthermore, NHANES adheres to a rigorous protocol of quality control procedures for the collection of the outcomes of interests, exposures and potential confounding factors analyzed and adjusted in this study. In this study, we mutually adjusted for total T, SHBG, and estradiol. The scope of this investigation is not to demonstrate whether calculated free T is better than total T or viceversa, but rather to follow current guidelines from several societies suggesting the use of free T as a confirmatory marker in cases of borderline low total T [14].

Despite these strengths, the current study has limitations that may influence interpretation of the results. First, we conducted a cross-sectional study that precludes the investigation of a temporal investigation between T and CVD. Furthermore, we relied on a single measurement of sex steroid hormones. This limits our ability to make a strong causal inference between T levels and CVD outcomes. It is possible that- at least in some men- CVD and associated pharmacotherapy resulted in decreased T levels. In addition, due to limited power sample size, we couldn’t conduct stroke specific analysis as the validity of the models were questionable. Second, the storage of serum or plasma in collection tubes after the centrifugation procedure can influence the measured total T process, and the ethylenediaminetetraacetic acid in collection tubes can influence SHBG, and subsequently these two can affect the calculation for free T [24]. Estradiol and SHBG were not available in all the NHANES cycles (2011–2012); therefore, two different datasets were created, full (1988–1991, 1999–2004, and 2011–2016) and subset samples (1988–1991, 1999–2004, and 2013–2016). The use of these two samples will allow comparison with future studies where estradiol and SHBG may or may not be available. Third, the latter NHANES waves (2011–2016) measured total T with HPLC tandem mass spectrophotometry, but the previous ones (1988–1991, 1999–2004) used electrochemiluminescence immunoassays. Fourth, the cut-off points to define T deficiency and calculated low free T may not fully or precisely capture their biological effect. Fifth, NHANES had no information related to testosterone replacement therapy, which could have influenced our findings and others have suggested caution about the use of testosterone therapy in the prevention of CVD [47]. Sixth, although we adjusted for several CVD risk factors, there remains the possibility of residual confounding. Finally, future studies with available data should explore in depth the role of population genomics and how genomic variation could influence the association between testosterone and CVD in different racial and ethnic groups [48, 49] that could provide insight about our findings with Mexican American men.

In summary, the results of this study confirm and elaborate on the observation that men with low levels of total T and calculated free T had an increased prevalence of CVD. Similarly, T deficiency was associated with an increased prevalence of MI and HF, and the continuous increment of total T was associated with a decreased prevalence of CAD and HF. In general, the direction and significance of these associations were consistent among NH-White and Mexican American men, but not Black men. Future studies with prospective designs and larger sample sizes of NH-Black and Mexican American men are required to confirm these findings.