A phenome-wide comparative analysis of genetic discordance between obesity and type 2 diabetes

Coral, Daniel E.; Fernandez-Tajes, Juan; Tsereteli, Neli; Pomares-Millan, Hugo; Fitipaldi, Hugo; Mutie, Pascal M.; Atabaki-Pasdar, Naeimeh; Kalamajski, Sebastian; Poveda, Alaitz; Miller-Fleming, Tyne W.; Zhong, Xue; Giordano, Giuseppe N.; Pearson, Ewan R.; Cox, Nancy J.; Franks, Paul W.

doi:10.1038/s42255-022-00731-5

Download PDF

Article
Open access
Published: 26 January 2023

A phenome-wide comparative analysis of genetic discordance between obesity and type 2 diabetes

Nature Metabolism volume 5, pages 237–247 (2023)Cite this article

10k Accesses
17 Citations
47 Altmetric
Metrics details

Subjects

Abstract

Obesity and type 2 diabetes are causally related, yet there is considerable heterogeneity in the consequences of both conditions and the mechanisms of action are poorly defined. Here we show a genetic-driven approach defining two obesity profiles that convey highly concordant and discordant diabetogenic effects. We annotate and then compare association signals for these profiles across clinical and molecular phenotypic layers. Key differences are identified in a wide range of traits, including cardiovascular mortality, fat distribution, liver metabolism, blood pressure, specific lipid fractions and blood levels of proteins involved in extracellular matrix remodelling. We find marginal differences in abundance of Bacteroidetes and Firmicutes bacteria in the gut. Instrumental analyses reveal prominent causal roles for waist-to-hip ratio, blood pressure and cholesterol content of high-density lipoprotein particles in the development of diabetes in obesity. We prioritize 17 genes from the discordant signature that convey protection against type 2 diabetes in obesity, which may represent logical targets for precision medicine approaches.

Genome-wide association studies

Article 26 August 2021

Divergent age-associated and metabolism-associated gut microbiome signatures modulate cardiovascular disease risk

Article 06 June 2024

Identifying proteomic risk factors for cancer using prospective and exome analyses of 1463 circulating proteins and risk of 19 cancers in the UK Biobank

Article Open access 15 May 2024

Main

Cardiometabolic diseases are the leading cause of death globally, with obesity and type 2 diabetes mellitus (T2D) accounting for a large proportion of this burden¹. The prevalences of obesity and T2D have risen sharply over the past decades worldwide², corresponding with a shift to sedentary lifestyles and poor diet³. Even though obesity and T2D often coincide, their relationship is complex and remains incompletely understood. Indeed, while more than 80% of people with T2D also have obesity, 10–30% of people with obesity appear metabolically healthy^4,5,6. Conversely, metabolic abnormalities occur in ~30% of normal-weight individuals^7,8,9. Likewise, despite weight loss improving glycaemic control in people with T2D¹⁰, when T2D occurs in people with normal weight, mortality rates are higher than those in people with overweight or obesity^11,12. Here, we refer to these divergent features as ‘discordant diabesity’. We focus on this unusual phenotype because it helps leverage the independent roles of excess adiposity and T2D in life-threatening disease.

To some extent, this discordance can be attributed to the imprecision with which body mass index (BMI), the conventional metric used to define obesity, characterizes adiposity^13,14. For instance, even when BMI is comparable, lean and fat mass distributions often vary from one person to the next¹⁵. Genetics has helped provide pathophysiological explanations for discordant diabesity, whereby, collectively, common variants affecting adipose distribution mimic monogenic syndromes such as familial lipodystrophies^{16,17,18,19,20,21}. Expanding our knowledge of the phenotypic signature of discordant diabesity using the quantitative framework of genetics may help elucidate the mechanisms by which the broader health consequences of excess adiposity varies from one person to the next.

Here, we characterize genetically determined discordant diabesity through a comparative analysis with its concordant counterpart (that is, where higher genetic risk of obesity and T2D coincide). We used a range of machine learning methods to undertake phenome-wide scans to identify traits other than T2D that distinctively characterize these profiles. We concluded by undertaking robust causal inference analyses to determine the causal relationships underlying discordant diabesity with other features of health and disease.

Results

Assembly of concordant and discordant diabesity profiles

An analysis flowchart is presented in Extended Data Fig. 1a. We first identified genetic instruments for BMI²² and T2D²³ by cross-referencing publicly accessible genome-wide association study (GWAS) summary statistics, finding 67 relatively independent single nucleotide polymorphisms (SNPs) strongly associated with both conditions (P < 5 × 10⁻⁸). After alignment to the BMI-increasing allele, these variants were labelled as ‘concordant’ (48 SNPs) or ‘discordant’ (19 SNPs) according to the positive or negative sign of their coefficients for T2D, respectively (Extended Data Fig. 1b and Supplementary Table 1; replication shown in Supplementary Table 2). Visual inspection of correlation patterns between BMI and T2D signals at each locus was undertaken using regional association plots (Supplementary Figs. 1 and 2).

Phenome-wide scans

Among the clinical phenotypes, we found that concordant and discordant diabesity profiles differed predominantly in cardiometabolic features including high-density lipoprotein (HDL) cholesterol, waist-to-hip ratio (WHR), waist circumference, and blood pressure (Fig. 1 and Supplementary Table 3). Generally, the discordant profile was associated with a favourable phenotypic signature compared to the concordant profile. For example, systolic blood pressure (SBP) was lower in the discordant compared to the concordant profile (SBP: β_C = 0.002 s.d. units per allele (95% confidence interval (CI): −0.001, 0.004), β_D = −0.008 s.d. units per allele (95% CI: −0.012, −0.004), pδ = 1.39 × 10⁻⁴). We also found differences in risk of coronary heart disease (CHD) and stroke, which were lower in the discordant compared to the concordant profile (for example, CHD: odds ratio (OR)_c = 1.01 per allele (95% CI: 1.01, 1.02), OR_D = 0.98 per allele (95% CI: 0.97, 0.99), pδ = 1.3 × 10⁻⁶). The levels of biomarkers of liver function such as gamma-glutamyl transferase (GGT) and alanine aminotransferase (ALT) enzymes were lower in the discordant relative to the concordant profile (for example, ALT: β_C = 0.008 s.d. units per allele (95% CI: 0.006, 0.011), β_D = −0.011 (95% CI: −0.019, −0.003), pδ = 2.07 × 10⁻⁶). SHBG, a protein also produced in the liver, was higher in the discordant as opposed to the concordant profile (β_C = −0.008 s.d. units per allele (95% CI: −0.012, −0.004), β_D = 0.013 s.d. units per allele (95% CI: 0.007, 0.019), pδ = 1.94 × 10⁻⁸). Additionally, the discordant profile was associated with higher mean corpuscular volume (β_C = −0.002 s.d. units per allele (95% CI: −0.005, 0), β_D = 0.006 s.d. units per allele (95% CI: 0.002, 0.01), pδ = 8.76 × 10⁻⁴) and lower levels of urate (β_C = 0.007 s.d. units per allele (95% CI: 0.004, 0.01), β_D = −0.005 s.d. units per allele (95% CI: −0.01, −0.001), pδ = 3 × 10⁻⁶) compared to the concordant profile. The odds of receiving treatment with alendronate was higher in the discordant than in the concordant profile, a drug indicated for osteoporosis (OR_C = 0.99 per allele (95% CI: 0.99, 0.99), OR_D = 1.001 per allele (95% CI: 1.001, 1.001), pδ = 3.26 × 10⁻⁶).

**Fig. 1: Summary-based comparison of concordant and discordant profiles.**

Profile decomposition

Further exploration of the molecular features of the discordant and concordant profiles revealed that some variants used to characterize these profiles deviated from the overall pattern of trait association for their respective SNP set. Using single-linkage clustering on the SNP–trait matrix (Extended Data Fig. 1c), we identified two outliers in the concordant profile, one near GCKR, associated with higher SHBG and lower liver enzymes (SHBG: 0.07 (95% CI: 0.07, 0.08), P = 7.5 × 10⁻¹⁹⁹) and a second near TOMM40 associated with higher HDL (0.07 s.d. units per allele (95% CI: 0.06, 0.08), P = 3.7 × 10⁻¹⁰⁷). In the discordant profile, the last variant to be aggregated to the clustering tree (that is, the SNP most distal from the other SNPs within its set) is located near SLC2A2 and, in contrast to the overall discordant estimates, was associated with higher levels of AST and GGT (GGT: 0.02 s.d. units per allele (95% CI: 0.02, 0.03), P = 2.7 × 10⁻²²).

External validation in BioVU

We sought replication of the discoveries outlined above in an independent European-ancestry cohort from BioVU, a de-identified collection of electronic health records and a linked biobank including inpatient and outpatient data from Vanderbilt University Medical Center (VUMC), a tertiary-care centre in Nashville, Tennessee, USA^24,25,26. We constructed separate genetic risk score (GRS) coefficients for concordant and discordant profiles and assessed their association with multiple phenotypes (Fig. 2 and Supplementary Table 4). We first confirmed that the concordant and discordant GRSs were associated with higher obesity risk, respectively (OR_C = 1.03 per allele (95% CI: 1.03, 1.04), OR_D = 1.02 per allele (95% CI: 1.01, 1.02), pδ = 1.6 × 10⁻³) and that the concordant and discordant profiles were positively and negatively associated with diabetes diagnosis, respectively (OR_C = 1.03 per allele, (95% CI: 1.02, 1.04), OR_D = 0.95 per allele (95% CI: 0.94, 0.96), pδ = 3.2 × 10⁻⁴⁹). Both scores were associated with increased odds of bariatric surgery (OR_C = 1.05 per allele (95% CI: 1.03, 1.06), OR_D = 1.03 per allele (95% CI: 1.008, 1.06), pδ = 0.24). We found divergent associations in multiple diseases directly related to the main traits (for example, essential hypertension (HT): OR_C = 1.014 per allele (95% CI: 1.009, 1.019), OR_D = 0.99 per allele (95% CI: 0.98, 0.99), pδ = 1.2 × 10⁻⁶). We also observed differences for other disease outcomes such as chronic kidney disease (OR_C = 1.02 per allele (95% CI: 1.01, 1.02), OR_D = 0.98 per allele (95% CI: 0.97, 0.99), pδ = 2.9 × 10⁻⁶) and osteoarthrosis (OR_C = 1.01 per allele (95% CI: 1, 1.01), OR_D = 1.02 per allele (95% CI: 1.01, 1.03), pδ = 0.012). Because both scores were also strongly associated with type 1 diabetes (T1D; OR_C = 1.05, (95% CI: 1.03, 1.05), OR_D = 0.96, (95% CI: 0.94, 0.97), pδ = 4.8 × 10⁻¹⁷), we repeated the analyses excluding individuals with T1D. This attenuated the differences between concordant and discordant profiles for a small subset of traits including diabetic retinopathy and end-stage chronic kidney disease.

**Fig. 2: Comparison of concordant and discordant profiles in BioVU.**

We also assessed the association of each GRS to multiple laboratory measurements in individuals of European (n > 68,000) and African American (n > 14,000) descent (Supplementary Table 5). The value per individual was computed as the median value over all measurements after a quality-control pipeline described in detail elsewhere²⁷. Significant differences were found for several glycaemic traits consistent with the diabetes risk profiles (for example, HbA1c: s.d. difference per concordant allele: 0.05 (95% CI: 0.04, 0.06), s.d. difference per discordant allele: −0.06 (95% CI: −0.08, −0.05), pδ = 1.5 × 10⁻³⁹). We confirmed the difference between the two profiles in HDL and observed differences in the other two main lipid fractions (for example, triglycerides: s.d. difference per concordant allele: 0.02 (95% CI: 0.01, 0.03), s.d. difference per discordant allele: −0.04 (95% CI: −0.05, −0.03), pδ = 1.5 × 10⁻¹⁴). The findings for red blood cell phenotypes were also replicated, and additional differences were found in leucocyte count, urea, creatinine, phosphate, C-reactive protein and parathyroid hormone (PTH), all of which were higher in carriers of concordant SNPs. Of the liver enzymes, only ALT values were available for comparison, whose levels were weakly associated with the concordant but not the discordant GRS (s.d. difference per concordant allele: 0.14 (95% CI: 0.02, 0.26), s.d. difference per discordant allele: 0.06 (95% CI: −0.08, 0.2), pδ = 0.2). In individuals of African American descent, significant differences were found in HbA1c, glucose and urea levels in urine.

Differences in mortality in UK Biobank

We examined the relationship of GRSs to mortality owing to cardiovascular events in >337,000 participants of European descent from the UK Biobank (mean follow-up of 11.8 years). Around 35,000 deaths were reported, of which approximately 20% were related to cardiovascular events. The concordant GRS was associated with higher mortality (hazard ratio (HR) per allele: 1.01 (95% CI: 1.01, 1.02)), whereas the discordant GRS was not (HR per allele: 0.99 (95% CI: 0.98, 1.01), pδ = 0.02). However, when assessing each SNP separately, we observed that the concordant variant near TOMM40 was associated with lower incidence of cardiovascular mortality (HR per allele: 0.85 (95% CI: 0.81, 0.90), P = 4.54 × 10⁻⁹ and Supplementary Table 6).

Differences in serum metabolites

Of the metabolites available, those related to lipid subfractions were the strongest discriminators of concordant and discordant profiles (Fig. 3 and Supplementary Table 7). Discordant diabesity was associated with higher cholesterol in lipoprotein particles of all densities, while lower triglyceride content in lipoprotein particles of low densities, as opposed to concordant diabesity (for example, free cholesterol in HDL: β_C = −0.008 s.d. units per allele (95% CI: −0.01, −0.005), β_D = 0.008 s.d. units per allele (95% CI: 0.004, 0.013), pδ = 3.09 × 10⁻¹⁰). Discordant diabesity also correlated with lower levels of branched-chain amino acids and aromatic amino acids, whereas in concordant diabesity they tended to be higher (total concentration of branched-chain amino acids: β_C = 0.004 s.d. units per allele (95% CI: 0.002, 0.008), β_D = −0.008 s.d. units per allele (95% CI: −0.012, −0.003), pδ = 1.46 × 10⁻⁶).

**Fig. 3: Comparison of concordant and discordant profiles in molecular phenotypes.**

Differences in gut microbiota

There were no differences between pooled concordant and discordant estimates for bacterial abundance in the gut that were statistically significant after false discovery rate (FDR) correction. Across ten taxa, several were nominally associated (P < 0.05) within either the concordant or the discordant profiles (Fig. 3 and Supplementary Table 8). Four of these belonged to the phylum Bacteroidetes (family Bacteroidaceae and geni Bacteroides, Parabacteroides and Butyricimonas), all of which were less abundant in discordant relative to concordant diabesity (for example, family Bacteroidaceae: β_C = 0.005 s.d. units per allele (95% CI: 0.001, 0.008), β_D = −0.004 s.d. units per allele (95% CI: −0.004, −0.01), pδ = 0.004). The remaining taxa belonged to the phylum Firmicutes, most of them members of the obligately anaerobic class Clostridia, which tended to be more abundant in the discordant profile compared to the concordant profile (for example, genus Subdoligranulum: β_C = −0.003 s.d. units per allele (95% CI: −0.006, 0.001), β_D = 0.006 s.d. units per allele (95% CI: 0.007, 0.011), pδ = 0.006). The family Lactobacillaceae was also lower in the discordant compared to the concordant profile (β_C = 0.006 s.d. units per allele (95% CI: 0.001, 0.01), β_D = −0.006 s.d. units per allele (95% CI: −0.014, 0.003), pδ = 0.02).

Differences in serum protein levels

We found a significant difference between concordant and discordant estimates after FDR correction in a single protein: heparan sulfate 6-O-sulfotransferase 2 (HS6ST2), which was higher in discordant relative to concordant diabesity (β_C = −0.01 s.d. units per allele (95% CI: −0.017, 0), β_D = 0.03 s.d. units per allele (95% CI: 0.02, 0.04), pδ = 7.52 × 10⁻⁷; Fig. 3). These analyses may be underpowered given that the effect of variants in trans is likely to be weaker than that of those in the gene encoding the protein. Thus, we also searched for strong cis effects (P < 5 × 10⁻⁸) in the discordant profile. We found one association between the discordant variant near PPARG and metalloproteinase inhibitor 4 (TIMP4; β = −0.28 s.d. units per allele (95% CI: −0.35, −0.2), P = 5 × 10⁻¹⁴).

Functional annotation using DEPICT

We used the Data-driven Expression Prioritized Integration for Complex Traits (DEPICT)²⁸ tool to compare the enrichment for tissues and biological pathways in each profile (Supplementary Figs. 3–5). The most notable difference was the significant enrichment (P < 0.05) for adipose tissue in the discordant profile, which was not found in the concordant profile. We also found significant enrichment for adrenal glands, ileum and kidney in the discordant but not in the concordant profile. Conversely, there was significant enrichment for endocrine tissue and retina in the concordant but not the discordant profile. Tissues for which there was significant enrichment in both profiles included pancreas and myometrium.

Gene expression and splicing in discordant diabesity

We found 506 genes whose expression/splicing was significantly influenced by concordant SNPs and 76 which were influenced by discordant SNPs across multiple tissues in GTEx²⁹. In eQTLGen³⁰, we found significant associations of concordant SNPs with expression of 493 genes. Discordant SNPs were associated with 94 genes. Around 46% of all the associations found in GTEx were replicated in eQTLGen (47% of the genes associated with concordant SNPs; 39% of the genes associated with discordant SNPs).

To identify genes most likely involved in the molecular mechanisms leading to discordant diabesity, we chose genetic instruments for the 76 genes whose expression was influenced by discordant SNPs in the corresponding tissues in GTEx, and for the 94 genes in eQTLGen. A prerequisite for these instruments was that they are strongly associated with BMI (P < 5 × 10⁻⁸). We followed the SMR & HEIDI approach³¹, which utilizes the strongest instrument for gene expression/splicing within the cis region of the corresponding gene (±500 Mb from the transcription start site) to calculate an estimate of the pleiotropic association across gene expression, BMI and T2D risk. This approach then determines if the association found reflects true pleiotropy rather than mere linkage by testing for heterogeneity of the estimates of SNPs in linkage disequilibrium (LD) with the lead SNP. We found 17 genes with robust expression signals for obesity and T2D whose directions of effect were in contrast (FDR-corrected P < 5%, p_HEIDI > 0.01; Fig. 4 and Supplementary Table 9). To locate the most likely tissue of action for these genes, we followed a scoring procedure³² through which a tissue specificity score is derived for each gene. This is calculated as (i) the proportion of median expression (in transcripts per million) across specific tissue types catalogued in GTEx and (ii) evidence of promoter/enhancer histone marks surrounding the genetic instruments, derived from multiple cell lines classified anatomically by the RoadMap Epigenomics Project³³ that could be mapped to tissue samples in GTEx. For each gene, we sorted tissues where we found pleiotropic links according to its specificity score and presence of promoter/enhancer signals for the genetic instrument. This allowed us to prioritize potential main action sites of relevance to discordant diabesity, for example, LYPLAL1 in adipose tissue and JAZF1 in vasculature and pancreas, while confirming the widespread effects of SLC22A3 across multiple organs.

**Fig. 4: Genes with likely discordant pleiotropic effects on BMI and T2D.**

Discordant diabesity genes as therapeutic targets

We performed a lookup of the genes identified previously in the comprehensive public access databases DGIdb³⁴ and PHAROS³⁵. Notably, there was evidence of interaction between three of the genes with strong pleiotropic associations (SLC2A2, SLC22A3 and KCNJ11) and metformin in both databases. SLC22A3 interacted with various quinoline derivatives (decynium-22, disprocynium-24, found in both databases), SarCNU (an antineoplastic drug in phase 2 clinical trials), derivatives of the alpha blocker phenoxybenzamine, corticosterone and colchicine. There is also evidence of potent inhibition of GLUT2, the protein product of SLC2A2, by a specific class of pyrazolopyrimidines. SLC38A11, MAU2 and FBXO46 are classified in the ‘Tdark’ level of target development in the PHAROS database, composed of understudied targets, while the remaining genes fall under the ‘Tbio’ level, which includes targets with no known interactions yet satisfying other conditions, such as having functional annotations based on experimental evidence, repeated mentions in publications indexed in PubMed, and 50 or more available commercial antibodies.

Instrumental variable analyses

To quantify the potential impact of traits that emerged from the previous steps on offsetting the diabetogenic effect of obesity, we derived genetic instruments for each of these traits using SNPs that were also robustly associated with BMI (P < 5 × 10⁻⁸) and decomposed these instruments into two groups based on their direction of effect on the trait of interest after alignment to the BMI-increasing allele. We then constructed two GRSs, one for each group of variants, and calculated the T2D risk conferred by each GRS using summary data from the DIAGRAM consortium; we focused on GRSs that confer protection from T2D.

From the clinical phenotypes, the GRSs that conveyed higher BMI but lower WHR and SBP were significantly associated with lower T2D risk (Extended Data Fig. 2a and Supplementary Table 10). For example, the estimate for the GRS associated with higher BMI but lower WHR had an OR of 0.96 per allele (95% CI: 0.94–0.98, P = 6.71 × 10⁻⁵). Some traits in the clinical phenotypes required instruments to be in cis with the gene encoding the corresponding protein (for example, SHBG), to prevent confounding due to pleiotropy. We found two such instruments for ApoA1 and SHBG, respectively, which were not associated with T2D risk (P = 0.17 and 0.84, respectively; Supplementary Table 11) despite their strong association with higher BMI. From the analysis of metabolites, we found two GRS coefficients associated with higher BMI and lower T2D risk. The strongest protection was found for the GRS conferring higher total concentration of lipoprotein particles (OR: 0.98, 95% CI: 0.96, 0.99, P = 0.006; Supplementary Table 12), consistent with our findings in the phenome scans.

To test for the potential causal effect on diabesity discordance of HS6ST2 and TIMP4, the two proteins identified in the previous analysis, we searched for valid instruments (P value for both protein levels and BMI < 5 × 10⁻⁸) in the cis region of the corresponding genes. We could only derive a valid instrument for TIMP4. Using the SMR & HEIDI method, we found a significant pleiotropic effect (P = 3.8 × 10⁻⁷, p_HEIDI = 0.4; Extended Data Fig. 2b). However, we noted that the lead instrument and its closest proxies were located within PPARG, which is proximal to TIMP4.

No instruments for the microbial taxa where we found nominally significant differences reached the significance threshold required for BMI. Extending the exploration to other taxa revealed a single significant association (P < 5 × 10⁻⁸) of the A allele of rs1530559 (a variant within the lactase persistence haploblock³⁶ in LD with the lactase functional variant rs4988235 (r² = 0.4)) with higher BMI and lower abundance of the order Bifidobacteriales. This variant was not associated with T2D risk (P = 0.76).

Individuals within the top decile for each profile

To determine the relevance of concordant and discordant profiles in people with obesity (≥30 kg/m²), we focused on this subgroup in UK Biobank who localized to the top decile of one of the two profile GRSs³⁷. Consistent with a binomial distribution, 18% of individuals with obesity were present in the two groups of extreme GRSs. The health characteristics of these individuals differed from all others with obesity (Supplementary Table 13) in several ways: for example, HbA1c levels in individuals with obesity and the extreme concordant GRS were higher compared to all other individuals with obesity (Kruskal–Wallis P = 5.94 × 10⁻¹²). Conversely, individuals with obesity and an extreme discordant GRS had significantly lower HbA1c compared to the rest of individuals with obesity (P = 2.71 × 10⁻⁴²). Persons with obesity at both extreme GRSs are also distinguished from the rest by the main clinical features identified in our previous analyses such as SBP, HDL and ALT. WHR did not adequately separate the concordant or discordant extreme GRS from the wider group of people with obesity. However, because the initial phenome scan revealed a gender-specific difference in WHR between concordant and discordant profiles, (Extended Data Fig. 1c), we performed an additional analysis for WHR stratified by sex, where we found that women with extreme discordant GRS had significantly lower WHR compared to other women with obesity (P = 6.05 × 10⁻¹⁰).

Comparison with previous studies of discordant diabesity

We compared our results to those obtained in three previous investigations of discordant variants. For instance, Mahajan et al.²³ calculated the change in estimates of SNPs associated with T2D before and after adjustment for BMI. They found 15 loci where signals were enhanced after adjustment, which was attributed to discordance. Consistently, the SNP effects in the diabesity discordant profile derived here were enhanced, while those of the concordant profile were attenuated after adjustment (as described in ref. ³⁸; Supplementary Table 14). The change in SNP effect estimates was consistently associated with SNP effects on BMI (R² = 0.8; Supplementary Fig. 6). However, we observed that for four of the 19 SNPs (20%) from the discordant set near PPARG, JAZF1, KCNJ11 and LYPLAL1, the change in SNP effect estimates was less than predicted. These discordant variants are those most likely to directly alter the relationship between BMI and T2D. This is consistent with the known effect of PPARG on adipocyte differentiation, and with our findings linking adipose tissue-specific gene expression at the LYPLAL1 locus with higher BMI but lower T2D risk. Similarly, we found that KCNJ11 and JAZF1 had discordant effects on BMI and T2D, which is related to tissue-specific expression in heart and arteries; variants at both loci are known to influence insulin secretion.

We also sought replication of a finding from Pigeyre et al.²⁷ linking discordance to levels of the protein IGFBP-3 in blood. We were not able to replicate this finding (Supplementary Table 15), possibly due to differences in the characteristics of the cohorts where this relationship was found. For our analysis, we used summary data from the INTERVAL study³⁹, which includes predominantly healthy blood donors of European ancestry. In contrast, Pigeyre et al. used data from the ORIGIN trial, a cohort composed of individuals of European (47%) and Latin American (53%) ancestries, enriched for T2D cases (>80% had a prior diagnosis).

Finally, we searched for the SNPs comprising the concordant and discordant profiles described above in the cluster analysis of discordant SNPs performed by Huang et al.²¹. Fourteen of the 19 discordant SNPs identified in our analysis (78%) are among or in LD with the 62 SNPs identified by Huang et al (r² > 0.1 within a 1-Mb window, as specified in the publication; Supplementary Table 16). Two of the subclusters described by the authors were significantly overrepresented by these 14 SNPs: 5 (ARAP1, ADCY5, PPARG, TCF7L2, KCNJ11-NCR3LG1) were in the subcluster characterized mainly by higher BMI and lower fasting glucose and risk of T2D (enrichment P = 1.6 × 10⁻³) and 4 (GRB14, LYPLAL1, ADAMTS9 and VEGFA) in the subcluster that conveyed an apparent protective effect on multiple cardiometabolic traits via peripheral adipose distribution (higher BMI and body fat percentage, and lower WHR; enrichment P = 0.04). Four concordant variants (at GCKR, TOMM40, AKAP6 and PPP1R3B-TNKS-MSRA) were also among the 62 SNPs described by Huang et al. As opposed to other variants in the concordant set, the variant in AKAP6 was associated with lower SBP (in ICBP GWAS: β = −0.25 mm Hg (95% CI: −0.38, −0.12), P = 1.25 × 10⁻⁴) and the variant near PPP1R3B-TNKS-MSRA was associated with higher HDL (β = 0.02 s.d. units (95% CI: 0.012, 0.027), P = 1.72 × 10⁻⁶). As we found and discussed in our analyses, TOMM40 and GCKR deviate from the concordant set owing to their favourable associations with lipids and liver enzymes that resemble the discordant set, a pattern that was also reported by Huang et al.

Discussion

Obesity conveys heterogenous effects in cardiometabolic health, making disease prevention and management challenging. Here we used genetics to deconstruct the obesity phenotype into concordant and discordant diabesity, with strikingly different health characteristics beyond diabetes and obesity. Through transcriptomic, metabolomic and metagenomic analyses, we identified biomarkers that shed light on mechanisms of action and may aid risk stratification. Further analyses identified potential targets for drug development and drug repurposing.

Obesity and T2D often coalesce, owing largely to the mediating effect of peripheral insulin resistance caused by excess adiposity. The trait discordances described here reflect mechanisms involved in uncoupling obesity risk from T2D risk, thereby exposing diabetes-independent pathways through which obesity affects disease risk, for example, through adipose distribution. It is likely that both a higher capacity to expand adipose tissue in the gluteo-femoral compartment^40,41 and lower abdominal region around organs such as the liver, which might underlie the difference seen in biomarkers of liver failure⁴², play important and independent roles in genetically determined diabesity discordance.

Another key phenotypic distinction between concordant and discordant profiles concerns blood pressure. Although T2D often causes vascular dysfunction, changes in the vascular bed may also precede metabolic perturbations through nutrient and hormonal flux^43,44, affecting pancreas, muscle and adipose tissue⁴⁵. For instance, capillary recruitment and permeability are key determinants of whole-body glucose uptake and glycaemic variation⁴⁶.

Our findings relating to lipid metabolites support the use of more refined profiling of lipid subfractions to help determine risk in people with obesity. The cholesterol content of HDL particles and BCAA levels appear especially informative biomarkers⁴⁷, possibly because they enhance glucose homeostasis in obesity by improving cross-talk between peripheral tissue and the liver⁴⁸.

Despite the contrasting health consequences of the two diabesity profiles, bariatric surgery was equally likely, which may predispose one group to health benefits following surgery, whereas the other may not benefit in this way.

We found a significant difference between concordant and discordant profiles in levels of HS6ST2, a protein expressed in brain, kidney and ovaries, which in animal knock-out models shows a strong association with increased body weight and insulin resistance, possibly owing to enhanced adipocyte differentiation^49,50. We found only one robust pleiotropic effect for discordant diabesity at TIMP4, which is proximal to PPARG, the likely causal gene. Moreover, PPARG activator medication inhibits matrix metalloproteinases^51,52. TIMP4 has been associated with adipogenesis, possibly through its effect on the adipose tissue extracellular matrix in obesity⁵³.

The colocalization analyses underscore the importance of tissue pleiotropy and tissue cross-talk in the molecular mechanisms of diabesity discordance. This is especially evident for SLC22A3, but also for other potential targets such as LYPLAL1, whose differential expression in both adipose tissue and adrenal glands appears linked to discordant diabesity. Moreover, three of the genes with pleiotropic links to T2D risk (SLC2A2, SLC22A3 and KCNJ11) interact with metformin. This suggests a potential effect of metformin in shifting individuals with obesity from a concordant to a discordant phenotype. SLC2A2 encodes GLUT2, which is part of the glucose sensor apparatus in pancreas and liver and is involved in intestinal glucose absorption in the gut⁵⁴. Variants in this gene have been associated with preference for sugary foods⁵⁵ and modified response to metformin^56,57. SLC22A3 encodes OCT3, a protein widely expressed across tissues that aids adipocyte beiging⁵⁸ and perivascular adipose tissue remodelling⁵⁹. KCNJ11 encodes the Kir6.2 subunit of the ATP-sensitive potassium channel. As this is the target of sulphonylureas, this group of drugs may also harbor potential candidates for the phenotype shift to discordance in diabesity. The other ligands identified in the lookup may also constitute potential therapeutic agents to prevent cardiometabolic complications in obesity. For the rest of the genes, especially those in the ‘Tdark’ level in PHAROS, follow-up functional experiments in the tissues indicated by the lead genetic instruments and its corresponding epigenetic annotations are warranted.

Certain SNPs deviate from the overall association pattern of the profile within which they reside. In the concordant profile, the BMI-increasing allele of the variant near TOMM40 increases T2D risk but, unlike other SNPs in the same profile, is associated with a better lipid profile and lower cardiovascular disease mortality. This gene and others in its proximity (APOC1 and APOE) have been consistently implicated in lipid metabolism⁶⁰. In the discordant profile, the variant SLC2A2 conveys protection against T2D risk despite being associated with heavier weight and higher blood pressure, and worse liver function and dyslipidaemia. The opposite pattern was observed in the concordant variant in GCKR, which encodes a regulatory protein that inhibits glucokinase. This reflects disparate phenotypic effects of modulating the glucose sensor apparatus at different levels⁵⁴. Deeper characterization of these mechanisms can further improve obesity stratification.

Although no statistically robust differences were observed in gut microbiota between the two diabesity profiles, possibly owing to low statistical power, nominal differences emerged in taxa belonging to the Bacteroidetes and Firmicutes phyla, which together constitute 90% of the human intestinal flora⁶¹. Our results indicate higher Firmicutes and lower Bacteroidetes abundance in discordant diabesity, which may result in enhanced production of short-chain fatty acid species such as butyrate, which is involved in glucose-lowering and anti-inflammatory mechanisms⁶².

Previous strategies to characterize the discordance between BMI and metabolic risk have been based on predefined sets of phenotypes traditionally linked with metabolic status^19,21. Our phenome-wide approach consisted of leveraging the wealth of genetic associations harvested to date to dissect the phenotypic structure relevant for discordant diabesity, having three main advantages: (1) variables defining the differential phenotypic structure of each profile are selected a in data-driven manner across many phenotypic layers; (2) leveraging genetic data across multiple datasets enhances power and minimizes cohort-specific biases that would be anticipated if analyses were performed in a single cohort; and (3) although concordant and discordant diabesity profiles may be driven by molecular mechanisms that are independent of DNA variation, using germline DNA variants helps mitigate reverse causality and other sources of confounding that hamper the interpretation of associations for most other types of biological variation and phenotypes. An example of this is the analysis of epigenetic factors, which has led to identification of obesity sub-phenotypes even in the context of genetic homogeneity, as found in monozygotic twins that are discordant for adiposity traits⁶³. However, these findings might be driven by variations in environmental exposures and behaviours that exist within and between twin pairs, as well as confounded by factors such as age, which differed between twin pairs in the reported analyses.

In conclusion, obesity profiles with either diabetogenic or antidiabetogenic proclivities reveal distinctive aetiological subtypes, with key differences in fat distribution, blood pressure and cholesterol content in HDL particles. We identified 1 protein (TIMP4) and 17 genes potentially involved in the molecular mechanisms leading to diabesity discordance, involving pleiotropic effects across multiple tissues.

Methods

Study populations

BioVU

Collection of electronic health records in BioVU was established in 1990 and includes data on billing codes from the International Classification of Diseases, 9th and 10th editions (ICD-9 and ICD-10). Disease phenotypes (‘phecodes’) are derived from these billing codes as described previously²⁴ and case, control and exclusion criteria are defined. Two codes on different visit days were required to instantiate a case for each phecode. The biobank was launched in 2007 and comprises excess blood samples that their donors had consented for use in biomedical research. Details of programme operations, ethical considerations, continuing oversight and patient engagement are published elsewhere²⁵. DNA samples were analysed using genome-wide genotyping platforms including Illumina multi-ethnic genotyping array. After quality assessment, the genotype data were then imputed to the Haplotype Reference Consortium reference panel at the Michigan imputation server. Populations of African American and European descent were identified by projecting individuals onto the major principal-component space derived from 1000 Genomes reference panel.

UK Biobank

The UK Biobank is an ongoing prospective study of approximately 500,000 adults. Initial enrolment took place from 2006 to 2010 and included individuals aged 40–69 years across the United Kingdom⁶⁴. It has collected comprehensive genetic and phenotypic information, biochemical assays and longitudinal health outcomes through health records, such as hospitalization and mortality. The genotypes were assayed using the UK Biobank Lung Exome Variant Evaluation and the Applied Biosystems UK Biobank Axiom Array, and imputed to the Haplotype Reference Consortium panel. Population structure was also assessed using principal-component analysis. We excluded individuals with inconsistency between their reported and genetic sex, had sex chromosome aneuploidy or were outliers for heterozygosity or missingness. Only individuals who were included in the calculation of genetic principal components were included, which ensures minimal genetic kinship with other participants.

Single-nucleotide polymorphism selection to construct concordant and discordant genetic profiles

We cross-referenced the largest GWAS for BMI and T2D and extracted common biallelic SNPs (minor allele frequency (MAF) > 1%). Insertions, deletions and potentially ambiguous palindromic SNPs (A/T or C/G alleles with MAF > 30%) were excluded. Because both GWAS were conducted predominantly in populations of European descent, we used 1000 Genomes EUR reference panel for clumping (r²< 0.01 over a 500-kb window) to identify nearly independent SNPs that were strongly associated with both conditions (P < 5 × 10⁻⁸). The directions of the effect of these SNPs on T2D were consistent in a second independent set of GWAS summary statistics extracted from the FinnGen database⁶⁵ (Supplementary Table 2).

Phenome-wide scans

We extracted association data for concordant and discordant SNPs from a variety of sources. From the curated repository Open GWAS⁶⁶, which we queried using the ‘ieugwasr’ package in R, we gathered data for >3,500 traits derived from UK Biobank and other consortia for a variety of traits; these traits are termed ‘clinical phenotypes’. In cases where the effect of a SNP on a trait was not found, we looked for the effect of the nearest proxy SNP up to an r² of 0.5 over a 500-kb window. We only kept estimates obtained from European ancestry populations in order to be consistent with the GWAS used to identify the genetic profiles. To prevent inclusion of inflated signals due to low sample size, we only included studies of more than 500 individuals; for binary traits, we required at least 25 minor alleles in the smallest group⁶⁷. We calculated z-scores by dividing the β coefficients by their corresponding standard errors, and then we computed standardized effect sizes as a function of MAF and sample size n using equations (1) and (2) (ref. ³¹):

$$\mathrm{SE} = \frac{1}{{\sqrt {2 \times \mathrm{MAF} \times \left( {1 - \mathrm{MAF}} \right) \times (n + z^2)} }}$$

(1)

$$\beta = z \times \mathrm{SE}$$

(2)

We aligned all the estimates from these scans to the BMI-increasing alleles, so that they represent phenotypic variations associated with higher BMI in both profiles.

We also obtained data for 657 blood metabolites^68,69 and 3,282 proteins in plasma³⁹. Associations with several bacterial taxa in the gut were obtained from the MiBioGen consortium⁷⁰. Association with expression and splicing of nearby genes in multiple tissue samples and in whole blood were obtained from data generated by GTEx and eQTLGen consortia, respectively.

Profile comparison

We then compared the effects of discordant versus concordant SNPs for every trait in two stages: we first obtained the combined effect of concordant (β_C) and discordant (β_D) SNPs separately using a random-effects meta-analysis with the Paule–Mandel estimator of between-SNP variance τ² (refs. ⁷¹^,72). We then calculated their difference δ = |β_C − β_D| and computed its standard error as $\mathrm{SE}_\delta ^2 = \mathrm{SE}_C^2 + \mathrm{SE}_D^2$⁷³. We excluded from these analyses T2D traits (Supplementary Table 17). Traits in which any of the combined estimates β_C or β_D and δ were statistically significant after 5% FDR correction were taken forward to stage two, where we converted the effect estimates for each SNP and the selected traits to z-scores and placed them in a SNP–trait matrix, with SNPs coded as ‘0’ if concordant and ‘1’ if discordant. We then trained several Random Forest classifiers (1,000 iterations) to this matrix, which attempted to classify SNPs in their correct category, and used the Boruta algorithm⁷⁴ to identify which traits were relevant to distinguish discordant from concordant SNPs. Briefly, this algorithm creates randomly shuffled copies of all traits in the SNP–trait matrix, and then evaluates for each trait if its contribution to the accuracy of decision trees in the Random Forest is higher than its corresponding random set.

Genetic risk score analyses

Concordant and discordant GRSs for an individual i were calculated as:

$$\mathrm{PRS}_{Pi} = \mathop {\sum }\limits_{j \in P}^{M_P} G_{ij}$$

(3)

Where P is the set of M_P SNPs belonging to the concordant or discordant profiles and G_ij is the genotype for SNP j in individual i. In BioVU, association analyses were carried out for each GRS using R package ‘PheWAS’ (v0.99.5-5)²⁴. We kept phecodes with at least 200 cases⁶⁷ and identified those associated with either of the GRS coefficients and a significant difference between the estimated effects after a 5% FDR correction.

In the UK Biobank, we examined the relationship of GRSs to mortality due to cardiovascular events in individuals followed up to the latest censor date (30th September 2021) using Cox regression. Primary cause of death was ascertained using ICD-10 codes reported in death certificates (Supplementary Table 18). All association models were adjusted for age, sex and first ten genetic principal components.

SMR & HEIDI

The SMR method consists of identifying for a protein or gene the strongest association signal, which is used as a genetic instrument to test for its pleiotropic effect on an outcome. The HEIDI method consists of calculating the heterogeneity in the estimates of SNPs in LD with the lead SNP used in SMR. A higher p_HEIDI value means heterogeneity is less likely, which supports true pleiotropy across the gene/protein and outcome signal, while a lower p_HEIDI value means there is heterogeneity in the estimates, and the SMR signal is probably due to linkage. We consider an association to be true pleiotropy if p_HEIDI > 0.01 (ref. ⁷⁵). We retained signals where we found evidence of true pleiotropy for both BMI and T2D.

Scoring method using epigenetic annotation

The scoring method to identify the most likely tissue of action assumes that if a genetic instrument for the expression of a gene in a certain tissue where it is highly expressed (that is, high tissue specificity) is in or close (in LD) to a promoter/enhancer region in the same tissue, and this genetic instrument is also associated with an outcome, then it is likely that the pleiotropic association on the outcome is due to perturbation of gene activity in that tissue. Promoter/enhancer signals were obtained by querying the RoadMap Epigenomics Project through the ‘haploR’ package in R.

Genes as therapeutic targets

The lookups in DGIdb and PHAROS were performed using the web-based tool. DGIdb assigns an interaction score to the drug–gene interactions, which is the result of combining publication count, source count, relative drug specificity and relative gene specificity. The PHAROS database classifies targets into four ‘Target Development Levels’, according to the evidence of drug interactions available: ‘Tdark’ contains understudied targets, ‘Tbio’ contains highly studied targets but without interaction with compounds, ‘Tchem’ includes targets that bind to small molecules, and ‘Tclin’ interact with approved drugs.

All analyses were done using packages within the R environment (v4.1.2)⁷⁶.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The GWAS summary data analysed in this study are available from the GIANT (https://portals.broadinstitute.org/collaboration/giant/index.php/GIANT_consortium) and DIAGRAM (https://diagram-consortium.org/) consortia websites, the Open GWAS database (https://gwas.mrcieu.ac.uk/), the GTEx consortium website (https://gtexportal.org/home/ and the MiBioGen repository (https://mibiogen.gcc.rug.nl/). UK Biobank data are available through a procedure described at http://www.ukbiobank.ac.uk/using-the-resource/. Individual-level genetic and clinical data from BioVU cannot be shared publicly due to patient confidentiality. However, summary statistics can be viewed in tabular form at https://phewascatalog.org/labwas/. The DGIdb and the PHAROS databases can be accessed online at https://www.dgidb.org/ and https://pharos.nih.gov/, respectively.

Code availability

The codes used for our analyses are available at https://github.com/danielcoral/DVA_codes/.

References

World Health Organization. Cardiovascular diseases (CVDs). Available from: https://www.who.int/news-room/fact-sheets/detail/cardiovascular-diseases-(cvds). Accessed October 2022.
Magliano, D. J. et al. Trends in incidence of total or type 2 diabetes: systematic review. BMJ. 366, l5003 (2019).
Article PubMed PubMed Central Google Scholar
Blüher, M. Obesity: global epidemiology and pathogenesis. Nat. Rev. Endocrinol. 15, 288–298 (2019).
Article PubMed Google Scholar
van Vliet-Ostaptchouk, J. V. et al. The prevalence of metabolic syndrome and metabolically healthy obesity in Europe: a collaborative analysis of ten large cohort studies. BMC Endocr. Disord. 14, 9 (2014).
Article PubMed PubMed Central Google Scholar
Blüher, M. Metabolically healthy obesity. Endocr. Rev. 41, 405–420 (2020).
Article Google Scholar
Vecchié, A. et al. Obesity phenotypes and their paradoxical association with cardiovascular diseases. Eur. J. Intern Med 48, 6–17 (2018).
Article PubMed Google Scholar
Ding, C., Chan, Z. & Magkos, F. Lean, but not healthy: the ‘metabolically obese, normal-weight’ phenotype. Curr. Opin. Clin. Nutr. Metab. Care 19, 408–417 (2016).
Article CAS PubMed Google Scholar
Wang, B. et al. Prevalence of metabolically healthy obese and metabolically obese but normal weight in adults worldwide: a meta-analysis. Horm. Metab. Res 47, 839–845 (2015).
Article CAS PubMed Google Scholar
Loos, R. J. F. & Kilpeläinen, T. O. Genes that make you fat, but keep you healthy. J. Intern Med. 284, 450–463 (2018).
Article CAS PubMed PubMed Central Google Scholar
Schwartz, S. S. & Kohl, B. A. Glycemic control and weight reduction without causing hypoglycemia: the case for continued safe aggressive care of patients with type 2 diabetes mellitus and avoidance of therapeutic inertia. Mayo Clin. Proc. 85, S15–S26 (2010).
Article PubMed PubMed Central Google Scholar
Carnethon, M. R. et al. Association of weight status with mortality in adults with incident diabetes. JAMA 308, 581–590 (2012).
CAS PubMed PubMed Central Google Scholar
Badrick, E., Sperrin, M., Buchan, I. E. & Renehan, A. G. Obesity paradox and mortality in adults with and without incident type 2 diabetes: a matched population-level cohort study. BMJ Open Diabetes Res. Care 5, e000369 (2017).
Article PubMed PubMed Central Google Scholar
Shah, N. R. & Braverman, E. R. Measuring adiposity in patients: the utility of body mass index (BMI), percent body fat and leptin. PLoS ONE 7, e33308 (2012).
Article CAS PubMed PubMed Central Google Scholar
Müller, M. J. et al. The case of GWAS of obesity: does body weight control play by the rules? Int. J. Obes. 42, 1395–1405 (2018).
Article Google Scholar
Goossens, G. H. The metabolic phenotype in obesity: fat mass, body fat distribution, and adipose tissue function. Obes. Facts 10, 207–215 (2017).
Article CAS PubMed PubMed Central Google Scholar
Shungin, D. et al. New genetic loci link adipose and insulin biology to body fat distribution. Nature 518, 187–196 (2015).
Article CAS PubMed PubMed Central Google Scholar
Rask-Andersen, M., Karlsson, T., Ek, W. E. & Johansson, Å. Genome-wide association study of body fat distribution identifies adiposity loci and sex-specific genetic effects. Nat. Commun. 10, 339 (2019).
Article CAS PubMed PubMed Central Google Scholar
Lotta, L. A. et al. Integrative genomic analysis implicates limited peripheral adipose storage capacity in the pathogenesis of human insulin resistance. Nat. Genet. 49, 17–26 (2017).
Article CAS PubMed Google Scholar
Ji, Y. et al. Genome-wide and abdominal MRI data provide evidence that a genetically determined favorable adiposity phenotype is characterized by lower ectopic liver fat and lower risk of type 2 diabetes, heart disease, and hypertension. Diabetes 68, 207–219 (2019).
Article CAS PubMed Google Scholar
Kilpeläinen, T. O. et al. Genetic variation near IRS1 associates with reduced adiposity and an impaired metabolic profile. Nat. Genet. 43, 753–760 (2011).
Article PubMed PubMed Central Google Scholar
Huang, L. O. et al. Genome-wide discovery of genetic loci that uncouple excess adiposity from its comorbidities. Nat. Metab. 3, 228–243 (2021).
Article CAS PubMed Google Scholar
Yengo, L. et al. Meta-analysis of genome-wide association studies for height and body mass index in ∼700,000 individuals of European ancestry. Hum. Mol. Genet. 27, 3641–3649 (2018).
Article CAS PubMed PubMed Central Google Scholar
Mahajan, A. et al. Fine-mapping type 2 diabetes loci to single-variant resolution using high-density imputation and islet-specific epigenome maps. Nat. Genet. 50, 1505–1513 (2018).
Article CAS PubMed PubMed Central Google Scholar
Denny, J. C. et al. PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene–disease associations. Bioinformatics 26, 1205–1210 (2010).
Article CAS PubMed PubMed Central Google Scholar
Roden, D. et al. Development of a large-scale de-identified DNA biobank to enable personalized medicine. Clin. Pharmacol. Ther. 84, 362–369 (2008).
Article CAS PubMed Google Scholar
Dennis, J. K. et al. Clinical laboratory test-wide association scan of polygenic scores identifies biomarkers of complex disease. Genome Med. 13, 6 (2021).
Article CAS PubMed PubMed Central Google Scholar
Pigeyre, M. et al. Identification of novel causal blood biomarkers linking metabolically favorable adiposity with type 2 diabetes risk. Diabetes Care 42, 1800–1808 (2019).
Article CAS PubMed Google Scholar
Pers, T. et al. Biological interpretation of genome-wide association studies using predicted gene functions. Nat. Commun. 6, 5890 (2015).
Article CAS PubMed Google Scholar
The GTEX Consortium. The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science 369, 1318–1330 (2020).
Article PubMed Central Google Scholar
Võsa, U. et al. Large-scale cis- and trans-eQTL analyses identify thousands of genetic loci and polygenic scores that regulate blood gene expression. Nat. Genet. 53, 1300–1310 (2021).
Article PubMed PubMed Central Google Scholar
Zhu, Z. et al. Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets. Nat. Genet. 48, 481–487 (2016).
Article CAS PubMed Google Scholar
Torres, J. M. et al. A multi-omic integrative scheme characterizes tissues of action at loci associated with type 2 diabetes. Am. J. Hum. Genet. 107, 1011–1028 (2020).
Article CAS PubMed PubMed Central Google Scholar
Kundaje, A. et al. Integrative analysis of 111 reference human epigenomes. Nature 518, 317–330 (2015).
Article CAS PubMed PubMed Central Google Scholar
Freshour, S. L. et al. Integration of the drug–gene interaction database (DGIdb 4.0) with open crowdsource efforts. Nucleic Acids Res. 49, D1144–D1151 (2021).
Article CAS PubMed Google Scholar
Sheils, T. K. et al. TCRD and Pharos 2021: mining the human proteome for disease biology. Nucleic Acids Res. 49, D1334–D1346 (2021).
Article CAS PubMed Google Scholar
Enattah, N. et al. Identification of a variant associated with adult-type hypolactasia. Nat. Genet. 30, 233–237 (2002).
Article CAS PubMed Google Scholar
Udler, M. S. et al. Type 2 diabetes genetic loci informed by multi-trait associations point to disease mechanisms and subtypes: a soft clustering analysis. PLoS Med. 15, e1002654 (2018).
Article PubMed PubMed Central Google Scholar
Aschard, H., Vilhjálmsson, B. J., Joshi, A. D., Price, A. L. & Kraft, P. Adjusting for heritable covariates can bias effect estimates in genome-wide association studies. Am. J. Hum. Genet. 96, 329–339 (2015).
Article CAS PubMed PubMed Central Google Scholar
Sun, B. B. et al. Genomic atlas of the human plasma proteome. Nature 558, 73–79 (2018).
Article CAS PubMed PubMed Central Google Scholar
Emdin, C. A. et al. Genetic association of waist-to-hip ratio with cardiometabolic traits, type 2 diabetes, and coronary heart disease. JAMA 317, 626–634 (2017).
Article PubMed PubMed Central Google Scholar
Lotta, L. A. et al. Association of genetic variants related to gluteofemoral vs abdominal fat distribution with type 2 diabetes, coronary disease, and cardiovascular risk factors. JAMA 320, 2553–2563 (2018).
Article CAS PubMed PubMed Central Google Scholar
Silva, N. M. G. D. et al. Liver function and risk of type 2 diabetes: bidirectional Mendelian randomization study. Diabetes 68, 1681–1691 (2019).
Article PubMed PubMed Central Google Scholar
Karaca, Ü., Schram, M. T., Houben, A. J. H. M., Muris, D. M. J. & Stehouwer, C. D. A. Microvascular dysfunction as a link between obesity, insulin resistance and hypertension. Diabetes Res. Clin. Pract. 103, 382–387 (2014).
Article CAS PubMed Google Scholar
Stehouwer, C. D. A. Microvascular dysfunction and hyperglycemia: a vicious cycle with widespread consequences. Diabetes 67, 1729–1741 (2018).
Article CAS PubMed Google Scholar
Barrett, E. J. et al. The vascular actions of insulin control its delivery to muscle and regulate the rate-limiting step in skeletal muscle insulin action. Diabetologia 52, 752–764 (2009).
Article PubMed PubMed Central Google Scholar
Meijer, R. I. et al. Insulin-induced microvascular recruitment in skin and muscle are related and both are associated with whole-body glucose uptake. Microcirculation 19, 494–500 (2012).
Article CAS PubMed Google Scholar
Holeček, M. Branched-chain amino acids in health and disease: metabolism, alterations in blood plasma, and as supplements. Nutr. Metab. 15, 33 (2018).
Article Google Scholar
Guasch-Ferré, M. et al. Metabolomics in prediabetes and diabetes: a systematic review and meta-analysis. Diabetes Care 39, 833–846 (2016).
Article PubMed PubMed Central Google Scholar
Nagai, N. et al. Involvement of heparan sulfate 6-O-sulfation in the regulation of energy metabolism and the alteration of thyroid hormone levels in male mice. Glycobiology 23, 980–992 (2013).
Article CAS PubMed Google Scholar
Matsuzawa, T. et al. Heparan sulfate promotes differentiation of white adipocytes to maintain insulin sensitivity and glucose homeostasis. J. Biol. Chem. 297, 101006 (2021).
Sakamuri, S. S. V. P. et al. Absence of tissue inhibitor of metalloproteinase-4 (TIMP4) ameliorates high fat diet-induced obesity in mice due to defective lipid absorption. Sci. Rep. 7, 6210 (2017).
Article PubMed PubMed Central Google Scholar
Marx, N. et al. Antidiabetic PPAR gamma-activator rosiglitazone reduces MMP-9 serum levels in type 2 diabetic patients with coronary artery disease. Arterioscler Thromb. Vasc. Biol. 23, 283–288 (2003).
Article CAS PubMed Google Scholar
Maquoi, E., Munaut, C., Colige, A., Collen, D. & Lijnen, H. R. Modulation of adipose tissue expression of murine matrix metalloproteinases and their tissue inhibitors with obesity. Diabetes 51, 1093–1101 (2002).
Article CAS PubMed Google Scholar
Thorens, B. GLUT2, glucose sensing and glucose homeostasis. Diabetologia 58, 221–232 (2015).
Article CAS PubMed Google Scholar
Eny, K. M., Wolever, T. M. S., Fontaine-Bisson, B. & El-Sohemy, A. Genetic variant in the glucose transporter type 2 is associated with higher intakes of sugars in two distinct populations. Physiol. Genomics 33, 355–360 (2008).
Article CAS PubMed Google Scholar
Zhou, K. et al. Variation in the glucose transporter gene SLC2A2 is associated with glycemic response to metformin. Nat. Genet. 48, 1055–1059 (2016).
Article CAS PubMed PubMed Central Google Scholar
Rathmann, W. et al. A variant of the glucose transporter gene SLC2A2 modifies the glycaemic response to metformin therapy in recently diagnosed type 2 diabetes. Diabetologia 62, 286–291 (2019).
Article CAS PubMed Google Scholar
Song, W. et al. Organic cation transporter 3 (Oct3) is a distinct catecholamines clearance route in adipocytes mediating the beiging of white adipose tissue. PLoS Biol. 17, e2006571 (2019).
Article PubMed PubMed Central Google Scholar
Saxton, S. N. et al. Restoring perivascular adipose tissue function in obesity using exercise. Cardiovasc. Drugs Ther. 35, 1291–1304 (2021).
Article CAS PubMed PubMed Central Google Scholar
Middelberg, R. P. et al. Genetic variants in LPL, OASL and TOMM40/APOE-C1-C2-C4 genes are associated with multiple cardiovascular-related traits. BMC Med. Genet. 12, 123 (2011).
Article CAS PubMed PubMed Central Google Scholar
Qin, J. et al. A human gut microbial gene catalogue established by metagenomic sequencing. Nature 464, 59–65 (2010).
Article CAS PubMed PubMed Central Google Scholar
den Besten, G. et al. The role of short-chain fatty acids in the interplay between diet, gut microbiota, and host energy metabolism. J. Lipid Res. 54, 2325–2340 (2013).
Yang, C. H. et al. Independent phenotypic plasticity axes define distinct obesity sub-types. Nat. Metab. 4, 1150–1165 (2022).
Article CAS PubMed PubMed Central Google Scholar
Bycroft, C. et al. The UK Biobank resource with deep phenotyping and genomic data. Nature 562, 203–209 (2018).
Article CAS PubMed PubMed Central Google Scholar
Kurki, M. I. et al. FinnGen: unique genetic insights from combining isolated population and national health register data. Preprint at https://www.medrxiv.org/content/10.1101/2022.03.03.22271360v1 (2022).
Elsworth B, et al. The MRC IEU OpenGWAS data infrastructure. Preprint at bioRxiv https://doi.org/10.1101/2020.08.10.244293 (2020).
Verma, A. et al. PheWAS and beyond: the landscape of associations with medical diagnoses and clinical measures across 38,662 individuals from Geisinger. Am. J. Hum. Genet 102, 592–608 (2018).
Article CAS PubMed PubMed Central Google Scholar
Shin, S.-Y. et al. An atlas of genetic influences on human blood metabolites. Nat. Genet. 46, 543–550 (2014).
Article CAS PubMed PubMed Central Google Scholar
Kettunen, J. et al. Genome-wide study for circulating metabolites identifies 62 loci and reveals novel systemic effects of LPA. Nat. Commun. 7, 11122 (2016).
Article CAS PubMed PubMed Central Google Scholar
Kurilshikov, A. et al. Large-scale association analyses identify host factors influencing human gut microbiome composition. Nat. Genet. 53, 156–165 (2021).
Article CAS PubMed PubMed Central Google Scholar
Dastani, Z. et al. Novel loci for adiponectin levels and their influence on type 2 diabetes and metabolic traits: a multi-ethnic meta-analysis of 45,891 individuals. PLoS Genet. 8, e1002607 (2012).
Article CAS PubMed PubMed Central Google Scholar
Veroniki, A. A. et al. Methods to estimate the between-study variance and its uncertainty in meta-analysis. Res. Synth. Methods 7, 55–79 (2016).
Article PubMed Google Scholar
Borenstein, M., Hedges, L. V., Higgins, J. P. T. & Rothstein, H. R. Multiple outcomes or time-points within a study. In Introduction to Meta-Analysis, 225–238 (John Wiley & Sons, 2009).
Kursa, M. B. & Rudnicki, W. R. Feature selection with the Boruta package. J. Stat. Softw. 36, 1–13 (2010).
Article Google Scholar
Wu, Y. et al. Integrative analysis of omics summary data reveals putative mechanisms underlying complex traits. Nat. Commun. 9, 918 (2018).
Article PubMed PubMed Central Google Scholar
R Core Team. R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. Accessed at https://www.R-project.org/ (2022).

Download references

Acknowledgements

This work was undertaken as part of the Stratification of Obesity Phenotypes to Optimize Future Therapy (SOPHIA) and the Novel Approach to Systematically Characterize Exercise- and Nutrient- responsive genes in Type 2 diabetes and cardiovascular disease (NASCENT) projects. SOPHIA has received funding from the Innovative Medicines Initiative 2 Joint Undertaking under grant agreement no. 875534. This Joint Undertaking is supported by the European Union’s Horizon 2020 research and innovation programme and EFPIA and T1D Exchange, JDRF and Obesity Action Coalition. NASCENT was funded by a grant from the European Commission (ERC-2015-CoG–681742 NASCENT). Additonal funding was provided by the Swedish Research Council (Distinguished Young Researcher Award) and the Swedish Foundation for Strategic Research (LUDC-IRC, 15-0067). All awards to PWF.

Funding

Open access funding provided by Lund University.

Author information

Authors and Affiliations

Genetic and Molecular Epidemiology Unit, Lund University Diabetes Centre, Department of Clinical Science, Lund University, Skåne University Hospital, Malmö, Sweden
Daniel E. Coral, Juan Fernandez-Tajes, Neli Tsereteli, Hugo Pomares-Millan, Hugo Fitipaldi, Pascal M. Mutie, Naeimeh Atabaki-Pasdar, Sebastian Kalamajski, Alaitz Poveda, Giuseppe N. Giordano, Ewan R. Pearson & Paul W. Franks
Oxford Centre for Diabetes, Endocrinology and Metabolism, University of Oxford, Oxford, UK
Naeimeh Atabaki-Pasdar & Nancy J. Cox
Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA
Tyne W. Miller-Fleming & Xue Zhong
Population Health and Genomics, University of Dundee, Dundee, UK
Ewan R. Pearson
Harvard T.H. Chan School of Public Health, Boston, MA, USA
Paul W. Franks

Authors

Daniel E. Coral
View author publications
You can also search for this author in PubMed Google Scholar
Juan Fernandez-Tajes
View author publications
You can also search for this author in PubMed Google Scholar
Neli Tsereteli
View author publications
You can also search for this author in PubMed Google Scholar
Hugo Pomares-Millan
View author publications
You can also search for this author in PubMed Google Scholar
Hugo Fitipaldi
View author publications
You can also search for this author in PubMed Google Scholar
Pascal M. Mutie
View author publications
You can also search for this author in PubMed Google Scholar
Naeimeh Atabaki-Pasdar
View author publications
You can also search for this author in PubMed Google Scholar
Sebastian Kalamajski
View author publications
You can also search for this author in PubMed Google Scholar
Alaitz Poveda
View author publications
You can also search for this author in PubMed Google Scholar
Tyne W. Miller-Fleming
View author publications
You can also search for this author in PubMed Google Scholar
Xue Zhong
View author publications
You can also search for this author in PubMed Google Scholar
Giuseppe N. Giordano
View author publications
You can also search for this author in PubMed Google Scholar
Ewan R. Pearson
View author publications
You can also search for this author in PubMed Google Scholar
Nancy J. Cox
View author publications
You can also search for this author in PubMed Google Scholar
Paul W. Franks
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.W.F. and D.E.C. conceived and designed and analyses, and wrote the manuscript. D.E.C., T.W.M.-F. and X.Z. performed the analyses. All other co-authors contributed materials/analysis tools and reviewed the manuscript.

Corresponding authors

Correspondence to Daniel E. Coral or Paul W. Franks.

Ethics declarations

Ethics statements

Ethics approval for the UK Biobank was obtained from the North West Centre for Research Ethics Committee. Analysis of individual-level data from UK Biobank participants in Lund University was approved by the Swedish Ethical Review Authority (2021-0317). The BioVU project was approved by the VUMC Institutional Review Board. The analysis of individual-level data was performed in VUMC, and only summary results were shared with researchers at Lund University. Both studies conformed to the ethical principles for medical research involving human participants outlined in the Declaration of Helsinki. All participants provided written informed consent at enrolment.

Inclusion & ethics statement

All collaborators of this study have fulfilled the criteria for authorship required by Nature Portfolio journals have been included as authors, as their participation was essential for the design and implementation of the study. Roles and responsibilities were agreed among collaborators ahead of the research. This work includes findings that are locally relevant, which have been determined in collaboration with local partners. This research was not severely restricted or prohibited in the setting of the researchers, and does not result in stigmatization, incrimination, discrimination or personal risk to participants. Local and regional research relevant to our study was taken into account in citations.

Competing interests

P.W.F. has received research grants from numerous diabetes drug companies and fees as consultant from Novo Nordisk, Lilly and Zoe. He is currently the Head of the Department of Translational Medicine at the Novo Nordisk Foundation. All other authors declare non-competing interests.

Peer review

Peer review information

Nature Metabolism thanks the anonymous reviewers for their contribution to the peer review of this work. Primary Handling Editor: Isabella Samuelson, in collaboration with the Nature Metabolism team.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Analysis flowchart and profile identification.

Panel A: Analysis flowchart. Panel B: BMI and T2D risk estimates of concordant and discordant SNPs after alignment to the BMI increasing allele. Panel C: summary-based concordant and discordant GRS coefficients (standard deviation units for continuous traits, log OR for binary traits). Traits shown have at least 1 estimate significant after 5% FDR correction and the difference between profiles is also significant after 5% FDR. Statistical tests were based on a Z-distribution and were two-sided. Bars show 95% confidence intervals. Sample sizes vary for every trait (> 100.000 for all traits). The heatmap shows the Z-scores of the SNPs in every trait, with the single-linkage tree at the bottom, separately for concordant and discordant SNPs.

Extended Data Fig. 2 Traits with potential causal effect on diabesity discordance.

Panel A: Traits where a difference was found in the comparison of profiles and one of the two direction-specific GRS associated with BMI was associated with lower risk of T2D (two-sided Z-statistic P < 0.05). To derive the GRS we used BMI data from the GIANT + UK Biobank meta-analysis (N = 681,275). WHR data came from the GIANT consortium (N = 212,244). SBP data came from the meta-analysis performed by the ICBP (N = 757,601). Metabolite data came from the UK Biobank (N = 115,078). Estimates represent T2D OR, bars represent 95% confidence intervals, which are derived from the DIAGRAM meta-analysis (N = 158,186). Panel C: Regional association plot showing the pleiotropic effect of genetic instruments for blood levels of TIMP4 protein and high BMI and lower T2D risk. Protein data was derived from the INTERVAL study (N = 3,301).

Supplementary information

Supplementary Information

Supplementary Figs. 1–6

Reporting Summary

Supplementary Tables 1–18

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Coral, D.E., Fernandez-Tajes, J., Tsereteli, N. et al. A phenome-wide comparative analysis of genetic discordance between obesity and type 2 diabetes. Nat Metab 5, 237–247 (2023). https://doi.org/10.1038/s42255-022-00731-5

Download citation

Received: 14 December 2021
Accepted: 20 December 2022
Published: 26 January 2023
Issue Date: February 2023
DOI: https://doi.org/10.1038/s42255-022-00731-5

This article is cited by

Harnessing the power of proteomics in precision diabetes medicine
- Nigel Kurgan
- Jeppe Kjærgaard Larsen
- Atul S. Deshmukh
Diabetologia (2024)
Illuminating the ‘healthy obese’ phenotype
- Mathias Rask-Andersen
- Åsa Johansson
Nature Metabolism (2023)
The DECON pilot project investigates predictive markers for successful bariatric surgery
- Gabriel Seifert
- Luca Fagnocchi
- J. Andrew Pospisilik
Scientific Reports (2023)
Precision nutrition for targeting pathophysiology of cardiometabolic phenotypes
- Christopher E. Shannon
- Méabh B. Ní Chathail
- Helen M. Roche
Reviews in Endocrine and Metabolic Disorders (2023)
Relevance of body composition in phenotyping the obesities
- Laura Salmón-Gómez
- Victoria Catalán
- Javier Gómez-Ambrosi
Reviews in Endocrine and Metabolic Disorders (2023)

Subjects

Abstract

Similar content being viewed by others

Main

Results

Assembly of concordant and discordant diabesity profiles

Phenome-wide scans

Profile decomposition

External validation in BioVU

Differences in mortality in UK Biobank

Differences in serum metabolites

Differences in gut microbiota

Differences in serum protein levels

Functional annotation using DEPICT

Gene expression and splicing in discordant diabesity

Discordant diabesity genes as therapeutic targets

Instrumental variable analyses

Individuals within the top decile for each profile

Comparison with previous studies of discordant diabesity

Discussion

Methods

Study populations

BioVU

UK Biobank

Single-nucleotide polymorphism selection to construct concordant and discordant genetic profiles

Phenome-wide scans

Profile comparison

Genetic risk score analyses

SMR & HEIDI

Scoring method using epigenetic annotation

Genes as therapeutic targets

Reporting summary

Data availability

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Ethics statements

Inclusion & ethics statement

Competing interests

Peer review

Peer review information

Additional information

Extended data

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links