Novel putative causal mutations associated with fat traits in Nellore cattle uncovered by eQTLs located in open chromatin regions

Garcia, Ingrid Soares; Silva-Vignato, Bárbara; Cesar, Aline Silva Mello; Petrini, Juliana; da Silva, Vinicius Henrique; Morosini, Natália Silva; Goes, Carolina Purcell; Afonso, Juliana; da Silva, Thaís Ribeiro; Lima, Beatriz Delcarme; Clemente, Luan Gaspar; Regitano, Luciana Correia de Almeida; Mourão, Gerson Barreto; Coutinho, Luiz Lehmann

doi:10.1038/s41598-024-60703-5

Download PDF

Article
Open access
Published: 02 May 2024

Novel putative causal mutations associated with fat traits in Nellore cattle uncovered by eQTLs located in open chromatin regions

Ingrid Soares Garcia¹,
Bárbara Silva-Vignato¹,
Aline Silva Mello Cesar²,
Juliana Petrini¹,
Vinicius Henrique da Silva¹,
Natália Silva Morosini¹,
Carolina Purcell Goes¹,
Juliana Afonso³,
Thaís Ribeiro da Silva¹,
Beatriz Delcarme Lima¹,
Luan Gaspar Clemente¹,
Luciana Correia de Almeida Regitano³,
Gerson Barreto Mourão¹ &
…
Luiz Lehmann Coutinho¹

Scientific Reports volume 14, Article number: 10094 (2024) Cite this article

271 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

Intramuscular fat (IMF) and backfat thickness (BFT) are critical economic traits impacting meat quality. However, the genetic variants controlling these traits need to be better understood. To advance knowledge in this area, we integrated RNA-seq and single nucleotide polymorphisms (SNPs) identified in genomic and transcriptomic data to generate a linkage disequilibrium filtered panel of 553,581 variants. Expression quantitative trait loci (eQTL) analysis revealed 36,916 cis-eQTLs and 14,408 trans-eQTLs. Association analysis resulted in three eQTLs associated with BFT and 24 with IMF. Functional enrichment analysis of genes regulated by these 27 eQTLs revealed noteworthy pathways that can play a fundamental role in lipid metabolism and fat deposition, such as immune response, cytoskeleton remodeling, iron transport, and phospholipid metabolism. We next used ATAC-Seq assay to identify and overlap eQTL and open chromatin regions. Six eQTLs were in regulatory regions, four in predicted insulators and possible CCCTC-binding factor DNA binding sites, one in an active enhancer region, and the last in a low signal region. Our results provided novel insights into the transcriptional regulation of IMF and BFT, unraveling putative regulatory variants.

Temporal dynamics of the multi-omic response to endurance exercise training

Article Open access 01 May 2024

Paternal microbiome perturbations impact offspring fitness

Article Open access 01 May 2024

Genome-wide association studies

Article 26 August 2021

Introduction

Backfat thickness (BFT) and intramuscular fat (IMF) are significant economic traits for the beef industry, impacting carcass and meat quality. Associated with beef sensorial quality, IMF is positively correlated with meat tenderness and juiciness, affecting its flavor, thus resulting in consumer satisfaction and repurchase decision¹. BFT significantly influences carcass quality and yield, as an appropriate subcutaneous fat cover is essential for mitigating issues such as cold shortening and evaporative weight loss during cooling². There is also a positive correlation between BFT and IMF³, making them interesting traits to study simultaneously.

Large-scale RNA-seq and high-throughput genotyping technologies led to genome-wide identification of regulatory variants associated with gene expression, also known as expression quantitative trait loci (eQTL)^4,5. In eQTL analyses, the main goal is to associate variants located throughout the genome with expression levels of each gene, providing information about the genetic regulation of gene expression in a given tissue. This assists in revealing metabolic pathways, regulatory genes, biological processes, and genetic factors that may be affected by the eQTL, and can modulate a phenotype^6,7.

Although advances in high-throughput genotyping have made it possible to increase eQTL discovery, the high cost of doing so is still a refraining factor, although genotype imputation can be a powerful tool to reduce costs. This is possible by genotyping a large population of animals with a lower density panel and imputing the genotype of untyped loci using information from a small reference population genotyped with a higher density panel^8,9. This allows for a larger dataset, increasing the possibility of identifying the regulatory variant associated with gene expression. Another tool that can provide more information by improving the fine-mapping of causal gene regulatory variants is Assay for Transposase-Accessible chromatin (ATAC-seq), as it can identify open chromatin regions^10,11.

We combined RNA-seq data and imputed genotypes to identify eQTLs. The significant eQTLs were associated with IMF and BFT to identify gene expression regulatory mutations that could modulate phenotypic variance. However, due to linkage disequilibrium, the eQTL may not be the causative variant, so we used ATAC-seq data to select eQTLs associated with the phenotypes that are in open chromatin regions. As a result, we identified six putative variants that modulate gene expression and are associated with IMF and BFT.

Results

eQTL analysis with the complete SNP dataset and SNP annotation

Aiming to expand our SNP database to detect regulatory variants, we combined genotypes from 778 progenies (BovineHD BeadChip 770k, Illumina), DNA-Sequencing from 26 Nellore sires of the population under study, and transcribed SNPs called from RNA-Sequence data from the Longissimus thoracis (LT) muscle of 192 animals within the progenies. This integration yielded an extensive imputed SNP panel, encompassing a total of 4,522,914 SNPs. Following stringent quality control measures (including minor allele frequency > 5% and call rate > 95%) and pruning (with an r² threshold of 0.8), we refined the SNP dataset for subsequent eQTL analysis, resulting in a set of 553,581 tag-SNPs.

Before eQTL identification, population stratification was investigated by principal component analysis (PCA). Using 192 animals and the SNPs from the BovineHD BeadChip 770k panel, the first two principal components (explaining 10.64% and 6.87% of the variance for PC1 and PC2, respectively) were included as covariates in the eQTL analysis model, as we observed a sample grouping partially explained by sire (Supplementary Table 1 and Supplementary Fig. 1). Our eQTL analysis resulted in 51,324 eQTLs (FDR < 5%, considering that each time a SNP regulates a distinct gene, it qualifies as an eQTL). There were 36,916 cis-eQTLs (25,896 for a unique genomic position), and 14,408 trans-eQTLs (4685 for a unique genomic position) distributed along the 29 Bos taurus autosomal chromosomes (Fig. 1a). Of these, 3823 SNPs act as cis and trans-eQTLs (1950 for a unique genomic position). From our 25,896 cis-eQTLs and 4685 trans-eQTLs, 2381 and 663 were novel SNPs, absent from the public SNP database (dbSNP), respectively. As for the regulated genes, 5142 genes were affected by cis-eQTLs, and 4707 genes had their expression affected by trans-eQTLs. A quantile–quantile plot illustrating the p-values from the eQTL analysis and a Manhattan plot for both cis and trans-eQTLs is provided in Supplementary Fig. 2. A complete list of cis and trans-eQTLs (FDR < 0.05) and the genes regulated by them can be found in Supplementary Table 2.

Ensembl Variant Effect Predictor (VEP) analysis showed that more cis and trans-eQTL were located on chromosome 23, while fewer local and distant variants were on chromosome 20 (Fig. 1b,c). Cis-eQTLs were predominantly located in 3'UTR, intronic, and downstream gene regions, while trans-eQTLs concentration was in 3'UTR, intronic, and intergenic regions. Among them, 23.8% and 21.9% were predicted to be synonymous variants, whereas 10.2% and 9.4% were classified as missense for cis and trans-eQTL, respectively (Fig. 1d,e and Supplementary Table 3).

Phenotype association analysis, gene annotation, and SNPs in ATAC-Seq peaks

To discover eQTLs associated with phenotypic variation, we conducted a phenotype association analysis. For this, the population was expanded to 374 animals, and after the population stratification test, PC1 (7.89%) and PC2 (6.96%) were included as covariates to account for the sire effect (Supplementary Table 4 and Supplementary Fig. 3).

A linear model with the effects of PC1, PC2, contemporary group (CG), and hot carcass weight as covariates was used to test the association of the 30,581 eQTLs (a sum of unique genomic positions of cis and trans-eQTLs) with BFT and IMF. Three eQTLs were associated with BFT (Supplementary Table 5) and 24 with IMF (Supplementary Table 6). The Circos plot shows the links between eQTLs, phenotypes, and their regulated genes (Fig. 2). A quantile–quantile plot and a Manhattan plot illustrating the p-values from the phenotype association analysis is provided in Supplementary Fig. 4.

The biological process of immune response can describe some genes regulated by our eQTLs, including TAP Binding Protein Like (TAPBPL), V-Set Domain Containing T Cell Activation Inhibitor 1 (VTCN1).). In contrast, Protein Kinase C And Casein Kinase Substrate in Neurons 1 (PACSIN1) and Cytoplasmic FMR1 Interacting Protein 2 (CYFIP2) are involved in cytoskeleton remodeling. Also, we identified genes related to phospholipid metabolism and lipid biosynthesis, such as Lysophosphatidylcholine Acyltransferase 3 (LPCAT3), Phosphatidylinositol Transfer Protein Alpha (PITPNA), Diacylglycerol Kinase Theta (DGKθ), Growth Differentiation Factor 11 (GDF11), and Cytoplasmic FMR1 Interacting Protein 2 (CYFIP2). Trans-Glutathione S-Transferase Alpha 2 (GSTA2) is involved in oxidative stress, and Homeostatic Iron Regulator (HFE) is involved in iron transport pathways. Other genes have previously been associated with meat and carcass quality or are involved in fat-related phenotypes such as Tripartite motif containing 45 (TRIM45), Transmembrane Protein 87A (TMEM87A), Transmembrane Protein 175 (TMEM175), Trans- Aldehyde Dehydrogenase 5 Family Member A1 (ALDH5A1), Glucosidase alpha neutral C (GANC) and, Non-SMC Condensin I Complex Subunit D2 (NCAPD2).

Using MetaCore software, we performed an enrichment analysis of the regulated genes. Figure 3 shows the top ten Pathway Maps (Fig. 3a) and the process networks (Fig. 3b) enriched for genes related to IMF and BFT. This revealed some noteworthy pathways, such as immune response, cytoskeleton remodeling, iron transport, and phospholipid metabolism.

To investigate which eQTLs are located at potential regulatory regions, we performed ATAC-Seq in bovine muscle samples. We identified 33,734 ATAC-Seq peaks with an average width of 2193 base pairs (bp) using a Fraction of Reads in Peaks (FRiP) score of 0.2 (Supplementary Table 7). The peaks were selected based on the FRiP score values and were overlapped with Transcriptional Start Sites (TSSs) in each replicate as a quality metric (Supplementary Fig. 5). As expected, the peaks were centralized on TSSs, which indicate high accessibility and, thus, gene expression. We overlapped our 27 traits-associated eQTLs with ATAC-Seq peaks to identify putative causal variants. The overlap resulted in six variants associated with the modulation of gene expression and phenotype, and located in open chromatin regions. Thus, we considered these as candidate regulatory regions (Table 1).

Table 1 Relevant eQTLs associated with intramuscular fat (IMF) and backfat thickness (BFT) located in open chromatin regions in a Nellore cattle population.

Full size table

In addition to overlapping our variants with ATAC-Seq peaks, we also consulted the ChromHMM model-based profile of chromatin states for cattle muscle (Functional Annotation of Animal and Genomes consortium database—FAANG). This model uses known epigenetic signals of histone marks and CTCF sites to characterize regulatory elements along the chromatin¹². Figure 4 illustrates the genome location of a novel SNP on chr21:55370318 present in a predicted insulator. Visualization in The Integrative Genomics Viewer (IGV) of the other five eQTLs, located in peaks, can be seen in Supplementary Figs. 6, 7, and 8. Three of those five eQTLs, rs208816180, rs721933457 and rs469481754, are in regions predicted to be insulators. The rs523028000 and rs456412241 eQTLs are in a region with low signal and active enhancer, respectively.

Discussion

Our research group focuses on unraveling the genetic mechanisms underlying fat deposition in beef cattle. To achieve this, our prior investigations have employed a multi-faceted approach, utilizing Genome-Wide Association Studies (GWAS) to pinpoint relevant genomic regions¹³, RNA-seq analysis to characterize gene expression profiles in animals exhibiting divergent IMF levels¹⁴, and eQTL analysis to identify variants influencing gene expression within QTLregions¹⁴.

Herein, we expanded our search for variants by combining genotypes from 374 progenies, whole genome DNA-Sequencing of 26 Nellore sires, and transcribed SNPs obtained from RNA-Seq data from LT of 192 animals to generate a total of 4,522,914 SNPs inputted across 374 animals. The number of SNPs discovered considerably increased our chances of finding causative mutations. However, we had to use linkage disequilibrium to reduce the number of SNPs for eQTL and phenotype association analyses. This reduction in number was essential to decrease the number of multiple tests from SNPs capturing the same effect, thus reducing our analyses to 553,581 tag-SNPs. We also used ATAC-Seq from muscle to localize open chromatin regions. The combination of phenotype association analysis, eQTL, and ATAC-Seq data allowed us to identify mutations in open chromatin regions that control gene expression and are associated with fat deposition.

Most eQTLs identified were in 3'UTR, introns, intergenic, and downstream gene regions. These results corroborate the hypothesis that untranslated gene regions are involved with gene expression regulation¹⁵. It is essential to point out that the SNPs identified in this study may not be the actual causal mutation since any SNP within the same LD block has the potential to be the causal variant¹⁶. Our association analysis uncovered 27 eQTLs associated with phenotypic variation, in agreement with previous results on the importance of regulatory mutations for phenotypic variation^14,17.

When exploring our set of 27 eQTLs in the cattle Genotype-Tissue Expression atlas (cGTEx—https://cgtex.roslin.ed.ac.uk/), we identified four cis-eQTLs associated with intramuscular fat (IMF), confirming our results. These include rs380143588 (chromosome 5), rs378206508 (chromosome 5), rs385973669 (chromosome 5), and rs209748084 (chromosome 19). Notably, none of the eQTLs associated with backfat thickness (BFT) were identified in cGTEx. It is important to note that although certain eQTLs were not detected in this database, a subset of them comprises novel SNPs that have not yet undergone annotation. Additionally, considering that many of our eQTLs associated with IMF and BFT originate from RNA-seq data, SNPs in regulatory regions may not be comprehensively annotated in existing databases, which predominantly consist of SNPs in intronic and more conserved regions, primarily derived from SNP chips.

Three cis-eQTLs were associated with BFT. Among these, a novel SNP (chromosome 21:55370318) was found to modulate the expression of the Transglutaminase 5 (TGM5) gene. TGM5 belongs to the transglutaminase family (TGs), a group of calcium-dependent enzymes responsible for posttranslational modification of proteins through deamidation and amine incorporation processes. These proteins also serve as scaffolds, maintain membrane integrity, regulate cell adhesion, and modulate signal transduction^18,19. TGM5 was associated with high triglyceride levels in humans²⁰, although this gene's mode of action is unknown. Another member of the TGs family, TGM2, a more extensively studied gene, has been indicated as a novel negative regulator of adipogenesis²¹. In addition, higher expression levels of TGM2 were associated with animals of low genomic estimated breeding values (GEBV) for IMF in a previous study of our group with this same Nellore population¹⁷.

The cis-eQTL rs208816180, associated with BFT, regulates the expression of H2BC15 and HFE. The first gene, present in a small histone gene cluster, is a histone H2B family member involved in DNA replication²². The second gene, HFE, regulates iron (Fe²⁺) absorption by mediating the interaction of transferrin receptors with transferrin. This enables the protein to competitively bind to the transferrin receptor at the same site as iron-loaded transferrin molecules, preventing iron uptake²³. Previous studies suggest that reduced lipogenic activities can occur with lower iron content in the muscle of Nellore steers²⁴. This might happen because iron is essential to the adipocyte's physiological functions, such as differentiation, endocrine, and energy supply²⁵. Another study discusses how HFE knockout mice can alter lipid metabolism by modifying the expression of lipid metabolism markers²⁶.

The last gene regulated by a cis-eQTL associated with BFT is Cilia And Flagella Associated Protein 97 (CFAP97), which is predicted to be involved in spermatogenesis²⁷. Although its function in adipose tissue is poorly understood, this gene was associated with adiposity traits such as percent of intramuscular fat, abdominal fat, and blood lipid levels in broilers²⁸.

Regarding IMF, 24 eQTLs were associated, regulating 28 genes (six being novel genes). From these, five genes are more directly involved in phospholipid metabolism or lipid biosynthesis, such as LPCAT3, PITPNA, DGKθ, GDF11 and, CYFIP2. LPCAT3 is an isoform of lysophatidylcholine acyltransferases, which participates in phosphatidylcholine (a major phospholipid class) remodeling²⁹. This gene is the main isoform in primary metabolic tissues, such as the skeletal muscle. It has an important role in lipoprotein production and lipogenesis. In contrast, its deficiency reduces lipid absorption^30,31. LPCAT3 is also involved in the lipid organization of the plasma membrane and insulin sensitivity in skeletal muscle. Knockout of LPCAT3 increased plasma membrane lipid clustering and reduced phospholipid packing of cellular membranes, suggesting that this gene modulates plasma membrane lipid organization³².

PITPNA encodes a family of lipid-binding proteins that transfer molecules of phosphatidylinositol or phosphatidylcholine between membrane surfaces³³. One isoform of Diacylglycerol Kinase, DGKθ, is a transferase that generates phosphatidic acid (another phospholipid involved in cell signaling) by catalyzing the phosphorylation of diacylglycerol using ATP as the phosphate donor, having a role in lipid homeostasis³⁴.

Growth Differentiation Factor 11 (GDF11) is a member of the Transforming Growth Factor β superfamily. This gene has been reported to be involved in adipogenesis by reducing lipid deposition³⁵. Also, GDF11 inhibits peroxisome proliferator-activated receptor γ (PPAR-γ) expression, one of the central genes involved in adipocyte differentiation³⁶.

A recent study in cultured cells reported that CYFIP2 is involved in thermogenesis and regulation of lipid metabolism in white adipocytes, explicitly influencing the browning of white adipocytes³⁷. Other studies reported the possibility of the CYFIP2 locus being associated with diet-induced obesity and metabolic dysfunction in mice^38,39. CYFIP2 was also among the top 10 novel candidate genes for obesity in humans in another study⁴⁰.

The GSTA2 gene can also play a role in fat deposition and obesity, as this gene is related to oxidative stress, producing an antioxidant enzyme that reduces lipid peroxidation⁴¹. Oxidative stress can stimulate white adipose tissue deposition and alter food intake, triggering obesity. Studies demonstrated that oxidative stress can increase preadipocyte proliferation, adipocyte differentiation, and mature adipocyte size^42,43. GSTA2 may also interact with CCAAT/enhancer binding protein-β (C/EBPβ) which is a fat deposition promoter⁴⁴.

Some identified regulated genes are related to immune response and cytoskeleton remodeling pathways. TAPBPL and VTCN1 are involved in antigen presentation, allowing immune cells to monitor the body for intracellular infection^45,46. H2BC15 also has its expression regulated by several immune stimuli in fish, which is probably related to its relevance to immune response⁴⁷. PACSIN1 can transform the liposome membrane into tubes with different diameters, generating various membrane morphologies^48,49. CYFIP2 protein is also a component of the WAVE regulatory complex (WRC) that controls actin cytoskeleton organization⁵⁰.

Other regulated genes have already been associated with meat quality. The TRIM45 gene regulates cell proliferation, growth, and apoptosis and was highlighted in Gene Ontology (GO) terms, linked to meat quality⁵¹. In another study, analyzing gene expression by RNA sequencing in LT muscle of Maremmana and Chianina cattle breeds, TRIM45 was identified as a gene overexpressed in tender Chianina meat⁵². TMEM87A and TMEM175 are part of a huge family of transmembrane protein genes (TMEM). Variants in TMEM genes were associated with obesity (and body fat traits) in human adults and children^53,54. Moreover, a previous work with this Nellore population found TMEM39B as a hub gene in co-expression networks associated with intramuscular fat content traits⁴.

The ALDH5A1 gene, which is related to glycolipid metabolism pathways, has been identified to play roles in IMF deposition⁵⁵. GANC is an enzyme in glycogen metabolism, which is involved in the hydrolysis of this polysaccharide. This gene was linked to energy, carbohydrates, and lipid metabolism pathways. It was also identified as a candidate gene to explain differences in intramuscular fat observed between divergent lines of domestic rabbits⁵⁶. NCAPD2 enables histone binding activity and is involved in mitotic chromosome condensation and segregation^57,58.

Our enrichment analysis revealed pathways related to lipid metabolism. Regarding the phospholipid pathways, lipid intermediates, such as diacylglycerol and fatty acids released from phospholipids, play a crucial role in triglyceride synthesis within the body^59,60. Additionally, we identified genes associated with immune response and cytoskeleton remodeling. One of the consequences of obesity is the presence of chronic inflammation in adipose tissue. When there is an increase in adipocyte volume, the enlarged adipocyte begins producing proinflammatory chemokines, which in turn recruit immune cells to the tissue^61,62. Also, when the adipocyte enlarges to store lipids, the cell must undergo a remodeling of its cytoskeleton⁶³. This means that fat deposition, immune response, and cytoskeleton remodeling are associated, and with a gain of adipose tissue mass, genes and pathways related to immune response and cytoskeleton remodeling are likely upregulated.

Some of the genes regulated by the six eQTLs located in ATAC-Seq peaks in our study have been associated in the past with fat-related phenotypes, having their expression regulated by other variants^14,20. An example of this is the eQTLs rs721933457 and rs523028000, which in our study, regulate NCAPD2 and TAPBPL respectively. In another study with our current Nellore cattle population, these genes were regulated by different eQTLs that were associated with IMF¹⁴. Also, another variant located in TGM5 was associated with high triglyceride levels²⁰, but in our study the eQTL that regulates this gene is a novel SNP (chromosome 21:55370318). In these cases, ATAC-Seq data can help us determine that the eQTLs rs721933457, rs523028000 and novel SNP (chromosome 21:55370318) are more likely the putative causal mutations, as this analysis prioritizes variants located in regulatory regions.

When considering the regions where the eQTLs were found within ATAC-Seq peaks, integrating multiple epigenomic marks provides further insights into their biological relevance within a spatial context; these regions are commonly referred to as chromatin states⁶⁴. Chromatin states can be used to identify different genomic elements, such as active enhancers, transcription start sites, and insulators. This allowed us to identify regulatory regions and better explain gene expression regulation⁶⁵. When visualizing the eQTLs located in ATAC-Seq peaks in IGV, four of them were co-located within regions where the chromatin state was predicted to act as insulators, alongside potential CCCTC-binding factor (CTCF) binding sites. Insulators are regions in the DNA where some elements can bind to protect the expression of genes from signals emitting from their surrounding regions. Insulators can protect gene expression in two ways: by acting as an enhancer-blocking element or by functioning as a barrier, preventing the advance of condensed chromatin that could silence expression⁶⁶. CTCF are highly conserved zinc finger proteins that act as a transcription factor by activating and repressing gene expression or by acting as an insulator protein. CTCF can recruit other transcription factors while binding to chromatin domain boundaries, preventing the advance of condensed chromatin and creating an open chromatin region^67,68. An eQTL located in a cis-regulatory element such as insulators can, in turn, affect gene expression⁶⁹.

SNPs rs523028000 and rs456412241 are in a region with low signal and active enhancer, respectively. An active enhancer is a cis-regulatory region where some elements, such as transcription factors, can bind to regulate the intensity of gene transcription⁷⁰.

In conclusion, by integrating different methods, we could identify putative regulatory variants that are associated with BFT and IMF. Most of our genes and pathways regulated by eQTLs are directly or indirectly involved in lipid metabolism and fat deposition. Furthermore, we identified eQTLs that are in open chromatin regions, colocalized with a profile of noteworthy chromatin states, which assist us in pinpointing candidate regulatory variants. This finding can contribute to improving livestock traits of economic relevance by helping us unravel how these regulatory genomic variants affect fat traits in Nellore cattle.

Materials and methods

Animals, samples, and phenotypes

Experimental procedures related to animal handling and care were approved by the Institutional Animal Care and Use Committee Guidelines of the Brazilian Agricultural Research Corporation (EMBRAPA) (CEUA 01/2013). All methods were performed in accordance with the relevant guidelines and regulations. This study was carried out in compliance with the ARRIVE guidelines.

We used a population of 374 Nellore steers derived from an experimental herd of EMBRAPA, that originated from 34 unrelated bulls representing the main Brazilian Nellore genealogies. Between the years 2009–2011, animals were raised in grazing systems and finished in feedlots with the same handling and nutritional conditions. Steers were slaughtered at an average age of 25 months and 452 kg in a commercial slaughterhouse located in Bariri (São Paulo, Brazil), following the Brazilian Ministry of Agriculture, Livestock and Food Supply (MAPA) guidelines. For additional details see^71,72.

A muscle sample from the Longissimus thoracis (LT) was collected for RNA extraction. Approximately 5 g was collected from the right side of each carcass between the 12th and 13th ribs immediately after the animal's death and stored at − 80 °C until analysis. At the beginning of the feedlot period, initial backfat thickness (iBFT, mm) was measured on the animal's back, between 12th and 13th ribs using a Pie Medical Aquila ultrasound device (Pie Medical, Inc. Maastricht, The Netherlands) equipped with a 17-cm 3.5 MHz transducer⁷³. For measurements of intramuscular fat content (IMF, %) and final backfat thickness (fBFT, mm), a beef sample of the LT muscle (12th–13th ribs, left side of the carcass) was collected 24 h after slaughter. For IMF analysis, beef samples of approximately 100 g were lyophilized and ground, then IMF was measured using the Ankom XT20 extractor, following the AOCS protocol⁹⁶, more details have already been described in another study of our group⁷². The fBFT was measured using a graduated ruler, as previously described⁷¹.

DNA extraction and high-density genotyping data

Genotyping analysis was performed for the 374 animals at the Bovine Functional Genomics Laboratory ARS/USA and ESALQ Genomics Center (Piracicaba, São Paulo, Brazil). Steers and their sires were genotyped using the BovineHD 770 k BeadChip (Infinium BeadChip, Illumina, San Diego, CA, USA), which included 783,450 SNPs, following Illumina's protocol. The DNA of the steers was isolated from the blood through a salting-out method. As a quality control step, SNPs with call rate ≤ 95%, minor allele frequency (MAF) ≤ 5%, located in sexual chromosomes, and those not mapped in the Bos taurus ARS-UCD1.2 reference genome were excluded from further analysis. This analysis was already described in more detail by^72,74. Sires' DNA was extracted from straws of frozen semen by using a standard phenol–chloroform method; more details can be found in⁷⁵. Whole-genome sequence data of the 26 sires were obtained with the Illumina HiSeq 2500 System (Illumina Inc., San Diego, CA, USA), as previously described by⁷⁶. Briefly, reads were trimmed and filtered using the Trimmomatic v.0.36⁷⁷ program and mapped to the ARS-UCD1.2 Bovine reference genome using the Burrows–Wheeler Aligner (BWA) v.0.7.17⁷⁸. Samtools v.1.8⁷⁹ was used to sort the mapped reads by sequence coordinates. The SNPs were then called using the GATK ‘HaplotypeCaller’.

RNA-sequencing

RNA-Sequencing data acquisition was already described elsewhere¹⁷. Briefly, a subset of 192 animals from the 374 animals used for the phenotype association analysis were selected. A sample of 100 mg of the LT muscle was processed using the Trizol reagent (Life Technologies, Carlsbad, CA, USA), following the manufacturer's guidelines. After extraction, RNA integrity was verified using the Bioanalyzer 2100 (Agilent, Santa Clara, CA, USA), and the samples presenting RNA integrity numbers (RIN) greater than 7 were considered for the next analyses. A total of 2 µg of RNA from each sample was used for the cDNA library preparation, according to the protocol described in the TruSeq RNA Sample Preparation kit v2 guide (Illumina, San Diego, CA, USA). The libraries were sequenced using the Illumina HiSeq2500 ultra-high-throughput sequencing system with the TruSeq SBS kit v3-HS (200 cycles), as described in¹⁷. All sequencing analyses were performed at ESALQ Genomics Center (Piracicaba, São Paulo, Brazil). After sequencing, the SeqyClean package v. 1.4.13 (https://github.com/ibest/seqyclean) was utilized to remove low complexity reads and adapter sequences from the library preparation step. For quality control visualization, FastQC software v. 0.10.1 (https://www.bioinformatics.babraham.ac.uk/projects/fastqc/) was used. The RNA-sequencing dataset analyzed in this study can be found in the European Nucleotide Archive (ENA) repository (EMBL-EBI) under the following accession codes: PRJEB13188, PRJEB10898, and PRJEB19421.

Variant calling analysis and SNP annotation

SNPs were identified in the gene expression data using the Genome Analysis Toolkit (GATK) program v. 4.1.0.0 in Genomic Variant Call Format (GVCF) mode, following the program's best practices manual^80,81. Data from 192 animals with RNA-seq information were used to call the variants individually by haplotype, and the Ensembl Bos taurus database SNP (release 96) was used as a database of known variants. Identified SNPs that had variant quality based on Phred (Phred scaled polymorphism probability) > 30 and minimum variant coverage > 10, were also filtered for call rate > 95% and MAF (minor allele frequency) > 5%. Non-biallelic SNPs located on sex chromosomes were not considered in the analysis. This variant acquisition step from RNA-Seq data was previously described in another study of our group¹⁴.

SNP imputation data

Identified SNPs in the genomic DNA sequence of 26 sires were jointly imputed imputed to 778 progenies with genotypes obtained from the BovineHD BeadChip 770 k panel (Illumina, San Diego, CA, USA), using the programs Eagle⁸² (for phasing) and Minimac3⁸³ (for imputation). Imputation accuracy was obtained using leave-one-out cross-validation, in which each sequenced animal was removed once from the reference population and included in the target population along with the progenies genotyped with the high-density panel. Thus, imputation accuracy metrics were calculated by comparing imputed alleles to alleles observed in the DNA sequence of each sire. The allelic imputation error rate was estimated as the ratio between the number of incorrectly imputed alleles and the total of alleles imputed. We also estimated the correlation between imputed and actual genotype. Only SNPs with a correlation greater than 0.98 and allelic imputation error rate < 2% were kept for further analysis. Therefore, positions with low imputation accuracy were discarded. Additionally, only SNPs in autossomol chormossome and with an allele frequency greater than 5% were maintained for the analysis.

The 123,300 SNPs identified in RNA sequencing of 192 animals within the progenies (call rate < 95%) were subsequently jointly imputed to the panel formed by imputed DNA-Seq variants plus SNPs from the high-density panel, using Eagle and Minimac3 programs for phasing and imputation, respectively. After imputation, 96.195 SNPs were kept with R2 (calculated by Minimac3) > 0.90. The SNPs were also filtered for allele frequency, removing monomorphic and SNPs with MAF < 5%. A graphical summary of the imputation analysis can be seen in Supplementary Fig. 9.

Population stratification test

Population stratification was investigated by principal component analysis (PCA) using the genotypes from the BovineHD 770 k BeadChip and our population of 192 (for eQTL mapping) and 374 animals (for association analysis). We first filtered the variants for MAF > 5% and call rate > 95%. Then, using the PLINK software⁸⁴ we tested the population stratification with 446,498 genotypes and used the sires to test for sample clustering since our population was sired by 34 unrelated bulls.

eQTL identification and functional annotation

The imputed SNPs were filtered for the subset of 192 animals with RNA-Seq information. Additionally, we performed quality control of the variants keeping only SNPs with MAF > 5% and call rate > 95%, totaling 4,436,504 SNPs. PLINK v. 1.07⁸⁴ was used in the set of imputed SNPs to select these, based on linkage disequilibrium (LD) calculation and pruning of the variants. Parameters applied to variant pruning were pairwise connections with a minimum r² of 0.8 and a window size of 100 SNPs, shifting 10 SNPs in each step, to obtain a subset of informative SNPs (tag-SNPs) within linkage disequilibrium (LD) blocks. The R package, Matrix eQTL v. 2.3⁸⁵, was used to perform cis and trans-eQTL identification, with the expression of 12,991 muscle genes data normalized in log2-CPM (counts per million of mapped reads) and adjusted for lane and flow cell effects; the genotype file containing the tag-SNPs; the first two main principal components (PC1 and PC2), to correct putative effects from population stratification, and CG (animals from the same farm, year, and slaughter date) as covariates in the model. In this study, cis-eQTLs were defined as SNPs located up to 1 Mb away from the regulated gene, while trans-eQTLs were SNPs located > 1 Mb away from the gene. The additive effect of each gene-SNP pair was tested by linear regression and false discovery rate (FDR), based on the Benjamini–Hochberg methodology⁸⁶, was calculated separately for cis and trans-eQTLs⁸⁵. The lists of cis and trans-eQTL (FDR < 5%) were annotated separately using VEP v. 101.0⁸⁷.

Phenotype association analysis

Association between 30,581 significative eQTLs (FDR < 5%) and the phenotypes were performed in PLINK v. 1.07 software⁸⁴, using a linear model and a SNP-by-SNP approach, with adjustment for multiple tests, and a population of 374 animals, the same used by¹³. This analysis was performed considering the same effects used in the eQTL mapping, being PC1, PC2, and CG. Furthermore, following the model used in a previous study by our research group¹³, hot carcass weight was included in the model as a covariate. SNPs associated with the phenotype at FDR < 10%⁸⁶ were considered significant.

Gene's annotation and functional enrichment

Annotation of eQTL-regulated genes was performed using the Ensembl Biomart tool (Ensembl Genes 104). To find molecular pathways in which genes regulated by the representative eQTL were involved, we used MetaCore software (https://portal.genego.com/) from Clarivate (London, GBR) with the Homo sapiens database and the list of annotated genes.

ATAC-Seq

We performed ATAC-Seq analysis in two LT samples from Nellore males, purchased from a commercial slaughterhouse, and then replicated in two technical replicates for each sample using the protocol described by⁸⁸. A paired-end sequencing (2 × 100) using HiSeq 2500 was performed, and approximately 40 million reads were generated per library. Trimmomatic (v:0.36)⁷⁷ was used to remove adapters from read ends, using the FastQC tool for quality control^89,90. Then, we generated ATAC-Seq pileup files from FASTA files using the nfcore/atacseq pipeline implemented in Nextflow (https://nf-co.re/atacseq). A consensus peak mapping was obtained from the four samples by considering only regions with counts higher than zero in all. In these regions, the FRiP score was calculated for each of the replicates, and an ATAC-Seq Peak region was considered when the average FRiP score in the four replicates was higher than 0.2, following the ENCODE consortium (Encyclopedia of DNA Elements) recommendations⁹¹. The deepTools v.3.5.1⁹² was used to create the matrix, heatmaps and line plots of overlapping peaks with TSS from individual and merged replicates as another quality control⁹³.

To explore the genomic overlap between eQTLs associated with the phenotypes and ATAC-Seq peaks, we used an in-house R script based on the subsetByOverlaps function from the GenomicRanges R/Bioconductor package⁹⁴. The Integrative Genomics Viewer (IGV – v.2.15.4) was used for data visualization⁹⁵. The ATAC-Seq dataset analyzed in this study can also be found in the ENA repository (EMBL-EBI) under the accession code PRJEB64479.

Data availability

The datasets used in this study can be found in online repositories. The RNA-sequencing dataset analyzed in this study can be found in the European Nucleotide Archive (ENA) repository (EMBL-EBI) under the following accession codes: PRJEB13188, PRJEB10898, and PRJEB19421. The Atac-Seq dataset analyzed in this study can also be found in the ENA repository (EMBL-EBI) under the accession code PRJEB64479. Accession: https://www.ebi.ac.uk/ena/browser/view/PRJEB13188; https://www.ebi.ac.uk/ena/browser/view/PRJEB10898; https://www.ebi.ac.uk/ena/browser/view/PRJEB19421; https://www.ebi.ac.uk/ena/browser/view/PRJEB64479.

References

Santos, D. et al. The most important attributes of beef sensory quality and production variables that can affect it: A review. Livest Sci. 250, 104573 (2021).
Article Google Scholar
Yokoo, M. J. et al. Genetic and environmental factors affecting ultrasound measures of longissimus muscle area and backfat thickness in Nelore cattle. Livest. Sci. 117, 147–154 (2008).
Article Google Scholar
Newcom, D. W., Baas, T. J., Schwab, C. R. & Stalder, K. J. Genetic and phenotypic relationships between individual subcutaneous backfat layers and percentage of longissimus intramuscular fat in Duroc swine. J. Anim. Sci. 83, 316 (2005).
Article CAS PubMed Google Scholar
Cesar, A. S. M. et al. Identification of putative regulatory regions and transcription factors associated with intramuscular fat content traits. BMC Genom. https://doi.org/10.1186/s12864-018-4871-y (2018).
Article Google Scholar
Nica, A. C. & Dermitzakis, E. T. Expression quantitative trait loci: Present and future. Philos. Trans. R. Soc. B https://doi.org/10.1098/rstb.2012.0362 (2013).
Article Google Scholar
Fagny, M. et al. Exploring regulation in tissues with eQTL networks. Proc. Natl. Acad. Sci. U S A https://doi.org/10.1073/pnas.1707375114 (2017).
Article PubMed PubMed Central Google Scholar
Nicolae, D. L. et al. Trait-associated SNPs are more likely to be eQTLs: Annotation to enhance discovery from GWAS. PLoS Genet. https://doi.org/10.1371/journal.pgen.1000888 (2010).
Article PubMed PubMed Central Google Scholar
Li, Y., Willer, C., Sanna, S. & Abecasis, G. Genotype imputation. Annu. Rev. Genom. Hum. Genet. 10, 387–406 (2009).
Article CAS Google Scholar
Sargolzaei, M., Chesnais, J. P. & Schenkel, F. S. A new approach for efficient genotype imputation using information from relatives. BMC Genom. https://doi.org/10.1186/1471-2164-15-478 (2014).
Article Google Scholar
Vergara, Z. & Gutierrez, C. Emerging roles of chromatin in the maintenance of genome organization and function in plants. Genome Biol. 18, 96 (2017).
Article PubMed PubMed Central Google Scholar
Kouzarides, T. Chromatin modifications and their function. Cell 128, 693–705 (2007).
Article CAS PubMed Google Scholar
Kern, C. et al. Functional annotations of three domestic animal genomes provide vital resources for comparative and agricultural research. Nat. Commun. 12, 1821 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Cesar, A. S. M. et al. Genome-wide association study for intramuscular fat deposition and composition in Nellore cattle. BMC Genet. https://doi.org/10.1186/1471-2156-15-39 (2014).
Article PubMed PubMed Central Google Scholar
Silva-Vignato, B. et al. Integrative analysis between genome-wide association study and expression quantitative trait loci reveals bovine muscle gene expression regulatory polymorphisms associated with intramuscular fat and backfat thickness. Front. Genet. 13, 5238 (2022).
Article Google Scholar
Barrett, L. W., Fletcher, S. & Wilton, S. D. Regulation of eukaryotic gene expression by the untranslated gene regions and other non-coding elements. Cell. Mol. Life Sci. 69, 3613–3634 (2012).
Article CAS PubMed PubMed Central Google Scholar
Mueller, J. C. Linkage disequilibrium for different scales and applications. Brief Bioinform. 5, 355–364 (2004).
Article CAS PubMed Google Scholar
Cesar, A. S. M. et al. Putative regulatory factors associated with intramuscular fat content. PLoS ONE 10, e0128350 (2015).
Article PubMed PubMed Central Google Scholar
Eckert, R. L. et al. Transglutaminase regulation of cell function. Physiol. Rev. 94, 383–417 (2014).
Article CAS PubMed PubMed Central Google Scholar
Iismaa, S. E., Mearns, B. M., Lorand, L. & Graham, R. M. Transglutaminases and disease: Lessons from genetically engineered mouse models and inherited disorders. Physiol. Rev. 89, 991–1023 (2009).
Article CAS PubMed Google Scholar
Larrieta-Carrasco, E. et al. Genetic variants in COL13A1, ADIPOQ and SAMM50, in addition to the PNPLA3 gene, confer susceptibility to elevated transaminase levels in an admixed Mexican population. Exp. Mol. Pathol. 104, 50–58 (2018).
Article CAS PubMed PubMed Central Google Scholar
Myneni, V. D., Melino, G. & Kaartinen, M. T. Transglutaminase 2—a novel inhibitor of adipogenesis. Cell Death Dis. 6, e1868–e1868 (2015).
Article CAS PubMed PubMed Central Google Scholar
Singh, R., Bassett, E., Chakravarti, A. & Parthun, M. R. Replication-dependent histone isoforms: A new source of complexity in chromatin structure and function. Nucleic Acids Res. 46, 9893–9894 (2018).
Article CAS PubMed PubMed Central Google Scholar
Hegde, R. et al. Human homeostatic iron regulator gene polymorphism in autistic population of India; a case-control study. Gene Rep. 26, 101518 (2022).
Article CAS Google Scholar
Diniz, W. J. S. et al. Iron content affects lipogenic gene expression in the muscle of nelore beef cattle. PLoS ONE 11, e0161160 (2016).
Article PubMed PubMed Central Google Scholar
Ma, W., Jia, L., Xiong, Q., Feng, Y. & Du, H. The role of iron homeostasis in adipocyte metabolism. Food Funct. 12, 4246–4253 (2021).
Article CAS PubMed Google Scholar
Tan, T. C. H. et al. Altered lipid metabolism in Hfe-knockout mice promotes severe NAFLD and early fibrosis. Am. J. Physiol. Gastrointest. Liver Physiol. 301, G865–G876 (2011).
Article CAS PubMed Google Scholar
Oura, S. et al. Cfap97d1 is important for flagellar axoneme maintenance and male mouse fertility. PLoS Genet. 16, e1008954 (2020).
Article MathSciNet CAS PubMed PubMed Central Google Scholar
Lake, J. A., Papah, M. B. & Abasht, B. Increased expression of lipid metabolism genes in early stages of wooden breast links myopathy of broilers to metabolic syndrome in humans. Genes 10, 746 (2019).
Article CAS PubMed PubMed Central Google Scholar
Hishikawa, D. et al. Discovery of a lysophospholipid acyltransferase family essential for membrane asymmetry and diversity. Proc. Natl. Acad. Sci. USA 105, 2830–2835 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
Zhao, Y. et al. Identification and characterization of a major liver lysophosphatidylcholine acyltransferase. J. Biol. Chem. 283, 8258–8265 (2008).
Article CAS PubMed Google Scholar
Zhang, Q. et al. The structural basis for the phospholipid remodeling by lysophosphatidylcholine acyltransferase 3. Nat. Commun. 12, 6869 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Ferrara, P. J. et al. Lysophospholipid acylation modulates plasma membrane lipid organization and insulin sensitivity in skeletal muscle. J. Clin. Investig. 131, 135963 (2021).
Article PubMed Google Scholar
Tilley, S. J. et al. Structure-function analysis of phosphatidylinositol transfer protein alpha bound to human phosphatidylinositol. Structure 12, 317–326 (2004).
Article CAS PubMed Google Scholar
Ma, Q., Gabelli, S. B. & Raben, D. M. Diacylglycerol kinases: Relationship to other lipid kinases. Adv. Biol. Regul. 71, 104–110 (2019).
Article CAS PubMed Google Scholar
Luo, H. et al. Growth differentiation factor 11 inhibits adipogenic differentiation by activating TGF-beta/Smad signalling pathway. Cell Prolif. 52, 12631 (2019).
Article Google Scholar
Zhang, Y. et al. Growth differentiation factor 11 is a protective factor for osteoblastogenesis by targeting PPARgamma. Gene 557, 209–214 (2015).
Article CAS PubMed Google Scholar
Manigandan, S. & Yun, J. W. Loss of cytoplasmic FMR1-interacting protein 2 (CYFIP2) induces browning in 3T3-L1 adipocytes via repression of GABA-BR and activation of mTORC1. J. Cell. Biochem. 123, 863–877 (2022).
Article CAS PubMed Google Scholar
Rendina-Ruedy, E. et al. A comparative study of the metabolic and skeletal response of C57BL/6J and C57BL/6N mice in a diet-induced model of type 2 diabetes. J. Nutr. Metab. 2015, 1–13 (2015).
Article Google Scholar
Heiker, J. T. et al. Identification of genetic loci associated with different responses to high-fat diet-induced obesity in C57BL/6N and C57BL/6J substrains. Physiol. Genom. 46, 377–384 (2014).
Article CAS Google Scholar
Keller, M. et al. Genome-wide DNA promoter methylation and transcriptome analysis in human adipose tissue unravels novel candidate genes for obesity. Mol. Metab. 6, 86–100 (2017).
Article CAS PubMed Google Scholar
Si, X., Zhou, Z., Strappe, P. & Blanchard, C. A comparison of RS4-type resistant starch to RS2-type resistant starch in suppressing oxidative stress in high-fat-diet-induced obese rats. Food Funct. 8, 232–240 (2017).
Article CAS PubMed Google Scholar
Higuchi, M. et al. Differentiation of human adipose-derived stem cells into fat involves reactive oxygen species and forkhead Box O1 mediated upregulation of antioxidant enzymes. Stem Cells Dev. 22, 878–888 (2013).
Article CAS PubMed Google Scholar
Furukawa, S. et al. Increased oxidative stress in obesity and its impact on metabolic syndrome. J. Clin. Investig. 114, 1752–1761 (2004).
Article CAS PubMed PubMed Central Google Scholar
Ki, S. H., Cho, I. J., Choi, D. W. & Kim, S. G. Glucocorticoid receptor (GR)-associated SMRT binding to C/EBPβ TAD and Nrf2 Neh4/5: Role of SMRT recruited to GR in GSTA2 gene repression. Mol. Cell. Biol. 25, 4150–4165 (2005).
Article CAS PubMed PubMed Central Google Scholar
Mantel, I., Sadiq, B. A. & Blander, J. M. Spotlight on TAP and its vital role in antigen presentation and cross-presentation. Mol. Immunol. 142, 105–119 (2022).
Article CAS PubMed Google Scholar
Samarawardana, P. & Shroyer, K. VTCN1 (V-set domain containing T cell activation inhibitor 1). Atlas Genet. Cytogenet. Oncol. Haematol. https://doi.org/10.4267/2042/38604 (2011).
Article Google Scholar
Valero, Y., Arizcun, M., Esteban, M. Á., Cuesta, A. & Chaves-Pozo, E. Transcription of histones H1 and H2B is regulated by several immune stimuli in gilthead seabream and European sea bass. Fish Shellfish Immunol. 57, 107–115 (2016).
Article CAS PubMed Google Scholar
Wang, Q. et al. Molecular mechanism of membrane constriction and tubulation mediated by the F-BAR protein Pacsin/Syndapin. Proc. Natl. Acad. Sci. USA 106, 12700–12705 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Mahmood, Md. I., Noguchi, H. & Okazaki, K. Curvature induction and sensing of the F-BAR protein Pacsin1 on lipid membranes via molecular dynamics simulations. Sci. Rep. 9, 14557 (2019).
Article ADS PubMed PubMed Central Google Scholar
Schenck, A. et al. CYFIP/Sra-1 controls neuronal connectivity in Drosophila and links the Rac1 GTPase pathway to the fragile X protein. Neuron 38, 887–898 (2003).
Article CAS PubMed Google Scholar
Olivieri, B. F. et al. Differentially expressed genes identified through RNA-seq with extreme values of principal components for beef fatty acid in Nelore cattle. J. Anim. Breed. Genet. 138, 80–90 (2021).
Article CAS PubMed Google Scholar
Bongiorni, S. et al. Transcriptomic investigation of meat tenderness in two Italian cattle breeds. Anim. Genet. 47, 273–287 (2016).
Article CAS PubMed Google Scholar
Todendi, P. F. et al. Genetic risk score based on fat mass and obesity-associated, transmembrane protein 18 and fibronectin type III domain containing 5 polymorphisms is associated with anthropometric characteristics in South Brazilian children and adolescents. Br. J. Nutr. 121, 93–99 (2019).
Article CAS PubMed Google Scholar
Ma, M., Lee, J. H. & Kim, M. Identification of a TMEM182 rs141764639 polymorphism associated with central obesity by regulating tumor necrosis factor-α in a Korean population. J. Diabetes Compl. 34, 107732 (2020).
Article Google Scholar
Luo, N. et al. Differential regulation of intramuscular fat and abdominal fat deposition in chickens. BMC Genom. 23, 308 (2022).
Article CAS Google Scholar
Sosa-Madrid, B. S. et al. The effect of divergent selection for intramuscular fat on the domestic rabbit genome. Animal 14, 2225–2235 (2020).
Article CAS PubMed Google Scholar
Schmiesing, J. A., Gregson, H. C., Zhou, S. & Yokomori, K. A human condensin complex containing hCAP-C–hCAP-E and CNAP1, a Homolog of Xenopus XCAP-D2, colocalizes with phosphorylated histone H3 during the early stage of mitotic chromosome condensation. Mol. Cell. Biol. 20, 6996–7006 (2000).
Article CAS PubMed PubMed Central Google Scholar
Ball, A. R. et al. Identification of a chromosome-targeting domain in the human condensin subunit CNAP1/hCAP-D2/Eg7. Mol. Cell Biol. 22, 5769–5781 (2002).
Article CAS PubMed Google Scholar
Lim, H.-Y., Wang, W., Wessells, R. J., Ocorr, K. & Bodmer, R. Phospholipid homeostasis regulates lipid metabolism and cardiac function through SREBP signaling in Drosophila. Genes Dev. 25, 189–200 (2011).
Article CAS PubMed PubMed Central Google Scholar
Vance, J. E. Thematic review series: Glycerolipids. Phosphatidylserine and phosphatidylethanolamine in mammalian cells: Two metabolically related aminophospholipids. J. Lipid Res. 49, 1377–1387 (2008).
Article CAS PubMed Google Scholar
Akasheh, R. T., Pang, J., York, J. M. & Fantuzzi, G. New pathways to control inflammatory responses in adipose tissue. Curr. Opin. Pharmacol. 13, 613–617 (2013).
Article CAS PubMed PubMed Central Google Scholar
DeBari, M. K. & Abbott, R. D. Adipose tissue fibrosis: Mechanisms, models, and importance. Int. J. Mol. Sci. 21, 6030 (2020).
Article CAS PubMed PubMed Central Google Scholar
Choe, S. S., Huh, J. Y., Hwang, I. J., Kim, J. I. & Kim, J. B. Adipose tissue remodeling: Its role in energy metabolism and metabolic disorders. Front. Endocrinol. 7, 30 (2016).
Article Google Scholar
Ernst, J. & Kellis, M. Discovery and characterization of chromatin states for systematic annotation of the human genome. Nat. Biotechnol. 28, 817–825 (2010).
Article CAS PubMed PubMed Central Google Scholar
Ernst, J. & Kellis, M. Chromatin-state discovery and genome annotation with ChromHMM. Nat. Protoc. 12, 2478–2492 (2017).
Article CAS PubMed PubMed Central Google Scholar
West, A. G., Gaszner, M. & Felsenfeld, G. Insulators: Many functions, many mechanisms. Genes Dev. 16, 271–288 (2002).
Article PubMed Google Scholar
Kim, S., Yu, N.-K. & Kaang, B.-K. CTCF as a multifunctional protein in genome regulation and gene expression. Exp. Mol. Med. 47, e166–e166 (2015).
Article CAS PubMed PubMed Central Google Scholar
Holwerda, S. J. B. & de Laat, W. CTCF: The protein, the binding partners, the binding sites and their chromatin loops. Philos. Trans. R. Soc. B 368, 20120369 (2013).
Article Google Scholar
Brown, C. D., Mangravite, L. M. & Engelhardt, B. E. Integrative modeling of eQTLs and cis-regulatory elements suggests mechanisms underlying cell type specificity of eQTLs. PLoS Genet. 9, e1003649 (2013).
Article CAS PubMed PubMed Central Google Scholar
Cramer, P. Organization and regulation of gene transcription. Nature 573, 45–54 (2019).
Article ADS CAS PubMed Google Scholar
Tizioto, P. C. et al. Genome scan for meat quality traits in nelore beef cattle. Physiol. Genom. https://doi.org/10.1152/physiolgenomics.00066.2013 (2013).
Article Google Scholar
Cesar, A. S. M. et al. Differences in the skeletal muscle transcriptome profile associated with extreme values of fatty acids content. BMC Genom. 17, 1–16 (2016).
Article Google Scholar
Nascimento, M. L. et al. Feed efficiency indexes and their relationships with carcass, non-carcass and meat quality traits in Nellore steers. Meat Sci. 116, 78–85 (2016).
Article CAS PubMed Google Scholar
Tizioto, P. C. et al. Genome scan for meat quality traits in nelore beef cattle. Physiol. Genom. 45, 1012–1020 (2013).
Article CAS Google Scholar
Tizioto, P. C. et al. Candidate genes for production traits in Nelore beef cattle. Genet. Mol. Res. 11, 4138–4144 (2012).
Article CAS PubMed Google Scholar
Cardoso, T. F. et al. Multi-omics approach reveals miR-SNPs affecting muscle fatty acids profile in nelore cattle. Genes 12, 67 (2021).
Article CAS PubMed PubMed Central Google Scholar
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Article CAS PubMed PubMed Central Google Scholar
Li, H. & Durbin, R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics 26, 589–595 (2010).
Article PubMed PubMed Central Google Scholar
Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Article PubMed PubMed Central Google Scholar
Brouard, J. S., Schenkel, F., Marete, A. & Bissonnette, N. The GATK joint genotyping workflow is appropriate for calling variants in RNA-seq experiments. J. Anim. Sci. Biotechnol. https://doi.org/10.1186/s40104-019-0359-0 (2019).
Article PubMed PubMed Central Google Scholar
McKenna, A. et al. The genome analysis toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. https://doi.org/10.1101/gr.107524.110 (2010).
Article PubMed PubMed Central Google Scholar
Loh, P.-R. et al. Reference-based phasing using the Haplotype Reference Consortium panel. Nat. Genet. 48, 1443–1448 (2016).
Article CAS PubMed PubMed Central Google Scholar
Das, S. et al. Next-generation genotype imputation service and methods. Nat Genet 48, 1284–1287 (2016).
Article CAS PubMed PubMed Central Google Scholar
Purcell, S. et al. PLINK: A tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007).
Article CAS PubMed PubMed Central Google Scholar
Shabalin, A. A. Matrix eQTL: Ultra fast eQTL analysis via large matrix operations. Bioinformatics https://doi.org/10.1093/bioinformatics/bts163 (2012).
Article PubMed PubMed Central Google Scholar
Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: A practical and powerful approach to multiple testing. J. R. Stat. Soc. Ser. B 57, 289–300 (1995).
Article MathSciNet Google Scholar
McLaren, W. et al. The ensembl variant effect predictor. Genome Biol. 17, 122 (2016).
Article PubMed PubMed Central Google Scholar
Buenrostro, J. D., Giresi, P. G., Zaba, L. C., Chang, H. Y. & Greenleaf, W. J. Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position. Nat. Methods 10, 1213–1218 (2013).
Article CAS PubMed PubMed Central Google Scholar
Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J. 17, 10 (2011).
Article Google Scholar
Leggett, R. M., Ramirez-Gonzalez, R. H., Clavijo, B. J., Waite, D. & Davey, R. P. Sequencing quality assessment tools to enable data-driven informatics for high throughput genomics. Front. Genet. 4, 288 (2013).
Article PubMed PubMed Central Google Scholar
The ENCODE Project Consortium. An integrated encyclopedia of DNA elements in the human genome. Nature 489, 57–74 (2012).
Article ADS PubMed Central Google Scholar
Ramírez, F. et al. deepTools2: A next generation web server for deep-sequencing data analysis. Nucleic Acids Res. 44, W160–W165 (2016).
Article PubMed PubMed Central Google Scholar
Alexandre, P. A. et al. Chromatin accessibility and regulatory vocabulary across indicine cattle tissues. Genome Biol. 22, 273 (2021).
Article CAS PubMed PubMed Central Google Scholar
Lawrence, M. et al. Software for computing and annotating genomic ranges. PLoS Comput. Biol. 9, e1003118 (2013).
Article CAS PubMed PubMed Central Google Scholar
Robinson, J. T. et al. Integrative genomics viewer. Nat. Biotechnol. 29, 24–26 (2011).
Article CAS PubMed PubMed Central Google Scholar
AOCS. Rapid determination of oil/fat utilizing high temperature solvent extraction. AOCS Official Procedure Am 5-04. Official Methods and Recommended Practices of the AOCS (American Oil Chemists Society, AOCS, 2004).

Download references

Acknowledgements

We thank both EMBRAPA and the University of São Paulo for the collaborative efforts. We also thank CAPES (Higher Education Personnel Improvement Coordination) for the scholarship to ISG (CAPES process number 88887.481433/2020-00), and CNPq (Brazilian National Council of Science) for science productivity fellowship to LCAR, LLC and GBM (CNPq process number 303457/2021-0–304353/2019-1–310714/2020-6). We also thank FAPESP (São Paulo Research Foundation) for providing financial support (FAPESP process number 2019/04089-2).

Author information

Authors and Affiliations

Department of Animal Science, College of Agriculture “Luiz de Queiroz”, University of São Paulo, Piracicaba, SP, Brazil
Ingrid Soares Garcia, Bárbara Silva-Vignato, Juliana Petrini, Vinicius Henrique da Silva, Natália Silva Morosini, Carolina Purcell Goes, Thaís Ribeiro da Silva, Beatriz Delcarme Lima, Luan Gaspar Clemente, Gerson Barreto Mourão & Luiz Lehmann Coutinho
Department of Agroindustry, Food and Nutrition, College of Agriculture “Luiz de Queiroz”, University of São Paulo, Piracicaba, SP, Brazil
Aline Silva Mello Cesar
Embrapa Pecuária Sudeste, São Carlos, SP, Brazil
Juliana Afonso & Luciana Correia de Almeida Regitano

Authors

Ingrid Soares Garcia
View author publications
You can also search for this author in PubMed Google Scholar
Bárbara Silva-Vignato
View author publications
You can also search for this author in PubMed Google Scholar
Aline Silva Mello Cesar
View author publications
You can also search for this author in PubMed Google Scholar
Juliana Petrini
View author publications
You can also search for this author in PubMed Google Scholar
Vinicius Henrique da Silva
View author publications
You can also search for this author in PubMed Google Scholar
Natália Silva Morosini
View author publications
You can also search for this author in PubMed Google Scholar
Carolina Purcell Goes
View author publications
You can also search for this author in PubMed Google Scholar
Juliana Afonso
View author publications
You can also search for this author in PubMed Google Scholar
Thaís Ribeiro da Silva
View author publications
You can also search for this author in PubMed Google Scholar
Beatriz Delcarme Lima
View author publications
You can also search for this author in PubMed Google Scholar
Luan Gaspar Clemente
View author publications
You can also search for this author in PubMed Google Scholar
Luciana Correia de Almeida Regitano
View author publications
You can also search for this author in PubMed Google Scholar
Gerson Barreto Mourão
View author publications
You can also search for this author in PubMed Google Scholar
Luiz Lehmann Coutinho
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

ISG, BSV, JP, LCAR, ASMC and LLC participated in the experimental design. ISG, BSV, VHS and JP performed data analysis. ISG drafted the manuscript. ISG, BSV, ASMC, JA, JP, NSM, CPG, LGC, TRS, BDL, GBM, LCAR and LLC collaborated with interpretation and discussion of the results. LLC and LCAR provided the experimental environment and coordination. All authors have read and approved the final manuscript.

Corresponding author

Correspondence to Luiz Lehmann Coutinho.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Table 1.

Supplementary Table 2.

Supplementary Table 3.

Supplementary Table 4.

Supplementary Table 5.

Supplementary Table 6.

Supplementary Table 7.

Supplementary Figures.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Garcia, I.S., Silva-Vignato, B., Cesar, A.S.M. et al. Novel putative causal mutations associated with fat traits in Nellore cattle uncovered by eQTLs located in open chromatin regions. Sci Rep 14, 10094 (2024). https://doi.org/10.1038/s41598-024-60703-5

Download citation

Received: 19 October 2023
Accepted: 26 April 2024
Published: 02 May 2024
DOI: https://doi.org/10.1038/s41598-024-60703-5

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.