Abstract
Cotton is a significant cash crop and the primary source of natural fiber globally. Among the numerous diseases encountered in cotton production, Verticillium wilt is one of the most serious, caused by the pathogen Verticillium dahliae (V. dahliae). Unfortunately, there are no effective targeted methods to combat this disease. Genomic resources for Verticillium wilt resistance primarily exist in Gossypium barbadense (G. barbadense). Regrettably, there have been limited transcriptomic comparisons between V. dahliae-resistant and -susceptible varieties of G. barbadense due to the scarcity of susceptible resources. In this study, we conducted a transcriptome analysis on both V. dahliae-resistant and -susceptible varieties of G. barbadense at the 0, 12, 24 and 48 hours after V. dahliae inoculation. This comparative transcriptome analysis yielded high-quality data and offered new insights into the molecular mechanisms underlying cotton’s resistance against this destructive pathogen.
Similar content being viewed by others
Background & Summary
Cotton, as one of the prominent fiber crops, serves as vital renewable natural fiber source, which contributes to approximately 35% of global fiber production1,2. In 2022, China becomes the second-largest cotton-producing country worldwide, yielding a total production of 5.977 million tons3. However, various diseases cause the loss of cotton production, with cotton Verticillium wilt (CVW) emerging as one of the most widespread and great threat to cotton production4. Over 40% of the cotton planting area in China being affected by V. dahliae, and this disease has become a main threat to the sustainable development of cotton production5.
CVW is a soil-borne vascular fungal disease, which is primarily caused by the pathogen of V. dahliae6,7. Root exudates induces V. dahliae germinaion and V. dahliae initiates invasion of the host plant through the root system. The pathogen then penetrates the root epidermal gaps via the root tip or natural wounds, ultimately growing vertically into the root vascular tissue. Subsequently, the spores of V. dahliae rapidly spreads to the cotton stems and leaves, resulting in gradual wilting and potential death of the cotton plant8,9. The microsclerotia of V. dahliae can survive up to 10 years, coupled with its wide range of hosts and the lack of effective fungicides for prevention or eradication, poses significant challenges in controlling the disease5. Consequently, the most effective, economical, safe, and environmentally friendly integrated disease management approach for controlling Verticillium wilt is planting and breeding resistant cultivars.
Gossypium barbadense (G. barbadense) and Gossypium hirsutum (G. hirsutum) are two major cultivated allopolyploid species, originating from natural hybridization and polyploidization events between diploid A0 and D5 genomes approximately 1–2 million years ago (MYA)10. These two cotton species exhibit significant differences in traits, such as stress tolerance, fiber quality, and lint yield11. These variations can be attributed to different processes of natural selection and human domestication. G. hirsutum is extensively cultivated due to its superior yield performance and better adaptability, contributing to over 90% of the global cotton fiber production12,13. V.dahilae impacts more than half of the cotton acreage, and G. hirsutum is notably susceptible to V. dahliae. This susceptibility leads to a substantial reduction in both fiber quality and yield14. Notably, G. barbadense demonstrates high resistance or immunity to cotton V. dahliae, making it a valuable resource for identifying key genes associated with Verticillium wilt resistance and uncovering the corresponding regulatory mechanisms15,16. However, the lack of V. dahliae-susceptible G. barbadense germplasm resources has limited the research focus to comparing V. dahliae-resistant G. barbadense with either V. dahliae-resistant G. hirsutum or V. dahliae-susceptible G. hirsutum varieties17,18,19,20,21,22. Currently, there is only one study comparing transcriptomic differences between V. dahliae-resistant and -susceptible varieties of G. barbadense, but the transcriptome data has not been released16. The rare bioinformatic data limited the deep understanding of how immune network of G. barbadense play role in defeating the invasion of V. dahliae.
In this study, we performed a comparison of the dynamic changes in the transcriptomes of resistant and susceptible G. barbadense varieties before and after inoculation with V. dahliae using RNA-seq. Our study demonstrates that the disease-resistant verity exhibits higher expression levels of genes involved in the suberin biosynthetic process, phenylpropanoid biosynthetic process, and other biological pathways associated with the establishment of cell physical barriers under normal growth conditions. Furthermore, we observed that the transcriptome response of the resistant variety is less sensitive even after invasion by Verticillium wilt, suggesting that its stronger physical barrier may play a significant role in conferring resistance against V. dahliae.
Methods
Plant material
Two Gossypium barbadense varieties, namely V. dahliae-resistant G. barbadense ‘Xinhai 47’(BR) and the V. dahliae-susceptible G. barbadense ‘Shidahaigan 1’ (BS) were selected for pathogenicity assays and RNA-seq analysis. To facilitate proper growth of the seedlings, the germinated seeds were incubated in controlled conditions in an incubator with a photoperiod of 16 hours of light and 8 hours of darkness; maintaining day and night temperatures of 25 °C and 23 °C respectively.
Disease resistance identification
Two-leaf-stage cotton seedlings were infected with the highly aggressive defoliating V. dahliae strain “V991” by using the root irrigation method23.The strain was kindly provided by Associate Professor Yanfei Sun (Shihezi University). The concentration of “V991” spore liquid was adjusted to 106 spores/mL by using a hemocytometer, followed by the application of 20 mL per plant into the nutrient bowls. Subsequently, the plants were incubated for further cultivation. At 14 days post-infection (dpi), the BR plants exhibited mild symptoms of Verticillium wilt, characterized by slight yellowing of a few lower leaves. In contrast, the BS plants displayed more severe symptoms, including significant yellowing and leaf loss, with some plants eventually succumbing to the infection (Fig. 1a). The investigation of disease index and quantification of fungal biomass was conducted with reference to previous studies23,24. The disease index of BS was notably higher than that of BR at both 14 and 21 dpi (Fig. 1b). Assessing the cotton stem sections revealed browning in both transverse and longitudinal stem sections of BS plants after 14 days of V. dahliae inoculation, indicating a more severe condition compared to BR plants (Fig. 1c). Additionally, quantification of fungal biomass in the stems illustrated significantly higher accumulation in BS plants compared to BR plants at 14 and 21 dpi (Fig. 1d). Overall, these results convincingly demonstrated the significantly higher resistance of BR to V. dahliae than to BS.
RNA Extraction, Library Preparation and Sequencing
Root samples from both the BR and BS genotypes were collected at 0, 12, 24 and 48 hours after V. dahliae inoculation separately. The samples were immediately frozen in liquid nitrogen. All samples were harvested from five cotton plants with three independent biological replicates. In total, 24 samples (2 genotypes × 4-time points × 3 biological replicates) were subjected to RNA-seq. For total RNA extraction, 0.1 g of the finely ground samples was accurately weighed and using the plant polysaccharide polyphenol total RNA extraction kit from TianGen Biotech Co., Ltd. (Beijing). From the extracted total RNA, 3 µg of qualified RNA was used for further processing. Specifically, mRNA was purified using magnetic beads with Okigo (dT) oligo for enrichment. Subsequently, the mRNA was fragmented, and first-strand cDNA synthesis was performed using a random primer with a six-base sequence. The second-strand cDNA synthesis was carried out by adding buffer, dNTPs, and DNA polymerase I. Subsequently, the double-stranded cDNA was purified using AMPure XP beads. To prepare the cDNA library, the purified double-stranded cDNA underwent end repair, A-tailing, and sequencing adapter ligation. Fragment size selection was performed using AMPure XP beads, and PCR amplification was conducted to obtain the cDNA library. The resulting library was subjected to RNA sequencing on the Illumina HiSeq 4000 platform, generating 150 bp paired-end reads. A total of 48 clean reads were obtained from the 24 cotton samples. The quality assessment of these raw RNA-seq reads (48 in total) was conducted using FastQC18 software, and the summarized results were generated using MultiQC software tool25. The raw sequencing data underwent data quality control with the fastp program, which involved the removal of sequences containing adapters, poly-N sequences, and low-quality data26.
Differential expression genes (DEGs) identification and GO enrichment
The clean reads were aligned to the Gossypium barbadense L. (Hai7124) genome27 using hisat2 with default parameters28. FeatureCounts (Version 2.0.1) was used to calculate the read count of each gene, and gene expression levels were quantified using TPM29 (transcripts per kilobase of exon model per million mapped reads). Differentially expressed genes (DEGs) were identified using the DESeq R package (1.18.0)30 based on the criteria of |log2(fold-change)| ≥ 2 and an adjusted P value < 0.05. The gene Ontology (GO) enrichment analysis of DEGs was performed using the clusterProfiler31 R package, with a significance threshold set at an adjusted p-value < 0.05.
Validation of DEGs by quantitative real-time PCR (qRT-PCR)
We selected 6 DEGs and designed gene specific primers online using the qPrimerDB website (https://biodb.swu.edu.cn/qprimerdb/), with amplified fragments ranging in length from 80 to 250 bp. The primers used in qRT-PCR were listed in Supplementary Table S1.
We performed the qRT-PCR experiment on Roche LightCycle 480II, using SYBR Green Mix. Additionally, GbUBQ7 (DQ116441.1) was used as the internal reference gene and the relative expression level of the target gene is determined through Livak’s 2−△△CT method32.
Data Records
24 raw RNA-seq data reported in this paper have been deposited in the Genome Sequence Archive (Genomics, Proteomics & Bioinformatics 2021) in National Genomics Data Center (Nucleic Acids Res 2021), China National Center for Bioinformation/Beijing Institute of Genomics, Chinese Academy of Sciences (GSA: CRA012121)33 that are publicly accessible at https://ngdc.cncb.ac.cn/gsa/browse/CRA012121. Other data, such as the breakdown of each sample file on GSA, lists of raw gene counts, TPM of the 24 samples, the information of DEGs and results of GO enrichment analysis are also available at figshare34. (https://doi.org/10.6084/m9.figshare.24097941.v2).
Technical Validation
Quality assessment of sequencing data
The MultiQC analysis revealed that the quality scores of all sequences exceeded 30(Fig. 2a). Moreover, the per-sequence quality scores predominantly concentrated within the range of 35–40 (Fig. 2b), indicating a base error rate of less than 0.03%. The GC distribution of all sequences exhibited a normal distribution closely resembling the theoretical distribution (Fig. 2c), providing further evidence of the high quality of the reads. Throughout the data processing, an average of 5.9 million adapter sequences were filtered out from each sample, resulting in an average sample size of clean reads of 8.83 Gb (range 6.96–10.88 Gb). The percentage of clean reads was calculated as 99.4% (Table 1).
Principal component analysis (PCA) and different basal transcriptome identification
To evaluate the transcriptional differences between the BR and BS varieties, we performed principal component analysis (PCA) using the TPM values from 24 samples. The results demonstrated a clear distinction between the BR and BS varieties, regardless of V. dahliae inoculation (Fig. 3a, Figshare File 234). Remarkably, even in the absence of V. dahliae inoculation, the BR and BS samples exhibited distinct separation based on PC1. This observation suggests a potential association between the transcriptional dissimilarities of the two varieties and their distinct levels of disease resistance. A total of 1117 DEGs were identified between the BR and BS varieties at 0 hours post-inoculation (hpi), prior to V. dahliae infection. Of these DEGs, 643 exhibited significantly higher expression levels in BR than in BS, whereas 474 displayed significantly lower expression levels in BR (Fig. 3b, Figshare File 334). Among the top 20 GO terms in 643 DEGs, genes associated with physical barrier formation, such as GbCYP86A1 (GB_A07G1113), GbCER1 (GB_A10G0724) in suberin biosynthetic process were identified. Furthermore, a GO term related to Verticillium wilt resistance, which included cellular response to antibiotic pathways and melatonin biosynthetic process, was also significantly enriched in 643 DEGs (Fig. 3c, Figshare File 434). The most enriched processes among the 474 DEGs with higher expression in BS were related to carbon-oxygen lyase activity and isoprenoid biosynthesis (Fig. 3d, Figshare File 534).
Analysis of differentially expressed genes (DEGs) involved in the response to V. dahliae inoculation
To investigate the molecular mechanisms underlying the resistance of different varieties of G. barbadense to V. dahliae infection, we compared the gene expression changes at 12, 24, and 48 hpi with an uninfected sample (0 hpi). A total of 1080, 469, 548 DEGs were identified at 12, 24, and 48 hpi in the BR variety respectively. Similarly, in the BS variety, we found 997, 1147, and 944 DEGs at the same time points (Fig. 4a, Figshare File 634). Interestingly, BR demonstrated a high number of induced genes at all three time points, whereas BS displayed the opposite pattern. Analysis revealed 1498 nonredundant DEGs in BR and 2129 in BS, in response to V. dahliae at all three time points post-inoculation (Fig. 4b-c, Figshare File 734).
Among the top 20 GO terms, 8 terms were shared between BR and BS. The most significantly enriched terms included photosynthesis, light reaction, and response to high light intensity (Fig. 5a, Figshare File 834). Additionally, BS exhibited specific enrichment in phenylpropanoid metabolic process, response to hydrogen peroxide, and flavonoid metabolic process. These GO terms have previously been reported to play a role in cotton resistance to V. dahliae (Fig. 5b, Figshare File 934). The findings suggest that BR displayed insensitivity to the pathogen, whereas BS exhibited a rapid activation of resistance-related transcriptome responses upon inoculation with V. dahliae.
Validation results of RNA-seq data by qRT-PCR
Although the log2(fold-change) value from qRT-PCR did not precisely match that of RNA-seq, there was a strong positive correlation (R2 = 0.854) observed between the results of RNA-seq and qRT-PCR expression analyses, indicating the reliability of the RNA-seq results (Fig. 6).
In summary, we have presented a collection of high-quality transcriptome data illustrating the response of G. barbadense to V. dahliae. Our preliminary analysis indicates a potential association between the resistance of G. barbadense to V. dahliae and the presence of physical barriers. We believe that this transcriptome data offers valuable insights into the molecular mechanisms that underlie cotton’s resistance to V. dahliae, providing a novel approach for identifying key genes involved in such resistance.
Code availability
The following software used in the study was run with default parameters if not specifically stated in the Methods paragraph and no custom code was used for the purposes of this study.
FastQC: https://www.bioinformatics.babraham.ac.uk/projects/fastqc (version 0.11.9).
fastp: https://github.com/OpenGene/fastp(version 0.20.0).
hisat2: https://daehwankimlab.github.io/hisat2/download(version 2.2.1).
FeatureCounts: https://sourceforge.net/projects/subread(version 2.0.4).
Differential analysis: R package DESeq 2 1.38.3.
clusterProfile: R package clusterProfiler v4.6.2.
Veen: https://hiplot.cn/basic/venn.
References
Xiao, G., Zhao, P. & Zhang, Y. A pivotal role of hormones in regulating cotton fiber development. Front Plant Sci 10, 87, https://doi.org/10.3389/fpls.2019.00087 (2019).
Zhu, Y. et al. Interactions between Verticillium dahliae and cotton: pathogenic mechanism and cotton resistance mechanism to Verticillium wilt. Front Plant Sci 14, 1174281, https://doi.org/10.3389/fpls.2023.1174281 (2023).
Chen, X. et al. Long-term increases in continuous cotton yield and soil fertility following the application of cotton straw and organic manure. Agronomy 13, 2133, https://doi.org/10.3390/agronomy13082133 (2023).
Shaban, M. et al. Physiological and molecular mechanism of defense in cotton against Verticillium dahliae. Plant Physiol Biochem 125, 193–204, https://doi.org/10.1016/j.plaphy.2018.02.011 (2018).
Zhang, X. et al. Transcriptomic analysis of gene expression of Verticillium dahliae upon treatment of the cotton root exudates. BMC Genomics 21, 155, https://doi.org/10.1186/s12864-020-6448-9 (2020).
Fradin, E. F. & Thomma, B. P. Physiology and molecular aspects of Verticillium wilt diseases caused by V. dahliae and V. albo-atrum. Mol Plant Pathol 7, 71–86, https://doi.org/10.1111/j.1364-3703.2006.00323.x (2006).
Klimes, A., Dobinson, K. F., Thomma, B. P. & Klosterman, S. J. Genomics spurs rapid advances in our understanding of the biology of vascular wilt pathogens in the genus Verticillium. Annu Rev Phytopathol 53, 181–198, https://doi.org/10.1146/annurev-phyto-080614-120224 (2015).
Wang, J. et al. The Verticillium dahliae small cysteine-rich protein VdSCP23 manipulates host immunity. Int J Mol Sci 24, https://doi.org/10.3390/ijms24119403 (2023).
Zhao, F. & Dai, X. Infection process of Verticillium dahliae klebahn in cotton. Genomics and Applied Biology 28, 786–792 (2009).
Wen, X. et al. A comprehensive overview of cotton genomics, biotechnology and molecular biological studies. Sci China Life Sci https://doi.org/10.1007/s11427-022-2278-0 (2023).
Viot, C. R. & Wendel, J. F. Evolution of the cotton genus, Gossypium, and its domestication in the Americas. Crit. Rev. Plant Sci. 42, 1–33, https://doi.org/10.1080/07352689.2022.2156061 (2023).
Lacape, J. M. et al. A new interspecific, Gossypium hirsutum x G. barbadense, RIL population: towards a unified consensus linkage map of tetraploid cotton. Theor Appl Genet 119, 281–292, https://doi.org/10.1007/s00122-009-1037-y (2009).
Park, Y. H. et al. Genetic mapping of new cotton fiber loci using EST-derived microsatellites in an interspecific recombinant inbred line cotton population. Mol Genet Genomics 274, 428–441, https://doi.org/10.1007/s00438-005-0037-0 (2005).
Li, R. et al. Genome resource for the Verticillium wilt resistant Gossypium hirsutum cultivar Zhongzhimian No. 2. Mol Plant Microbe Interact 36, 68–72, https://doi.org/10.1094/mpmi-10-22-0205-a (2023).
He, L. et al. Advanced genes expression pattern greatly contributes to divergence in Verticillium wilt resistance between Gossypium barbadense and Gossupium hirsutum. Front Plant Sci 13, 979585, https://doi.org/10.3389/fpls.2022.979585 (2022).
Wang, F. et al. Multicopper oxidases GbAO and GbSKS are involved in the Verticillium dahliae resistance in Gossypium barbadense. J Plant Physiol 280, 153887, https://doi.org/10.1016/j.jplph.2022.153887 (2023).
Gao, W. et al. Proteomic and virus-induced gene silencing (VIGS) Analyses reveal that gossypol, brassinosteroids, and jasmonic acid contribute to the resistance of cotton to Verticillium dahliae. Mol Cell Proteomics 12, 3690–3703, https://doi.org/10.1074/mcp.M113.031013 (2013).
Sun, Q. et al. Analysis of sea-island cotton and upland cotton in response to Verticillium dahliae infection by RNA sequencing. BMC Genomics 14, 852, https://doi.org/10.1186/1471-2164-14-852 (2013).
Xiong, X. P. et al. The cotton lignin biosynthetic gene Gh4CL30 regulates lignification and phenolic content and contributes to Verticillium wilt resistance. Mol Plant Microbe Interact 34, 240–254, https://doi.org/10.1094/mpmi-03-20-0071-r (2021).
Xu, L. et al. Lignin metabolism has a central role in the resistance of cotton to the wilt fungus Verticillium dahliae as revealed by RNA-Seq-dependent transcriptional analysis and histochemistry. J Exp Bot 62, 5607–5621, https://doi.org/10.1093/jxb/err245 (2011).
Zhang, W. et al. Large-scale identification of Gossypium hirsutum genes associated with Verticillium dahliae by comparative transcriptomic and reverse genetics analysis. PLoS One 12, e0181609, https://doi.org/10.1371/journal.pone.0181609 (2017).
Zhang, Y. et al. Transcriptome profiling of Gossypium barbadense inoculated with Verticillium dahliae provides a resource for cotton improvement. BMC Genomics 14, 637, https://doi.org/10.1186/1471-2164-14-637 (2013).
Xiong, X. et al. The cotton WRKY transcription factor GhWRKY70 negatively regulates the defense response against Verticillium dahliae. Crop Journal 7, 393–402, https://doi.org/10.1016/j.cj.2018.10.005 (2019).
Atallah, Z. K., Bae, J., Jansky, S. H., Rouse, D. I. & Stevenson, W. R. Multiplex real-time quantitative PCR to detect and quantify Verticillium dahliae colonization in potato lines that differ in response to Verticillium wilt. Phytopathology 97, 865–872, https://doi.org/10.1094/phyto-97-7-0865 (2007).
Ewels, P., Magnusson, M., Lundin, S. & Kaller, M. MultiQC: summarize analysis results for multiple tools and samples in a single report. Bioinformatics 32, 3047–3048, https://doi.org/10.1093/bioinformatics/btw354 (2016).
Chen, S. F., Zhou, Y. Q., Chen, Y. R. & Gu, J. Fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34, 884–890, https://doi.org/10.1093/bioinformatics/bty560 (2018).
Hu, Y. et al. Gossypium barbadense and Gossypium hirsutum genomes provide insights into the origin and evolution of allotetraploid cotton. Nat Genet. 51, 739–748, https://doi.org/10.1038/s41588-019-0371-5 (2019).
Pertea, M., Kim, D., Pertea, G. M., Leek, J. T. & Salzberg, S. L. Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown. Nat Protoc 11, 1650–1667, https://doi.org/10.1038/nprot.2016.095 (2016).
Zhao, Y. et al. TPM, FPKM, or normalized counts? A comparative study of quantification measures for the analysis of RNA-seq data from the NCI patient-derived models repository. J Transl Med 19, 269, https://doi.org/10.1186/s12967-021-02936-w (2021).
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol 15, 550, https://doi.org/10.1186/s13059-014-0550-8 (2014).
Wu, T. et al. ClusterProfiler 4.0: A universal enrichment tool for interpreting omics data. Innovation (Camb) 2, 100141, https://doi.org/10.1016/j.xinn.2021.100141 (2021).
Livak, K. J. & Schmittgen, T. D. Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) method. Methods 25(4), 402–408, https://doi.org/10.1006/meth.2001.1262 (2001).
Genome Sequence Archive https://ngdc.cncb.ac.cn/gsa/browse/CRA012121 (2023).
Xiong, X. & Sun, C. Transcriptome data of samples from two Gossypium barbadense varieties before and after Verticillium dahliae inoculation. Figshare. https://doi.org/10.6084/m9.figshare.24097941.v2 (2023).
Acknowledgements
This project was supported by the China Postdoctoral Science Foundation (Grant No. 2021M703552), the Natural Science Foundation of Hubei Provincial Department of Science and Technology (2023AFB931), and the Natural Science Foundation of Hubei Provincial Department of Education (Q20224301).
Author information
Authors and Affiliations
Contributions
F.X. and C.F. designed the study, revised the manuscript. J.S. revised the manuscript. X.X. and C.S. performed the experiments and wrote the original draft. B.C. performed the sample collection, RNA isolation.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interest.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Xiong, X., Sun, C., Chen, B. et al. Transcriptomic datasets of Verticillium wilt resistant and non-resistant Gossypium barbadense varieties during pathogen inoculation. Sci Data 11, 11 (2024). https://doi.org/10.1038/s41597-023-02852-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-023-02852-2