Abstract
Sex chromosomes are highly variable in some taxonomic groups, but the evolutionary mechanisms underlying this diversity are not well understood. In terrestrial isopod crustaceans, evolutionary turnovers in sex chromosomes are frequent, possibly caused by Wolbachia, a vertically-transmitted endosymbiont causing male-to-female sex reversal. Here, we use surgical manipulations and genetic crosses, plus genome sequencing, to examine sex chromosomes in the terrestrial isopod Trachelipus rathkei. Although an earlier cytogenetics study suggested a ZZ/ZW sex chromosome system in this species, we surprisingly find multiple lines of evidence that in our study population, sex is determined by an XX/XY system. Consistent with a recent evolutionary origin for this XX/XY system, the putative male-specific region of the genome is small. The genome shows evidence of Y-linked duplications of the gene encoding the androgenic gland hormone, a major component of male sexual differentiation in isopods. Our analyses also uncover sequences horizontally acquired from past Wolbachia infections, consistent with the hypothesis that Wolbachia may have interfered with the evolution of sex determination in T. rathkei. Overall, these results provide evidence for the co-occurrence of multiple sex chromosome systems within T. rathkei, further highlighting the relevance of terrestrial isopods as models for the study of sex chromosome evolution.
Similar content being viewed by others
Introduction
Although sexual reproduction is shared by most eukaryotes, a variety of different cues can trigger individuals to follow a male, female, or hermaphroditic developmental plan (Conover and Kynard 1981; Tingley and Anderson 1986; Janzen and Phillips 2006; Ospina-Álvarez and Piferrer 2008; Verhulst et al. 2010). In many eukaryotes, sex is primarily determined genotypically, often involving sex chromosomes, although other mechanisms, such as polygenic systems and haplodiploidy, are also known (Vandeputte et al. 2007; Heimpel and de Boer 2008). Sex chromosomes in animals are usually grouped into two main classes: XY systems, in which males are heterogametic (XY) and females are homogametic (XX); and ZW systems, in which females are heterogametic (ZW) and males are homogametic (ZZ). However, non-genetic cues can also play an important role in some species. For instance, environmental factors, such as temperature or population density, influence or determine phenotypic sex in reptiles, fishes, and invertebrates (Conover and Kynard 1981; Tingley and Anderson 1986; Janzen and Phillips 2006). In some cases, cytoplasmic factors, including sex-reversing endosymbionts, such as Wolbachia, microsporidia, and paramyxids can serve as a sex-determining signal (Terry et al. 1998; Bouchon et al. 1998; Kageyama et al. 2002; Negri et al. 2006; Pickup and Ironside 2018).
Evolutionary theory holds that the formation of sex chromosomes begins when an autosome acquires a sex-determining locus (Rice 1996). Subsequently, recombination around the sex-determining locus is selected because of sexually antagonistic selection (Bergero and Charlesworth 2009). For instance, selection should favor mutations that are beneficial in males but deleterious in females when those alleles are linked to a dominant male-determining allele; recombination, on the other hand, would break up this linkage and result in females that carry these male-beneficial alleles. The non-recombining region is then expected to spread in the presence of continued sexually antagonistic selection, and may eventually span the whole sex chromosome, except for the usual presence of a small recombining pseudo-autosomal region (Charlesworth et al. 2005). Once recombination has ceased, the non-recombining sex chromosome, such as the Y chromosome in mammals or the W chromosome in birds, is expected to degenerate. Non-recombining genes frequently undergo pseudogenization, acquiring nonsense mutations or transposable element insertions (Charlesworth and Charlesworth 2000). At the same time, gene trafficking can occur when selection promotes the translocation of formerly autosomal genes to the sex chromosomes (Emerson et al. 2004).
Different species appear to be at different stages of sex chromosome evolution. For instance, the sex chromosomes of therian mammals are highly conserved, having originated ~160 million years ago (Potrzebowski et al. 2008; Veyrunes et al. 2008). The highly degenerated, heteromorphic Y chromosome represents an advanced stage of sex chromosome evolution. In other taxonomic groups, on the other hand, sex chromosomes appear to undergo more frequent evolutionary turnovers (Ross et al. 2009; Cioffi et al. 2013; Vicoso and Bachtrog 2015; Myosho et al. 2015; Pennell et al. 2018; Jeffries et al. 2018). Such young sex chromosomes may have little or no recombination suppression, differentiation in gene content, or sex chromosome dosage compensation, and may not be detectable by traditional cytogenetic methods because they are visually indistinguishable (homomorphic) (Gamble et al. 2014; Vicoso and Bachtrog 2015). Sex chromosomes may even be polymorphic within a species, with different sex-determining loci segregating within or among populations (Orzack et al. 1980; Traut 1994; Ogata et al. 2008; Meisel et al. 2016).
Unfortunately, we still have a limited understanding of why evolutionary turnovers of sex chromosomes are rare in some groups but frequent in others. A variety of models have been proposed to explain why these turnovers occur, including sexual antagonism, where a novel sex-determining allele spreads because of its association with another allele with sex-specific effects (van Doorn and Kirkpatrick 2007); the accumulation of deleterious mutations on the nonrecombining sex chromosome (Blaser et al. 2013); and the “hot potato” model, which suggests that the accumulation of both sexually antagonistic and deleterious mutations can lead to repeated sex chromosome turnovers (Blaser et al. 2014). In some organisms, interactions with vertically transmitted reproductive endosymbionts are also thought to influence the evolution of their hosts’ sex determination mechanisms (Rigaud et al. 1997; Cordaux et al. 2011). However, many of these models have been difficult to test in nature. This problem is exacerbated by the fact that, while sex chromosomes have been extensively studied in model organisms like Drosophila, studies are more sparse in non-model organisms.
One group that has received relatively little attention is crustaceans. Different crustacean species show a variety of distinct sex-determining mechanisms, yet there are very few crustacean species in which candidate master sex-determining genes have been identified (Chandler et al. 2017, 2018). Within crustaceans, perhaps one of the best-studied groups in terms of sex determination is the terrestrial isopods (Oniscidea). Terrestrial isopod species have a mix of XY and ZW systems, along with reports of a few parthenogenic species and populations (Fussey 1984; Johnson 1986; Rigaud et al. 1997). The bacterial endosymbiont Wolbachia also influences sex determination by causing male-to-female sex reversal in some isopod hosts (Bouchon et al. 1998; Cordaux et al. 2004). In fact, interactions with Wolbachia are thought to drive rapid evolutionary turnover of the sex chromosomes in terrestrial isopods. For instance, in the common pillbug Armadillidium vulgare, a copy of the Wolbachia genome horizontally integrated into the host genome (known as the f element) led to the origin of a new W chromosome (Leclercq et al. 2016). A recent phylogenetic analysis also identified several transitions in heterogametic systems along the isopod phylogeny, including closely related species pairs with different sex chromosome systems (Becking et al. 2017). Moreover, only a few species of terrestrial isopods are known to have heteromorphic sex chromosomes, in which the X and Y, or Z and W, chromosomes are distinguishable in cytogenetics experiments (Rigaud et al. 1997), and WW or YY individuals are often viable and fertile (Juchault and Rigaud 1995; Becking et al. 2019), suggesting that the W and Y chromosomes have not lost any essential genes in these species.
In this study, we examined sex determination in the widespread species Trachelipus rathkei. This species was previously established by cytogenetic methods to have heteromorphic, albeit slightly, Z and W sex chromosomes (Mittal and Pahwa 1980), and is nested within a clade that appears ancestrally to possess a ZZ/ZW sex determination mechanism (Becking et al. 2017). We sought to confirm female heterogamety by crossing females to sex-reversed males (which have female genotypes but male phenotypes), and assessing the sex ratio of the resulting progenies, which will differ depending on the sex chromosome system (XX neo-male × XX female yields all XX and therefore 100% female offspring; ZW neo-male × ZW female expected to produce 1/4 ZZ, 1/2 ZW, and 1/4 WW offspring, thus 75% female or 66.7% female depending on whether WW genotypes are viable). Surprisingly, we found that, at least in our focal population, sex is determined by an XX/XY system, suggesting a recent sex chromosome turnover. To test this hypothesis, we performed whole-genome sequencing. Consistent with a recent origin of an XX/XY sex-determination system, we find evidence that the putative male-specific region is small relative to the whole genome, and we identified a male-specific, partial duplication of the androgenic gland hormone (AGH) gene, a rare example of a candidate sex-determining gene in a crustacean. In addition, although our study population does not appear to harbor current Wolbachia infections, we find genomic evidence of past infections. Overall, our results are consistent with the hypothesis that Wolbachia endosymbionts may have interfered with the evolution of sex determination in T. rathkei.
Methods
Animal collection and husbandry
We sampled wild isopods from Rice Creek Field Station (RCFS) at SUNY Oswego in Oswego, NY. We captured animals using a combination of methods. First, we haphazardly searched through leaf litter, logs, and rocks. We also used “potato traps”, made by carving out a 1–2 cm diameter core from a potato and placing it in the litter for 1–2 weeks. Finally, we constructed pitfall traps from plastic cups buried in the ground with the rim of the cup flush with the ground. The primary species captured were Oniscus asellus and T. rathkei, but we also captured Philoscia muscorum, Hyloniscus riparius, Trichoniscus pusillus, and occasionally Cylisticus convexus. Species identification was performed in the field and confirmed in the lab, where we also determined the phenotypic sex of specimens.
Isopods were housed in plastic food storage containers with holes in the lids for air exchange, on a substrate of moistened soil. Containers were checked twice weekly. Animals were fed carrots and dried leaves ad libitum. The photoperiod was kept on a schedule of 18:6 light hours: dark hours in the summer and 14:10 in the winter. We isolated ovigerous females in individual containers and separated offspring from their mothers upon emergence from the marsupium. We initially sexed offspring at 6–8 weeks old and separated males from females to prevent sibling mating. We then double-checked offspring sex at roughly 2 weeks intervals thereafter until 4 months of age to watch for individuals that might have shown late signs of sexual differentiation. Terrestrial isopods are known to store sperm from a single mating to fertilize future broods. Therefore, for experimental crosses, we only used T. rathkei females that were born in the lab, separated from brothers as soon as they could be sexed, and which had not produced any offspring by 12 months of age.
Wolbachia testing
We used PCR assays to test for Wolbachia presence in T. rathkei individuals. DNA was extracted from one or two legs, depending on the size of the animal. We ruptured the leg tissue in 400 μL deionized water along with a few 0.5 mm zirconia/silica beads (enough to cover the bottom of the tube) using a bead beater machine. Samples were lysed following a protocol of 2500 RPM for 10 s, followed by 4200 RPM for 10 s, and finally 4800 RPM for 10 s. The tube was then visually inspected to confirm the leg was sufficiently pulverized. We then transferred the lysate to a new tube, added 60 µL of a 5% Chelex® 100 molecular biology grade resin suspension, and incubated it for 15 min at 100 °C. After incubation, we centrifuged the extract at 16,000 g for 3 min and reserved 80 µL of supernatant for PCR testing. We confirmed successful DNA extraction using the mitochondrial primers HCO2198/LCO1490 (Folmer et al. 1994). We performed PCRs in 10 µL reactions, using 4.95 µL of molecular biology grade water, 2 µL NEB OneTaq Buffer, 1 µL of mixed dNTPS at a final concentration of 2 mM for each dNTP, 1 µL of a 5 µM solution of each primer, and 0.05 µL of NEB OneTaq. For the mitochondrial primer set, PCR conditions included an initial denaturation of 94 °C for 1 min; five cycles of 94 °C denaturations for 30 s, 45 °C annealings for 90 s, and 68 °C extensions for 60 s. The samples then underwent 35 cycles of 94 °C for 30 s, 51 °C for 90 s, and 68 °C for 60 s. This was followed by a final extension step of 68 °C for 5 min. To test for Wolbachia, we performed PCR using Wolbachia-specific primers targeting the wsp (81f/691r) and ftsZ (ftsZf1/ftsZr1) genes (Werren et al. 1995; Braig et al. 1998). We performed PCRs in 10 µL reactions, using 4.95 µL of molecular biology grade water, 2 µL NEB OneTaq Buffer, 1 µL of mixed dNTPs at a final concentration of 2 mM for each dNTP, 1 µL of either wsp or ftsZ primers, and 0.05 µL of NEB OneTaq. PCR conditions contained an initial denaturation of 95 °C for 5 min, followed by 36 cycles of 95 °C for 60 s, 54 °C for 60 s, and 68 °C for 3 min. This was followed by a final extension step at 68 °C for 10 min. Positive PCR tests would not necessarily be able to distinguish between true infection and a copy of the Wolbachia genome horizontally integrated into the host genome, but the absence of a PCR product should be a reliable indicator that these Wolbachia sequences are not present (at least at detectable levels).
Androgenic gland implantation and crosses
To test whether sex is determined by a ZZ/ZW or XX/XY system of sex determination in our population of T. rathkei, we performed crosses between females and experimentally sex-reversed neo-males. Juvenile female T. rathkei were implanted with live androgenic glands, according to Becking et al. (2017). Male donors and female recipients were selected from large lab-reared broods with even (~1:1) sex ratios. An adult male was sacrificed by decapitation, and live androgenic glands were dissected into Ringer solution (393 mM NaCl, 2 mM KCl, 2 mM CaCl2·2H2O, 2 mM NaHCO3). Female recipients were between 5 and 8 weeks old, an age at which males and females begin to become distinguishable by the appearance of external male genitalia, but at which sexual development is not complete. Occasionally, young males may still be mistaken for females at this stage, if the male genitalia is not sufficiently developed, and thus it is common for some fraction of the recipients in these experiments to carry male genotypes (Becking et al. 2017); nevertheless, it is important for transplant recipients to be as young as possible for sex reversal to be complete. The androgenic gland was injected using a pulled glass pipette into a hole pierced with a dissecting needle in the sixth or seventh segment of the juvenile female’s person. Recipients were isolated in a small plastic container with a moist paper towel for recovery and observation. Experimental animals were monitored for signs of male development. Any surviving animal that failed to develop male genitalia by 4 months post-implantation was considered to be a failed injection. After maturation, adult neo-males were placed in individual containers with 1–3 previously unmated females. Crosses were monitored twice weekly to check for signs of reproduction in females. Gravid females were then isolated into their own containers until parturition.
Genome sequencing
We performed whole-genome sequencing using a combination of Illumina, PacBio, and Oxford Nanopore sequencing, with multiple sequencing samples of each sex (Supplementary Table 1). Because we expected the T. rathkei genome to be large, repetitive, and highly polymorphic, and because we expected to need to isolate DNA from multiple individuals, we established a partially inbred laboratory line using offspring from a single female collected from RCFS. We mated brothers and sisters from this female for two generations in the lab prior to collecting genetic samples from the third generation for sequencing. DNA was collected for sequencing using the Qiagen DNEasy Blood and Tissue Kit following the manufacturer’s instructions. DNA was quantified using the Qubit DNA Broad Range assay kit, and the A260/280 value was checked with a Nanodrop spectrophotometer. Samples were stored at −80 °C prior to being shipped to the sequencing center. Illumina sequencing was performed at the State University of New York at Buffalo Genomics and Bioinformatics Core Facility.
For PacBio sequencing, we had to pool DNA from multiple individuals to obtain sufficient quantities of DNA for library preparation. We performed separate DNA extractions from three individuals of each sex as above. Then, we pooled the DNA from the three individuals of each sex and concentrated it using Ampure XP beads (Beckman-Coulter). Briefly, we washed the beads three times in molecular biology grade water, once in Qiagen buffer EB, and finally re-suspended the beads in their original buffer. We then added equal volumes of Ampure XP suspension to the DNA samples, mixed them on a shaker for 15 min, placed the tubes on a magnetic bead separator, and removed the supernatant. We washed the beads twice with 1.5 mL of 70% ethanol, and finally eluted the DNA samples in 30 µL of Qiagen buffer EB. Sequencing libraries were prepared and sequenced at the University of Delaware Sequencing and Genotyping Center on a PacBio RSII.
We also supplemented our PacBio dataset with Oxford Nanopore sequencing data. We isolated DNA from a single T. rathkei female and two separate males using a Qiagen DNEasy Kit as described above. We then performed sequencing on Oxford Nanopore Minion flowcells (R9.4) with the Rapid Sequencing Kit (SQK-RAD004) following the manufacturer’s instructions.
Genome assembly
We performed a hybrid assembly combining the short- and long-read sequence data, closely mirroring the bioinformatics pipeline used to generate previously published isopod genome assemblies (Chebbi et al. 2019; Becking et al. 2019). We first removed adapters and trimmed the Illumina sequencing data using Trimmomatic v. 0.36 (Bolger et al. 2014); we removed leading and trailing bases, as well as internal windows of at least 4 bp, with a mean quality score of 5 or lower, and discarded any reads shorter than 36 bp after trimming. We then used SparseAssembler (Ye et al. 2012) to assemble the cleaned Illumina data from sample Mpool. This dataset was chosen because it had decent coverage (~38×), was generated from a PCR-free sequencing library, and came from male samples (so that Y chromosome sequences would be present); additional male sequencing samples were excluded from this initial assembly to minimize the number of sequence polymorphisms that would be present in the data with additional samples. This first assembly was performed using two different kmer sizes (k = 51 and k = 61). After performing preliminary quality checks using Quast (Gurevich et al. 2013), we decided to proceed with the k = 61 assemblies, which had a longer total length and N50 (Supplementary Table 2). However, because we suspected the genome might still contain high levels of heterozygosity despite two generations of inbreeding, we used Redundans (Pryszcz and Gabaldón 2016) to remove putative allelic contigs from the Illumina-only assembly; we set identity and overlap thresholds of 95%.
Prior to performing hybrid assembly, we used the short reads to correct sequencing errors in the long reads using FMLRC (Wang et al. 2018) with the default settings, except requiring a minimum count of 3 to consider a path (−m 3). PacBio and Oxford Nanopore reads derived from female isopods were corrected using Illumina sample Fpool, while long reads from male samples were corrected using sample Mpool.
We next performed hybrid assembly using DBG2OLC (Ye et al. 2016), which accepts a short-read assembly (rather than raw short-read sequence data) and long-read sequence data (in this case, our combined PacBio and Oxford Nanopore reads) as input. We tested out a range of different parameter values: from the Redundans-filtered assembly, we first removed contigs less than 100 bp or 200 bp; we tested kmer sizes of 17 and 19; for the kmer coverage threshold, we tried values of 2 and 5; and for the minimum overlap, we tried values of 10 and 30. We used an adaptive threshold of 0.01. These assemblies ranged in size from ~5.2 Gb to 8.5 Gb; we selected three assemblies across the range of total sizes for further processing.
We next corrected errors in these assemblies, caused by the relatively high error rates in long-read sequence data. In the standard DBG2OLC pipeline, the resulting contigs are corrected using the contigs from the short-read assembly and from the long reads using Sparc (Ye and Ma 2016); however, in our initial attempts, large portions of the assemblies went uncorrected, perhaps because we had relatively low-coverage long-read data. Therefore, instead, we performed three rounds of error correction using Pilon (Walker et al. 2014), by mapping the trimmed Illumina sequence reads to each assembly using bbmap (first two rounds; Bushnell et al. 2017) and bwa mem (third round; with the parameters −A 1 −B 1 −O 1 −E 1 −k 11 −W 20 −d 0.5 −L 6 for mapping to an error-prone assembly; Li 2013).
Finally, we assessed the quality of each of the three candidate assemblies using BUSCO v.3.0.2 (Simão et al. 2015), with the arthropod reference gene set, and selected the assembly with the greatest number of BUSCO reference genes present for further analysis.
To remove contaminants from the final assembly, we generated blob plots using Blobtools v.1.0 (Laetsch and Blaxter 2017). To accomplish this, we BLASTed all contigs against the NCBI nucleotide (nt) database using megablast (Morgulis et al. 2008) and against Uniprot reference proteomes using diamond blastx (Buchfink et al. 2015). We then removed any contigs that were identified as coming from plants, fungi, viruses, or bacteria, except for those matching Wolbachia.
Genome annotation
We used RepeatModeler v.1.0.10, which uses RECON (Bao and Eddy 2002), RepeatScout (Price et al. 2005), and Tandem Repeat Finder (Benson 1999), to construct a custom repeat library for T. rathkei. Because we were unable to run RepeatModeler successfully using the full assembly, we ran RepeatModeler on a random subset of 40% of the contigs; this should still successfully identify the most repetitive elements in the genome as long as all repeat families are well represented in the subset. We then masked the assembly using RepeatMasker 4.0.7 and used the data generated by RepeatMasker on the divergence between each individual repeat and the consensus sequence for that family to examine the history of transposable element activity in T. rathkei (Tarailo-Graovac and Chen 2009).
We annotated coding sequences using the MAKER pipeline (Cantarel et al. 2008). We initially ran MAKER v2.31.9 using assembled transcript sequences (est2genome = 1) from previously available data from one wild-caught male and one wild-caught female T. rathkei from the same population (Becking et al. 2017). This transcriptome was generated by assembling the male and female samples separately, each with two assemblers, Trinity v2.8.4 (Grabherr et al. 2011) and TransLiG v1.3 (Liu et al. 2019), and then merging and filtering the transcriptome assemblies with EvidentialGene v4 (Gilbert 2019). This initial annotation also incorporated protein alignments against Uniprot-Swissprot (version March 2020), and the resulting output was used to train SNAP (Korf 2004). To train AUGUSTUS (Stanke et al. 2006) we used the output from the BUSCO quality assessment described earlier. We then completed a final round of MAKER using the trained gene models, retaining the transcript and protein alignments from earlier as evidence.
Development of sex-linked PCR markers
We used multiple approaches to develop male-specific, putatively Y-linked PCR markers. Initial attempts to perform a hybrid Illumina-PacBio genomic assembly with only male samples and then identify contigs with zero coverage in females were unsuccessful. We therefore developed a complementary approach by looking for male-specific k-mers using just the raw Illumina sequencing reads. We chose a value of k = 21 because it should be large enough that most k-mers will not occur more than once in the genome sequence, yet small enough to minimize the impact of sequencing errors (Vurture et al. 2017). We used kmc v.3.1.0 (Kokot et al. 2017) to count all the canonical 21-mers in each of the Illumina sequencing datasets (in other words, each 21-mer and its reverse complement were considered to be the same k-mer during counting). We then searched for k-mers that occurred at least eight times in the Mpool Illumina sequencing dataset and a total of at least three times combined across the lower coverage M2, M5, M6, and wildM samples, but which were completely absent from all-female samples. We then extracted all Illumina sequence reads containing these candidate male-specific k-mers using mirabait v.4.0.2 (Chevreux et al. 1999) and assembled them using Spades v.3.11.1 (Bankevich et al. 2012). We also performed a reciprocal analysis looking for female-specific k-mers (which are not expected in an XX/XY system), searching for k-mers that occurred at least eight times in the Fpool Illumina sequencing dataset, a total of at least three times combined across the F3, F4, and wildF samples, but which were completely absent from all-male samples.
To test male-specificity of these contigs, we used PCR. We developed PCR primers for a subset of candidate male-specific contigs. To identify the best candidates, we first mapped raw sequencing reads from all male and female Illumina samples to the full genome sequence plus the candidate male-specific contigs, and identified contigs that had coverage in male samples but not female samples; we also avoided contigs that showed evidence of containing repeat elements, after BLAST searches against the whole genome assembly. We designed primers using PRIMER3 (Untergasser et al. 2012). In these PCRs, primers were initially screened using template DNA from two male samples and two female samples; primers that showed evidence of sex specificity after this first PCR were re-tested using a larger number of samples. PCR primers were initially tested using a cycle of 98 °C for 3 min, followed by 40 cycles of 98 °C for 15 s, 50 °C for 35 s, and 68 °C for 60 s; this was followed by a final extension step of 68 °C for 10 min. For samples that did not amplify under this program, a gradient PCR was run to determine optimal annealing temperature. All PCRs were performed using the same recipe and reaction conditions as the Wolbachia PCRs described above.
We also identified open reading frames (ORFs) in these candidate male-specific contigs using Transdecoder v.4.0.0 (Haas and Papanicolaou 2016) and annotated the ORFs using Trinotate v.3.1.1 (Bryant et al. 2017). Subsequently, we designed additional primers targeting one of the candidates ORFs (F: 5′–ATTCTTGACTCTCCCCACGA–3′; R: 5′–TCTCCAACTACGATTTCGTTAATT–3′).
Results
No Wolbachia and balanced sex ratios in T. rathkei
Among the 100+ individuals captured and tested between 2015 and 2017, no T. rathkei from RCFS conclusively tested positive for Wolbachia. This was not due to inadequate testing protocols; for instance, a captive population of Porcellio laevis housed in our lab shows nearly a 100% infection rate using the same methods (not shown). Approximately 150 T. rathkei broods were raised in the lab from either mated, wild-caught females or first-generation crosses. The mean and median brood sizes of this species in our lab were 27.1 and 22.5 offspring, respectively, and the vast majority of these broods had a balanced sex ratio (Supplementary Table 3). Thus, the prevalence of Wolbachia and other sex ratio distorters is at most very low in this population of T. rathkei. In addition, some wild-caught females produced broods even after several months to a year in isolation in the lab (Supplementary Table 3), confirming that this species is capable of long-term sperm storage.
Crossing sex-reversed individuals indicate an XY sex-determination system
Five juveniles implanted with androgenic glands survived to mature into males; they were crossed with virgin females from families with normal sex ratios. Each putative neo-male was paired with 2–3 females, and each female produced 1–3 broods of offspring. Two of these males sired broods with balanced sex ratios (not significantly different from the null hypothesis of a 1:1 ratio of males to females; Table 1). These males were likely individuals that would have developed into males even without the AG implantation but were initially misidentified as juvenile females probably due to incomplete sexual differentiation at that early stage. Thus, these crosses are uninformative with respect to the sex-determination system. A similar rate of “failed” crosses was observed in a recent study following identical protocols in other isopod species, including species without any evidence of sex chromosome polymorphism, suggesting that sexing juveniles for AG implantation at these early stages is difficult due to the possibility of incomplete sexual differentiation (Becking et al. 2017). Three other males produced only female offspring, consistent with an XX/XY system (XX neo-male × XX female yields all XX and therefore 100% female offspring) but not a ZZ/ZW system (ZW neo-male × ZW female expected to produce 1/4 ZZ, 1/2 ZW, and 1/4 WW offspring, thus 75% female or 66.7% female depending on whether WW genotypes are viable; Table 1).
Genome assembly
All sequencing data have been deposited in the NCBI SRA under the project accession number PRJNA633105 (sequencing runs are SRR11797353-SRR11797365, SRR4000573, and SRR4000567); the draft assembly has been deposited under accession number GCA_015478945.1. The draft genome assembly of T. rathkei is approximately 5.2 Gb in total length. The genome is highly repetitive, consisting of ~70% repetitive elements. Transposable elements constitute the largest repeat category, with LINEs, followed by DNA elements and LTRs, being the most represented (Fig. 1). All repeat families seem to have a single divergence peak of around 7–10% (Fig. 1).
Despite its large size, the draft assembly is likely only partially complete, with ~25% of arthropod BUSCO genes missing (Table 2). For an independent assessment of assembly completeness, we also estimated the proportion of transcripts from the previously available transcriptome dataset that were present in the assembly. There were 15,805 transcripts assembled from that previously available transcriptome whose best hits in blastn searches against the NCBI nt database and diamond blastx searches against Uniprot-Swissprot were from other arthropods (and thus were unlikely to come from bacterial contaminants); of those, only 53% had nearly full-length matches in the genome (≥90% of the transcript length at ≥90% sequence identity), suggesting some missing data and/or remaining uncorrected sequencing errors in the draft assembly as well. At a more relaxed cutoff of at least 50% of the transcript having BLAST hits in the genome at ≥90% sequence identity, then 91% of transcripts appear to be represented somewhere in the assembly, suggesting that most transcripts are indeed present in the assembly but maybe partial fragments rather than whole gene sequences.
We screened the T. rathkei genome for Wolbachia nuclear insertions by BLASTing the assembled contigs against a collection of Wolbachia genome sequences and then BLASTing the matching regions against all representative bacterial genomes from RefSeq to rule out false positives. After this filtering step, we were left with 1010 high confidence matches (best BLAST hit in a Wolbachia genome, e-value < 1 × 10−6) spread across 719 contigs, with a total length of ~350 kb for the matching sequences (Supplementary Table 4), much smaller than a typical full Wolbachia genome of about 1–1.6 Mb on average (Sun et al. 2001). These may represent independent small insertions into the host genome, or one or larger insertions that were subsequently broken up by mobile elements or other genomic rearrangements, or which assembled into separate contigs due to insufficient data. These likely horizontally acquired sequences were closely related to Wolbachia strain wCon from the isopod Cylisticus convexus (Badawi et al. 2018), the feminizing Wolbachia strain wVulC, and the f element of A. vulgare (Leclercq et al. 2016) (Fig. 2).
Searching for candidate sex-determining genes
We identified ~6.04 × 106 21-mers as potentially male-specific, suggesting there is a minimum of 6 Mb of male-specific sequence content in the genome. However, when we isolated the raw Illumina sequencing reads containing those 21-mers and assembled them, we obtained 89.4 Mb of assembled sequences, suggesting the male-specific region may be as large as ~90 Mb, but still shares significant similarity with the X chromosome. Even if up to 90 Mb of the sequence is partially sex-linked, this represents just 1.7% of the genome.
As a negative control, we also screened for k-mers specific to females. We identified 9.11 × 105 female-specific 21-mers using the same criteria as our analysis for male-specific k-mers, or ~15% of the number of male-specific k-mers. The observation that there are far more male-specific 21-mers than there are female-specific 21-mers in our dataset suggests that most of these are from real male-specific genomic sequences.
Of the initial 16 candidate Y-linked PCR markers designed from anonymous sequences, none showed the expected pattern of male-specific amplification in our early tests (Supplementary Table 5). This may be due to (1) the highly repetitive nature of the T. rathkei genome, despite our best efforts to target primers to non-repetitive sequences, or (2) low divergence between X- and Y-linked copies.
Because the candidate male-specific contigs were assembled from Illumina data only and thus short and fragmented, we were unable to screen them for annotated candidate sex-determining genes using the typical MAKER pipeline. However, we were able to identify ORFs and annotate them like transcripts using Trinotate (Bryant et al. 2017). Three contigs in the male-specific assembly showed homology to the AGH gene upon annotation, suggesting there may be a Y-linked duplication of the AGH gene. Therefore, we designed PCR primers specifically targeting one of the Y-linked AGH-like sequences (AGHY1 on NODE_44048_length_535, see Methods for primer sequences; same PCR cycling conditions as the other candidate sex-specific primers). These primers resulted in a PCR product of the expected size (195 bp) in all-male samples screened (7/7), but not in any of the female samples (0/7), all of which were unrelated wild-caught individuals, confirming the male-specificity of this AGH allele.
This AGH sequence could be either a male-specific duplication of the AGH gene or a Y-linked allele that has diverged from an X-linked copy (in other words, gametologs). To distinguish between these possibilities, we examined the sequencing depth of these genes and of other putatively single-copy genes (identified in the BUSCO analysis) in male and female Illumina sequencing data. If the male-specific AGH sequence is a gametolog of an X-linked sequence, we would expect the total sequencing depth of all AGH sequences (putative autosomal and putative Y-linked) to be the same in both the pooled male and pooled female samples, with the female sample having a higher average sequencing depth for the putative X-linked AGH sequences (since they would be homozygous for the X-linked gametologs, while males would be hemizygous for the X-linked gametologs). If, on the other hand, the male-specific AGH sequences are Y-linked duplicates, and not allelic to the other AGH sequences in our assembly, we would expect the shared autosomal AGH sequences to have similar sequencing depth in both male and female samples, and the combined sequencing depth of all AGH sequences (putative autosomal and putative Y-linked) would be higher in the male sample. Our results were consistent with the latter scenario, suggesting these are Y-linked duplicates rather than gametologs (Fig. 3). Note that the sequencing depth of AGHY1 and AGHY2, though much lower in the female sample than in the male sample, is still non-zero in the female sample, probably because of ambiguously mapped reads due to high similarity between the Y-linked and autosomal copies.
Because the male-specific AGH sequences were found only in our Illumina data, we were unable to assemble them into long contigs, even after repeated attempts to assemble them individually with different assemblers and parameter values (not shown); all these contigs were ~600 bp or less in length. Thus we are unable to determine whether these are complete duplicates of the whole gene or fragments. Nevertheless, phylogenetic analysis suggests that one of the Y-linked duplicates is a copy of the other, rather than an independent duplication of an autosomal copy, and based on branch lengths they are as divergent from one another as AGH orthologs in different species (Fig. 5). In addition, these Y-linked copies seem to lack an intron that is present in the autosomal copies (Fig. 4), suggesting they may have originated via retrotransposition. Note that AGHY1 and AGHY2 seem to be duplicates of the same region of the AGH gene (Fig. 4), suggesting they are indeed two duplicates (the fact that both copies have appreciable sequencing depth in the pooled male sample, and are ~20% divergent at the DNA sequence level, suggests that these are real duplicates and not assembly artifacts; moreover, they cannot be allelic sequences because all the males in the Illumina samples that they were assembled from were siblings, carrying the same Y chromosome). AGHY3, on the other hand, covers a different region of the AGH gene (Fig. 4), so AGHY3 may not be an additional third duplicate, but may instead be a part of AGHY1 or AGHY2 that just assembled into a separate contig.
We also find some evidence of additional autosomal duplicates of the AGH gene. Two contigs in the full assembly contained annotated transcripts with at least partial homology to the expressed transcript identified as the AGH sequence, and a third contained no annotated genes but still showed high sequence similarity to AGH in BLAST searches. However, not all of the annotated exons in the first two copies matched the expressed transcript, and there were unannotated portions of the same contigs that did show sequence similarity to the transcript (Fig. 4). Moreover, some of the matching portions of the assembled contigs had less than 90% sequence identity to the expressed transcript, and analysis of the sequencing depth of these regions reveals that one has very low coverage, suggesting it may be an assembly artifact (see below). Thus, we cannot rule out the possibility that some of these possible autosomal duplicates represent assembly and/or annotation artifacts. If they are real, these autosomal duplicates appear to be specific to Trachelipus, occurring after its divergence from Porcellio (Fig. 5), but they may still be nonfunctional.
Discussion
We have shown that, at least in our upstate New York population, sex determination in the terrestrial isopod T. rathkei is based on an XX/XY sex chromosome system. Two independent lines of evidence support this finding: first, crosses between females and sex-reversed neo-males yielded all-female offspring (Table 1), consistent with an XX/XY system but not a ZZ/ZW system (Becking et al. 2017); second, we have identified PCR primers that only amplify a product in male samples, indicating the presence of a male-specific genomic region, i.e., a Y chromosome.
Our findings run counter to a previously published study showing evidence of female heterogamety in this species based on cytogenetics; in that study, female germ cells contained one set of unpaired chromosomes (presumably, the Z and W sex chromosomes), while male germ cells did not (Mittal and Pahwa 1980). There are multiple possible explanations for this contradiction. First, it is possible that the previous study incorrectly identified the species of study specimens, as no information on the identification is given in the paper; however, T. rathkei is relatively easy to distinguish from other cosmopolitan terrestrial isopod species by its five pairs of pleopodal “lungs” (most superficially similar species such as Porcellio scaber have only two pairs; Hatchett 1947; Shultz 2018). In addition, that study was published before feminizing Wolbachia was widely recognized in terrestrial isopods. It is therefore theoretically possible that the females used in that study carried an XY genotype but were feminized by Wolbachia, while the males in that study might have carried a YY genotype, perhaps resulting from a cross between an XY father and a sex-reversed XY or YY mother which failed to transmit Wolbachia (Becking et al. 2019).
Perhaps the most likely explanation is sex chromosome differences between populations. Indeed, this would not be unprecedented, as sex determination in terrestrial isopods is thought to evolve rapidly (Rigaud et al. 1997; Cordaux et al. 2011; Becking et al. 2017), and within-species sex chromosome polymorphisms are documented in a few other species. For instance, two subspecies of Porcellio dilatatus, P. dilatatus dilatatus and P. dilatatus petiti have XX/XY and ZZ/ZW systems, respectively (Juchault and Legrand 1964; Legrand et al. 1974; Becking et al. 2017). In addition, multiple sex determining elements segregate in populations of the common pillbug A. vulgare (Juchault et al. 1992), including a novel W chromosome that resulted from the integration of an almost entire Wolbachia genome into the host genome (Leclercq et al. 2016). Outside terrestrial isopods, sex chromosome polymorphisms are also documented in a range of other arthropods and vertebrates (Orzack et al. 1980; Franco et al. 1982; Ogata et al. 2008; Rodrigues et al. 2013). T. rathkei is probably non-native in North America where this study was conducted (Jass and Klausmeier 2000), as well as perhaps in India where the prior study on cytogenetics was done (Mittal and Pahwa 1980). Given its cosmopolitan distribution and the fact that other terrestrial isopods have moderate to high levels of genetic diversity (Romiguier et al. 2014), it might not be especially surprising for T. rathkei to harbor multiple polymorphic sex-determining loci. Nevertheless, the XX/XY system seems to be fixed, or at least the majority, in our population: multiple segregating sex-determining factors within a single population usually result in sex-biased broods (Denholm et al. 1986; Basolo 1994), including in other isopods like A. vulgare (Rigaud and Juchault 1993); however, we observed only a few (out of 131) sex-biased broods from wild-caught gravid females or lab-reared females outside of our sex reversal experiments and never observed any single-sex broods. Moreover, the male-specific primer pair we designed amplified successfully in all tested wild-caught males, and none of the wild-caught females, albeit with modest sample sizes. Hopefully, future follow-up work can further characterize geographic variation in sex determination in this species.
Regardless of whether or not sex determination is polymorphic in our population of T. rathkei, we propose that the ZZ/ZW sex chromosomes in this species are more likely to be ancestral and that the XX/XY system is derived. First, T. rathkei is nested within a clade that mostly consists of ZZ/ZW species (Becking et al. 2017). Moreover, the previous study finding a ZZ/ZW system in T. rathkei was based on the presence of heteromorphic sex chromosomes in female meiotic spreads (Mittal and Pahwa 1980), suggesting that the Z and W have been diverging long enough to be cytogenetically distinguishable, in contrast to other isopods examined so far showing homomorphic sex chromosomes (Rigaud et al. 1997). In addition, the putative male-specific region of the T. rathkei genome does not contain any genes that are essential for male reproduction or spermatogenesis, since phenotypic males with an XX genotype still sired offspring in our sex-reversal experiments, and the male-specific region is only a small portion of the total genome, similar to other terrestrial isopods examined so far (Chebbi et al. 2019; Becking et al. 2019). This male-specific region is probably at least 6 Mb and has an upper size limit of around 90 Mb, but these estimates include sequences that retain high similarity to X-linked copies; indeed, most of the candidate male-specific primers we tested failed to show sex-specific amplification patterns. These observations suggest that this Y chromosome may be evolutionarily young since it has not had time to accumulate major differences from the X. Given that we found genomic evidence of a past association with Wolbachia in this species and that infection by Wolbachia has been found in other T. rathkei populations (Cordaux et al. 2012), this scenario is consistent with the hypothesis that transitions in sex determination mechanisms may be triggered by Wolbachia and other endosymbionts that manipulate host reproduction (Rigaud et al. 1997; Cordaux et al. 2011). If other populations of T. rathkei with different sex determination mechanisms can be identified, it may be possible to leverage this system to further study the mechanisms and selective forces influencing transitions in sex determination mechanisms. In addition, studies of sex determination in a phylogenetic context involving other members of the family Trachelipodidae would shed further light on the origins of the X and Y chromosomes in T. rathkei.
The draft genome assembly of T. rathkei is especially large, at around 5.2 Gb, with approximately 29% GC content. The actual genome is likely to be even larger, given that ~25% of the BUSCO arthropod orthologs were missing in our assembly. By comparison, genomes of pillbugs in the genus Armadillidium tend to be smaller at around 1.2–2 Gb in size (Chebbi et al. 2019; Becking et al. 2019), but other terrestrial isopods have genomes ranging to over 8 Gb (Gregory 2020), and other crustacean relatives such as amphipods also have large genomes (Rees et al. 2007; Rivarola-Duarte et al. 2014; Kao et al. 2016), so T. rathkei is not out of the ordinary for this group.
The T. rathkei genome contains a large proportion of repetitive elements, in particular transposable elements (Fig. 1). The most common transposable element families are LINEs, DNA elements, and LTRs, similar to A. vulgare and A. nasatum (Chebbi et al. 2019; Becking et al. 2019, 2020). The distribution of divergence values, with a single mode around 7–10% divergence (Fig. 1), suggests that most repeat families expanded around the same time as previously shown in A. vulgare and A. nasatum; however, unlike in A. vulgare, T. rathkei shows no evidence of a second more recent burst in DNA element activity. Simple repeats also comprise a substantial portion of the genome; even manually looking through the assembled contigs reveals a high abundance of (TA)x repeats. It would be interesting to examine the repeat content of the male-specific portion of the genome. Unfortunately, however, we were only able to recover male-specific sequences from the short-read Illumina data, and this portion of the genome assembly is highly fragmented, precluding a more detailed analysis. Hopefully, additional long-read sequencing data will allow us to examine transposable element dynamics in this area in the future.
We found many contigs with high similarity to the Wolbachia genome (Supplementary Table 4), even though we were unable to detect current Wolbachia infections in our population using PCR. This is not surprising given that horizontal transfers of Wolbachia DNA into host genomes is common (Dunning Hotopp 2011), and that Wolbachia is relatively common in terrestrial isopods and arthropods in general (Cordaux et al. 2012; Pascar and Chandler 2018; Medina et al. 2019) and has been found in other populations of T. rathkei. These Wolbachia insertions are closely related to other Wolbachia strains from isopods, including feminizing strains (Cordaux et al. 2004; Leclercq et al. 2016). This suggests that T. rathkei may have been infected with a feminizing Wolbachia strain in the past, even though no firm conclusion can be drawn solely from phylogenetic evidence. If so, it is conceivable that Wolbachia may have been involved in the sex chromosome turnover we characterized in T. rathkei, as previously hypothesized (Rigaud et al. 1997; Cordaux and Gilbert 2017).
Male differentiation in terrestrial isopods is controlled by the androgenic gland hormone, AGH. AGH is a peptide hormone similar in structure to insulin and is secreted by the androgenic gland (Martin et al. 1999). AGH expression is sufficient to transform juvenile female isopods into fertile males (Martin et al. 1999). Presumably, in wild-type males, the primary sex-determining signal triggers the differentiation of the androgenic glands during development, which then secretes AGH. Interestingly, the draft genome of T. rathkei contains multiple AGH-like sequences, unlike A. vulgare, which has a single copy (Chebbi et al. 2019). While some of these may be assembly artifacts, there is evidence of at least two partial Y-linked sequences (assembled from Illumina sequencing reads containing male-specific k-mers), of which one was confirmed by PCR to be male-specific. These duplications seem to be specific to T. rathkei (Fig. 5), though other members of the genus Trachelipus or the family Trachelipodidae have yet to be examined. Consistent with this, a past study found no evidence of any expressed AGH duplications in other terrestrial isopod species except Porcellio gallicus (Cerveau et al. 2014).
In many other species, novel sex chromosomes have arisen via duplication of a sex-determining gene. For instance, duplicates of the vertebrate gene Dmrt1 have evolved into master sex-determining signals on the W and Y chromosomes, respectively, in the frog Xenopus laevis (Yoshimoto et al. 2008) and the medaka Oryzias latipes (Matsuda et al. 2002, 2007; Nanda et al. 2002), while a Y-linked duplicate of the anti-Müllerian hormone gene is a candidate master sex-determining gene in the teleost fish Odontesthes hatcheri (Hattori et al. 2012). The presence of Y-linked AGH copies in T. rathkei, and no other obvious ORFs homologous to known sex determination or sex differentiation genes, makes these genes obvious candidates for the master male-determining signal in T. rathkei. Sex-specific genomic regions like the Y and W chromosomes are also expected to acquire sexually antagonistic alleles (van Doorn and Kirkpatrick 2007, 2010; Charlesworth 2017). Thus, if functional these duplicates might instead provide male fitness benefits rather than serving as a master male sex-determining gene. Unfortunately, we were unable to assemble full copies of these Y-linked AGH homologs because they only showed up in our Illumina data, not in our low-coverage long-read data. Future deep sequencing using long reads should further clarify the molecular evolution of these genes. In addition, expression studies should determine which of these genes are expressed, in what tissues, and at what stages.
We have shown that the terrestrial isopod T. rathkei uses an XX/XY sex chromosome system, at least in upstate New York, in contrast to a past cytogenetic study suggesting a ZZ/ZW mechanism (Mittal and Pahwa 1980). In line with this, whole-genome sequencing and follow-up PCRs demonstrate the existence of male-specific, Y-linked copies of the AGH gene in this species. These findings highlight the role of gene duplication in the evolution of sex chromosomes and they further establish terrestrial isopods as models to study the evolution of sex determination systems and the mechanisms underlying their transitions.
Data availability
Data can be accessed at NCBI under the following Accession ID: PRJNA633105.
References
Badawi M, Moumen B, Giraud I, Grève P, Cordaux R (2018) Investigating the molecular genetic basis of cytoplasmic sex determination caused by Wolbachia endosymbionts in terrestrial isopods. Genes 9:290
Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS et al. (2012) SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol 19:455–477
Bao Z, Eddy SR (2002) Automated de novo identification of repeat sequence families in sequenced genomes. Genome Res 12:1269–1276
Basolo AL (1994) The dynamics of fisherian sex-ratio evolution: theoretical and experimental investigations. Am Naturalist 144:473–490
Becking T, Chebbi MA, Giraud I, Moumen B, Laverré T, Caubet Y et al. (2019) Sex chromosomes control vertical transmission of feminizing Wolbachia symbionts in an isopod. PLoS Biol 17:e3000438
Becking T, Gilbert C, Cordaux R (2020) Impact of transposable elements on genome size variation between two closely related crustacean species. Anal Biochem 600:113770
Becking T, Giraud I, Raimond M, Moumen B, Chandler C, Cordaux R et al. (2017) Diversity and evolution of sex determination systems in terrestrial isopods. Sci Rep 7:1084
Benson G (1999) Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res 27:573–580
Bergero R, Charlesworth D (2009) The evolution of restricted recombination in sex chromosomes. Trends Ecol Evol 24:94–102
Blaser O, Grossen C, Neuenschwander S, Perrin N (2013) Sex-chromosome turnovers induced by deleterious mutation load. Evolution 67:635–645
Blaser O, Neuenschwander S, Perrin N (2014) Sex-chromosome turnovers: the hot-potato model. Am Naturalist 183:140–146
Bolger AM, Lohse M, Usadel B (2014) Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30:2114–2120
Bouchon D, Rigaud T, Juchault P (1998) Evidence for widespread Wolbachia infection in isopod crustaceans: molecular identification and host feminization. Proc R Soc Lond Ser B 265:1081–1090
Braig HR, Zhou W, Dobson SL, O’Neill SL (1998) Cloning and characterization of a gene encoding the major surface protein of the bacterial endosymbiont Wolbachia pipientis. J Bacteriol 180:2373–2378
Bryant DM, Johnson K, DiTommaso T, Tickle T, Couger MB, Payzin-Dogru D et al. (2017) A tissue-mapped axolotl de novo transcriptome enables identification of limb regeneration factors. Cell Rep 18:762–776
Buchfink B, Xie C, Huson DH (2015) Fast and sensitive protein alignment using DIAMOND. Nat Methods 12:59–60
Bushnell B, Rood J, Singer E (2017) BBMerge—accurate paired shotgun read merging via overlap. PLoS ONE 12:e0185056
Cantarel BL, Korf I, Robb SMC, Parra G, Ross E, Moore B et al. (2008) MAKER: an easy-to-use annotation pipeline designed for emerging model organism genomes. Genome Res 18:188–196
Capella-Gutiérrez S, Silla-Martínez JM, Gabaldón T (2009) trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 25:1972–1973
Cerveau N, Bouchon D, Bergès T, Grève P (2014) Molecular evolution of the androgenic hormone in terrestrial isopods. Gene 540:71–77
Chandler JC, Elizur A, Ventura T (2018) The decapod researcher’s guide to the galaxy of sex determination. Hydrobiologia 825:61–80
Chandler JC, Fitzgibbon QP, Smith G, Elizur A, Ventura T (2017) Y-linked iDmrt1 paralogue (iDMY) in the Eastern spiny lobster, Sagmariasus verreauxi—the first invertebrate sex-linked Dmrt. Dev Biol 430:337–345
Charlesworth D (2017) Evolution of recombination rates between sex chromosomes. Philos Trans R Soc B 372:20160456
Charlesworth B, Charlesworth D (2000) The degeneration of Y chromosomes. Philos Trans R Soc Lond B 355:1563–1572
Charlesworth D, Charlesworth B, Marais G (2005) Steps in the evolution of heteromorphic sex chromosomes. Heredity 95:118–128
Chebbi MA, Becking T, Moumen B, Giraud I, Gilbert C, Peccoud J et al. (2019) The genome of armadillidium vulgare (Crustacea, Isopoda) provides insights into sex chromosome evolution in the context of cytoplasmic sex determination. Mol Biol Evol 36:727–741
Chevreux B, Wetter T, Suhai S (1999). Genome sequence assembly using trace signals and additional sequence information. In: German conference on bioinformatics, Citeseer, 99, pp 45–56.
Cioffi MB, Liehr T, Trifonov V, Molina WF, Bertollo LAC (2013) Independent sex chromosome evolution in lower vertebrates: a molecular cytogenetic overview in the erythrinidae fish family. CGR 141:186–194
Conover DO, Kynard BE (1981) Environmental sex determination: interaction of temperature and genotype in a fish. Science 213:577–579
Cordaux R, Bouchon D, Grève P (2011) The impact of endosymbionts on the evolution of host sex-determination mechanisms. Trends Genet 27:332–341
Cordaux R, Michel-Salzat A, Frelon-Raimond M, Rigaud T, Bouchon D (2004) Evidence for a new feminizing Wolbachia strain in the isopod Armadillidium vulgare: evolutionary implications. Heredity 93:78–84
Cordaux R, Gilbert C (2017) Evolutionary Significance of Wolbachia-to-Animal Horizontal Gene Transfer: Female Sex Determination and the f Element in the Isopod Armadillidium vulgare. Genes 8:186. https://doi.org/10.3390/genes8070186
Cordaux R, Pichon S, Hatira HBA, Doublet V, Grève P, Marcadé I et al. (2012) Widespread Wolbachia infection in terrestrial isopods and other crustaceans Zookeys 176:123–131
Darriba D, Posada D, Kozlov AM, Stamatakis A, Morel B, Flouri T (2020) ModelTest-NG: a new and scalable tool for the selection of DNA and protein evolutionary models Mol Biol Evolution 37:291–294
Denholm I, Franco MG, Rubini PG, Vecchi M (1986) Geographical variation in house-fly (Musca domestica L.) sex determinants within the British Isles. Genet Res 47:19–27
van Doorn GS, Kirkpatrick M (2007) Turnover of sex chromosomes induced by sexual conflict. Nature 449:909–912
van Doorn GS, Kirkpatrick M (2010) Transitions between male and female heterogamety caused by sex-antagonistic selection. Genetics 186:629–645
Dunning Hotopp JC (2011) Horizontal gene transfer between bacteria and animals. Trends Genet 27:157–163
Edgar RC (2004) MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinforma 5:113
Emerson JJ, Kaessmann H, Betrán E, Long M (2004) Extensive gene traffic on the mammalian X chromosome. Science 303:537–540
Folmer O, Black MB, Hoeh WR, Lutz R, Vrijenhoek RC (1994) DNA primers for amplification of mitochondrial cytochrome c oxidase subunit I from diverse metazoan invertebrates. Mol Mar Biol Biotechnol 3:294–299
Franco MG, Rubini PG, Vecchi M (1982) Sex-determinants and their distribution in various populations of Musca domestica L. of Western Europe. Genet Res 40:279–293
Fussey GD (1984) The distribution of the two forms of the woodlouse Trichoniscus pusillus Brandt (Isopoda: Oniscoidea) in the British Isles: a reassessment of geographic parthenogenesis. Biol J Linn Soc 22:309–321
Gamble T, Geneva AJ, Glor RE, Zarkower D (2014) Anolis sex chromosomes are derived from a single ancestral pair. Evolution 68:1027–1041
Gilbert DG (2019). Longest protein, longest transcript or most expression, for accurate gene reconstruction of transcriptomes? bioRxiv. 829184.
Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I et al. (2011) Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol 29:644–652
Gregory TR (2020). Animal genome size database.
Gurevich A, Saveliev V, Vyahhi N, Tesler G (2013) QUAST: quality assessment tool for genome assemblies. Bioinformatics 29:1072–1075
Haas B, Papanicolaou A (2016). TransDecoder (find coding regions within transcripts). http://transdecoder.github.io
Hatchett SP (1947) Biology of the isopoda of Michigan. Ecol Monogr 17:47–80
Hattori RS, Murai Y, Oura M, Masuda S, Majhi SK, Sakamoto T et al. (2012) A Y-linked anti-Müllerian hormone duplication takes over a critical role in sex determination. PNAS 109:2955–2959
Heimpel GE, de Boer JG (2008) Sex determination in the hymenoptera. Annu Rev Entomol 53:209–230
Janzen FJ, Phillips PC (2006) Exploring the evolution of environmental sex determination, especially in reptiles. J Evolut Biol 19:1775–1784
Jass J, Klausmeier B (2000) Endemics and immigrants: North American Terrestrial Isopods (Isopoda, Oniscidea) North of Mexico. Crustaceana 73:771–799
Jeffries DL, Lavanchy G, Sermier R, Sredl MJ, Miura I, Borzée A et al. (2018) A rapid rate of sex-chromosome turnover and non-random transitions in true frogs. Nat Commun 9:4088
Johnson C (1986) Parthenogenetic reproduction in the philosciid isopod, Ocelloscia Floridana (Van Name 1940). Crustaceana 51:123–132
Juchault P, Legrand J (1964) Démonstration de l’homogamétie femelle par croisement de deux femelles génétiques chez les Oniscoïdes Porcellio dilatatus et Helleria brevi-cornis. Comp Rend l’Acad Sci Paris 258:2685–2686
Juchault P, Rigaud T (1995) Evidence for female heterogamety in two terrestrial crustaceans and the problem of sex chromosome evolution in isopods. Heredity 75:466–471
Juchault P, Rigaud T, Mocquard J-P (1992) Evolution of sex-determining mechanisms in a wild population of Armadillidium vulgare Latr. (Crustacea, Isopoda): competition between two feminizing parasitic sex factors. Heredity 69:382–390
Kageyama D, Nishimura G, Hoshizaki S, Ishikawa Y (2002) Feminizing Wolbachia in an insect, Ostrinia furnacalis (Lepidoptera: Crambidae). Heredity 88:444–449
Kao D, Lai AG, Stamataki E, Rosic S, Konstantinides N, Jarvis E et al. (2016) The genome of the crustacean Parhyale hawaiensis, a model for animal development, regeneration, immunity and lignocellulose digestion. eLife 5:e20062
Kokot M, Długosz M, Deorowicz S (2017) KMC 3: counting and manipulating k-mer statistics. Bioinformatics 33:2759–2761
Korf I (2004) Gene finding in novel genomes. BMC Bioinform 5:59
Laetsch DR, Blaxter ML (2017) BlobTools: Interrogation of genome assemblies. F1000Res 6:1287
Leclercq S, Thézé J, Chebbi MA, Giraud I, Moumen B, Ernenwein L et al. (2016) Birth of a W sex chromosome by horizontal transfer of Wolbachia bacterial symbiont genome. Proc Natl Acad Sci USA 113:15036–15041
Legrand J, Juchault P, Artault J, Mocquard J, Picaud J (1974) Le statut systématique de la «forme» petiti vandel de Porcellio dilatatus brandt, récoltée à l’île saint-honorat (Alpes-Maritimes). Critères morphologiques, génétiques et physiologiques. Bull Soc Zool Fr 99:461–471
Li H (2013). Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv:13033997 [q-bio]
Liu J, Yu T, Mu Z, Li G (2019) TransLiG: a de novo transcriptome assembler that uses line graph iteration. Genome Biol 20:81
Martin G, Sorokine O, Moniatte M, Bulet P, Hetru C, Dorsselaer AV (1999) The structure of a glycosylated protein hormone responsible for sex determination in the isopod, Armadillidium vulgare. Eur J Biochem 262:727–736
Matsuda M, Nagahama Y, Shinomiya A, Sato T, Matsuda C, Kobayashi T et al. (2002) DMY is a Y-specific DM-domain gene required for male development in the medaka fish. Nature 417:559–563
Matsuda M, Shinomiya A, Kinoshita M, Suzuki A, Kobayashi T, Paul-Prasanth B et al. (2007) DMY gene induces male development in genetically female (XX) medaka fish. PNAS 104:3865–3870
Medina P, Russell SL, Corbett-Detig R (2019). Deep data mining reveals variable abundance and distribution of microbial reproductive manipulators within and among diverse host species. bioRxiv. 679837
Meisel RP, Davey T, Son JH, Gerry AC, Shono T, Scott JG (2016) Is multifactorial sex determination in the house fly, musca domestica (L.), stable over time? J Hered 107:615–625
Mittal OP, Pahwa S (1980) Cytogenetical studies on two terrestrial isopods (Crustacea), Philoscia lodnensis Ramakrishna and Porcellio rathkei Brandt. Cytobios 29:17–24
Morgulis A, Coulouris G, Raytselis Y, Madden TL, Agarwala R, Schäffer AA (2008) Database indexing for production MegaBLAST searches. Bioinformatics 24:1757–1764
Myosho T, Takehana Y, Hamaguchi S, Sakaizumi M (2015) Turnover of sex chromosomes in celebensis group medaka fishes. G3: Genes Genomes Genet 5:2685–2691
Nanda I, Kondo M, Hornung U, Asakawa S, Winkler C, Shimizu A et al. (2002) A duplicated copy of DMRT1 in the sex-determining region of the Y chromosome of the medaka, Oryzias latipes. PNAS 99:11778–11783
Negri I, Pellecchia M, Mazzoglio PJ, Patetta A, Alma A (2006) Feminizing Wolbachia in Zyginidia pullula (Insecta, Hemiptera), a leafhopper with an XX/X0 sex-determination system. Proc R Soc B: Biol Sci 273:2409–2416
Ogata M, Hasegawa Y, Ohtani H, Mineyama M, Miura I (2008) The ZZ/ZW sex-determining mechanism originated twice and independently during evolution of the frog, Rana rugosa. Heredity 100:92–99
Orzack SH, Sohn JJ, Kallman KD, Levin SA, Johnston R (1980) Maintenance of the three sex chromosome polymorphism in the platyfish, Xiphophorus maculatus. Evolution 34:663–672
Ospina-Álvarez N, Piferrer F (2008) Temperature-dependent sex determination in fish revisited: prevalence, a single sex ratio response pattern, and possible effects of climate change. PLoS ONE 3:e2837
Pascar J, Chandler CH (2018) A bioinformatics approach to identifying Wolbachia infections in arthropods. PeerJ 6:e5486
Pennell MW, Mank JE, Peichel CL (2018) Transitions in sex determination and sex chromosomes across vertebrate species. Mol Ecol 27:3950–3963
Pickup J, Ironside JE (2018) Multiple origins of parasitic feminization: thelygeny and intersexuality in beach-hoppers are caused by paramyxid parasites, not microsporidia. Parasitology 145:408–415
Potrzebowski L, Vinckenbosch N, Marques AC, Chalmel F, Jégou B, Kaessmann H (2008) Chromosomal gene movements reflect the recent origin and biology of therian sex chromosomes. PLoS Biol 6:e80
Price AL, Jones NC, Pevzner PA (2005) De novo identification of repeat families in large genomes. Bioinformatics 21:i351–i358
Pryszcz LP, Gabaldón T (2016) Redundans: an assembly pipeline for highly heterozygous genomes. Nucleic Acids Res 44:e113–e113
Rees DJ, Dufresne F, Glémet H, Belzile C (2007) Amphipod genome sizes: first estimates for arctic species reveal genomic giants. Genome 50:151–158
Rice WR (1996) Evolution of the Y sex chromosome in animals. BioScience 46:331–343
Rigaud T, Juchault P (1993) Conflict between feminizing sex ratio distorters and an autosomal masculinizing gene in the terrestrial isopod Armadillidium vulgare Latr. Genetics 133:247–252
Rigaud T, Juchault P, Mocquard J-P (1997) The evolution of sex determination in isopod crustaceans. BioEssays 19:409–416
Rivarola-Duarte L, Otto C, Jühling F, Schreiber S, Bedulina D, Jakob L et al. (2014) A first Glimpse at the genome of the Baikalian amphipod Eulimnogammarus verrucosus. J Exp Zool Part Bn 322:177–189
Rodrigues N, Betto‐Colliard C, Jourdan‐Pineau H, Perrin N (2013) Within-population polymorphism of sex-determination systems in the common frog (Rana temporaria). J Evolut Biol 26:1569–1577
Romiguier J, Gayral P, Ballenghien M, Bernard A, Cahais V, Chenuil A et al. (2014) Comparative population genomics in animals uncovers the determinants of genetic diversity. Nature 515:261–263
Ross JA, Urton JR, Boland J, Shapiro MD, Peichel CL (2009) Turnover of sex chromosomes in the stickleback fishes (Gasterosteidae). PLoS Genet 5:e1000391
Shultz JW (2018) A guide to the identification of the terrestrial Isopoda of Maryland, U.S.A. (Crustacea). ZooKeys 801:207–228
Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM (2015) BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31:3210–3212
Stamatakis A (2014) RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30:1312–1313
Stanke M, Keller O, Gunduz I, Hayes A, Waack S, Morgenstern B (2006) AUGUSTUS: ab initio prediction of alternative transcripts. Nucleic Acids Res 34:W435–W439
Sun LV, Foster JM, Tzertzinis G, Ono M, Bandi C, Slatko BE et al. (2001) Determination of Wolbachia genome size by pulsed-field gel electrophoresis. J Bacteriol 183:2219–2225
Tarailo-Graovac M, Chen N (2009) Using repeatmasker to identify repetitive elements in genomic sequences. Curr Protoc Bioinform 25:4–10
Terry RS, Smith JE, Dunn AM (1998) Impact of a novel, feminising microsporidium on its crustacean host. J Eukaryot Microbiol 45:497–501
Tingley GA, Anderson RM (1986) Environmental sex determination and density-dependent population regulation in the entomogenous nematode Romanomermis culicivorax. Parasitology 92:431–449
Traut W (1994) Sex determination in the fly Megaselia scalaris, a model system for primary steps of sex chromosome evolution. Genetics 136:1097–1104
Untergasser A, Cutcutache I, Koressaar T, Ye J, Faircloth BC, Remm M et al. (2012) Primer3—new capabilities and interfaces. Nucleic Acids Res 40:e115–e115
Vandeputte M, Dupont-Nivet M, Chavanne H, Chatain B (2007) A polygenic hypothesis for sex determination in the European sea bass dicentrarchus labrax. Genetics 176:1049–1057
Verhulst EC, Beukeboom LW, van de Zande L (2010) Maternal control of haplodiploid sex determination in the wasp nasonia. Science 328:620–623
Veyrunes F, Waters PD, Miethke P, Rens W, McMillan D, Alsop AE et al. (2008) Bird-like sex chromosomes of platypus imply recent origin of mammal sex chromosomes. Genome Res 18:965–973
Vicoso B, Bachtrog D (2015) Numerous transitions of sex chromosomes in diptera. PLoS Biol 13:e1002078
Vurture GW, Sedlazeck FJ, Nattestad M, Underwood CJ, Fang H, Gurtowski J et al. (2017) GenomeScope: fast reference-free genome profiling from short reads (B Berger, Ed.). Bioinformatics 33:2202–2204
Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S et al. (2014) Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS ONE 9:e112963
Wang JR, Holt J, McMillan L, Jones CD (2018) FMLRC: hybrid long read error correction using an FM-index. BMC Bioinforma 19:50
Werren J H., Zhang W, Guo LR (1995). Evolution and phylogeny of Wolbachia: reproductive parasites of arthropods. Proc R Soc Lond Ser B: Biol Sci 261: 55–63.
Ye C, Hill CM, Wu S, Ruan J, Ma Z (2016) DBG2OLC: efficient assembly of large genomes using long erroneous reads of the third generation sequencing technologies. Sci Rep 6:31900
Ye C, Ma Z (Sam) (2016) Sparc: a sparsity-based consensus algorithm for long erroneous sequencing reads. PeerJ 4:e2016.
Ye C, Ma ZS, Cannon CH, Pop M, Yu DW (2012) Exploiting sparseness in de novo genome assembly. BMC Bioinform 13:S1
Yoshimoto S, Okada E, Umemoto H, Tamura K, Uno Y, Nishida-Umehara C et al. (2008) A W-linked DM-domain gene, DM-W, participates in primary ovary development in Xenopus laevis. Proc Natl Acad Sci USA 105:2469–2474
Acknowledgements
We thank the editor and three anonymous reviewers for constructive comments on earlier drafts of this manuscript. We also appreciate the computing time and assistance provided by the National Center for Genome Analysis Support at Indiana University, especially Tom Doak and Sheri Sanders. This research was funded by National Science Foundation grant NSF-DEB 1453298 to CHC. It was also supported in part by Lilly Endowment, Inc., through its support for the Indiana University Pervasive Technology Institute. This research is based upon work supported by the National Science Foundation under Grant Nos. DBI-1062432 2011, ABI-1458641 2015, and ABI-1759906 2018 to Indiana University. Any opinions, findings, and conclusions, or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the National Science Foundation, the National Center for Genome Analysis Support, or Indiana University.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Associate editor: Aurora Ruiz-Herrera
Supplementary information
Rights and permissions
About this article
Cite this article
Russell, A., Borrelli, S., Fontana, R. et al. Evolutionary transition to XY sex chromosomes associated with Y-linked duplication of a male hormone gene in a terrestrial isopod. Heredity 127, 266–277 (2021). https://doi.org/10.1038/s41437-021-00457-2
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/s41437-021-00457-2