Population genomics in two cave-obligate invertebrates confirms extremely limited dispersal between caves

Balogh, Andras; Ngo, Lam; Zigler, Kirk S.; Dixon, Groves

doi:10.1038/s41598-020-74508-9

Download PDF

Article
Open access
Published: 16 October 2020

Population genomics in two cave-obligate invertebrates confirms extremely limited dispersal between caves

Andras Balogh¹,
Lam Ngo²,
Kirk S. Zigler² &
…
Groves Dixon¹

Scientific Reports volume 10, Article number: 17554 (2020) Cite this article

2362 Accesses
9 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Caves offer selective pressures that are distinct from the surface. Organisms that have evolved to exist under these pressures typically exhibit a suite of convergent characteristics, including a loss or reduction of eyes and pigmentation. As a result, cave-obligate taxa, termed troglobionts, are no longer viable on the surface. This circumstance has led to an understanding of highly constrained dispersal capabilities, and the prediction that, in the absence of subterranean connections, extreme genetic divergence between cave populations. An effective test of this model would involve (1) common troglobionts from (2) nearby caves in a cave-dense region, (3) good sample sizes per cave, (4) multiple taxa, and (5) genome-wide characterization. With these criteria in mind, we used RAD-seq to genotype an average of ten individuals of the troglobiotic spider Nesticus barri and the troglobiotic beetle Ptomaphagus hatchi, each from four closely located caves (ranging from 3 to 13 km apart) in the cave-rich southern Cumberland Plateau of Tennessee, USA. Consistent with the hypothesis of highly restricted dispersal, we find that populations from separate caves are indeed highly genetically isolated. Our results support the idea of caves as natural laboratories for the study of parallel evolutionary processes.

Parallel and convergent genomic changes underlie independent subterranean colonization across beetles

Article Open access 29 June 2023

Population expansion, divergence, and persistence in Western Fence Lizards (Sceloporus occidentalis) at the northern extreme of their distributional range

Article Open access 15 April 2022

Life on a beach leads to phenotypic divergence despite gene flow for an island lizard

Article Open access 03 February 2023

Introduction

Caves are unique habitats with environmental conditions fundamentally distinct from the surface. The most conspicuous of these is the complete absence of light, precluding the use of visual cues for hunting, foraging, locating mates, and evading predators¹. Moreover, as photosynthesis is not possible, cave communities depend nearly entirely on trophic input from the surface. Caves are also more stable in temperature and humidity than surface habitats². As a result, adaptation to life underground typically involves extensive evolutionary change³.

Some organisms have evolved under these conditions to the extent that they are never found outside of caves. These organisms, termed troglobionts², often bear a suite of distinctive characteristics, including loss or reduction of eyes, pigment loss, elongated appendages, improved non-visual sensory mechanisms, reduced metabolic rates, longer lifespans, and lower rates of reproduction^2,3,4. These features represent widely repeated cases of convergent evolution, that can be shaped even by fine-scaled niche partitioning within subterranean habitats⁵. Many of these phenotypes have obvious tradeoffs for fitness on the surface. For instance, pigment loss, selectively neutral or possibly even advantageous in the cave environment⁶, will decrease crypsis on the surface, a trait known to undergo particularly strong purifying selection⁷. While most studies the selective pressures on crypsis come from surface-dwelling species⁸, the importance of camouflage among other arthropods is suggestive that similar pressures would exist for troglobionts traversing surface terrain. Intolerance to variation in temperature and humidity may also preclude surface viability². Hence, the surface is a hostile environment for troglobionts. With this idea in mind, Culver and Pipan² pointed out that caves are like islands in a sea of surface habitat.

Previous studies have shown that troglobiont migration is indeed highly limited. For instance, at the species level, endemism is less the exception than the rule^9,10. This is especially true in the eastern United States, where up to 45% of troglobionts are single-cave endemics¹¹. These exceptional rates of endemism are consistent with restricted gene flow and frequent speciation. Population genetic studies lend further support. Examining COI in the troglobiotic spider Nesticus barri, Snowman et al.¹² found extensive haplotype divergence and limited sharing of haplotypes between caves, indicating that migration was minimal to nonexistent over distances greater than 15 km. Another study, examining COI in several troglobionts, including N. barri and the beetle Ptomaphagus hatchi, provided similar findings¹³, indicating that restricted migration is likely general to terrestrial troglobionts.

Here, we evaluate the hypothesis that dispersal of terrestrial troglobiont invertebrates is extremely limited between caves. To emphasize the severity of isolation, we sampled individuals from caves located close together on the southern Cumberland Plateau, one of the most biodiverse karst areas in the United States^9,10,14 (Fig. 1; Table 1). The southern Cumberland Plateau is also one of the most cave-rich regions in North America, with more than 4000 caves known from a six-county area in southern Tennessee and northeast Alabama¹⁵. To ensure the generality of the hypothesis, we focused on two species with distinct natural histories: the spider N. barri and the beetle P. hatchi. N. barri is part of a complex of 28 species found across the southeastern United States¹⁶. Their tendency to live in dark, moist habitats has led to numerous instances of cave habitation, with roughly one third of species in the group either troglophiles (frequent cave dwellers that are also found on the surface) or troglobionts¹⁷. N. barri demonstrates typical troglomorphic features, lacking eyes and with reduced pigment, although it still possesses reproductive seasonality¹⁸. The genus Ptomaphagus includes about 60 species in North America, again with roughly one third either troglophiles or troglobionts⁴. Diversification of the genus throughout the southern Cumberland Plateau is thought to have occurred through progressive vicariance, as the Cumberland Plateau eroded over the last 6 million years¹⁹. Like other troglobiotic Ptomaphagus, P. hatchi has greatly reduced eyes and is wingless. On the southern Cumberland Plateau, N. barri and P. hatchi have largely overlapping ranges, with each species known from dozens of caves^12,19, and both species are common in the caves they inhabit. Finally, where previous studies made use of one, or at most a handful loci, we take a genome-wide approach using 2bRAD²⁰. This method has the advantage of interrogating thousands of loci across the genome, allowing for more confident estimates of population divergence and neutral diversity^21,22. This is the first study to investigate these species at the genomic scale.

Table 1 Cave and sampling information.

Full size table

With this approach, we test several predictions of the hypothesis of extremely limited dispersal between caves. First, each cave should harbor a genetically distinct population. Individuals from any cave should be most similar to others from the same cave. Hence, unsupervised methods such as hierarchical clustering, and principal component analysis should sort samples based on their cave of origin. Second, when modeled as mixtures of ancestral populations, individuals should demonstrate limited admixture between caves. Third, we should observe extensive genetic differentiation between caves. We test this third prediction using pairwise estimates of F_ST and d_XY. An important corollary of our hypothesis is that intermediate subterranean connectivity, via habitats such as horizontal fissures⁵ or Mesovoid Shallow Substratum (MSS)^23,24 is also highly limited or nonexistent.

Materials and methods

Sampling

We collected specimens from four caves on the edge of the southern Cumberland Plateau in Franklin County, Tennessee (Table 1). The caves were chosen based on proximity, location, and previous knowledge of the presence of N. barri and P. hatchi^13,25. Distances between the caves ranged from 3 to 13 km (Fig. 1). The caves are distributed across two adjacent watersheds, with Solomon’s Temple (ST) and Sewanee Blowhole (SB) in the Upper Elk River watershed that drains to the north and west of the study area, whereas Grapevine Cave (GV) and Buggytop Cave (BT) are in the Guntersville Lake watershed that drains the study area to the south (Fig. 1; Table 1). Three of the caves (ST, GV, and BT) have maps on record with the Tennessee Cave Survey, so for those caves, we had a precise measure of the total human-accessible cave passage. The fourth cave (SB) has not been mapped, so we relied on the estimated length of the cave as noted in the Tennessee Cave Survey records.

Sampling was conducted between 21 September and 25 October 2018. We collected Ptomaphagus and Nesticus by hand during visual encounter surveys as two researchers moved through the cave. Sampling was opportunistic, rather than random, but was dispersed throughout the cave. An initial survey of Buggytop Cave yielded only a few Ptomaphagus, so food baits (tuna) were placed in the cave for 24 h and live specimens were subsequently collected at the baits. Sample size per cave ranged from 6 to 16 individuals per species (Table 1). Specimens were placed into 100% EtOH in the field and subsequently stored at − 20 °C. Sampling was permitted by the Tennessee Wildlife Resources Agency (Permit #1385) and the Tennessee Department of Environment and Conservation (Permit #2013-026).

Library preparation

Most of the DNA extractions were performed using the entire body of the individual. If a particular sample seemed large enough (mainly applied to Nesticus barri) the legs were saved while the cephalothorax and abdomen were used. QIAGEN’s DNeasy Blood & Tissue Kit (Cat. No. 69504 or 69506) was used following the kit’s protocol with the exception of using 50 µl Buffer AE for elution. Concentrations of each DNA isolation were initially checked by nanodrop and confirmed with the Quant-IT Picogreen DS DNA assay (Life Technologies cat. no. P7589). The 2b-RAD library preparation was carried out as described previously^20,26,27,28. Briefly, DNA isolations were normalized to ~ 12.5 ng/µl. Samples with concentrations lower than 12.5 ng/µl DNA (~ 30 samples), were fully dehydrated in a vacuum centrifuge and resuspended to a target concentration of ~ 12.5 ng/µl. Digestion reactions had concentrations of 1 × NEB buffer #3 and 10 µM SAM mixed with 1 total U of BcgI restriction enzyme and 50 ng genomic DNA in a total volume of 6 µl. Digests were incubated at 37 °C for 1 h followed by 20 min at 65 °C for heat inactivation. Ligation reactions had concentrations of 1 × T4 ligase buffer, and 0.25 µM each adapter with 400 total U of T4 DNA ligase and 6 µl of digested DNA in a total volume of 20 µl. Ligation reactions were incubated at 4 °C overnight, followed by 20 min at 65 °C for heat inactivation. Inclusion of internal barcodes in the i7 adapter allowed for pooling sets of samples at this point. Amplification and additional barcoding reactions were performed on these pools. These reactions had concentrations of 312 µM each dNTP, 0.2 µM each of p5 and p7, 0.15 µM appropriate TruSeq-Un primer, 0.15 µM appropriate primer, 1 × Titanium Taq buffer, and 1 × Taq polymerase mixed with 4 µl of pooled ligation in a total volume of 20 µl. Thermocycler conditions were 70 °C for 30 s, followed by 14 cycles of 95 °C for 20 s, 65 °C for 3 min, and 72 °C for 30 s. The final library was pooled into a single tube and sequenced at the University of Texas at Austin’s Genome Sequencing and Analysis Facility on a single lane on the Illumina Hiseq 2500 platform.

Data processing

Raw reads were trimmed and demultiplexed based on internal ligation barcodes using custom Perl scripts²⁸. Reads were quality filtered using fastq_quality_filter from the Fastx Toolkit (https://hannonlab.cshl.edu/fastx_toolkit/). Generation of de novo loci was performed using cd-hit²⁹ and custom Perl scripts as described previously^20,26,27,28. These tags were assembled into 30 equally sized pseudo-chromosomes for mapping. Re-mapping of reads to these de novo loci was done with bowtie2³⁰. Sorting and indexing of bam files in preparation for genotyping were done with samtools³¹.

Genotype analyses

We analyzed the genotype data in two ways. The first method depended on hard genotype calls, in which the genotype of each individual at each site is either called exactly or filtered to missing data based on arbitrary cutoffs. While simpler to implement, these hard genotype calls can introduce biases, because they can fail to capture statistical uncertainty inherent to individual genotypes from Next-Generation Sequencing data³². A second, alternative approach is to estimate sample allele frequency spectra directly from base calls and quality metrics in the alignment data, allowing for population genetic inferences without making individual genotype calls³². We processed hard genotype calls primarily using VCFtools³³. Estimation of genotype likelihoods, allele frequency spectra, and additional population genetic inferences were implemented using Angsd^34,35,36. Throughout the manuscript we emphasize the results produced using Angsd, using analyses from hard genotype calls primarily for corroboration. All steps used for both sets of analyses, along with scripts for statistical analysis and figure generation are included in the git repository³⁷.

Hard genotype calls were made as described previously³⁸ using mpileup and bcftools³⁹. Genotype calls with depth lower than 2, as well as indels, singletons, sites with more than two alleles, or less than 75% of samples genotyped were removed using VCFtools³³. Sites with excess heterozygosity (p value < 0.1) (likely paralogs) were removed based on the—hardy output from VCFtools.

For the analyses performed with Angsd, quality filtering included a minimum mapping score of 20 (− minMapQ 20), a minimum quality score of 32 (− minQ 32), and minimum representation among samples > 80% (− minInd). Filters based on p values, including the strand bias p value (− sb_pvalue)⁴⁰, the hetbias p value (− hetbias_pval)⁴¹, and SNP p value (− snp_pval), were set to 0.05.

Summarizing genetic variation

To summarize genetic variation, we used Angsd to calculate pairwise differences between samples using the − IBS 1 option and a minimum minor allele frequency of 1% (− minMaf). Here pairwise distance between samples i and j (d_ij) is calculated as:

$${d}_{ij}=\frac{{\sum }_{m}^{M}1-{I}_{{b}_{j}}\left({b}_{i}\right)}{M}$$

where M is the total number of sites with at least 1 read from each individual, and 1 − I_bj(b_i) is the indicator function which is equal to one when the two individuals have the same base and zero otherwise³⁴. This distance matrix was used for hierarchical clustering and multidimensional scaling. We determined the optimal number of mixture model components for this distance matrix based on BIC using the Mclust package⁴². Admixture analysis was performed on genotype likelihoods output by Angsd using NGSadmix⁴³.

Genetic differentiation between populations based on SNP calls

We estimated F_ST for each pair of populations using Angsd^35,44. This method computes the posterior expectation of genetic variance between populations (designated A), and total expected variance (designated B). These values (A and B) are closely related to the alpha and beta estimates described in Reynolds et al.⁴⁵. The unweighted F_ST is computed as the mean of the per-site ratios of A and B and the weighted F_ST is computed as the ratio of the sum of As to the sum of Bs³⁴. The unweighted and weighted F_ST values reported in Table 2 and Fig. 4 are these estimates computed from all sites in the dataset. Based on communications that unweighted F_ST values tend to be too noisy³⁶ we focus on the weighted F_ST values.

Table 2 Estimates of pairwise genetic differentiation between caves.

Full size table

Pairwise F_ST and d_XY were also calculated for each pair of caves based on hard genotypes. We used VCFtools to calculate pairwise Weir and Cockerham’s F_ST⁴⁶, and a custom R script to calculate d_XY for unphased data as:

$${d}_{XY}=\sum _{ij}{x}_{i}{y}_{j}{k}_{ij}$$

where x_i and y_j are the frequencies of the ith allele from population X and the jth allele from population Y respectively, and k_ij is 1 when i and j differ, and 0 if they are the same⁴⁷. The Weir and Cockerham’s F_ST and the d_XY values reported in Table 2 and Fig. 4 are the averages of these statistics across all hard genotyped SNPs. We used our Weir and Cockerham’s F_ST values to estimate pairwise migration rates according to the following formula⁴⁷:

$$E\left({F}_{ST}\right)=\frac{1}{1+4{N}_{e}m}.$$

Nucleotide diversity

To calculate nucleotide diversity using genotype likelihoods, we first generated genotype likelihoods as described above. We then used the custom Python script HetMajorityProb.py^27,28 to remove sites where the heterozygosity rate appeared higher than 50%, as these were likely paralogs spuriously lumped as single loci. We then estimated nucleotide diversity from the folded site frequency spectra using Angsd^35,48. The value was then averaged across the 30 pseudochromsomes from the reference created during de novo locus generation described above. These averages are the values reported in Fig. 5.

We also calculated nucleotide diversity (π) from hard genotype calls. Here we first determined the allele frequencies for each species in each cave using VCFtools. We then calculated the expected heterozygosity (h) for each site as:

$$h=\frac{n}{n-1}\left(1-\sum {p}_{i}^{2}\right)$$

where n is the number of sequences, and p_i is the frequency of the ith allele at the site⁴⁷. We then calculated π as the sum of expected heterozygosities across sites:

$$\pi =\sum_{j=1}^{S}{h}_{j}$$

where S is the number of segregating sites and h_j is the expected heterozygosity of the site⁴⁷. We report this value per site by dividing by the total number of interrogated positions. Effective population size was estimated by dividing this per site value by four times the number of mutations per site per generation (4µ). The mutation rate estimate was based on Drosophila⁴⁹.

Results

Sequencing

Sequencing produced 236 million raw reads. After demultiplexing and PCR duplicate removal, a total of 65.8 million reads remained. Mean fold coverage per sample tended to be higher for P. hatchi (1.2 ± se 0.08 million reads) than N. barri (0.462 ± se 0.04 million reads). De novo locus generation produced 130,061 loci for N. barri and 264,486 loci for P. hatchi. Filtering of hard genotype calls from mpileup produced 7620 biallelic SNPs for N. barri and 12,879 for P. hatchi. Filtering of genotype likelihoods produced with Angsd identified 13,629 SNPs for N. barri and 31,000 for P. hatchi. Final sample sizes for each species are given in Table 1.

Delineating populations

For both species, the four caves harbored genetically distinct populations. Hierarchical clustering based on pairwise differences separated individuals by cave (Fig. 2a,d). Overall topology of clustering was the same for both species, with caves from the same watershed clustering together (Figs. 1, 2a,d). Multidimensional scaling produced similar results, with clear clustering by cave along the first three axes for N. barri (Fig. 2b,c) and the first two for P. hatchi (Fig. 2e,f). Both analyses indicated that SB and ST caves were more similar for N. barri than P. hatchi.

Admixture analysis further supported the caves as independent populations. For both species, the optimal number of mixture model components based on BIC was k = 4 or 5, with only marginal improvements in BIC for the 5th component (Fig. S2). When the number of ancestral populations (k) was set to four, ancestry estimates matched fully with source cave (Fig. 3). When k was set to 5, both species showed evidence of two groups within the largest cave, Buggy Top, rather than admixture between caves. Together, these analyses indicate that the four caves are indeed genetically distinct populations.

Genetic differentiation between caves

Genetic differentiation between caves was high. Pairwise weighted F_ST (Angsd) ranged from 0.33 to 0.52 for N. barri, and 0.3 to 0.36 for P. hatchi. Unweighted F_ST (Angsd) and Weir and Cockerham’s F_ST estimated from hard genotype calls were lower, but still considerable, with a minimum value of 0.12 (Table 2; Fig. 4).

Nucleotide diversity

In both species, estimates of nucleotide diversity based on genotype likelihoods indicated surprising levels of diversity that varied with cave length. For individual caves, per site nucleotide diversity (π) ranged from 1.17 × 10^–3 to 2.43 × 10^–3. For all but the longest cave (Buggytop Cave; BT), the beetle P. hatchi had higher nucleotide diversity than the spider N. barri (Table 3). While the sample size was small, we found that for both species, nucleotide diversity correlated positively with cave length (Fig. 5). In a multiple linear regression model of nucleotide diversity that included species, cave length, and cave name (R² = 0.92), only cave length was significant when controlling for the other independent variables (p < 0.05). Assuming a mutation rate of 2.8 × 10^–9 estimated for Drosophila⁴⁹, the effective population sizes based on the π estimates for P. hatchi ranged from 1.4 × 10⁵ in Solomon’s Temple to 2.2 × 10⁵ in Buggytop, with a range of 1.0 × 10⁵ to 2.3 × 10⁵ for N. barri (Table 3). We note that the actual mutation rate for these species is unknown and that the estimate for Drosophila serves only as a rough approximation. Hence the resulting Ne estimates are highly uncertain, and differences between species could be influenced by differences in mutation rate as well as demography. Nucleotide diversity estimates based on hard genotype calls were proportionally similar, but on average eightfold lower than from genotype likelihoods (Table 3). This likely resulted from greater stringency during filtering of variant sites from the hard genotype calls. Estimates of π from hard genotype calls were similarly positively associated with cave length (Fig. S3). Based on our estimates of Weir and Cockerham’s F_ST between caves (Fig. 4; Table 2), we computed pairwise migration rates in terms of effective population size as M = 4Ne x m. This statistic ranged from 1.23 to 4.12 for Nesticus (mean = 2.19) and from 2.20 to 5.48 (mean = 3.71) for Ptomaphagus. Hence, the maximum migration rate m for any pair of caves was estimated to be less than 1.5 × 10^–5. While our estimates of Ne are highly uncertain, this low value demonstrates that migration between caves is indeed highly restricted.

Table 3 Nucleotide diversity for each species and cave estimated using genotype likelihoods (Angsd) and from hard genotype calls (mpileup).

Full size table

Discussion

We used genome-wide genotyping to examine population structure of two troglobionts from the southern Cumberland Plateau in Tennessee. Despite relatively small distances between caves (no two caves were more than 15 km apart), we detected strong population structure for both species. Hierarchical clustering, PCA, and Admixture clearly identified each cave as a genetically distinct population (Figs. 2, 3). Pairwise estimates of genetic differentiation further supported these results, with a minimum weighted F_ST of 0.33 for N. barri and 0.30 for P. hatchi (Table 2), indicating “very great differentiation”⁵⁰. For comparison, a recent 2bRAD study on the coral Acropora millepora across the Great Barrier Reef, including sites located over 1200 km apart, detected a maximum pairwise F_ST of 0.014²⁷. Kamimura et al.⁵¹ reported a similar comparison. Using 16 microsatellite markers, they compared populations of cave-dwelling barklice (genus Neotrogla) from two Brazilian caves less than 1 km apart. While the number of individuals was small (3 individuals from one cave and 8 from another) they reported significant genetic differentiation, with F_ST = 0.043. They contrasted this with a similar F_ST value of 0.042, for populations of Drosophila americana located more than 200 km apart⁵². While it is difficult to directly compare F_ST between different types of markers, such as SNPs and microsatellites, these comparisons highlight the relatively strong genetic isolation between populations of terrestrial cave-dwelling species. The tendency of cave populations toward greater genetic differentiation was also shown in the Mexican blind cavefish complex (Astyanax mexicanus). Here, using 26 microsatellite loci, applied to 11 cave and 10 surface populations, Bradic et al.⁵³ detected generally greater differentiation between cave populations than surface populations, with F_ST ranging from 0.2 to 0.5 between cave populations and a maximum of 0.09 for surface populations. These findings extend the idea, even among aquatic vertebrates, that cave populations exchange migrants much less readily than surface populations. These diverse cases reiterate the historical understanding that highly localized distributions of subterranean organisms are shaped by limited dispersal capabilities⁵⁴.

Based on hierarchical clustering and admixture analyses (Figs. 2, 3), we expected to find greater genetic differentiation between caves located in different watersheds. This pattern was consistent for N. barri, but not for P. hatchi (Fig. 4). Although all the several genetic difference estimates were tightly correlated (Fig. S1), only absolute genetic distance (d_XY from hard genotype calls) was consistently lower within watersheds for P. hatchi.

Based on π estimates, the effective population sizes of both species were surprisingly large (Table 3), especially as caves are expected to harbor relatively small populations². Because Next-Generation sequencing reads originate from single molecules, they are subject to several sources of error such as DNA damage, PCR errors, and sequencing errors, which can inflate estimates of nucleotide diversity⁵⁵. De novo locus generation likely further contributes to this bias. Fortunately, these error sources can be reasonably expected to influence all samples similarly²². Hence, while we believe absolute estimates of Ne reported here are likely inflated, and should be considered cautiously, relative comparisons of diversity are still reliable. This idea is illustrated by the concurrent associations between cave length and π estimated using Angsd and from hard genotypes, despite the roughly eightfold difference between them in absolute terms (Table 3; Fig. 5; Figure S2).

Based on our results, we conclude that gene flow between caves is rare. This is consistent with the inability of terrestrial troglobionts to traverse even small distances between caves (Fig. 1). Hence the analogy of caves to islands in a sea of surface habitat holds for these species^2,12. It is thought that migration of troglobionts must occur via subterranean connections^2,5. One possibility for such connections would be Mesovoid Shallow Substratum (MSS). These intermediate habitats can form as crevices beneath streambeds of temporary watercourses (alluvial MSS) and can harbor both epigean and subterranean fauna²³. Consistent with this concept, hierarchical clustering of populations for both species paired caves by watershed, rather than physical distance (Figs. 1, 2). This possibly reflects greater frequency of rare subterranean connections, or more recent variance, between caves located in the same watershed. While the ranges of both our study species are limited, they still encompass dozens of cave populations across areas substantially larger than our study area^12,19. These numerous populations, coupled with the extreme genetic differentiation observed here, suggests that extremely rare migration into an unoccupied cave can establish a population that subsequently becomes genetically distinct. Rare migrants may still occasionally reach the cave, but at such a low frequency that the population remains genetically distinct from other caves just one or a few km distant.

For both species, we detected a positive association between nucleotide diversity and cave length (Fig. 5; Fig. S2). While the number of caves in our study was too small for confident statistical analysis, the strength of the correlation, and the similar patterns observed for the two different species, is an interesting trend consistent with the intuitive idea that larger caves harbor larger, more genetically diverse populations. Souza-Silva et al.⁵⁶ reported a similar pattern, linking caves’ invertebrate species richness with their linear development. One explanation for this pattern is greater availability of food, and/or microhabitats in longer caves⁵⁶. For instance, larger caves are known to harbor richer and more abundant bat populations⁵⁷, that would provide larger guano deposits⁵⁶. Another interesting possibility is that linear development increases caves’ exposure to small-scale interconnections, such as canaliculi⁵⁶, or other intermediate subterranean environments^5,23,24. This scenario would be particularly powerful in shaping richness if individuals in these intermediate connections were attracted into the larger caves by food sources such as guano⁵⁶. At the regional scale, the number of caves^14,58, rather than total karst area, is a significant predictor of troglobiont richness, indicating that while intermediate subterranean environments may be important for connectivity, caves are still a dominant factor shaping community diversity. The positive trend identified here (Fig. 5) is suggestive that similar processes may shape genetic diversity of individual troglobiont populations. Further application of genomic tools, as applied for the first time to the species in this study, will help shed light on how generally this pattern occurs.

Data availability

Demultiplexed reads for all samples are available on the NCBI SRA database (PRJNA601737). All scripts used for data processing, statistical analysis, and plotting figures, as well as intermediate data files, are available on github: https://github.com/grovesdixon/caveRAD. The permanent release of this repository linked to this publication is available here https://doi.org/10.5281/zenodo.4034777.

References

Rétaux, S. & Casane, D. Evolution of eye development in the darkness of caves: adaptation, drift, or both?. EvoDevo 4, 1–12 (2013).
Article Google Scholar
Culver, D. C. & Pipan, T. The Biology of Caves and Other Subterranean Habitats (Oxford University Press, Oxford, 2019).
Book Google Scholar
Poulson, T. L. & White, W. B. The cave environment. Science (80–) 165, 971–981 (1969).
Article ADS CAS Google Scholar
Peck, S. B. Evolution of adult morphology and life-history characters in cavernicolous Ptomaphagus beetles. Evolution (N. Y). 40, 1021–1030 (1986).
Google Scholar
Trontelj, P., Borko, Š & Delić, T. Testing the uniqueness of deep terrestrial life. Sci. Rep. 9, 1–9 (2019).
Article CAS Google Scholar
Polo-Cavia, N. & Gomez-Mestre, I. Pigmentation plasticity enhances crypsis in larval newts: associated metabolic cost and background choice behaviour. Sci. Rep. 7, 1–10 (2017).
Article CAS Google Scholar
Cook, L. M. & Saccheri, I. J. The peppered moth and industrial melanism: evolution of a natural selection case study. Heredity (Edinb.). 110, 207–212 (2013).
Article CAS PubMed Google Scholar
Stevens, M. & Merilaita, S. Animal camouflage: current issues and new perspectives. Philos. Trans. R. Soc. B. Biol. Sci. 364, 423–427 (2009).
Article Google Scholar
Culver, D. C., Master, L. L., Christman, M. C. & Hobbs, H. H. Obligate cave fauna of the 48 contiguous United States. Conserv. Biol. 14, 386–401 (2000).
Article Google Scholar
Niemiller, M. L. & Zigler, K. S. Patterns of cave biodiversity and endemism in the Appalachians and Interior Plateau of Tennessee, USA. PLoS ONE 8, e64177 (2013).
Article ADS PubMed PubMed Central Google Scholar
Christman, M. C., Culver, D. C., Madden, M. K. & White, D. Patterns of endemism of the eastern North American cave fauna. J. Biogeogr. 32, 1441–1452 (2005).
Article Google Scholar
Snowman, C. V., Zigler, K. S. & Hedin, M. Caves as islands: mitochondrial phylogeography of the cave-obligate spider species Nesticus barri (Araneae: Nesticidae). J. Arachnol. 38, 49–56 (2010).
Article Google Scholar
Dixon, G. B. & Zigler, K. S. Cave-obligate biodiversity on the campus of Sewanee: The University of the South, Franklin County, Tennessee. Northeast. Nat. 10, 251–266 (2011).
Google Scholar
Christman, M. C. & Culver, D. C. The relationship between cave biodiversity and available habitat. J. Biogeogr. 28, 367–380 (2001).
Article Google Scholar
Zigler, K. S., Niemiller, M. L. & Fenolio, D. B. Cave Biodiversity of the Southern Cumberland Plateau. In 2014 National Speleological Society Convention Guidebook, 159–163 (National Speleological Society, 2014).
Hedin, M. C. Speciational history in a diverse clade of habitat-specialized spiders (Araneae: Nesticidae: Nesticus): inferences from geographic-based sampling. Evolution (N. Y.). 51, 1929–1945 (1997).
Google Scholar
Hedin, M. & Dellinger, B. Descriptions of a new species and previously unknown males of Nesticus (Araneae: Nesticidae) from caves in Eastern North America, with comments on species rarity. Zootaxa 19, 1–19 (2005).
Article Google Scholar
Carver, L. M., Perlaky, P., Cressler, A. & Zigler, K. S. Reproductive seasonality in Nesticus (Araneae: Nesticidae) cave spiders. PLoS ONE 11, 7–8 (2016).
Article CAS Google Scholar
Leray, V. L., Caravas, J., Friedrich, M. & Zigler, K. S. Mitochondrial sequence data indicate “Vicariance by Erosion” as a mechanism of species diversification in North American Ptomaphagus (Coleoptera, Leiodidae, Cholevinae ) cave beetles. 57, 35–57 (2019).
Wang, S., Meyer, E., McKay, J. K. & Matz, M. V. 2b-RAD: a simple and flexible method for genome-wide genotyping. Nat. Methods 9, 808–810 (2012).
Article CAS PubMed Google Scholar
Rokas, A. & Abbot, P. Harnessing genomics for evolutionary insights. Trends Ecol. Evol. 24, 192–200 (2009).
Article PubMed Google Scholar
Nunziata, S. O. & Weisrock, D. W. Estimation of contemporary effective population size and population declines using RAD sequence data. Heredity 120, 196–207. https://doi.org/10.1038/s41437-017-0037-y (2018).
Article CAS PubMed Google Scholar
Ortuño, V. M. et al. The ‘alluvial mesovoid shallow substratum’, a new subterranean habitat. PLoS ONE 8, 1–16 (2013).
Article CAS Google Scholar
Mammola, S. et al. Ecology and sampling techniques of an understudied subterranean habitat: the Milieu Souterrain Superficiel (MSS). Naturwissenschaften 103, 88 (2016).
Article PubMed CAS Google Scholar
Wakefield, K. R. & Zigler, K. S. Obligate subterranean fauna of Carter State Natural Area, Franklin County, Tennessee. Speleobiol. Notes 4, 24–28 (2012).
Google Scholar
Dixon, G. B. et al. Genomic determinants of coral heat tolerance across latitudes. Science (80–). 348, 1460–1462 (2015).
Article ADS CAS Google Scholar
Matz, M. V., Treml, E. A., Aglyamova, G. V. & Bay, L. K. Potential and limits for rapid genetic adaptation to warming in a Great Barrier Reef coral. PLoS Genet. 14, 1–19 (2018).
Article CAS Google Scholar
Matz, M. V. 2bRAD_denovo git repository. https://github.com/z0on/2bRAD_denovo (2019). Accessed 12 September 2020.
Li, W. & Godzik, A. Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 22, 1658–1659 (2006).
Article CAS PubMed Google Scholar
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
Article CAS PubMed PubMed Central Google Scholar
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Article PubMed PubMed Central CAS Google Scholar
Nielsen, R., Korneliussen, T., Albrechtsen, A., Li, Y. & Wang, J. SNP calling, genotype calling, and sample allele frequency estimation from new-generation sequencing data. PLoS ONE 7, e37558 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Danecek, P. et al. The variant call format and VCFtools. Bioinformatics 27, 2156–2158 (2011).
Article CAS PubMed PubMed Central Google Scholar
Korneliussen, T. S. ANGSD web page. https://www.popgen.dk/angsd/index.php/ANGSD (2013). Accessed 12 September 2020.
Korneliussen, T. S., Albrechtsen, A. & Nielsen, R. ANGSD: analysis of next generation sequencing data. BMC Bioinf. 15, 1–13 (2014).
Article Google Scholar
ANGSD. angsd git repository. https://github.com/ANGSD/angsd (2014). Accessed 12 September 2020.
Dixon, G. caveRAD git repository. https://github.com/grovesdixon/caveRAD (2019). Accessed 12 September 2020.
Dixon, G., Kitano, J. & Kirkpatrick, M. The origin of a new sex chromosome by introgression between two stickleback fishes. Mol. Biol. Evol. 36, 28–38 (2019).
Article CAS PubMed Google Scholar
Li, H. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics 27, 2987–2993 (2011).
Article CAS PubMed PubMed Central Google Scholar
Guo, Y. et al. The effect of strand bias in Illumina short-read sequencing data. BMC Genom. 13, 1–11 (2012).
Article CAS Google Scholar
Vieira, F. G., Fumagalli, M., Albrechtsen, A. & Nielsen, R. Estimating inbreeding coefficients from NGS data: impact on genotype calling and allele frequency estimation. Genome Res. 23, 1852–1861 (2013).
Article CAS PubMed PubMed Central Google Scholar
Fraley, C. & Raftery, A. E. Model-based methods of classification: using the mclust software in chemometrics. J. Stat. Softw. 18, 1–13 (2007).
Article Google Scholar
Skotte, L., Korneliussen, T. S. & Albrechtsen, A. Estimating individual admixture proportions from next generation sequencing data. Genetics 195, 693–702 (2013).
Article CAS PubMed PubMed Central Google Scholar
Fumagalli, M. et al. Quantifying population genetic differentiation from next-generation sequencing data. Genetics 195, 979–992 (2013).
Article PubMed PubMed Central Google Scholar
Reynolds, J., Weir, B. S. & Cockerham, C. C. Estimation of the coancestry coefficient: basis for a short-term genetic distance. Genet. Soc. Am. 105, 767–779 (1983).
Article CAS Google Scholar
Weir, B. S. & Cockerham, C. C. Estimating F-statistics for the analysis of population structure. Evolution (N. Y.). 38, 1358–1370 (1984).
CAS Google Scholar
Hahn, M. W. Molecular Population Genetics (Oxford University Press, Oxford, 2018).
Google Scholar
Korneliussen, T. S., Moltke, I., Albrechtsen, A. & Nielsen, R. Calculation of Tajima’s D and other neutrality test statistics from low depth next-generation sequencing data. BMC Bioinf. 14, 1–14 (2013).
Keightley, P. D., Ness, R. W., Halligan, D. L. & Haddrill, P. R. Estimation of the spontaneous mutation rate per nucleotide site in a Drosophila melanogaster full-sib family. Genetics 196, 313–320 (2014).
Article CAS PubMed Google Scholar
Wright, S. Variability Within and Among Natural Populations (University of Chicago Press, Chicago, 1978).
Google Scholar
Kamimura, Y., Abe, J., Ferreira, R. L. & Yoshizawa, K. Microsatellite markers developed using a next-generation sequencing technique for Neotrogla spp. (Psocodea: Prionoglarididae), cave dwelling insects with sex-reversed genitalia. Entomol. Sci. 22, 48–55 (2019).
Article Google Scholar
Schäfer, M. A., Orsini, L., McAllister, B. F. & Schlötterer, C. Patterns of microsatellite variation through a transition zone of a chromosomal cline in Drosophila americana. Heredity (Edinb). 97, 291–295 (2006).
Article PubMed CAS Google Scholar
Bradic, M., Beerli, P., García-De Leán, F. J., Esquivel-Bobadilla, S. & Borowsky, R. L. Gene flow and population structure in the Mexican blind cavefish complex (Astyanax mexicanus). BMC Evol. Biol. 12, 9 (2012).
Article PubMed PubMed Central Google Scholar
Juan, C., Guzik, M. T., Jaume, D. & Cooper, S. J. B. Evolution in caves: Darwin’s ‘wrecks of ancient life’ in the molecular era. Mol. Ecol. 19, 3865–3880 (2010).
Article PubMed Google Scholar
Pool, J. E., Hellmann, I., Jensen, J. D. & Nielsen, R. Population genetic inference from genomic sequence variation. Genome Res. 20, 291–300 (2010).
Article CAS PubMed PubMed Central Google Scholar
Silva, M. S., Martins, R. P. & Ferreira, R. L. Cave lithology determining the structure of the invertebrate communities in the Brazilian Atlantic Rain Forest. Biodivers. Conserv. 20, 1713–1729 (2011).
Article Google Scholar
Brunet, A. K. & Medellín, R. A. The species-area relationship in bat assemblages of tropical caves. J. Mammal. 82, 1114–1122 (2001).
Article Google Scholar
Culver, D. C., Christman, M. C., Elliott, W. R., Hobbs, H. H. & Reddell, J. R. The North American obligate cave fauna: regional patterns. Biodivers. Conserv. 12, 441–468 (2003).
Article Google Scholar

Download references

Acknowledgements

This study was supported by the National Science Foundation Grant IOS-1755277 to Mikhail Matz. Data analysis was performed with the help of the Texas Advanced Computing Center. We thank Mikhail Matz for support as well as assistance with analysis and composition.

Author information

Authors and Affiliations

Department of Integrative Biology, University of Texas, PAT Building Room 427, 2401 Speedway, Austin, TX, USA
Andras Balogh & Groves Dixon
Department of Biology, University of the South, Sewanee, TN, USA
Lam Ngo & Kirk S. Zigler

Authors

Andras Balogh
View author publications
You can also search for this author in PubMed Google Scholar
Lam Ngo
View author publications
You can also search for this author in PubMed Google Scholar
Kirk S. Zigler
View author publications
You can also search for this author in PubMed Google Scholar
Groves Dixon
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

G.D. and K.S.Z developed the project. L.N. and K.S.Z. conducted the field work. A.B. and G.D. conducted the lab work and data analyses. A.B. and G.D. drafted the manuscript. All authors reviewed, corrected, and approved the final version.

Corresponding author

Correspondence to Groves Dixon.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Figures.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Balogh, A., Ngo, L., Zigler, K.S. et al. Population genomics in two cave-obligate invertebrates confirms extremely limited dispersal between caves. Sci Rep 10, 17554 (2020). https://doi.org/10.1038/s41598-020-74508-9

Download citation

Received: 16 July 2020
Accepted: 28 September 2020
Published: 16 October 2020
DOI: https://doi.org/10.1038/s41598-020-74508-9

This article is cited by

Genomic data identify genetic structure in Enoploctenus cyclothorax (Araneae: Ctenidae), revealing two distinct taxonomic units in the southern region of the Brazilian Atlantic Forest
- Mariana Costa Terra
- Antonio Domingos Brescovit
- Renata da Rosa
Conservation Genetics (2024)
Exploratory behaviour divergence between surface populations, cave colonists and a cave population in the water louse, Asellus aquaticus
- Gergely Horváth
- Kata Kerekes
- Gábor Herczeg
Behavioral Ecology and Sociobiology (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Parallel and convergent genomic changes underlie independent subterranean colonization across beetles

Population expansion, divergence, and persistence in Western Fence Lizards (Sceloporus occidentalis) at the northern extreme of their distributional range

Life on a beach leads to phenotypic divergence despite gene flow for an island lizard

Introduction

Materials and methods

Sampling

Library preparation

Data processing

Genotype analyses

Summarizing genetic variation

Genetic differentiation between populations based on SNP calls

Nucleotide diversity

Results

Sequencing

Delineating populations

Genetic differentiation between caves

Nucleotide diversity

Discussion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary information

Supplementary Figures.

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Genomic data identify genetic structure in Enoploctenus cyclothorax (Araneae: Ctenidae), revealing two distinct taxonomic units in the southern region of the Brazilian Atlantic Forest

Exploratory behaviour divergence between surface populations, cave colonists and a cave population in the water louse, Asellus aquaticus

Comments

Search

Quick links