Main

Ancient DNA research has traced the principal ancestors of early European farmers to highly similar Neolithic populations of Greece and western Anatolia, beginning in the seventh millennium bc (refs 1, 2); however, the later history of these regions down to the Bronze Age, a transformational period in the history of Eurasia4,6,9, is less clear. There is limited genetic evidence suggesting migrations from both the east (the area of Iran and the Caucasus), reaching Anatolia by at least ~3800 bc (ref. 4), and the north (eastern Europe and Siberia) contributing ‘Ancient North Eurasian’ ancestry6,10 to all modern Europeans. The timing and impact of these migrations in the Aegean is, however, unknown.

During the Bronze Age, two prominent archaeological cultures emerged in the Aegean. The culture of the island of Crete, sometimes referred to as ‘Minoan’11, was Europe’s first literate civilization, and has been described as ‘Europe’s first major experience of civilization’12. However, the Linear A syllabic ideographic and Cretan hieroglyphic scripts used by this culture remain undeciphered, obscuring its origins. Equally important was the civilization of the ‘Mycenaean’ culture of mainland Greece, whose language, written in the Linear B script, was an early form of Greek13. Cretan influence in mainland Greece and the later Mycenaean occupation of Crete link these two archaeological cultures, but the degree of genetic affinity between mainland and Cretan populations is unknown. Greek is related to other Indo-European languages, leading to diverse theories tracing its earliest speakers from the seventh millennium down to ~1600 bc, and proposing varying degrees of population change (Supplementary Information section 1).

Genome-wide ancient DNA data provide a new source of information about the people of the Bronze Age, who were first known through the ancient poetic and historical traditions starting with Homer and Herodotus, later through the disciplines of archaeology and linguistics, and, more recently, by the limited information from ancient mitochondrial DNA14,15. Here we answer several questions. First, do the labels ‘Minoan’ and ‘Mycenaean’ correspond to genetically coherent populations or do they obscure a more complex structure of the peoples who inhabited Crete and mainland Greece at this time? Second, how were the two groups related to each other, to their neighbours across the Aegean in Anatolia, and to other ancient populations from Europe1,2,6,8,9,10 and the Near East2,3,4,5,9,16,17? Third, can inferences about their ancestral origins inform debates about the origins of their cultures? Fourth, how are the Minoans and Mycenaeans related to Modern Greeks, who inhabit the same area today?

We generated genome-wide data from 19 ancient individuals (Fig. 1a, Extended Data Table 1 and Supplementary Information section 1). These comprised ten Minoans from Crete (approximately 2900–1700 bc; labelled Minoan_Odigitria, from Moni Odigitria near the southern coast of central Crete; and Minoan_Lasithi, from the cave of Hagios Charalambos in the highland plain of Lasithi in east Crete). Four Mycenaeans were included from mainland Greece (approximately 1700–1200 bc; from the western coast of the Peloponnese, from Argolis, and the island of Salamis). An additional individual from Armenoi in western Crete (approximately 1370–1340 bc; labelled Crete_Armenoi) postdated the appearance of Mycenaean culture on the island. Our dataset also included a Neolithic sample from Alepotrypa Cave at Diros Bay in the southern Peloponnese (about 5400 bc), adding to previously published samples from northern Greece2 (collectively labelled Greece_N). Finally, it included three Bronze Age individuals (approximately 2800–1800 bc; labelled Anatolia_BA) from Harmanören Göndürle in southwestern Anatolia (Turkey), adding knowledge about genetic variation in Anatolia after the Neolithic/Chalcolithic periods1,2,4,17 (Supplementary Information section 1). We processed the ancient remains, extracted DNA, and prepared Illumina libraries in dedicated clean rooms (Methods and Supplementary Table 1), and, after initial screening for mitochondrial DNA, used in-solution hybridization18 to capture ~1.2 million single nucleotide polymorphisms (SNPs)6,19 on the ancient samples. We assessed contamination by examining the rate at which they matched the mitochondrial consensus sequence (Supplementary Table 2) and the rate at which male samples were heterozygous on the X chromosome (Methods). We combined the dataset of the 19 ancient individuals with 332 other ancient individuals from the literature, 2,614 present-day humans genotyped on the Human Origins array, and 2 present-day Cretans (Methods).

Figure 1: Samples and PCA.
figure 1

a, Geographical locations of newly reported ancient data. Lines point to sampling locations; jitter is added to show the number of sampled individuals per location. b, Three hundred and thirty-four ancient individuals projected onto the first 2 principal components computed on a sample of 1,029 present-day West Eurasians4,5,10,31, including 30 Modern Greek samples from Greece and Cyprus. For abbreviations of population names, see Methods.

PowerPoint slide

We performed principal component analysis (PCA)20 (Methods), projecting ancient samples onto the first two principal components inferred from present-day West Eurasian populations10 that form two south–north parallel clines in Europe and the Near East along principal component 2. Minoans and Mycenaeans were centrally positioned in the PCA (Fig. 1b), framed to the left by ancient populations from mainland Europe and the Eurasian steppe, to the right by ancient populations from the Caucasus and Western Asia, and to the bottom by Early/Middle Neolithic farmers from Europe and Anatolia. The Neolithic samples from Greece clustered with these farmers and were distinct from the Minoans and Mycenaeans. The Bronze Age individuals from southwestern Anatolia were also distinct, intermediate between Anatolian and Levantine populations towards the bottom, and populations from Armenia, Iran, and the Caucasus towards the top. ADMIXTURE analysis (Methods and Extended Data Fig. 1) showed that Minoans and Mycenaeans both possessed a ‘pink’ genetic component (K = 8 and greater) shared with Bronze Age southwestern Anatolians, Neolithic Central Anatolians from Tepecik-Çiftlik17, a Chalcolithic northwestern Anatolian1, and western Anatolians from Kumtepe16. This component was maximized in the Mesolithic/Neolithic samples from Iran4,5 and hunter–gatherers from the Caucasus3 (Extended Data Fig. 1). It was not found in the Neolithic of northwestern Anatolia, Greece, or the Early/Middle Neolithic populations of the rest of Europe, only appearing in the populations of the Late Neolithic/Bronze Age in mainland Europe6, introduced there by migration from the Eurasian steppe1,6.

Beyond the visual impressions of PCA and ADMIXTURE, we formally tested the relationships among populations from our study and the literature, using f4-statistics of the form f4(X, Y; Test, Chimp) that evaluated whether Test shared more alleles with X or Y. We found that Test populations from Iran, the Caucasus, and eastern Europe shared more alleles with Minoans and Mycenaeans than with the Neolithic population of Greece (Extended Data Fig. 2a, b). The Minoans from the Lasithi plateau in the highlands of eastern Crete and from the coast of southern Crete (Extended Data Fig. 2c) were consistent with being a homogeneous population. Mycenaeans differed from these Minoans in sharing significantly fewer alleles with Neolithic people from the Levant, Anatolia, Greece, and mainland Europe (Extended Data Fig. 2d). In comparison, the Bronze Age Anatolians shared fewer alleles with ancient Europeans and more with ancient populations of Iran and the Levant (Extended Data Fig. 3). We used f3-statistics of the form f3(Ref1, Ref2; Test) that, if negative, showed that Test was admixed from sources related to the Ref1, Ref2 source populations. We did not find significantly negative (Ref1, Ref2) pairs for Minoans or Bronze Age Anatolians (z > −2.5), but did for Mycenaeans (−4.9 < z < −3.0; Extended Data Fig. 4), involving early farmers from the Levant, Anatolia, Greece, and the rest of Europe as one source, and Iran or the Eurasian steppe or steppe-influenced Europeans as the other.

We modelled Bronze Age populations using the qpAdm/qpWave6 framework (Methods and Supplementary Information section 2), which relates a set of ‘left’ populations (admixed population and ancestral source populations) with a set of ‘right’ populations (diverse outgroups) and allows testing for the number of streams of ancestry from ‘right’ to ‘left’ and estimation of admixture proportions. This analysis showed that all Bronze Age populations from the Aegean and Anatolia are consistent with deriving most (approximately 62–86%) of their ancestry from an Anatolian Neolithic-related population (Table 1). However, they also had a component (approximately 9–32%) of ‘eastern’ (Caucasus/Iran-related) ancestry. It was previously shown that this type of ancestry was introduced into mainland Europe via Bronze Age pastoralists from the Eurasian steppe, who were a mix of both eastern European hunter–gatherers and populations from the Caucasus and Iran4,6; our results show that it also arrived on its own, at least in the Minoans, without eastern European hunter–gatherer ancestry. This ancestry need not have arrived from regions east of Anatolia, as it was already present during the Neolithic in central Anatolia at Tepecik-Çiftlik17 (Supplementary Information section 2). The eastern influence in the Bronze Age populations from Greece and southwestern Anatolia is also supported by an analysis of their Y chromosomes. Four out of five males belonging to Minoans, Mycenaeans, and southwestern Anatolians (Supplementary Information section 3) belonged to haplogroup J, which was rare or non-existent in earlier populations from Greece and western Anatolia who were dominated by Y-chromosome haplogroup G2 (refs 1, 2, 17). Haplogroup J was present in Caucasus hunter–gatherers3 and a Mesolithic individual from Iran4, and its spread westwards may have accompanied the ‘eastern’ genome-wide influence.

Table 1 Admixture modelling of Bronze Age populations

The Minoans could be modelled as a mixture of the Anatolia Neolithic-related substratum with additional ‘eastern’ ancestry, but the other two groups had additional ancestry: the Mycenaeans had approximately 4–16% ancestry from a ‘northern’ ultimate source related to the hunter–gatherers of eastern Europe and Siberia (Table 1), while the Bronze Age southwestern Anatolians may have had ~6% ancestry related to Neolithic Levantine populations. The elite Mycenaean individual from the ‘royal’ tomb at Peristeria in the western Peloponnese did not differ genetically from the other three Mycenaean individuals buried in common graves. To identify more proximate sources of the distinctive eastern European/north Eurasian-related ancestry in Mycenaeans, we included later populations as candidate sources (Supplementary Information section 2), and could model Mycenaeans as a mixture of the Anatolian Neolithic and Chalcolithic-to-Bronze Age populations from Armenia (Table 1). Populations from Armenia possessed some ancestry related to eastern European hunter–gatherers4, so they, or similar unsampled populations of western Asia, could have contributed it to populations of the Aegean. This model makes geographical sense, since a population movement from the vicinity of Armenia could have admixed with Anatolian Neolithic-related farmers on either side of the Aegean. However, Mycenaeans can also be modelled as a mixture of Minoans and Bronze Age steppe populations (Table 1 and Supplementary Information section 2), suggesting that, alternatively, ‘eastern’ ancestry arrived in both Crete and mainland Greece, followed by about 13–18% admixture with a ‘northern’ steppe population in mainland Greece only. Such a scenario is also plausible: first, it provides a genetic correlate for the distribution of shared toponyms in Crete, mainland Greece, and Anatolia discovered in ref. 21; second, it postulates a single migration from the east; third, it proposes some gene flow from geographically contiguous areas to the north where steppe ancestry was present since at least the mid-third millennium bc (refs 6, 9). We validated inferences from qpAdm by treating source populations as ‘ghosts’ and re-estimating mixture proportions4, by examining the correspondence between qpAdm estimates and PCA4 (Extended Data Fig. 5), and by comparing simulated individuals of known ancestry against the Mycenaeans (Extended Data Fig. 6).

Geographical structure may have prevented the spread of the ‘northern’ ancestry from the mainland to Crete, contributing to genetic differentiation. Such a structure may, in principle, be long-standing, even before the advent of the Neolithic in the seventh millennium bc. Alternatively, both ‘northern’ and ‘eastern’ ancestry may have arrived in the Aegean at any time between the Early Neolithic and the Late Bronze Age. Wider geographical and temporal sampling of pre-Bronze Age populations of the Aegean may better trace the advent of ‘northern’ and ‘eastern’ ancestry in the region. However, sampled Neolithic samples from Greece, down to the Final Neolithic ~4100 bc (ref. 2), do not possess either type of ancestry, suggesting that the admixture we detect probably occurred during the fourth to second millennium bc time window. Other proposed migrations, such as settlement by Egyptian or Phoenician colonists22, are not discernible in our data, as there is no measurable Levantine or African influence in the Minoans and Mycenaeans, thus rejecting the hypothesis that the cultures of the Aegean were seeded by migrants from the old civilizations of these regions. On the other hand, migrants from areas east or north of the Aegean, while numerically less influential than the locals, may have contributed to the emergence of the third to second millennium bc Bronze Age cultures as ‘creative disruptors’ of local traditions, bearers of innovations, or through cultural interaction with the locals, coinciding with the genetic process of admixture23. Relative ancestral contributions do not determine the relative roles in the rise of civilization of the different ancestral populations; nonetheless, the strong persistence of the Neolithic substratum does suggest a key role for the locals in this process.

Phenotype prediction from genetic data has enabled the reconstruction of the appearance of ancient Europeans1,24 who left no visual record of their pigmentation. By contrast, the appearance of the Bronze Age people of the Aegean has been preserved in colourful frescos and pottery, depicting people with mostly dark hair and eyes25. We used the HIrisPlex26 tool (Supplementary Information section 4) to infer that the appearance of our ancient samples matched the visual representations (Extended Data Table 2), suggesting that art of this period reproduced phenotypes naturalistically.

We estimated the fixation index, FST, of Bronze Age populations with present-day West Eurasians, finding that Mycenaeans were least differentiated from populations from Greece, Cyprus, Albania, and Italy (Fig. 2), part of a general pattern in which Bronze Age populations broadly resembled present-day inhabitants from the same region (Extended Data Fig. 7). Modern Greeks occupy the intermediate space of the PCA along principal component 1 (Fig. 1b) between ancient European and Near Eastern populations, such as those of the Bronze Age. They are not, however, identical to Bronze Age populations, as they are above them along principal component 2 (Fig. 1b). This is because Neolithic farmers shared fewer alleles with Modern Greeks than with Mycenaeans (Extended Data Fig. 8), consistent with additional later admixture27,28.

Figure 2: Genetic differentiation of Bronze Age populations to present-day populations.
figure 2

The FST inbreeding coefficient (Methods) between newly reported populations and present-day West Eurasian populations. This shows a pattern of genetic affinity between Bronze Age and present-day populations from the corresponding broad geographical regions: a, Mycenaeans; b, Minoans from Hagios Charalambos (Lasithi regional unit); c, Minoans from Moni Odigitria (Herakleion regional unit); d, southwestern Bronze Age (BA) Anatolians. The same pattern also applies to Bronze Age populations from other regions of West Eurasia (Extended Data Fig. 5).

PowerPoint slide

The Minoans and Mycenaeans, sampled from different sites in Crete and mainland Greece, were homogeneous, supporting the genetic coherency of these two groups. Differences between them were modest, viewed against their broad overall similarity to each other and to the southwestern Anatolians, sharing in both the ‘local’ Anatolian Neolithic-like farmer ancestry and the ‘eastern’ Caucasus-related admixture. Two key questions remain to be addressed by future studies. First, when did the common ‘eastern’ ancestry of both Minoans and Mycenaeans arrive in the Aegean? Second, is the ‘northern’ ancestry in Mycenaeans due to sporadic infiltration of Greece, or to a rapid migration as in Central Europe6? Such a migration would support the idea that proto-Greek speakers29 formed the southern wing of a steppe intrusion of Indo-European speakers. Yet, the absence of ‘northern’ ancestry in the Bronze Age samples from Pisidia, where Indo-European languages were attested in antiquity, casts doubt on this genetic–linguistic association, with further sampling of ancient Anatolian speakers needed. Whatever the answer to these questions, the discovery of at least two migration events into the Aegean in addition to the first farming dispersal before the Bronze Age, and of additional population change since that time, supports the view that the Greeks did not emerge fully formed from the depths of prehistory, but were, indeed, a people ‘ever in the process of becoming’30.

Methods

No statistical methods were used to predetermine sample size. The experiments were not randomized and the investigators were not blinded to allocation during experiments and outcome assessment.

Ancient DNA

An overview of which steps in processing the ancient samples were undertaken in which laboratory is provided in Supplementary Table 1.

Dublin, Ireland

The inner ear area of each petrous bone was identified, isolated, then ground to a fine powder. Cleaning and isolation of the cochlea was performed using aluminium oxide powder in a sandblasting chamber. Once isolated, it was decontaminated by ultraviolet irradiation for 7.5 min on each side, ground on a mixer mill to a weight of about 50 mg, and finally transferred to a sterile Eppendorf tube. All procedures were conducted in clean and dedicated ancient DNA facilities.

Seattle, Washington, USA

Teeth processed in this laboratory were decontaminated and pulverized to powder in clean and dedicated ancient DNA facilities following previously published methods11.

Leipzig, Germany

As previously described32, sampling, extraction, and preparation of double-indexed, double-stranded libraries took place in the clean room facilities of the Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany (MPI-EVA), followed by enrichment of human mitochondrial DNA33. Enriched libraries were sequenced on an Illumina GAIIx platform for 2 × 76 + 7 cycles and the resulting data were mapped to the revised Cambridge Reference Sequence using the EAGER pipeline to evaluate DNA preservation (Supplementary Table 2). These libraries were then shipped to Boston, Massachusetts, USA, where nuclear target enrichment was performed (see below).

Tübingen, Germany

Pre-PCR steps took place in the clean room facilities of the Institute for Archaeological Sciences at the University of Tübingen, Germany. After surface irradiation with ultraviolet light, the tooth was sawn apart transversally at the border of crown and root, and dentine powder from the inside the crown was sampled using a sterile dentistry drill. Extraction, library preparation, and enrichment of human mitochondrial DNA used the same protocols as described for MPI-EVA, with the addition of an updated extraction protocol34. Sequencing of shotgun and mitochondrial-DNA-enriched libraries took place at the facilities of the Frauenklinik of the University of Tübingen, on an Illumina MiSeq for 2 × 150 + 8 cycles or on an Illumina HiSeq 2500 for 2 × 101 + 8 cycles (Supplementary Table 2).

Additional libraries were produced including full or partial35 repair with uracil-DNA glycosylase and endonuclease VIII to remove deaminated bases. In-solution enrichment was performed using previously reported protocols6,18. Two SNP sets of 394,577 SNPs (‘390k capture’6) or 1,237,207 SNPs (‘1240k capture’1) were targeted. Sequencing took place in the facilities of the Frauenklinik, University of Tübingen, on an Illumina HiSeq 2500 for 2 × 101 + 8 cycles and at the facilities of the University of Kiel on a HiSeq 4000 for 2 × 150 + 8 cycles. One uracil-DNA glycosylase-treated library (I0071) was sent to Boston, Massachusetts, for nuclear target enrichment (see below).

Boston, Massachusetts, USA

The bone powders, prepared from petrous bones in Dublin, Ireland, were sent to Boston, where DNA extractions and barcoded library preparations without uracil removal were performed in the Harvard Medical School cleanroom following previously described protocols34,35,36. At the screening stage, libraries were (1) shotgun sequenced and (2) sequenced after enriching for the human mitochondrial DNA37 together with some nuclear loci to approximate the nuclear coverage and mitochondrial contamination.

All four libraries (barcoded) prepared in Boston, three libraries (indexed) prepared in Leipzig, and one library (indexed) prepared in Tübingen, were used to perform 390k (ref. 6) and 840k (ref. 19) or 1240k (=390k + 840k) targeted capture of a total of 1,233,013 SNPs, following the in-solution target enrichment protocol in ref. 18 and sequenced either on an Illumina HiSeq 2500 or an Illumina NextSeq 500 (see Supplementary Table 1 for details).

For each sample, each SNP position was represented by a randomly chosen sequence, restricting to those with a minimum mapping quality (MAPQ ≥ 10), sites with a minimum sequence quality (≥20), and removing two bases at the ends of reads4.

Testing for contamination

Modern human contamination of the mitochondrial DNA was assessed using the software schmutzi38, which took into account that the consensus sequence should be reconstructed from reads showing characteristics of ancient DNA and originating from a single individual (Supplementary Table 2). We assessed contamination by examining heterozygosity on the X chromosome in five males (possessing only one copy of the X chromosome) using ANGSD39 (Supplementary Information section 3); this was in the range 0.3–4%. Indirect evidence that the females in our dataset (for which X-chromosome-based contamination estimation was impossible) were authentic was furnished by their clustering with male samples and distinctiveness from present-day Greek or central European populations that may have possibly contaminated them (Fig. 1b). We also computed f4-statistics of the form f4(Males, Females; Test, Chimp) for populations that had both male and female individuals for all ancient or present-day Test populations in our dataset. If female samples were substantially contaminated from a source related to Test, these statistics would be significantly negative; however, we found that the Z-score of these statistics was −1.6 < Z < 2.5. We thus included both male and female samples in our analysis to maximize sample size instead of restricting it to damaged molecules for females8.

Modern human data

We used a dataset of 2,614 individuals genotyped on the Affymetrix Human Origins array4,5,10,31, including 28 Modern Greek (from Greece and Cyprus) samples previously described10. We also included data from two Modern Greeks from Crete whose whole-genome sequences were published as part of the Simons Genome Diversity Project40. We also analysed Modern Greek data from Thessaly and Central Greece41 and diverse regions27,42 genotyped on Illumina arrays.

Datasets

We analysed two datasets: HO, which includes the Affymetrix Human Origins genotyping data together with 351 ancient humans (including samples from the literature1,2,3,4,5,7,8,9,10,16,17,43,44,45,46,47,48,49,50,51 and the newly reported data) on 591,642 autosomal SNPs; and the HOIll dataset, which does not include the Human Origins data, but has a larger number of 1,054,671 autosomal SNPs4. We did not use previously performed genotype calls of data from the literature, but re-processed them, beginning with the original data release format (FASTQ or BAM). The main analysis dataset was HOIll, except for analyses that included modern populations, in which case the HO dataset was analysed. For the analysis of Illumina genotype data of Modern Greeks (Extended Data Fig. 6), a total of 489,148 autosomal SNPs were analysed.

Abbreviations used

For brevity, we used the following abbreviations in population names, following the convention of ref. 4: CHG, Caucasus hunter–gatherers; EHG, Eastern European hunter–gatherers; WHG, Western European hunter–gatherers; SHG, Scandinavian hunter–gatherers; N, Neolithic; EN, Early Neolithic; MN, Middle Neolithic; ChL, Chalcolithic; LNBA, Late Neolithic/Bronze Age; BA, Bronze Age; EBA, Early Bronze Age; EMBA, Early/Middle Bronze Age; MLBA, Middle/Late Bronze Age; IA, Iron Age.

PCA

PCA was performed in the smartpca program of EIGENSOFT20, using default parameters and the lsqproject: YES10 and numoutlieriter: 0 options. PCA was performed on 1,029 present-day West Eurasians and 334 ancient samples were projected (Fig. 1b); Upper Palaeolithic individuals before the appearance of the Villabruna cluster8 plot in the middle of present-day West Eurasian variation and are not shown.

ADMIXTURE analysis

ADMIXTURE analysis52 of the HO dataset was performed after pruning for linkage disequilibrium in PLINK53,54 with parameters indep-pairwise 200 25 0.4, after which 299,971 SNPs were retained. Twenty replicates of the analysis were performed with different random seeds, and the highest likelihood replicate for each value of K was retained. We show the K = 2 to K = 17 results for the 351 ancient and 30 Modern Greek samples in Extended Data Fig. 1.

f-statistics

The f3- and f4-statistics were computed in ADMIXTOOLS31 using the programs qp3Pop and qpF4ratio with default parameters, and qpDstat with f4mode: YES. Standard errors were computed with a block jack-knife55. When an ancient population was the target for f3-statistics, we set the inbreed: YES parameter, as our data were represented by pseudo-haploid genotypes, which introduced artificial genetic drift that masked the negative signal of admixture31.

Testing for the number of streams of ancestry and estimating mixture proportions

We used the qpWave6,56,57/qpAdm6 framework, which relates a set of ‘left’ populations (the population of interest and candidate ancestral sources) to a set of ‘right’ populations (diverse outgroups), testing for the number of streams of ancestry from ‘right’ to ‘left’ and estimating mixture proportions.

Simulations of admixed individuals

We simulated admixed individuals (Supplementary Information section 2) given a set of sources and mixture proportions by first sampling (at each SNP) one of the sources (according to the mixture proportions), and then one of the individuals from that population (with equal probability). Because of missingness, the data-generating mixture proportions did not correspond precisely to the actual ancestry of simulated individuals and we corrected for this bias (Supplementary Information section 2). We noted the maximum absolute value of the Z-score of the statistic f4(Mycenaean, Simulated; A, B), where A, B were two outgroup populations to test whether, for a particular choice of ancestry of Simulated, it formed a clade with the sampled Mycenaeans.

Estimation of FST coefficients

We estimated FST in smartpca20 with the default parameters inbreed: YES57, and fstonly: YES.

Phenotypic inference

The ancient samples had low coverage (median 0.87×) and thus diploid genotypes could not be reliably assessed for them. However, we could use the low coverage data to compute allele frequencies in all individuals and the Bronze Age Aegean using a likelihood approach1. We then sampled from the posterior distribution of the genotypes g given the read counts r of the reference allele and t of the total reads covering a site. We took 100 random genotype samples per individuals and submitted them to HIrisPlex26, obtaining an estimate of the uncertainty of phenotype inference (Supplementary Information section 4 and Extended Data Table 2).

Data availability

The aligned sequences are available through the European Nucleotide Archive under accession number PRJEB20914. Genotype datasets used in analysis are available at https://reich.hms.harvard.edu/datasets. All other data are available from the corresponding authors upon reasonable request.