SHaploseek is a sequencing-only, high-resolution method for comprehensive preimplantation genetic testing

Backenroth, Daniel; Altarescu, Gheona; Zahdeh, Fouad; Mann, Tzvia; Murik, Omer; Renbaum, Paul; Segel, Reeval; Zeligson, Sharon; Hakam-Spector, Elinor; Carmi, Shai; Zeevi, David A.

doi:10.1038/s41598-023-45292-z

Download PDF

Article
Open access
Published: 21 October 2023

SHaploseek is a sequencing-only, high-resolution method for comprehensive preimplantation genetic testing

Daniel Backenroth¹^na1,
Gheona Altarescu^2,3^na1,
Fouad Zahdeh⁴,
Tzvia Mann⁴,
Omer Murik⁴,
Paul Renbaum⁵,
Reeval Segel⁵,
Sharon Zeligson⁵,
Elinor Hakam-Spector⁵,
Shai Carmi¹^na1 &
…
David A. Zeevi⁴^na1

Scientific Reports volume 13, Article number: 18036 (2023) Cite this article

658 Accesses
1 Citations
Metrics details

Subjects

Abstract

Recent advances in genomic technologies expand the scope and efficiency of preimplantation genetic testing (PGT). We previously developed Haploseek, a clinically-validated, variant-agnostic comprehensive PGT solution. Haploseek is based on microarray genotyping of the embryo’s parents and relatives, combined with low-pass sequencing of the embryos. Here, to increase throughput and versatility, we aimed to develop a sequencing-only implementation of Haploseek. Accordingly, we developed SHaploseek, a universal PGT method to determine genome-wide haplotypes of each embryo based on low-pass (≤ 5x) sequencing of the parents and relative(s) along with ultra-low-pass (0.2–0.4x) sequencing of the embryos. We used SHaploseek to analyze five single lymphoblast cells and 31 embryos. We validated the genome-wide haplotype predictions against either bulk DNA, Haploseek, or, at focal genomic sites, PCR-based PGT results. SHaploseek achieved > 99% concordance with bulk DNA in two families from which single cells were derived from grown-up children. In embryos from 12 PGT families, all of SHaploseek’s focal site haplotype predictions were concordant with clinical PCR-based PGT results. Genome-wide, there was > 99% median concordance between Haploseek and SHaploseek’s haplotype predictions. Concordance remained high at all assayed sequencing depths ≥ 2x, as well as with only 1ng of parental DNA input. In subtelomeric regions, significantly more haplotype predictions were high-confidence in SHaploseek compared to Haploseek. In summary, SHaploseek constitutes a single-platform, accurate, and cost-effective comprehensive PGT solution.

A deep catalogue of protein-coding variation in 983,578 individuals

Article 20 May 2024

Bidirectional epigenetic editing reveals hierarchies in gene regulation

Article Open access 17 May 2024

High-resolution genome-wide mapping of chromosome-arm-scale truncations induced by CRISPR–Cas9 editing

Article Open access 29 May 2024

Introduction

In preimplantation genetic testing (PGT), DNA is extracted from biopsies obtained from in-vitro fertilized (IVF) embryos and tested for various molecular variants and chromosomal aberrations. Traditional PGT methods have focused on familial single variants for monogenic (Mendelian) diseases and required family-specific assay preparation. Over the past decade, advances in genome-wide technologies for genotyping and sequencing led to the development of several methods for comprehensive PGT, providing all-in-one solutions for the testing of monogenic disorders, large structural variations, and aneuploidy (PGT-M, PGT-SR, and PGT-A, respectively). Implementations include Karyomapping¹, OnePGT/haplarithmisis^2,3,4, HaploPGT⁵, MARSALA⁶, GENType⁷, FHLA⁸, and others^{9,10,11,12,13}. These methods are based on microarray genotyping, genome-wide genotyping by sequencing, whole-genome sequencing, or their combinations. For PGT-M/SR, the data for each embryo must be accompanied by sequencing/genotyping of the embryo’s parents (and, usually, at least one other relative) to determine whether the embryo has inherited the haplotype carrying the pathogenic variant. For a recent review, see reference ¹⁴.

We previously developed Haploseek^15,16, an accurate, comprehensive PGT method designed to be highly affordable. In Haploseek, for PGT-M/SR of inherited variants, the parents and another first degree relative (an already born child, a sample from chorionic villus sampling/amniocentesis, or a grandparent) are genotyped on a microarray. In parallel, embryo biopsies undergo ultra-low-pass sequencing (~ 0.4 × depth). The genotyping and sequencing data enter a hidden Markov model, which reconstructs the genome-wide haplotypes transmitted to each embryo. Copy number variants are detected based on the observed sequencing depth. Haploseek was validated as a cost-effective solution for PGT-A, PGT-SR, and PGT-M, including in families of various ethnicities or with consanguinity¹⁵. It has been used clinically to test over 1,100 embryos across 300 IVF cycles. Nonetheless, Haploseek uses two different clinical-grade platforms—microarray and next generation sequencing—which is labor intensive and prone to procedural errors. Further, the genome sequencing of the embryos generates data on SNVs not covered by the array, leading to information loss.

Here, we describe the development and clinical validation of SHaploseek, a comprehensive PGT solution based on whole-genome sequencing as the only molecular platform. We validated the accuracy of SHaploseek’s haplotype predictions on a genome-wide scale. We also validated a new low input sequencing protocol for pre-case work-up when DNA is scarce. In subtelomeric genomic regions, SHaploseek has greater diagnostic yield compared to Haploseek. With these improvements, our SHaploseek workflow becomes a reliable universal solution for PGT of most disease-causing variants in the human genome.

Methods

Ethics declaration

Ethical approval for this study was obtained from the Shaare Zedek Medical Center institutional review board. DNA, tissue culture samples, and human embryo biopsies in this study were donated to the Shaare Zedek Medical Genetics Institute for research with informed consent according to Shaare Zedek institutional review board guidelines and as set forth in the Declaration of Helsinki.

An overview of SHaploseek

SHaploseek is based on low-pass genome sequencing. In addition to the parents, at least one other (first degree) relative must be recruited in order to provide information on the parental haplotypes. The DNA of parents and relatives are sequenced at depth around 1−4x. DNA from the embryo biopsies is amplified and sequenced to very low depth (0.2–0.4x). A hidden Markov model (HMM) uses information from all individuals to infer the parental haplotypes transmitted to each embryo. Copy number variants are detected by another HMM.

Evaluation of SHaploseek based on cell culture experiments

For our first evaluation of SHaploseek, we used DNA extracted from parents, grown-up children, and cell culture isolates from the same children. We used Families 1 and 2 from our previous work¹⁶. In each family, we previously performed microarray genotyping of the parents and three or four children. We then designated one child in each family as a phasing reference (referred to as “Child1”) and performed ultra-low-pass (0.2–0.4x) sequencing on whole-genome amplified DNA from single lymphoblast cells derived from each of the other siblings. We combined the array data for the parents and Child1 with the sequencing data for the single cells to reconstruct genome-wide haplotypes for each sibling. Here, for SHaploseek, we replaced the microarray genotyping of the parents and Child1 with low-pass sequencing. We sequenced each of them at just under 9x sequencing depth (Table 1) and then randomly down-sampled each genome to 4x, 2x, or 1x depth for downstream analysis. We inferred the genome-wide haplotypes of each sibling and evaluated their accuracy by comparing them to haplotypes derived from microarray genotypes of bulk DNA from the grown-up children.

Table 1 The families who participated in this study and the unsampled sequencing depth of each family member.

Full size table

Evaluation of SHaploseek on PGT-M cases

For our second evaluation, we used day 5 blastocyst biopsies that already underwent PGT with both Haploseek¹⁵ and (for most embryos) classical PCR of informative polymorphic markers surrounding the variant of interest. Among the 12 families, four had an already born child as a phasing reference and eight had one or more grandparents (Table 1). With Haploseek, the parents and other relative(s) were genotyped on microarrays. Here, we sequenced these individuals at depth in the range 5–22x (Table 1), as well as down-sampled to 4x, 2x, and 1x. We inferred the genome-wide haplotypes of each embryo and evaluated the accuracy of our predictions by comparing them to those of Haploseek/PCR.

Sequencing the parents and reference individuals and variant calling

For both cell culture experiments and PGT-M evaluations, we sent 1 µg of DNA from the parent and reference individuals to BGI (Hong Kong, China) for genome sequencing. The achieved sequencing depths are listed in Table 1. We aligned the sequencing reads to the human reference genome (hg19) using the Burrows-Wheeler aligner (BWA)¹⁷. We called single-nucleotide variants using bcftools¹⁸ in (autosomal + chrX) regions delineated by the gnomAD hg19-v0-wgs_evaluation_regions.v1.interval_list.bed file¹⁹. These variant calls were only used for initial site filtering, as the phasing method is fully probabilistic (see below). We used bam-readcount²⁰ to count reference and alternate alleles at each single-nucleotide variant (SNV) position reported by bcftools. For the down-sampling experiments, we used samtools to down-sample the original alignments to a prespecified mean sequencing depth prior to variant calling and allele counting.

Whole-genome amplification and low-pass embryo genome sequencing

DNA from either cell culture isolates or blastocyst biopsies was whole-genome amplified (WGA) using the PicoPLEX^® Single Cell WGA Kit (TaKaRa Bio), converted into a genome sequencing library using the Nextera XT library prep kit (Illumina), and sequenced on a NextSeq 500 instrument (Illumina) at 0.2–0.4 × depth, all as part of our previous studies^15,16. We aligned the sequencing reads to hg19 using BWA and counted reads mapping to reference and alternate alleles using bam-readcount. For Haploseek, we considered SNV positions matching those on the CytoScan^® 750 K array (Thermo Fisher). For SHaploseek, we considered, independently in each family, SNV positions identified by bcftools where an alternate allele was present in at least one of the parents or reference individuals. All analyses included both the autosomes and the X chromosome.

Haplotype prediction for Haploseek

Genome-wide haplotypes for the embryos/cell culture isolates were inferred using an HMM as part of our previous work^15,16.

Haplotype prediction for SHaploseek

Our HMM for haplotype prediction is similar to that we have previously developed for Haploseek¹⁶. However, for SHaploseek, we modeled the likelihood of observing the sequencing reads (given an assumed genotype configuration) not only from the embryos or single cells (as in Haploseek), but also from the parents and other relatives. We provide full details in the Supplementary Materials and Methods. The method is limited to the other relative being a sibling or a grandparent of the embryo. In the latter case, the pathogenic variant must be inherited from one of the grandparents.

Identification of copy number variants

We detected copy number gains/losses longer than 5 Mb by a second HMM. The CNV HMM runs exclusively on each embryo’s low depth sequencing data and does not require parent and relative DNA. The method was previously described¹⁶ and extensively validated for clinical PGT-A and PGT-SR applications¹⁵ (see also Table S1 here). Given that the method was incorporated into the SHaploseek workflow without any modification, we do not further discuss it here.

For distinguishing between balanced and normal translocation carriers, we use the same strategy as we previously described in Fig. 4 of reference ¹⁶.

Outputs of SHaploseek

SHaploseek generates two types of outputs. The first is a “binary” haplotype prediction, based on the Viterbi path returned by the HMM. A prediction is provided separately for the maternal and paternal chromosomes of the embryo. Consider the case when the reference relative is a previously born child. In this case, for each SNV and for each embryo, the prediction is whether the chromosome is (or is not) identical to that of the reference child. For the case when the reference relative is a grandparent, the prediction is whether the chromosome is coming from that grandparent or from the other grandparent (of the same parent). The second output is the “marginal”, or posterior, probability returned by the HMM. At each SNV, this provides the probability, given the entire data, that the embryo has inherited a haplotype identical to that of the reference child or grandparent. A confident haplotype prediction is reflected by a marginal probability close to 0 or 1. We denote sites with marginal probability > 0.99 or < 0.01 as high-confidence (“pass”).

Evaluating the accuracy of SHaploseek

For validating SHaploseek’s single cell or embryo biopsy haplotype predictions, we only considered array sites that had high-confidence haplotype calls in all compared methods. We then dichotomized all marginal probabilities by rounding them to 0 or 1, which generated binary haplotype predictions that could be compared across methods. For the single cell data, we compared the haplotypes predicted by SHaploseek to those predicted based on the array genotypes of the corresponding grown-up children¹⁶. These genotypes are based on bulk DNA and can therefore be considered as “ground-truth”. For the day 5 embryo biopsies, we compared the haplotype predictions of SHaploseek to those of Haploseek. At PGT-M loci, we compared the predictions of SHaploseek to those based on PCR amplification of informative polymorphic markers surrounding the variant-based PGT. None of the PGT-M loci was directly covered by the array.

SHaploseek resequencing experiments

For five arbitrarily selected families (Table 1), we repeated the SHaploseek analysis using just 1 ng of input DNA for the parents and reference individuals. We converted the genomic DNA into a genome sequencing library using the Nextera XT library prep kit. We normalized and pooled the resultant libraries and then converted them into single stranded circular DNA using the MGIEasy Universal Library Conversion Kit (App –A; MGI) according to the manufacturer’s protocol. We converted the ssDNA library with Illumina adapters into a DNA nanoball sequencing library using the High-throughput Sequencing Primer Kit (App-C; MGI) and loaded the library onto an FCS PE150 flow cell for 2 × 150 paired end sequencing on the DNBSEQ-G400RS (MGI) high throughput sequencer. We finally used the same embryo data and computational pipeline as described above to predict the haplotypes for the corresponding embryos.

Visualizations and other statistics

We previously developed a user-friendly web browser interface to visualize Haploseek’s outputs¹⁶. Here, we updated the interface to accommodate the higher density of SNVs in the sequencing data generated by SHaploseek. We summarized and plotted other data with BoxPlotR²¹ and Statistics Kingdom²², online platforms for data analysis and visualization.

Results

Replacing microarrays with low-pass sequencing accurately resolves haplotypes in single cells from tissue culture samples

For our initial evaluation of SHaploseek, we compared bulk and single cell data from members of two families, as in our previous studies^15,16. In these families, bulk DNA was available from parents and three or four grown-up children each. In our previous experiments, we genotyped the parents and a single reference child from each family using microarrays. We then performed ultra-low-pass genome sequencing of single cells extracted from tissue cultures derived from the remaining children and predicted the haplotypes transmitted to those children. Here, for SHaploseek, we replaced microarray genotyping of the trio (parents and reference child) by genome sequencing at depths ≈9x, and, by down-sampling, 4x, 2x, and 1x. We inferred the haplotypes transmitted to each child based on a hidden Markov model (HMM; Materials and Methods).

We quantified the performance of SHaploseek using two metrics. The first is a measure of the ability of SHaploseek to generate confident output, defined as the proportion of microarray SNV sites where SHaploseek reported high-confidence haplotype predictions. The second is a measure of phasing accuracy, defined as the concordance between the haplotype predictions of SHaploseek, at high-confidence sites, and the haplotypes inferred by microarray genotyping of bulk DNA from the corresponding grown-up children (Materials and Methods).

We report the results in Fig. 1, separately for each family, and alongside performance metrics of Haploseek. In Family 1, the proportion of sites with high-confidence haplotype calls, out of ≈ 200,000 array SNVs, was 89–99% across methods, children, and sequencing depths (Fig. 1a). The haplotype phasing accuracy at these sites exceeded 99.8% under all conditions Fig. 1b).

In Family 2, parental consanguinity posed an additional challenge for SHaploseek, as we have previously observed for Haploseek¹⁶. In what we define as regions of consanguinity (ROCs; regions where the parents share both haplotypes; see ¹⁵), phasing can be ambiguous, compromising the accuracy of all haplotype phasing methods. Indeed, the presence of multiple ROCs in Family 2 (covering at least 10% of all array SNVs¹⁶) led to a noticeably lower number of high-confidence sites in that family (71–88%; Fig. 1c). Nonetheless, SHaploseek phasing accuracy in Family 2 non-ROCs exceeded 99.4% at trio sequencing depth of 2 × or higher (Fig. 1d). At 1 × depth, phasing accuracy was compromised, especially in Child 2 (Fig. 1c,d).

Clinical validation of SHaploseek with human embryo biopsies from Haploseek PGT cycles

For clinical validation of SHaploseek, we used 12 families in which PGT-M was previously performed with Haploseek (Table 1; Families 9 through 49; Table S1). For each family, we used existing genome-wide sequencing data for the embryo biopsies (one to four embryos per family; depth 0.2–0.4x) and leftover DNA from the embryos’ parents and reference individuals. Four of the families had a previously born child as a phasing reference (ten embryos overall), while the other eight each had one or more grandparents (21 embryos; Table 1; Table S1). We sequenced the parents and reference individuals at depths ≈ 5–22x, and, by down-sampling, 4x, 2x, and 1x. We evaluated the accuracy of SHaploseek by reporting the proportion of SNVs with high-confidence haplotype prediction in both methods (Fig. 2a,c), and, in these SNVs, comparing the genome-wide phasing predictions of SHaploseek to those of Haploseek (Fig. 2b,d).

At high-confidence sites, SHaploseek has very high phasing accuracy (median ≈ 99%) at all assayed sequencing depths in families with a child reference (Fig. 2b), and at sequencing depths ≥ 2x in grandparent families (Fig. 2d). The median proportion of array SNVs with high-confidence haplotype calls was 80–91% across all sequencing depths in child families (Fig. 2a) and 70–88% in grandparent families (Fig. 2c). The decline is mostly explained by the number of grandparents sequenced: the proportion of high-confidence sites was comparable (at ≈ 88% for 2 × depth) between families with a child reference and families where both grandparents (of the focal parent) were sequenced. In families where only a single grandparent was sequenced, the proportion of high confidence sites dropped to ≈ 75%. In grandparent families, the median proportion of high-confidence sites dropped from ≈ 80 to ≈ 70% between 2× and 1 × sequencing depths (Fig. 2c), and was accompanied by a sharp decline in phasing accuracy (Fig. 2d).

We next evaluated SHaploseek for the original PGT-M indications, comparing its results against either Haploseek, or, for most embryos, also a PCR-based analysis. The evaluation, for various autosomal dominant and recessive disorders, is listed in Table S1. For all sequencing depths, whenever SHaploseek generated a high-confidence haplotype call, it was concordant with the Haploseek/PCR result.

Clinical validation of SHaploseek after low-pass sequencing of minuscule amounts of parental and reference DNA

Our results suggest that SHaploseek can accurately infer genome-wide haplotypes using low-pass (1–4 × depth) parental (and phasing reference) genomes. In the following, we attempted to (i) confirm the accuracy of SHaploseek on low-pass data without the need for down-sampling; and (ii) simulate a clinical case with very low-input, “precious” samples. This can occur, for example, when the reference individual is deceased or when the DNA of a reference child is only available from chorionic villus or amniotic fluid sampling of an aborted fetus. We arbitrarily selected five of the 12 clinical PGT-M families (Table 1) for resequencing at a target depth of 3× based on libraries from just 1ng of input DNA. The DNA samples differed in quality, which led to a wide range of resequencing depths (1.5–5.2x; Table 1). Two of the resequenced families were ‘child’ families (six embryos) and the other three were ‘grandparent’ families (eight embryos). As above, we evaluated the proportion of array SNVs where SHaploseek (and Haploseek) generated high-confidence haplotype predictions, as well as the concordance between the two methods in these sites (Fig. 3). The proportion of high-confidence SNVs was 67–87% in the grandparent family embryos and 86–94% in the child families, and the concordance exceeded 99% in all embryos (99.6% in child families). In PGT-M indication sites, the resequenced SHaploseek predictions were concordant with the Haploseek/PCR-based predictions for all embryos for which SHaploseek generated a high-confidence call (Table S1).

SHaploseek generates higher density and higher quality haplotype predictions than Haploseek in subtelomeric genomic regions

Our experience applying Haploseek to clinical PGT-M cycles was that predictions had low-confidence at SNVs within 5 Mb of a telomere or an acrocentric centromere. This is likely due to the small number of flanking SNVs available near the telomeres, which provides insufficient phasing information. We hypothesized that SHaploseek, which is not limited to a small set of array SNVs, may obtain higher confidence haplotype calls in subtelomeric regions.

To test our hypothesis, we first examined the number of subtelomeric SNVs available for analysis in each method. Indeed, the number of SNVs available for SHaploseek was at least sixfold higher than Haploseek (Fig. 4a). The higher SNV density at subtelomeric regions in SHaploseek translated into significantly higher proportions of high-confidence haplotype calls (Fig. 4b). The higher proportions were evident at the unsampled, 4x, and 2x sequencing depths, as well as in the experiments with samples ‘resequenced’ from very low input DNA.

To illustrate the advantage of SHaploseek in subtelomeric regions over Haploseek, we considered Family 47, with PGT-M indication of a pathogenic variant in the subtelomeric PKD1 gene on chr16p and a single embryo (113f.-2; Fig. 4c). In Haploseek, all array SNVs within or flanking PKD1 had low-confidence calls and thus no result could be reported (Fig. 4c). In contrast, both PCR-based PGT-M and SHaploseek (for all considered depths of sequencing) confidently identified a wild type PKD1-flanking haplotype (Table S1). The number of subtelomeric SNVs with high-confidence calls in SHaploseek increased with the sequencing depth (Fig. 4c).

Discussion

We described an enhanced, sequencing-only reimplementation of the clinical-grade Haploseek comprehensive PGT method. Our new method, SHaploseek, retains the established user-friendly interface and the universality, affordability, and high accuracy characteristics of Haploseek, along with presenting multiple new advantages. First, it features a single experimental pipeline based on a single platform. Given that sequencing library prep is easily automated, this implies that the hands-on time for family haplotype construction is reduced from 24 h using microarrays (for Haploseek) to eight hours using sequencing (for SHaploseek). Second, SHaploseek has higher resolution haplotype predictions, which improves diagnostic sensitivity in difficult subtelomeric regions. Third, the low-pass sequencing in SHaploseek offers sample multiplexing opportunities and reduced per-sample costs. For example, in our locale, a 48-sample batch of microarrays (for parents and family members) costs $8600 USD in reagents, whereas low-pass genome sequencing costs $4850 USD. This translates into a reagent cost saving of $3750 in total, or $78 USD per sample. Fourth, the waiting time for family haplotype construction can be almost entirely eliminated if the family members are sequenced together with the embryos in the same sequencing run.

In the burgeoning new field of sequencing-based comprehensive PGT solutions, SHaploseek is competitive with recently developed methods. OnePGT² and scGBS⁴, low-pass DNA nanoball sequencing¹¹, and GENtype⁷ are all sequencing-based PGT methodologies, with the ability, like SHaploseek, to perform PGT-M, PGT-A, and PGT-SR in a single molecular assay. OnePGT, scGBS, and GENtype utilize reduced representation sequencing to reduce costs associated with genome-wide haplotype construction^4,7. However, the addition of restriction enzyme digestion, size selection, and adapter ligation to the NGS library prep protocol translates into a longer and more laborious (albeit mostly automated) process than what is required for SHaploseek, where DNA fragmentation and adaptor ligation are performed by a transposase in a single 5-min step. The low-pass DNA nanoball sequencing PGT method¹¹ is similar to SHaploseek in chemistry, but seems less cost-effective, as it requires sequencing the parents and embryos at depths 10x and 4x, respectively¹¹, compared to ≈ 3x and ≈ 0.4x, respectively, with SHaploseek.

Regarding limitations of this study, SHaploseek requires DNA from a family member (beyond the couple) as a phasing reference. While this requirement is common to most comprehensive PGT methods, recent methods such as GENtype and Chen et al.¹¹ can work without a phasing reference, provided that the variant of interest can be confidently genotyped in at least one embryo^7,11. However, the identification of an embryo bearing the pathogenic variant of interest could require multiple PGT cycles, and may result in misdiagnosis in case of recombination near the variant when only one embryo is available as a phasing reference. Another limitation of the current work is that no follow up was performed on the pregnancy rates and birth outcomes of the embryos assessed.

As mentioned above, one of SHaploseek’s advantages is higher diagnostic yield in subtelomeric regions. In our experience with Haploseek, clinically relevant genes (i.e., FANCA, IKBKG, TSC2, or PKD1), submicroscopic deletions, and tandem duplications in these regions were difficult to test, as haplotype calls were often low-confidence and the risk of embryo misdiagnosis was too high to allow reporting. The increased certainty of SHaploseek in subtelomeric regions is thus a welcome addition to the PGT toolbox. We expect diagnostic yields in these regions to further improve with future versions of SHaploseek, given that repetitive low complexity subtelomeric sequences are now resolved with increasing accuracy by endeavors such as the T2T project²³.

In our previous work, we predicted that a more convenient, robust, and ubiquitous sequencing-only implementation of Haploseek would not be far away on the horizon¹⁵. Here we report, in conclusion, that SHaploseek realizes this important goal while retaining the high fidelity and all other positive aspects of the original Haploseek implementation.

Data availability

Deidentified source data for this study is available from the corresponding author upon request.

References

Thornhill, A. R. et al. Karyomapping-a comprehensive means of simultaneous monogenic and cytogenetic PGD: Comparison with standard approaches in real time for Marfan syndrome. J. Assist. Reprod. Genet. 32, 347–356. https://doi.org/10.1007/s10815-014-0405-y (2015).
Article PubMed PubMed Central Google Scholar
Masset, H. et al. Multi-centre evaluation of a comprehensive preimplantation genetic test through haplotyping-by-sequencing. Hum. Reprod. 34, 1608–1619. https://doi.org/10.1093/humrep/dez106 (2019).
Article CAS PubMed Google Scholar
Zamani Esteki, M. et al. Concurrent whole-genome haplotyping and copy-number profiling of single cells. Am. J. Hum. Genet. 96, 894–912. https://doi.org/10.1016/j.ajhg.2015.04.011 (2015).
Article CAS PubMed PubMed Central Google Scholar
Masset, H. et al. Single-cell genome-wide concurrent haplotyping and copy-number profiling through genotyping-by-sequencing. Nucleic Acids Res. 50, e63. https://doi.org/10.1093/nar/gkac134 (2022).
Article CAS PubMed PubMed Central Google Scholar
Xie, P. et al. A novel multifunctional haplotyping-based preimplantation genetic testing for different genetic conditions. Hum. Reprod. 37, 2546–2559. https://doi.org/10.1093/humrep/deac190 (2022).
Article CAS PubMed Google Scholar
Yan, L. et al. Live births after simultaneous avoidance of monogenic diseases and chromosome abnormality by next-generation sequencing with linkage analyses. Proc. Natl. Acad. Sci. U. S. A. 112, 15964–15969. https://doi.org/10.1073/pnas.1523297113 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
De Witte, L. et al. GENType: All-in-one preimplantation genetic testing by pedigree haplotyping and copy number profiling suitable for third-party reproduction. Hum. Reprod. 37, 1678–1691. https://doi.org/10.1093/humrep/deac088 (2022).
Article CAS PubMed Google Scholar
Zhang, S. et al. A comprehensive and universal approach for embryo testing in patients with different genetic disorders. Clin. Transl. Med. 11, e490. https://doi.org/10.1002/ctm2.490 (2021).
Article CAS PubMed PubMed Central Google Scholar
Kumar, A. et al. Whole-genome risk prediction of common diseases in human preimplantation embryos. Nat. Med. 28, 513–516. https://doi.org/10.1038/s41591-022-01735-0 (2022).
Article MathSciNet CAS PubMed PubMed Central Google Scholar
Murphy, N. M., Samarasekera, T. S., Macaskill, L., Mullen, J. & Rombauts, L. J. F. Genome sequencing of human in vitro fertilisation embryos for pathogenic variation screening. Sci. Rep. 10, 3795. https://doi.org/10.1038/s41598-020-60704-0 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Chen, S. et al. Comprehensive preimplantation genetic testing by massively parallel sequencing. Hum. Reprod. https://doi.org/10.1093/humrep/deaa269 (2020).
Article PubMed PubMed Central Google Scholar
Treff, N. R. et al. Validation of concurrent preimplantation genetic testing for polygenic and monogenic disorders, structural rearrangements, and whole and segmental chromosome aneuploidy with a single universal platform. Eur. J. Med. Genet. 62, 103647. https://doi.org/10.1016/j.ejmg.2019.04.004 (2019).
Article PubMed Google Scholar
Yuan, P. et al. A whole-genome sequencing-based novel preimplantation genetic testing method for de novo mutations combined with chromosomal balanced translocations. J. Assist. Reprod. Genet. 37, 2525–2533. https://doi.org/10.1007/s10815-020-01921-4 (2020).
Article PubMed PubMed Central Google Scholar
De Rycke, M. & Berckmoes, V. Preimplantation genetic testing for monogenic disorders. Genes https://doi.org/10.3390/genes11080871 (2020).
Article PubMed PubMed Central Google Scholar
Zeevi, D. A. et al. Expanded clinical validation of Haploseek for comprehensive preimplantation genetic testing. Genet. Med. https://doi.org/10.1038/s41436-021-01145-6 (2021).
Article PubMed Google Scholar
Backenroth, D. et al. Haploseek: A 24-hour all-in-one method for preimplantation genetic diagnosis (PGD) of monogenic disease and aneuploidy. Genet. Med. 21, 1390–1399. https://doi.org/10.1038/s41436-018-0351-7 (2019).
Article CAS PubMed Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760. https://doi.org/10.1093/bioinformatics/btp324 (2009).
Article CAS PubMed PubMed Central Google Scholar
Danecek, P. et al. Twelve years of SAMtools and BCFtools. GigaScience 1, 1. https://doi.org/10.1093/gigascience/giab008 (2021).
Article CAS Google Scholar
gnomAD. <https://gnomad.broadinstitute.org/downloads> (
bam-readcount. <https://github.com/genome/bam-readcount> (
BoxPlotR. <http://shiny.chemgrid.org/boxplotr/> (
Statistics Kingdom. <https://www.statskingdom.com/> (
Nurk, S. et al. The complete sequence of a human genome. Science 376, 44–53. https://doi.org/10.1126/science.abj6987 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was funded by a Shaare Zedek intramural Grant to D.A.Z. S. C. and D.A.Z. also received funding from the Hebrew University of Jerusalem Center for Interdisciplinary Data Science Research (CIDR; Grant No. 3035000322).

Author information

These authors contributed equally: Daniel Backenroth, Gheona Altarescu, Shai Carmi and David A. Zeevi.

Authors and Affiliations

Braun School of Public Health and Community Medicine, The Hebrew University of Jerusalem, Jerusalem, Israel
Daniel Backenroth & Shai Carmi
PGT Unit, Medical Genetics Institute, Shaare Zedek Medical Center, Jerusalem, Israel
Gheona Altarescu
Faculty of Medicine, Hebrew University, Jerusalem, Israel
Gheona Altarescu
Translational Genomics Lab, Medical Genetics Institute, Shaare Zedek Medical Center, Bayit Str. 12, P.O.Box 3235, 91031, Jerusalem, Israel
Fouad Zahdeh, Tzvia Mann, Omer Murik & David A. Zeevi
Medical Genetics Institute, Shaare Zedek Medical Center, Jerusalem, Israel
Paul Renbaum, Reeval Segel, Sharon Zeligson & Elinor Hakam-Spector

Authors

Daniel Backenroth
View author publications
You can also search for this author in PubMed Google Scholar
Gheona Altarescu
View author publications
You can also search for this author in PubMed Google Scholar
Fouad Zahdeh
View author publications
You can also search for this author in PubMed Google Scholar
Tzvia Mann
View author publications
You can also search for this author in PubMed Google Scholar
Omer Murik
View author publications
You can also search for this author in PubMed Google Scholar
Paul Renbaum
View author publications
You can also search for this author in PubMed Google Scholar
Reeval Segel
View author publications
You can also search for this author in PubMed Google Scholar
Sharon Zeligson
View author publications
You can also search for this author in PubMed Google Scholar
Elinor Hakam-Spector
View author publications
You can also search for this author in PubMed Google Scholar
Shai Carmi
View author publications
You can also search for this author in PubMed Google Scholar
David A. Zeevi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.B. contributed conceptualization, formal analysis, investigation, methodology, software, validation, and manuscript review & editing. G.A. contributed conceptualization, formal analysis, funding acquisition, investigation, supervision, validation, and manuscript review & editing. F.Z. contributed data curation, investigation, methodology, software, validation, and data visualization. T.M. contributed methodology and validation. O.M. contributed data curation, investigation, and methodology. P.R. contributed funding acquisition, resources, and supervision. R.S. contributed investigation, resources, and supervision. S.Z. contributed formal analysis, investigation, and methodology. E.H.S. contributed formal analysis, investigation, and supervision. S.C. contributed conceptualization, funding acquisition, investigation, methodology, supervision, and manuscript review & editing. D.A.Z. contributed conceptualization, data curation, formal analysis, funding acquisition, investigation, methodology, supervision, validation, writing of the original manuscript draft, and manuscript review & editing. All authors approved the final version of the manuscript.

Corresponding author

Correspondence to David A. Zeevi.

Ethics declarations

Competing interests

D.B. is an employee and shareholder at The Janssen Pharmaceutical Companies of Johnson & Johnson. All other authors report no conflicts of interest.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Backenroth, D., Altarescu, G., Zahdeh, F. et al. SHaploseek is a sequencing-only, high-resolution method for comprehensive preimplantation genetic testing. Sci Rep 13, 18036 (2023). https://doi.org/10.1038/s41598-023-45292-z

Download citation

Received: 15 June 2023
Accepted: 18 October 2023
Published: 21 October 2023
DOI: https://doi.org/10.1038/s41598-023-45292-z

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.