Introduction

An estimated 5.7% of admissions to neonatal intensive care units (ICUs) could be due to chromosomal or monogenic disorders [1]. Obtaining a rapid etiological diagnosis remains a major challenge for guiding care management in this urgent context. In the absence of a clinical diagnosis, exome or genome sequencing (ES or GS) is now often recommended as a first-tier diagnostic test [2] because of the vast number of genetic diseases–over 8,000 [3, 4]–and their considerable clinical and genetic heterogeneity.

Clinical GS is less developed than clinical ES in most institutions but appears more suitable in an urgent context since no enrichment phase is needed. In 2012, Kingsmore et al. provided a proof of concept for GS in an ultra-rapid circuit, first retrospectively on two individuals, then prospectively on 5 individuals with a diagnosis in 2/7 (29%) in less than 50 hours [5]. To reduce the duration of bioinformatics analysis and variant interpretation, they targeted genes involved in nearly 600 recessive diseases. Two years later, they applied this strategy on “rapid” GS in 16 children in neonatal ICU, expanding the analysis to around 4,000 monogenic diseases [6]. They obtained a diagnostic rate of almost 70% in approximately 43 days. Since then, “rapid” ES and GS have been applied prospectively in a number of paediatric ICU cohorts. The median time to diagnosis using ES or GS ranges from 1 to 43 days and the diagnostic yield from 20% to 72% (Table 1) [4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26], showing that the definition of “rapid” ES or GS is not universal in the literature, and highly depends on the context (year of the study, country, resources). Some teams have succeeded in reducing turnaround time by optimizing phenotypic data collection or data interpretation. For example, Clark et al. have implemented a fully automated system to obtain a genetic disease diagnosis with a median turnaround time of about 20 h [27]. Because the arrival of samples cannot be planned, some authors used a full sequencing run for a single sample [8], while others allocated additional resources (in particular to enable a more flexible batching of samples) [21]. This allowed them to optimize the time to results but required a considerable extra upfront investment. In this article, we will use the term “accelerated” GS. We will refer to “rapid” when referring to the general literature on the subject, regardless of the actual turnaround time.

Table 1 Literature review of rapid exome and genome studies in neonatal and paediatric population.

We report on the FASTGENOMICS pilot study in newborns and infants requiring an urgent diagnosis for care orientation. It was designed to evaluate the feasibility of implementing trio-GS to deliver a result in less than 45 days, while minimizing extra costs by adapting the organization to integrate urgent requests as a priority into the usual, not urgent workflow, and to identify the technical or organizational obstacles encountered.

Materials and methods

FASTGENOMICS is a French national, multicentre, prospective pilot study in newborns and infants suspected of genetic disease and hospitalized in neonatal or paediatric ICU.

Individuals

We included newborns and infants proposed for inclusion by their referring clinical geneticist from eight reference centers for developmental diseases across France under the four conditions: (1) hospitalized in ICU with an urgent need for an etiological diagnosis to guide medical care; (2) suspected genetic cause and no obvious clinical diagnosis; (3) proband and both biological parents available for blood samples; and (4) written informed consent from both parents. The inclusion was validated by the two clinical geneticists coordinating the project. After the genetic medical consultation, whole blood samples were collected from the proband and their parents. Inclusions were discontinued after 37 individuals had been enrolled, as provided in the financing plan.

From the end of December 2018 to February 2020, 37 newborns or infants (14 females, 23 males) were included. Twelve had ultrasound features during pregnancy and seven of them had normal prenatal array-CGH. The median age of probands at inclusion was 27 days. All were born from healthy parents, including four consanguineous couples. The overall clinical features of the probands are described in Table S1 and mainly included neonatal hypotonia (17/37), epilepsy (13/37), and/or multiple congenital abnormalities (7/37). One individual died before the results were available.

Methods

We implemented a dedicated accelerated circuit with a time limit for each step of the process and staff (a technician and a biologist), for whom FASTGENOMICS became a high priority. To evaluate the feasibility of this rapid process, while avoiding potential pitfalls and resolving existing issues, we measured the duration from the dispatch of the blood samples to the sending of the report and we recorded all unexpected and adverse events at each of the 8 steps in the process (Fig. 1). Additional methods can be found in the supplemental materials.

Fig. 1: Flowchart of the study.
figure 1

Tasks with a shaded background were performed by the including centers. Tasks with vertical stripes were performed by the Dijon Bourgogne University Hospital. The task with horizontal stripes was performed by the CNRGH.

Step 1 – Sample reception. The inclusion center sent the blood samples to the coordinating lab (Dijon Bourgogne University Hospital) using a dedicated transporter. Estimated maximum duration: 2 calendar days.

Step 2 – Pre-analytic registration and techniques. After individuals were registered in the laboratory’s information system, standard procedures were followed to extract and qualify genomic DNA, as well as to genotype SNPs to ensure sample matching (Supplemental data). Estimated maximum duration: 8 calendar days.

Step 3 – Shipping of samples to the sequencing platform. The coordinating lab used a dedicated transporter to ship the DNA samples to the sequencing platform at the Centre National de Recherche en Génomique Humaine (CNRGH, Évry), avoiding deliveries during the weekend. Estimated maximum duration: 2 calendar days.

Step 4 – Sequencing. The preparation of the genomic DNA libraries was done according to manufacturer instructions (Supplemental data). Trio-GS was performed using a NovaSeq6000 (Illumina, CA, USA). Estimated maximum duration: 21 calendar days.

Step 5 – Bioinformatics analysis. The coordinating lab downloaded raw sequencing data from the CNRGH platform and transferred it to the computer cluster of the University of Burgundy (CCuB) to run bioinformatics analyses (Supplemental data). Estimated maximum duration: four calendar days.

Step 6 – Data interpretation. Data was independently interpreted by one cytogeneticist and two molecular geneticists (Supplemental data). Estimated maximum duration: 5 calendar days.

Step 7 – Multidisciplinary team meeting. All readers compared their results during a specific multidisciplinary team meeting, organized as quickly as possible, including at least two senior genomic geneticists and one clinical geneticist. Each selected variant was ranked into one of the five categories from the ACMG recommendations. Only the pathogenic and likely pathogenic results were considered etiological diagnoses. Variants of uncertain significance (VUS) were returned to the referring clinician when the multidisciplinary team estimated that the implication in the phenotype was very likely and/or when additional tests could be proposed to confirm the pathogenicity of the variants. Estimated maximum duration: 2 calendar days.

Step 8 – Communication of the results. The molecular diagnosis was emailed to the referring clinician. Estimated duration: 1 calendar day.

Steps 5 (bioinformatics) and 6 (interpretation) could be done during the weekend but steps 1 and 3 (transport), 2 and 4 (pre-analytic registration and techniques), or 7 (multidisciplinary team meeting) could not.

In order to calculate the minimum reliably achievable turnaround time of the analysis, we evaluated the duration that could be held for each of the eight steps of our diagnostic routine, with a prioritization over routine cases at each step, but without requiring additional staff. After adding up the estimated duration of each step, we found that the result should be returned to the clinician within 45 calendar days.

We divided the study into two phases: phase 1, including the first two months to identify the flaws in the initial experimental protocol; phase 2, covering the twelve following months and including adjustments in order to manage the obstacles and to reduce the waiting time for the remaining subjects.

Variant confirmation and parental segregation were performed by Sanger sequencing, qPCR or Multiplex Ligation-dependent Probe Amplification (MLPA) if IGV visualization was uncertain. All causal variants and VUS identified in this study were submitted to ClinVar (accession IDs in supplemental materials).

Limitation of extra costs

Extra costs were mainly limited by not requesting rapid sequencing to the platform. The costs of staff and consumables were compared when sequencing was part of their routine flow or not. Consumable costs covered DNA quantification, quality control, and normalization before library preparation, PCR-free library preparation for GS (Illumina TruSeq DNA PCR-free), quality control before sequencing, standard sequencing on a shared S4 V1.5 Illumina flow cell, or rapid sequencing on a dedicated S1 or SP V1.5 Illumina flow cell in order to reach at least 30X for each sample and bioinformatics analyses to produce FASTQ. Staff costs covered all the people involved in the steps above. Costs were expressed in euros excluding VAT. We also evaluated the additional resources needed per trio in the accelerated circuit set-up for this study as compared to the standard circuit of the lab.

Results

Process steps

Overall duration

In this pilot study, which included 37 unrelated newborns or infants in neonatal or paediatric ICU, GS median duration was 42 days (ranging from 27 to 92 days) from the dispatch of the blood sample to the communication of the results to the referring clinician.

Ten individuals were included in phase 1 and 27 individuals in phase 2. During phase 1, the time required for steps 5 (bioinformatics analysis), 6 (data interpretation) and 8 (results to clinician) was shorter than or equal to the expected duration (Fig. 2, Table S2). We identified several recurring unexpected events that led to delays. Corrective measures or protocol adaptations (details in the discussion) led to complete resolution for steps 2 (pre-analytic registration and techniques) and 7 (multidisciplinary team meeting), and to a decrease in duration for step 4 (sequencing). Steps 1 and 3 (transport of samples) could not be improved within the study timeframe.

Fig. 2: Actual duration by individual over time, step by step from blood sample dispatch to communication of the results to the referring clinician.
figure 2

The vertical bar separates phase 1 from phase 2.

Finally, we were able to reduce the median duration from blood sampling to the communication of the results to the referring clinician from 68 days in phase 1 to 40 days in phase 2 (Fig. 2, Table S2), below our objective of 45 days.

Unexpected and adverse events

The times required from the dispatch of blood samples to the communication of the results for each case are presented in Fig. 2.

Step 1 – Sample reception. For individuals 3, 6, and 36, the transport from the inclusion center to the coordinating lab took 3, 6, and 4 days instead of 1 day.

Step 2 – Pre-analytic registration and techniques. For 8 individuals, the blood samples were received on Friday or the day before a public holiday, causing the extraction to take three calendar days. For 2 individuals, step 2 took respectively 9 and 12 days instead of 8, because the dedicated staff responsible for extracting DNA in phase 1 and sending the samples to the sequencing platform was temporarily absent.

Step 3 – Shipping of samples to the platform. For individual 37, the transport time from the coordinating lab to the sequencing platform took 3 days instead of 1.

Step 4 – Sequencing. During phase 1, sequencing exceeded the expected maximal duration (21 days) in all cases except for the first 2 individuals, who were treated on dedicated flow cells. For case 21, sequencing had to be performed a second time because the sequencing data was contaminated due to an equipment problem during the library preparation step, consequently adding 33 days to the analysis time.

Step 7 – Multidisciplinary team meeting. During phase 1, team meetings had to be delayed for cases 6, 9, and 10.

There were no delays in steps 5 (bioinformatics analysis), 6 (data interpretation), or 8 (communication of the results) (Fig. 2).

Molecular results

Twenty-five causal variants (pathogenic or likely pathogenic) were identified in 18/37 individuals (49%) and 10 VUS in 8/37 (22%) individuals (Table 2, Table S3). We found 34 single nucleotide variants (SNVs) or indels, 1 large intragenic deletion, 1 complex rearrangement, and 1 uniparental isodisomy. The uniparental isodisomy of chromosome 1 was found in case 22 after the identification of a homozygous variant inherited from the mother in a 2 Mb loss of heterozygosity region. We had to confirm the variant by Sanger sequencing for case 19 because some reads around it had a poor mapping quality score and by MLPA for case 27 to confirm the LAMA2 duplication. The complex rearrangement was suspected to be caused by the partial duplication of LAMA2 and TMEM244, followed by a partial deletion of LAMA2 and TMEM244 in the original copy of the genes (supplemental Figure S1). The groups with the highest rate of positive diagnoses were those with multiple congenital abnormalities (57%) or epilepsy (54%).

Table 2 Results for the 37 included individuals.

For some cases, variants classification was a long process or is still pending despite expert advice and/or additional investigations. For case 3, a missense VUS in PIGN was reclassified as probably pathogenic based on functional studies and on expert advice provided nine months after the provisional report, allowing us to propose a prenatal diagnosis to the family. For case 8, an X-linked variant in the PIGA gene inherited from the mother led to familial segregation, which was inconclusive, and was reclassified as probably pathogenic after functional studies. For cases 31 and 36, expression studies on muscle biopsy have been suggested to help reclassify compound heterozygous VUS in the TTN gene. For case 33, the identification of candidate variants in the SATB1 genes, not yet associated with a neurodevelopmental human condition at the time of the study, led us to submit it in GeneMatcher to collect similar cases [28] and allowed to identify a new disorder [29]. We, therefore, expect the final diagnostic rate to be nearly 57%.

Limitation of extra-costs

Extra-costs were mainly limited by not requesting a rapid sequencing to the platform, which would have cost 7900 € per trio (6300 € for consumables and 1600 € for staff), compared to 2590 € (2500 € for consumables and 90 € for staff) in their routine flow. The additional resources per trio of the accelerated circuit were 40 minutes of technician time (step 2), 10 minutes of bioinformatician time (step 5), 30 min of administrative time (steps 1, 3, and 7), and 50 min of geneticist time (steps 2 and 6). The cost of the bioinformatics adaptations made before the study was estimated at 7 h.

Discussion

“Rapid” ES or GS has already been tested in other centers, with a median duration ranging from 1 day to 6 weeks. A definition of “rapid” ES or GS is seriously lacking in the literature [11, 23]. We therefore decided to use the term of “accelerated” GS, to be in line with our main objective of integrating priority GS for neonates and children hospitalized in ICU in the diagnostic process on a national level (network of 8 French reference centers). This study was not designed to evaluate the costs, but to allow accelerated trio-GS without high additional expenses. Therefore, standard laboratory procedures were used as much as possible. Some changes to the usual routine organization were implemented before the study. While our team usually relies on ES for routine diagnosis, GS was preferred because it has been shown that GS is more effective for detecting exonic variants, structural variants (SVs) including CNVs (copy number variations), and balanced chromosomal rearrangements [30, 31]. In addition, GS is more suitable for urgent situations because of the shorter preparation time for samples before the sequencing itself, due to the absence of exon capture. The bioinformatics pipeline implemented for this study was adapted to reduce the computation time from 4 to 2 days.

The prospective investigation performed to identify barriers or organizational obstacles has helped shorten the process throughout the course of the study. While we were unable to meet the objective of 45 days from blood sampling to results during phase 1, with a median duration of 68 and a mean duration of 63 days, the results of the corrective measures were clearly visible during phase 2 (Fig. 2 and Table S2). As the company in charge of the transport of blood samples did not explain the reasons for the delays in steps 1 and 3, we were unable to address this issue. To minimize delays and meet deadlines, samples were tracked during each step, requiring frequent interactions between the referring clinician and the laboratory. To further reduce the sample shipment duration, we would have to switch to a more expensive contract, notably to handle deliveries on non-working days. To avoid both steps 1 and 3, we could also perform the analyses locally, which would require a sequencer per hospital. For step 2, we estimated at the beginning of the study that DNA would be extracted within one working day after the reception of the blood samples. This goal was not met during phase 1, so we revised the division and rotation of tasks to increase efficiency without interrupting the routine activity of the diagnostic laboratory. To further reduce the pre-analytic registration and techniques duration, duties on weekends and public holidays would have to be set up, to allow extraction, quality checks, and shipment of the samples for sequencing without waiting until the beginning of the following week. The main issue was the time required for sequencing (step 4), which almost consistently exceeded the expected maximum duration. The handling of samples was modified to set up a priority circuit with technician time dedicated to this project on the CNRGH platform. In addition, the high-throughput library preparation and sequencing workflow was modified to be able to include the samples received each week without delay. The coordinating lab also had to inform the platform managers in real time of the inclusion of samples into the flow to better organize GS in its activity. The obstacle for even faster sequencing is the threefold cost in terms of sequencers, consumables, and staff. It should be noted that in most studies on rapid circuits, sequencing was carried out on local dedicated platforms. France, however, has opted to establish a national network for genomics centralized on two platforms. The expected intense flow of samples to these platforms in the future may improve sequencing time without the need for a dedicated machine. Steps 5, 6, and 8, which were all performed by the coordinating lab, were successfully completed within the estimated time frame. Bioinformatics analysis (step 5) is limited by the available hardware and the software optimization. For example, the use of phenotypic-driven analysis and GPUs could further reduce the time needed for bioinformatics computing [32]. In phase 1, the multidisciplinary meeting (step 7) was planned during data interpretation (step 6). The short notice prevented quorum from being reached in time for 3 cases. In phase 2, there were no further delays for the step 7 thanks to a schedule planned as soon as the sequencing data was received (step 5). The duration of steps 6 to 8 depended on the availability of the geneticists, both when the data was available for interpretation and later when the interpretation was completed to plan the multidisciplinary meeting.

We can assume that our high diagnosis yield stems from the inclusion criteria requiring a genetic cause to be suspected. We found 34 SNVs or indels and 2 SVs. We have shown that the percentage of positive variants increases with additional investigations (asking for expert advice, extended familial segregation, functional studies…). After reevaluation, we have increased the molecular diagnostic rate from 18/37 (49%) to 21/37 individuals (57%). Interestingly, GS identified one case that could not have been resolved by ES. In case 27, we indeed identified a deletion in LAMA2 inherited from the father and a complex rearrangement leading to a disruption of LAMA2 inherited from the mother. ES would have identified the duplication in the rearrangement and the deletion but would not have provided a hypothesis for the underlying mechanism nor the link between the duplication and the loss-of-function of the gene.

In previous studies, “rapid” GS resulted in a potential change in patient management in 27% to 72% of positive cases, including changes in therapy, palliative care, targeted surveillance, genetic counseling, and prenatal diagnosis for future pregnancies [4, 7,8,9,10,11,12,13, 17, 33, 34]. The results of the FASTGENOMICS study show an undeniable clinical interest for patients, with an accelerated diagnosis that could lead to earlier care management decisions and adaptations (including therapeutics, active reanimation, consultation of a specialist in the pathology, reevaluation of surgical indications, complementary investigations and even discontinuation of medical care). Based on the literature, specific changes in therapeutics alone could potentially benefit 7/18 positive patients (39%). L-tyrosine has been suggested to be beneficial in nemaline myopathy (#256030, case 4) [35], although the effectiveness of this treatment has not been proven [36]. Interferon-α2a or -α2b have been shown to increase hemoglobin and decrease iron overload, and successful allogenic bone marrow transplantation has been described in cases resistant to interferon therapy in congenital dyserythropoietic anaemia type Ia (#224120, case 11) [37]. In some cases, the initiation of sodium channel blocker therapy is recommended for treating seizure in individuals with familial benign infantile seizures due to a SCN2A pathogenic variant (#607745, case 15) [38]. Avoiding valproic acid and divalproate is recommended in cases with mitochondrial DNA depletion syndrome 4 A (Alpers type) (#203700, case 18) because of the risk of precipitating liver disease [39]. Inclusion in a gene therapy trial (NCT01840657 and NCT03199469) can be proposed to improve neuromuscular and respiratory symptoms in children with MTM1-associated myotubular myopathy (#310400, case 20). Supplementation of oral galactose has showed clinical and metabolic improvements of glycosylation in CDG syndrome type It (#614921) (case 22) [40]. Vinpocetine or clonazepam have been reported in autosomal dominant epileptic encephalopathy type 43 (#617113, case 24) as having potential efficacy in reducing seizures [41, 42].

We have confirmed the importance of frequent interactions and solid communication, the risks of relying on a small team for specific tasks and the need to train backups, the impact on routine activities, and the importance of obtaining interpretations from both molecular geneticists and cytogeneticists for their expertize in SNVs, indels, and SVs. Speeding up the time to diagnosis using GS while including patients from multiple hospitals but relying on only one laboratory and one sequencing platform and while limiting additional costs presented considerable logistical challenges. Nonetheless, our collaborative community was able to accelerate access to GS in urgent situations and promote a unified approach across different sites in France through infrastructure organization, regular feedback, and sharing of expertise.

This pilot study demonstrated the feasibility of national accelerated diagnosis by trio-GS for newborns and infants requiring an urgent diagnostic orientation for their care, with limited additional cost. Since the extra costs are minor compared to those of the standard circuit, centers that already use ES or GS for diagnosis could implement such an accelerated circuit. A larger study is underway to investigate the impact of these results on patient care management in a larger cohort, including the present one, at short and medium term, in comparison with the classic circuit. Nevertheless, this accelerated circuit must certainly coexist with a rapid circuit, which, although more expensive, can be used in situations where the need for a diagnosis is particularly urgent.