Introduction

Neonatal brain injury, with its two main subtypes of hypoxic-ischemic encephalopathy (HIE) in the late preterm and term infant and germinal matrix hemorrhage-intraventricular hemorrhage (GMH-IVH)/periventricular white matter injury (PVWMI) in the premature infant remains a focus of pediatric research due to its relatively high prevalence, the limited understanding and consensus on diagnostic criteria, the paucity of therapeutic options, and the serious long-term outcomes of neurological disabilities and death.1,2,3,4,5,6,7,8,9,10,11

Currently, clinical tools are limited to assess severity of brain injury, provide prognoses, and monitor intervention efficacy, promoting further research into multidimensional approaches to fill the gaps of knowledge.12,13,14 Within the multidimensional approach, blood-based biomarkers provide the potential for serial non-invasive measures to identify pathological processes based on abnormal molecular concentrations. Blood biomarkers have been extensively researched in adult traumatic brain injury and neurodegenerative diseases.15,16,17

Such studies have identified several potential biomarkers representing multiple facets of the injury continuum, including CNS-insult, inflammatory, and trophic proteins, that are now being investigated in the neonatal population. Glial fibrillary acidic protein (GFAP) and neurogranin (NRGN) are astrocyte or neuronal-specific proteins studied as potential brain-specific CNS-insult indicators. GFAP is an astrocyte cytoskeletal intermediate filament protein specific to central nervous system tissues and is released into the blood after brain injury.17 GFAP has been shown to be a significant diagnostic and potentially prognostic biomarker in infants with HIE and PVWMI.18,19,20,21,22,23 Neurogranin is a brain-specific protein kinase C substrate that plays a role in the excitatory neurons of the cortex and hippocampus and higher concentrations have been found in the circulation of patients with traumatic brain injury and neurodegeneration.15,24,25,26,27 In neonates with neonatal encephalopathy, NRGN, measured on DOL 0-1, has been associated with worse encephalopathy scores and neurodevelopmental outcomes.28 Interleukins have been investigated as potential biomarkers of neonatal brain injury among infants with neonatal encephalopathy, with elevated levels of IL-6, IL-8, and IL-10 measured within 24 h from birth associated with brain injury severity, abnormal neuroimaging findings on MRI, and impaired neurological outcomes.21,28,29,30,31 Neuronal trophic factors may play a role in neuronal cell repair and recovery after injury. Brain-derived neurotrophic factor (BDNF) is a trophic protein, shown to play a role in neuronal proliferation, differentiation, and plasticity.32,33,34,35 Decreased levels of BDNF were associated with worse encephalopathy grade and worse neurodevelopmental outcomes in neonates with encephalopathy.28,29 In addition, vascular endothelial growth factor (VEGF) is another trophic protein upregulated after tissue injury.36 In the literature, there is controversy over whether high or low VEGF is associated with encephalopathy severity.21,37,38

From the published studies, evidence shows a panel of multiple inflammatory, neuronal, and glial biomarkers measured at multiple time points would be advantageous for clinical diagnosis, prognosis, therapeutic monitoring, and identifying new therapeutic targets.39 However, a review of these biomarkers suggests they have not been studied enough to be used in routine clinical care and need validation to develop practical tests.40

The major obstacle to adopting many of these biomarkers into clinical practice is the lack of normative data in the neonatal population and the variability among gestational ages. Therefore, we aimed to determine the serum concentration of six pre-clinical brain injury-associated biomarkers which included CNS-insult (NRGN, GFAP), inflammatory (IL-6, IL-8, IL-10), and trophic (BDNF, VEGF) proteins in non-brain injured neonates from 23 to 41 weeks’ gestation.

Methods

Patients

In this multicenter prospective observational cohort study, we identified and analyzed the concentrations of CNS-insult (GFAP, NRGN), inflammation (IL-6, IL-8, IL-10), and trophic-factor (BDNF, VEGF) markers from a cohort of non-brain injured neonates born between 23-41 weeks’ gestation admitted to the level 4 neonatal intensive care unit (NICU) at the 48 bed Johns Hopkins Children’s Center (2009-2019) and 97 bed Johns Hopkins All Children’s Hospital (2015–2019).2,13,14,41,42 As both units accommodate inborn and outborn patients, only infants admitted in the first 24 h of life were included. Infants with genetic syndromes, chromosomal abnormalities, and major congenital anomalies were not approached for consent. Infants with evidence of brain insult or predetermined brain injury risk factors at birth, infants who developed signs/symptoms of brain injury during the hospital stay or had neuroradiological abnormalities on any brain imaging, and infants with an abnormal neurologic exam at the time of discharge were excluded from the study (Fig. 1). The neonates in this cohort had serial serum samples collected on days of life (DOL) 0–7. The study received institutional review board approval at both hospitals, and signed parental informed consent was obtained for each participant within the infant’s first 30 days of life and prior to NICU discharge (IRB00026068).

Fig. 1: Sequential exclusion criteria for the neonatal cohort.
figure 1

Exclusion criteria included conditions, clinical signs, or imaging that indicated brain injury or infection in the eligible neonates born between 23 and 41 weeks’ gestation admitted to the NICU.

Laboratory methods

Samples were procured from remnant serum obtained daily as part of routine patient care. Per College of American Pathologists (CAP) and clinical policy regulations, remnant serum was retained in the clinical laboratory for 48 h after all clinical testing was completed to allow for add-on testing or results validation. Once available for research use, appropriate samples for patients who provided consent as well as those potentially eligible for the study who had not yet provided consent were biobanked. Stored samples were subsequently discarded if consent was not obtained within 30 days. Biobanking entailed immediate transfer of samples into a 1.4 mL low-retention polypropylene cryovial and freezing at −80 °C. Prefreezing characterization included a description of sample quality by grading hemolysis level (none, mild, marked, or gross) using a comparison to a visual standard. All sample aliquots were exposed to 1–2 freeze/thaws prior to assaying. To minimize any effect post-collection handling may have on biomarkers levels, samples were only frozen once and thawed once and all samples were processed in the same manner. A detailed methodology describing participant engagement, consent, and the biobanking process with vulnerable populations is available.43

A custom multiplex enzyme-linked immunosorbent assay (ELISA) was developed to measure BDNF, IL-6, IL-8, IL-10, and VEGF simultaneously using robotically spotted capture antibodies on the 96-well plate format (Meso Scale Discovery [MSD], Rockville, MD) as previously reported.28 The lower limits of quantification for the BDNF, IL-6, IL-8, IL-10, and VEGF assays were 11.84, 0.15, 0.10, 0.15, and 0.41 pg/mL, respectively, with interassay coefficient percent of variation were calculated using a standardized interassay control of normal human plasma and were 24.7, 8.5, 8.5, 16.6, and 5.7, respectively. All assays were performed in the same laboratory and in a single batch utilizing 44 consecutive assay plates over 10 weeks.

A custom duplex ELISA was developed to measure GFAP and NRGN simultaneously using robotically spotted capture antibodies on the 96-well plate format (MSD, Rockville, MD) as reported previously.28 The lower limits of quantification for both the GFAP and NRGN assays were 0.009 ng/mL, with interassay percent coefficients of variation of 13.7 and 27.7, respectively.

Statistical analysis

Demographic data are presented as median and interquartile range (IQR) or percentages, as appropriate. For biomarker values that were detectable, but below the lower limit of quantification, a value was imputed by dividing the lower limit of quantification by two, noting that medians are robust to the specific choice of imputations. To investigate the relationship between each biomarker and GA, we used regression analyses adjusted for DOL. For the secreted molecules BDNF, VEGF, and interleukins, we utilized a quantile regression model to fit the median biomarker level based on the covariates, in which one individual sample per patient was selected at random with 1000 such bootstrapped iterations done to ensure that every sample had an equal chance of selection for the model—the standard error of the mean was determined based on the empirical distribution from the bootstrapped iterations. For the CNS-necrosis markers, we created a model to investigate the relationship between detectable or non-detectable GFAP and NRGN and GA. Detectable GFAP was analyzed using mixed-effects logistic regression to account for multiple measurements within the same person. Mixed-effects logistic regression did not computationally converge for NRGN; thus, we used a bootstrapping approach with logistic regression, with 1000 iterations each randomly choosing one measurement per person, similar to the discussion above. For all statistical analyses, a p-value of ≤0.05 was considered significant. Statistical analysis was performed using Stata (Version 15, StataCorp LLC, College Station, TX).

To identify sex differences in biomarkers values, we tested differences in the distribution of biomarkers between male and female infants using the rank-sum test. We accounted for within-person correlation of ranks by estimating the empirical distribution of the rank-sum statistic with 50,000 permutations of the male/female variable to determine the empirical p-value. The Monte Carlo error for the p-value ranged between 0.0008 and 0.0022 for the different biomarkers.

Results

Assay performance

The assays were performed within acceptable limits with the overall interassay percent coefficient of variation <25%. NRGN had the highest variation (27.7%), likely related to most samples being at the limit of detection. IL-10, GFAP, and NRGN were the lowest concentration analytes. It was expected that GFAP and NRGN would be at the limits of detection in a non-brain injury neonatal cohort. IL-10 was the lowest concentration analyte with a median value of 0.60 pg/mL (IQR 0.05–1.91) and below the limit of detection for many samples over all GAs.

Demographics

There were 5428 samples assessed for eligibility from 1219 neonates between 23 and 41 weeks’ gestation admitted to the JHH or All Children’s NICU. Exclusion criteria included cord pH ≤7.15, infection bacteremia within the first 7 days of life, GMH-IVH, periventricular leukomalacia, intracranial hemorrhage, stroke, seizures, abnormal brain MRI or cranial ultrasound, or therapeutic hypothermia. This resulted in 3232 serum samples to analyze from 745 enrollees. The maternal and neonatal characteristics from this cohort are presented in Table 1. The maternal population had a median gravida of 2 (IQR 1–4) and parity of 1 (IQR 0–2), with 43.0% spontaneous vaginal delivery, 20.3% with preeclampsia, and 6.4% with abruption. The distribution of GA had a median of 32.3 weeks (IQR 29–37 weeks) (Supplement Fig. 1). The neonatal cohort had a median weight of 1720 grams (IQR 1150–2790), a 5-min Apgar score of 8 (IQR 7–9), and cord pH of 7.3 (IQR 7.24–7.34). The cohort was 55.6% male, 8.7% had meconium, and 11.5% had intrauterine growth restriction.

Table 1 Descriptive statistics of maternal and neonatal characteristics.

Relationship of biomarkers to gestational age (GA)

For the cohort in aggregate, the descriptive statistics of the biomarker concentrations are shown in Table 2, including comparison by sex, which revealed no significant differences except BDNF was higher in females (p = 0.037). The relationships between the trophic and inflammatory biomarkers and GA are demonstrated in Fig. 2 and Table 3. The trophic-factor BDNF demonstrated a significant gestation-dependent increase and increased 3.7% for every week of gestation (p < 0.001). VEGF, another trophic factor, demonstrated no gestational concentration effect (p = 0.910). The inflammatory cytokines demonstrated significant gestational effects. As shown in Fig. 2, IL-8 increased 8.9% for every week of gestation (p < 0.001). IL-10 decreased during gestation but was not significant (coefficient = −0.041, p = 0.179). IL-6 also showed no relationship with GA (coefficient = 0.005, p = 0.557).

Table 2 Descriptive statistics of biomarker concentrations.
Fig. 2: Relationship of biomarker concentration with gestational age.
figure 2

Each data point represents the biomarker concentration of a serum sample from an individual patient. The line represents the median for each gestational age and the shaded area represents the 95% confidence interval that the median is within that range. a BDNF increases by 3.7% for every gestational week (p < 0.001). b VEGF shows no relationship with gestational age (coefficient = −0.001, p = 0.910). c IL-6 shows no relationship with gestational age (coefficient = 0.005, p = 0.557). d IL-8 increases 8.9% with every gestational week (p < 0.001). e IL-10 shows no relationship with gestational age (coefficient = −0.041, p = 0.179).

Table 3 Biomarker relationships with gestational age and day of life adjusted for clinical chorioamnionitis.

The biomarker levels of the CNS-necrosis factors are shown in Fig. 3. For analysis, we evaluated the relationship between the amount of biomarker detectable and GA. Overall, 50% of samples had detectable GFAP and 33% had detectable NRGN. The odds of having detectable GFAP did not change for samples collected from infants less than 27 weeks’ gestation (p = 0.100) or between 27 and 36 weeks’ (p = 0.089) (Table 3). However, after 36 weeks’ gestation, the odds of having detectable GFAP increased by 53% for every gestational week (p = 0.002). Additionally, from 23 to 36 weeks’ gestation, the odds of having detectable NRGN decreased by 3% each week (p = 0.042). Then, after 36 weeks’ gestation, the odds of having detectable NRGN increased by 11% (p = 0.002).

Fig. 3: Relationship of CNS-necrosis biomarkers with gestational age and day of life (DOL).
figure 3

Each data point represents the biomarker concentration of a serum sample from an individual patient. GFAP concentration plotted over a gestational age and c DOL. NRGN concentration plotted over b gestational age and d DOL.

Relationship of biomarkers to day of life (DOL)

The relationship between DOL and biomarker concentration is presented in Fig. 4. Unlike BDNF, VEGF, IL-6, 8, and 10 had significant daily variation. VEGF increased by 10.8% for each DOL (p < 0.001). IL-6 decreased by 15.8% for each DOL (p < 0.001). IL-8 increased each DOL by 18.9% (p < 0.001). IL-10 decreased by 81.4% for each DOL from cord blood through DOL 3 (p < 0.001). After DOL 3, IL-10 showed no relationship with DOL (p = 0.825). BDNF had no significant relationship with DOL (p = 0.152).

Fig. 4: Relationship of biomarker concentration with day of life (DOL).
figure 4

Each data point represents the biomarker concentration of a serum sample from an individual patient. The line represents the median for each gestational age and the shaded area represents the 95% confidence interval that the median is within that range. a BDNF shows no significant relationship with DOL (coefficient 0.021, p = 0.152). b VEGF increases by 10.8% each DOL (p < 0.001). c IL-6 decreases 15.8% each DOL (p < 0.001). d IL-8 increases 18.9% each DOL (p < 0.001). e IL-10 decreases by 81.4% each DOL less than DOL 3.

The brain necrosis markers GFAP and NRGN had significant DOL variation with the odds of having detectable GFAP decreased by 15% each DOL (p < 0.001) (Table 4). Similarly, the odds of having detectable NRGN decreased by 8% for each DOL (p = 0.008).

Table 4 CNS-necrosis biomarker relationships with gestational age and day of life adjusted for clinical chorioamnionitis.

Clinical chorioamnionitis sensitivity analysis

In our cohort, maternal clinical chorioamnionitis was present in 20% of participants. To investigate whether the associations of either GA or DOL on the biomarkers were due to confounding by clinical chorioamnionitis, we performed a sensitivity analysis for clinical chorioamnionitis and, surprisingly, did not find any significant differences after adjustment for clinical chorioamnionitis (Tables 3 and 4). For VEGF however, after adjusting for clinical chorioamnionitis, there was a marginally significant association with GA, increasing 1.7% for each gestational week (p = 0.044).

Discussion

To our knowledge, this is the first study to report biomarker values in non-brain injured neonates across the GA spectrum and the first 7 DOL. For GA, BDNF and IL-8 increased each gestational week, while VEGF, IL-6, and IL-10 showed no relationship. For DOL, VEGF and IL-8 increased each DOL, IL-6 decreased each DOL, and IL-10 decreased each DOL for DOL 0-3 only. BDNF was not associated with DOL, and no changes were observed between days 0 and 7 of life. For CNS-insult markers, the odds of having detectable GFAP and NRGN increased each week after 36 weeks’ GA yet, the odds of having detectable GFAP and NRGN decreased with each DOL.

Biomarkers are being studied in neonates to diagnose injury, prognosticate on outcome, monitor disease progression, and potentially unveil new therapeutic interventions. In neonatal critical care, biomarkers have not yet been incorporated into clinical practice as the report pre-clinical studies are relatively small sample and oftentimes lack long-term developmental outcome data. These studies have also varied widely in the specific biomarkers or combination of biomarkers analyzed, the body fluid source, and the laboratory analysis methods. Interpretation of results has also proven to be challenging given the lack of biomarker normative data, the heterogeneity inherent in the wide GA range, the differing patterns of injury between the premature and term infant, the multifactorial disease pathophysiology, and the salient and not yet elucidated role of stress, infection, and different clinical variables in disease expression and progression.

The strengths of this study include the incorporation of two remote NICUs, a large sample size, and standardized sample collection, storage, and laboratory assays. The analysis utilized sophisticated statistical methodology to develop a biomarker model that incorporated multiple measurements from the same individual, in which we investigated both the median and mean models (Figs. 5 and 6). For some of the biomarkers, such as IL-10, the median model was more appropriate because of the larger number of undetectable data points, whereas for other biomarkers there was no difference between the mean and median model. Therefore, we decided to display the median models in the paper for consistency.

Fig. 5: Relationship of biomarker concentration with gestational age in mean model.
figure 5

Each data point represents the biomarker concentration of a serum sample from an individual patient. The line represents the geometric mean for each gestational age and the shaded area represents the 95% confidence interval that the mean is within that range. a BDNF increases by 3.6% for every gestational week (p < 0.001). b VEGF shows no relationship with gestational age (coefficient = 0.019, p = 0.088). c IL-6 shows no relationship with gestational age (coefficient = −0.011, p = 0.243). d IL-8 increases 6.1% with every gestational week (p < 0.001). e IL-10 decreases 7.0% with every gestational week (p < 0.001).

Fig. 6: Relationship of biomarker concentration with day of life (DOL) in mean model.
figure 6

Each data point represents the biomarker concentration of a serum sample from an individual patient. The line represents the geometric mean for each gestational age and the shaded area represents the 95% confidence interval that the mean is within that range. a BDNF increased 5.6% each DOL (p < 0.001). b VEGF increased 31.4% each DOL (p < 0.001). c IL-6 decreased 14% each DOL (p < 0.001). d IL-8 increased 17% each day (p < 0.001). e IL-10 decreased 50.3% each DOL for samples collected before DOL 3 (p < 0.001) and increased by 13.6% after DOL 3 (p < 0.001).

Positive and definite correlation between BDNF serum levels and advancing GA was found, unaffected by clinical chorioamnionitis. BDNF was the only biomarker affected by sex with values higher in females. Although significant, it is unclear if the modest increase is biologically relevant, but sex-related differences in serum BDNF have been reported in adults.44 BDNF has been shown to play a crucial modulatory role in many CNS processes, including synaptic plasticity, new synapse formation, neuronal survival, dendritic branching, and neurotransmitter excitation and inhibition.45,46,47 Similar findings were found in animal studies, where the elevation of BDNF levels resulted in an increased rate and extent of myelination and to a lesser degree, the axon caliber. This increased myelin content and caliber was maintained in adulthood.48 Even though it remains unclear as to what extent serum levels of BDNF reflect the brain concentrations,49,50,51 it is reasonable to assume that the linear increase of BDNF with GA could stem from normal brain growth.

Our findings of undetectable levels of NRGN and GFAP are also important. As these are biomarkers of CNS necrosis, they would not be expected to be elevated in non-brain injured neonates. As demonstrated, however, a number of neonates had quantifiable levels of NRGN and GFAP after birth. An explanation for our findings could be a subclinical brain injury, such has been demonstrated in neonatal and infant cardiac surgery.52,53 Additionally, numerous studies have shown long-term neurological and behavioral deficits in premature infants in the absence of grossly abnormal neuro-radiographic imaging, whether in the form of serial cranial ultrasounds or an MRI performed close to term corrected gestational age. Therefore, NRGN and GFAP are favorable candidates for biomarkers of neonatal brain injury because elevated levels would differ greatly from our study showing low levels of detection of these biomarkers in neonates without brain injuries. The mechanisms driving the temporal relationship of when GFAP and NRGN are detectable are likely gestationally regulated. Levels of GFAP and NRGN both increase in the brain with normal development.54,55 This was evident when GFAP was used in a VLBW IVH cohort where neonates with periventricular white matter injury had elevated GFAP levels, but the concentrations were lower than previously reported in full-term neonates with HIE.20,28 It is likely that the day of life changes in these molecules are related to the effect of the birth process.

We found that IL-8 increased, and IL-6 and IL-10 were unchanged with normal gestational development. Although there are numerous reports of interleukin concentrations to suggest neonatal sepsis, this is the first report to identify the normal gestational levels of IL-6, 8, and 10. In a 2016 cross-sectional study, Romero et al found that neonates born to mothers with chorioamnionitis, in the presence or absence of intra-amniotic inflammation with or without bacteria, had significantly higher median umbilical cord blood plasma concentrations of IL-6, IL-8, and IL-10.56 Similarly, Goepfert et al reported increased umbilical cord blood levels of IL-6 in preterm infants born after spontaneous preterm labor or premature rupture of membranes compared to preterm births done for maternal or fetal indications.57 In our study, we found that chorioamnionitis did not affect the association of biomarkers with GA and DOL. From our study, the progressive increase in IL-8 and nonsignificant absolute decrease in IL-10 is intriguing and possibly reflects the normal maturation of the immune system.

Furthermore, although our gestational age models are adjusted for DOL, we wanted to investigate directly how the biomarkers changed over DOL. DOL and BDNF did not have any relationship, suggesting that perhaps BDNF does not change as quickly as the other biomarkers, which all significantly changed with DOL. VEGF and IL-8 increased and IL-6 and IL-10 decreased with each DOL. Interestingly, after DOL 3, IL-10 no longer changed with DOL, suggesting a plateau in IL-10 concentration. Furthermore, these relationships were not affected by clinical chorioamnionitis. This is in agreement with previous studies that showed IL-6 decreased over the first seven days of life.58 This suggests that inflammatory biomarkers and VEGF should be adjusted for DOL in any analysis looking at biomarker concentration.

This study has several limitations. One limitation is the difficulty in identifying truly neurologically intact neonates among a NICU cohort of infants with GAs as low as 23 weeks. Our exclusion criteria were rigorous to identify infants with clinical brain injury, even among the ELBW infants, but it is well known that premature infants remain at risk for subtle neurological deficits even in the absence of grossly normal neuro-radiographic imaging. Not accounting for clinical variables of brain hypoperfusion like hypotension or symptomatic patent ductus arteriosus is also a limitation. Despite all our effort to screen for, and identify “normal, non-brain injured” neonates, the pool of patients admitted for varying NICU support does not fully represent an ideal healthy neonatal population, but fairly represents the neonatal population where these biomarkers would have clinical utility. Another limitation is that the samples used in this study were clinical remnants with delayed pre-analytic processing. However, all samples were processed in the same manner to minimize any pre-analytic variability. Lastly, when analyzing the effect of infection and inflammation, only the absence of bacteremia in the first 7 days of life and no evidence of clinical chorioamnionitis were used as variables. Another limitation is that only one of the sites documented whether clinical chorioamnionitis was associated with histological abnormalities on placenta pathology. Future biomarker studies investigating inflammatory biomarkers are needed with broader delineation of perinatal risk factors of infection/inflammation that would include the diagnosis of histologic chorioamnionitis, the presence and duration of prolonged rupture of membranes, presence, and frequency of urinary tract infections during pregnancy, as well as type and duration of maternal antibiotics treatment prior to delivery. Such future studies will also need to factor in the non-specific clinical findings in the first 7 days of life of hypotension, need for pressor support, leukocytosis, leukopenia, neutropenia, elevated serial CRP levels, the use of extended course of antibiotics, and the diagnosis of clinical sepsis, among others—all potentially suggestive of an inflammatory process.

Also, although harder to achieve, as we only sampled the first seven days of life, it is unclear if postnatal concentrations of the molecules studied, have plateaued or if there are continued developmental changes.

Conclusions

The gestational effect on serum concentrations of BDNF and IL-8 may be important biologically, suggesting that circulating levels will need to be gestationally adjusted for clinically relevant assessment of brain injury. Additionally, VEGF and the interleukins are dynamic in the immediate postnatal period, and DOL should be included in adjustments for clinically relevant assessment in neonates.