Introduction

Over the last decades efforts have been made to redefine normal labour duration1,2,3,4,5,6. Contemporary research suggests that the first stage of labour can last longer than previously understood without evidence of an increase in risks for the mother and neonates7,8. Strategies supporting greater patience and less interventions during the first stage of labour have been tested and adopted in some settings3,5,9,10, however, it is an ongoing challenge in maternity care to balance the benefits of expectantly managing longer labour durations without jeopardizing the health of the neonate7,11,12.

Prior evidence evaluating the benefits and harms of expectant labour duration during the active phase is inconclusive regarding potential consequences for the neonate4,12,13,14. More research has been accomplished addressing second stage duration and adverse neonatal outcomes, potentially based on the idea that second stage is the most demanding period of labour for both neonate and woman15,16,17.

Labour is a continuous process and time-based interventions during both first and second stage are influenced by overall labour duration in complex ways18,19. In addition, many studies dichotomize labour duration into ‘prolonged’ and ‘not prolonged’ with varying time thresholds making it difficult to estimate a possible effect of a gradually increasing labour duration13,15,20. Recent research of the active phase of labour proposes dilation of 5 cm as the threshold separating latent and active phase, which is the updated definition according to World Health Organization (WHO)21. This study aimed to investigate the possible associations between active first stage of labour and neonatal outcomes in a contemporary cohort of women.

Informed by our previous research on labour duration, including both latent, active phase and second stage, we explored associations between a wide range of labour duration measures and adverse neonatal outcomes during the spontaneous onset of labour2,17,18,22. The objective of this study was to (a) investigate the associations between first stage duration and adverse neonatal outcomes (total effect), and (b) quantify the extent to which these associations are mediated by second stage labour duration (indirect effect).

Material and methods

Data sources

We conducted a population-based cohort study, using data the Stockholm-Gotland Obstetric Cohort23. This prospective dataset includes 334,138 maternal and neonate dyads with singleton births between January 2008 and June 202023. Standardized recorded variables and data is captured directly from the electronic medical record system, by the unique personal identify number of the mother and infant, including diagnoses according to the Swedish version of the International Classification of Diseases (ICD), Tenth Revision. Furthermore, the data contain granular information embedded in standardized partographs with serial information (i.e., the date and time points of an individual’s cervical dilation). The cohort is linked to the Swedish Neonatal Quality (SNQ) register, which collects data on all infants born in Sweden who are admitted for neonatal care, including delivery rooms deaths and ICD codes. A validation of the SNQ register from 2019 demonstrated excellent accuracy24,25.

Study population

We included nulliparous women with singleton term gestation pregnancies (≥ 37 + 0 weeks) in vertex presentation who experienced spontaneous labour onset, (n = 99,313, Target Population). Figure 1 show a flowchart for this selection of the population. From this population, we identified the “Complete case cohort”, restricted to women with a documented notation of date and time of 5 cm cervical dilation in the partograph, this cohort was used for sensitivity analysis.

Figure 1
figure 1

Flow chart for the target population, complete case cohort (for sensitivity analysis) and main cohort: Study Cohort N = 46,040 women.

For 15,241 women with missing notations for 5 cm but with measurements of cervical dilatation before and after 5 cm (see “Imputation section” for further description), we estimated and imputed the date time 5 cm and hence included these in the Study cohort which consisted of 46,040 women (Figure S1). Main analysis was conducted on the Study cohort. Data did not capture gender preferences; the standardized gendered terms are used (woman) for the individuals included in this study.

Exposure

The exposure of interest was active phase, i.e. active first stage duration. For each included woman we identified by the first partograph notation of cervical dilation of 5 cm (known or estimated). The end of first stage was defined by the first timepoint that full cervical dilation was recorded in the partograph by the caregiver for the patient in real time. This definition of cervical dilation in relation to active phase was chosen to correspond with the updated WHO partograph and definitions21. Consequently, women with a caesarean delivery prior to a fully dilated cervix, during active first stage were not included.

Imputation

The onset of active first stage (i.e. timepoint of cervical dilation of 5 cm) was not available in a large proportion of women (n = 60,237, 60%). This is to be expected given wide variability in both timing of hospital admission and of cervical examinations in labours with spontaneous onset. To improve precision and minimize the risk of selection bias, 10 different imputation patterns were hierarchically used to estimate onset of active first stage for cases where partograph data of 5 cm was missing but information on cervical dilation before and after 5 cm was documented in the partograph. Each woman with a recorded data timepoint for cervical dilation pre and post 5 cm were subject for imputation, information about the imputation extensively described in2 (Figure S1). This method was applied to reduce selection bias in the study population and this has been used in previous publication, published elsewhere26 From the Target population we defined the complete case cohort (n = 31,351) as only including observations with a notation of cervical dilation of 5 cm. The final Study cohort included the complete cases plus “15,241” cases with the imputation of the onset of the active first stage with using information from at least 1 notation of cervical dilation < 5 cm and another between 6 and 10 cm, of which, 1% cases (n = 552) do not have documented notation of the end of active phase that were therefore removed from the main analysis in the Study Cohort (Fig. 1).

Outcomes

Adverse neonatal outcomes were retrieved from the SNQ register and categorized hierarchically into two separate composite adverse neonatal outcomes, severe and moderate. Severe neonatal outcomes were defined as a composite of neonatal outcome conditions with a high risk of death and/or major neurodevelopmental impairments. Moderate neonatal outcomes were defined as a composite of neonatal outcome conditions with a low risk of death or major impairments if adequate treatment is administered within an appropriate time frame (Table S1 and Table S2).

Covariates

We selected potential confounding variables based on biological plausibility, a priori knowledge and by a Directed Acyclic Graph (DAG)27. Early pregnancy BMI, maternal age28 and gestational week were selected as confounders in the final analysis. Gestational week at birth was used as a proxy for birthweight13,29. Other covariates considered and tested in the initial analysis (but removed in the final analysis to avoid over adjustment, in line with the framework of DAG) as confounders were hypertensive disease (gestational hypertension, and preeclampsia), pre-gestational diabetes, gestational diabetes, epidural analgesia, fetal position, birthweight and oxytocin use during first stage (Fig. S3).

Main analysis

Descriptive statistics of baseline characteristics were stratified into four categories (c1-c4), based on the distribution of the active first stage duration (Table 1). Continuous variables were transformed into categorical variables. Our primary independent variable was the duration of the active first stage, and to explore how duration-category were associated with neonatal outcomes, we created four categories of duration of the active first stage by increasing distribution intervals, namely; c1: < 50th percentile (median), c2: 50th to 75th percentile, c3: > 75th to 90th percentile and c4: > 90th percentile (Table 1).

Table 1 Maternal and delivery characteristics, categorize into percentiles of first stage duration for the Study cohort including 46 040 women.

Poisson regression with robust standard errors was applied to investigate the association between active first stage of labour duration and the relative risk of either severe or moderate neonatal outcome. In the regression analysis, c1 was used as reference. Results are presented as crude and adjusted relative risks with corresponding 95% confidence intervals (CI) (Table 3 and Figs. 2, 3). There were few missing observations on covariates (< 4%), and no imputation methods were applied to them. SAS version 9.4 (SAS Institute Inc., Cary, NC, USA) was used for data management and analyses (Table 2).

Figure 2
figure 2

The absolute risk and proportions for severe or moderate neonatal outcomes for the association between increasing active first stage of labour duration presented along with the adjusted relative risk of severe or moderate neonatal outcomes in the forest plot. Forest plot for the association between increasing active first stage of labour duration and the relative risk of either severe or moderate neonatal outcome, adjusted relative risk presented.

Figure 3
figure 3

Stratified adverse neonatal outcomes depicted linearly in relation to the active first stage of labour duration for the points of investigated percentiles in the Study Cohort of 46 040 women. Y axis corresponding to absolute risks in % and X corresponds to the continuous exposure “active first stage of labour duration”, categorized.

Table 2 Descriptive data on the duration of active first stage of labour in hours in the Study cohort and the Complete case cohort (reported as percentiles) and stratified by adverse neonatal outcome (Severe, Moderate, No adverse).

Mediation analysis

A mediation analysis was performed to investigate the extent to which associations between first stage and adverse neonatal outcomes were mediated by second stage of labour duration (Table 4, Figs. S2, S3)16,17,18. This approach, extensively described elsewhere, builds on a counterfactual framework30,31,32,33,34. Total effect decomposes into the product of the direct effect and mediated effect, and we also estimated the proportion mediated on the risk difference scale. Here, the mediator, second stage, was modelled both as binary and as a continuous variable (Fig. S2, Table 4), following the significant findings in the main analysis (Table 3). We used the CAUSALMED procedure in SAS for all estimates.

Table 3 Active first stage of labour duration and relative/absolute risks of adverse neonatal outcomes in the Study cohort, using the first category (< 50th percentile) of active first stage of labour duration as the reference.
Table 4 Mediation analysis for comparing increasing active first stage labour’s association to adverse neonatal outcomes; effect of active first stage of labour duration (Total effect TE), mediated through the causal mediator, second stage labour duration, (Natural indirect effect NIE).

Sensitivity analysis

We performed several sensitivity analyses. First, to account for the possible influence of maternal co-morbidity, we tested several potential confounders in the adjusted analysis. Second, to verify the categorization (percentiles of first stage duration) of our exposure, we reran analyses of relative risks using alternative exposure categorization groups (Table S3). We also investigated the results using first timepoint of contractions as start of the exposure, this to compare if patterns of duration between the outcomes (severe, moderate, no adverse) was robust when using another starting point (Table S4). Third, to further evaluate the robustness of the imputed timepoints for 5 cm, all analyses were also performed in the “Complete case cohort” (Table S5). Four, to examine the potential influence of oxytocin use during first and/or second stage of labour, we performed adjusted analysis including oxytocin as a confounder, additionally we performed an adjusted analysis including also birthweight in the adjusted model. (Table S6). The last sensitivity analysis was performed to describe the distribution of duration and numbers of excluded cases of adverse neonatal outcomes due to caesarean delivery during the active first stage (Fig. 1, Table S7).

Details of ethical approval

Permission for this study was obtained from Swedish Ethical Review Authority (In Swedish: Etikprövningsmyndigheten, Sweden, http://etikprovningsmyndigheten.se) and approved by the regional ethical committee (IRB) at Karolinska Institutet, Stockholm No 2019-02818, 2009/275-31. The studies in the project are based on previously collected medical record and register data and the personal identification numbers has been replaced by anonymous serial numbers by the Swedish National Board of Health and Welfare. Analyses were conducted on de-identified data and informed consent was collected from caregivers, in accordance with the ethical approvals. All methods were performed in accordance with relevant guidelines and regulations.

Results

Main analysis

Among the 46,040 women in the Study cohort, 94.2% experienced a delivery with normal neonatal outcomes. We identified 979 (2.1%) deliveries with a severe outcome, and 1712 (3.7%) with a moderate outcome (Table 1). The median duration in the Study Cohort was 5.1 h. Among deliveries with severe neonatal outcomes the median duration was 5.7 h and among moderate neonatal outcomes the median was 6.2 h. Absolute risk for moderate outcomes increased from 2.8% (c1) to 6.2 (c4), and absolute risk for severe outcomes from 1.9% (c1) to 3.0% (Table 1, Table 3, Figs. 2, 3). With increasing active first stage duration, the proportions of women with a caesarean delivery during second stage increased, from 4.3% (< 50th percentile, c1) to 13.3% (> 90th percentile, c2) (Table 1).

There was an association between increasing active first stage duration and the relative risk of adverse neonatal outcomes which persisted in the adjusted analysis for moderate neonatal outcomes for all duration categories (c2-c4) compared to the reference (c1). Compared to women with an active first stage < 5.1 h (c1), women with an active first stage of labour between 5.1 and 7.5 h (c2) had an increased risk of moderate adverse neonatal outcomes (aRR 1.4, 95% CI 1.24, 1.58). The most pronounced relative risk was observed among women with an active first stage beyond 10.1 h (c4) compared to women in (c1) (aRR 2.11, 95% CI 1.83, 2.43) (Figs. 2, 3, Table 3).

Increasing labour duration prior to 10.1 h was not associated with an increased relative risk of severe outcomes to the same extent as for the moderate neonatal outcomes. However, when comparing women with labours beyond 10.1 h (c4) to women with < 5.1 h (c1), the relative risk of severe neonatal outcomes was 1.53 (95% CI 1.26, 1.87) (Figs. 2, 3, Table 3).

Results from mediation analysis

We then assessed the role of second stage duration on the association between active first stage duration and adverse neonatal outcomes (Fig. S2). When modelling second stage as a continuous mediator, approximately 21% of the total effect was mediated through the duration of second stage (Table 4, model 4).

For severe neonatal outcomes, among women with an active first stage duration > 10.1 h versus less than 10.1 h, 14% of the total effect was mediated through the second stage duration (Table 4, model 6). To conclude, this analysis showed that a most of the risk on adverse neonatal outcomes were due to active first stage duration since a smaller percentage (21% and 14% respectively) was related to the indirect effect of an increasing second stage duration.

Results from sensitivity analyses

In a sensitivity analysis, neither hypertensive disease, gestational hypertension or preeclampsia nor gestational diabetes significantly altered the point estimates for risk of adverse neonatal outcomes and were therefore not included as confounders in the final analysis. In the second sensitivity analysis, as expected the results were marginally changed (Table S3). Same pattern was also observed when we tested a different start of the exposure (Table S4). Sensitivity analysis performed in the Complete case cohort, showed that results were consistent with findings from the main cohort (Table S5). Including oxytocin and birthweight as a confounder only marginally reduced the risk estimates (Table S6). In our last sensitivity analysis, we performed a restricted analysis for women experiencing an adverse neonatal outcome and not included in the population due to a caesarean delivery during first stage. They had slightly shorter overall active first stage duration at all investigated percentiles. Here it was obvious that a majority of these cases 43.8%, n = 74 (moderate), and 52% n = 31 (severe) showed an early sign of fetal distress, and had a caesarean delivery before 5.1 h (50th percentile) (Table S7).

Comment

Principal findings

Increasing active first stage duration in our population of nulliparous women with term pregnancies and spontaneous onset was associated with increased relative risks of both moderate and severe neonatal outcomes. Effect decomposition further suggested that most of this association was not mediated by second stage of labour duration.

Results in the context of what is known

These study results contribute to prior findings showing an association between longer active first stage of labour durations and adverse neonatal outcomes in high income settings. Our findings echo previous publications by Blankenship et al.12 who reported increasing neonatal morbidity for labours > 11.3 h. Different from ours, their study included parous and induced labours and investigated a slightly different composite neonatal outcome. Similarly, the most recent study from the Consortium on Safe Labour (based on data from 2002 to 2008) concluded that slower labour (≥ 4 h) at 6–7 cm was not associated with risk but allowing arrest of dilation of ≥ 4 h between 8 and 9 cm increased the risk for adverse neonatal outcomes6. Considered together, these findings suggest that increasing active first stage of labour duration, rather than a strict duration threshold at a specific cervical dilation, may increase risk.

Creating composite neonatal outcomes is not straightforward. Some neonatal outcomes are associated with potentially debilitating or life-threatening consequences while others are not. For this reason, we chose to analyses adverse neonatal outcomes indicative of serious concerns for long term consequences separately from those that are less likely to have long term consequences for the neonate, when adequately treated. For example, the underlying etiology for the severe outcome intracranial hemorrhage (ICH) may have been developed antenatally or acquired during labour, however the consequences could be fatal and actions to prevent ICH is important. Although we used a more clinically relevant composite neonatal outcome, our findings appear similar to previous publications6,7,12,14, with an important finding that neonates with a composite of severe outcomes had shorter distribution of duration than moderate outcomes.

The mediation analysis addressed the questions: (a) “Is second stage labour duration mediating the association between active first stage of labour and adverse neonatal outcome?” and (b) “Is the mediated effect of second stage of labour duration similar for moderate and severe neonatal outcomes?” Study results indicate that first stage duration is independently associated with adverse neonatal outcomes, given that only 21% and 13% was mediated through second stage duration for severe and moderate outcomes, respectively.

There is conflicting evidence regarding the association between first and second stage of labour duration and adverse neonatal outcomes: some studies report no increased risk15,35,36, while others have found that a prolonged duration is associated with increased neonatal morbidity16,17,20,37,38. This may be partly explained by the fact that studies addressing the impact of labour duration have used a predefined binary threshold of 95th or 90th percentiles duration (first or second stage) and estimated risks for discrete additional increments in duration (i.e. hours) beyond these thresholds. This may indicate that predefined binary thresholds at 95th and 90th percentiles duration are not clinically meaningful. Also, discordances in the neonatal outcomes investigated hamper comparisons between different studies1,12,13,14,15,17,20,35,38.

Clinical and research implications

Our study offers novel findings that challenge the value of using points of statistical distribution to define normal vs. abnormal labour duration threshold, focusing on the continuous process of labour duration. The findings from this study may safeguard against potential unintended fatal consequences of practice change based on average separate thresholds for slow progression during first and second stage of labour. Vice versa, potential unintended consequences of a “better safe than sorry regime” approach needs to be balanced against the risk of long-term consequences for both mother and neonate when terminating labour solely due to ‘slow labour’.

Other factors need also to be considered in the context of studying labour duration; uterine contractions cause intermittent decreases in uteroplacental blood flow, resulting in temporarily reduced blood and oxygen delivery to the fetus39,40. Most fetuses tolerate these normal labour processes very well. We hypothesis that other factors such as placental insufficiency, inflammation or sub-clinical infection that, when combined with longer labour, may increase the risk. There may also be some fetuses who are more sensitive to medical practices, such as oxytocin augmentation, higher dosage and time augmented are reported for women with longer duration. By widening our focus beyond labour duration to consider combinations of anthropometrics and medical practices that increase or decrease fetal tolerance for labour, we may ultimately be more successful in defining clinical interventions to identify fetuses who cannot tolerate labour much beyond the median duration vs. fetuses who are impervious to longer durations. These cases are apparent in the sensitivity analysis restricted to the few cases of adverse neonatal outcomes among women with caesarean deliveries due to fetal distress (Fig. 1, Table S7), where the majority of cases were found in the < 50th percentile, 5.1 h.

Studies suggest that identifying a firm, universal labour duration threshold for when neonatal risk increases is challenging and may not be feasible. There is an urgent need to better elucidate the physiological and patho-physiological factors that influence women and neonates during spontaneous onsets to better guide clinicians Incidence rates of adverse neonatal outcomes in term gestations (> 37 weeks)- among low-risk women are low and relative risk or odds ratios commonly used may be difficult to interpret. For these reasons, the absolute neonatal risk related to labour duration may be a more appropriate measure for clinical guidance12,13,14.

Strengths and limitations

Strengths of our study include the use of a large population-based cohort linked data with the validated SNQ quality register providing the most robust outcome data available25. Compared to previous research, this study was adequately powered for comparison at higher percentiles of duration despite the prior known rarity of some of the adverse neonatal outcomes. After imputation for start of active phase for women with missing notation of (first stage of labour), i.e. 5 cm cervical dilation, we were able to be consistent with recommendations for active labour duration research21,41. Since we were also able to include women with dilation of 3.4 cm we could reduce both selection bias and misclassification of the exposure, which is the well-established major research challenge in studies investigating duration. A series of sensitivity analysis performed that only slightly changed the results, demonstrating robust results in the final study cohort. Using mediation analysis to quantify the effect of different levels of duration in a broader perspective has to our knowledge not been applied for studies investigating the impact of labour duration on adverse neonatal outcomes30,31,32,33.

Limitations in our analyses is that our model did not include latent phase duration (a possible confounder) and selection bias introduced by including only women who reached the second stage of labour. The influence of latent phase and the differences in duration between spontaneous onsets in different cohorts has been reported and described elsewhere2,18. Importantly, this investigated cohort does not include any women experiencing a caesarean delivery during the active first stage of labour, how labour duration in relation to their neonatal outcomes has not been investigated in this study, with the exemption those included in the sensitivity analysis in Table S7. We did not expand our cohort to include any women with > 5 cm dilation upon arrival, since we could not accurately measure the start of exposure for them, hence it is possible that the study cohort is biased since a few women with faster labours (being dilated more than 5 cm upon arrival) could have been excluded from the cohort. We do not think that that would have influenced our robust estimates, since being dilated more than 5 cm is uncommon for nulliparous women arriving at the delivery wards in Sweden. Mediation analysis was restricted to second stage duration, other potential mediators of interest (augmentation with oxytocin, epidural, mode of delivery) were not investigated. Though we were able to adjust for several confounders and also tested in sensitivity analysis a wide range of covariates, we were unable to control for unknown factors. Residual confounding may also exist to some extent due to how categorization of known confounders was performed. One more limitation is that despite the data were captured from the partograph prospectively, the intention of this study is describing association. Hence, the study is not designed for predictions for at what exact timepoint of duration clinicians should intervene to avoid adverse outcome. The generalizability is restricted to similar populations and settings for spontaneous onsets, where the caesarean delivery rate during first stage of labour is low.

Conclusions

We report an association between increasing active first stage duration and increased risks of adverse neonatal outcomes. The absolute risk of experiencing an adverse neonatal was low in this study population. Given the findings and the overall high use of interventions to mitigate slow labour, future strategies are needed to identify neonates who are vulnerable to longer labours for targeted intervention strategies and clinical guidance.