Diagnosing an overcrowded emergency department from its Electronic Health Records

Marzano, Luca; Darwich, Adam S.; Jayanth, Raghothama; Sven, Lethvall; Falk, Nina; Bodeby, Patrik; Meijer, Sebastiaan

doi:10.1038/s41598-024-60888-9

Download PDF

Article
Open access
Published: 30 April 2024

Diagnosing an overcrowded emergency department from its Electronic Health Records

Luca Marzano¹,
Adam S. Darwich¹,
Raghothama Jayanth¹,
Lethvall Sven²,
Nina Falk²,
Patrik Bodeby² &
…
Sebastiaan Meijer¹

Scientific Reports volume 14, Article number: 9955 (2024) Cite this article

512 Accesses
6 Altmetric
Metrics details

Subjects

Abstract

Emergency department overcrowding is a complex problem that persists globally. Data of visits constitute an opportunity to understand its dynamics. However, the gap between the collected information and the real-life clinical processes, and the lack of a whole-system perspective, still constitute a relevant limitation. An analytical pipeline was developed to analyse one-year of production data following the patients that came from the ED (n = 49,938) at Uppsala University Hospital (Uppsala, Sweden) by involving clinical experts in all the steps of the analysis. The key internal issues to the ED were the high volume of generic or non-specific diagnoses from non-urgent visits, and the delayed decision regarding hospital admission caused by several imaging assessments and lack of hospital beds. Furthermore, the external pressure of high frequent re-visits of geriatric, psychiatric, and patients with unspecified diagnoses dramatically contributed to the overcrowding. Our work demonstrates that through analysis of production data of the ED patient flow and participation of clinical experts in the pipeline, it was possible to identify systemic issues and directions for solutions. A critical factor was to take a whole systems perspective, as it opened the scope to the boundary effects of inflow and outflow in the whole healthcare system.

AI-enabled electrocardiography alert intervention and all-cause mortality: a pragmatic randomized clinical trial

Article 29 April 2024

An overview of clinical decision support systems: benefits, risks, and strategies for success

Article Open access 06 February 2020

AI in health and medicine

Article 20 January 2022

Introduction

Emergency departments (EDs) are essential components in healthcare systems by providing critical care to patients requiring immediate medical attention¹. ED overcrowding is characterized by an increased number of patients seeking care, resulting in long wait times, treatment delays, and reduced quality of care^2,3,4,5.

This problem persists globally^1,6 despite the differences between healthcare policies in different countries^7,8 , Sweden being no exception^6,9,10,11. Previous studies showed a high workload for the main Swedish hospitals¹², pointing out the multifaced nature of operational errors^11,13, negative patient experience of high waiting times¹⁴, and the decreasing availability of beds followed by an increasing of patients visiting ED⁶.

This problem is challenging because of the complexity of the system operations and diversity of clinical profiles of the patients^15,16. Indeed, a high volume of patients visiting EDs corresponds to a wide range of medical conditions, from patients that need basic care to those with an urgent need for intervention due to the severity of the conditions, with a constrained number of resources to treat them often subjected to cost pressures^16,17.

In recent years, the use of real-world data in clinical practice to inform clinical decisions and systems operations has attracted significant interest^18,19,20. Healthcare production data and Electronic Health Records (EHRs) present an opportunity to comprehensively analyse ED overcrowding and enhance healthcare system operations and management^19,20,21.

Several techniques to exploit real-world data have been proposed and discussed to address the challenge of ED overcrowding in operational research^21,22. These techniques span from traditional approaches such as multivariate linear models²² and simulation process modelling^23,24, to novel techniques based on machine learning^25,26 and process mining^27,28.

Most data-driven approaches retrospectively analyse the data to explore, explain and predict operational variables, such as admissions, re-visits, triage, diagnosis, and length of stay^{16,25,29,30,31,32,33,34,35,36,37,38}. Simulation studies have been used for the purpose of performance evaluation and testing layout planning^39,40,41,42 with a focus on the optimization of scheduling management^43,44. Process mining has been applied for the extraction of clinical pathways directly from EHRs⁴⁵ to improve capacity management⁴⁶ and to cluster patient trajectories based on similar clinical characteristics^47,48. Few participatory approaches involving experts have been used to investigate this problem from the perspective of the different actors involved (e.g., explore the possibility to use past medical records to inform admission decisions, and study of re-visits through created personas from the data records^49,50,51,52) and dashboard development to visualize key performance indicators (KPIs) in real-time^53,54.

However, the gap between real-world data and the actual processes that occur in emergency departments constitutes a key limitation^29,35,55. Indeed, the gap between real operations and abstraction made from event log data is considered a substantial challenge^56,57. This not only limits the effectiveness of pure data-driven approaches but also affects the simulation and process mining approaches^27,58,59. Moreover, the reliability of data-driven approaches is limited by the discrepancies between real-world data primary users and collected information from the clinical experts³⁵.

Previous works mainly refer to supporting better operational decisions⁴³, often attempting to optimise a single key performance indicator (KPI) or specific flows treating the ED as an isolated system⁶⁰, but with limited focus on the policy-level analysis to solve the overcrowding problem^41,61. Moreover, the focus of previous data-driven analysis has been on the volume of flows rather than clinical variability^{16,33,62,63,64,65}, missing considerations on how the complexity of medical evaluation can impact prompt decisions^16,17,34.

Despite the large amount of published works and variety of approaches, further research is still necessary to understand the potential of healthcare data for informing reduction in overcrowding and enhance the quality of care in the ED. In fact, to study the complexity and the multi-constrained nature of the overcrowding makes necessary to consider the effect of processes happening outside the ED⁴¹. For example, the efficiency of ED discharge could be affected by the delay of hospital admission due to overcrowding of the wards, the so-called boarding⁶⁶, or further pressure can originate from factors outside the hospital⁶⁷.

The involvement of experts in the analytical process is necessary to leverage these challenges, increase the understanding of phenomena beyond the real-data limitations, and explore future design strategies⁶⁸. Hence, a whole-system approach is required to develop reliable solutions for practical applications^15,34,69.

To summarise, there is a need to develop approaches that go beyond pure empirical approaches to leverage real-world data to address ED overcrowding. Therefore, we aimed to develop a pipeline to analyse ED data from a whole-system perspective that strives to overcome the limitations of the data information and discuss deeply causes and potential solutions of the overcrowding. The ED whole-system perspective is given by involving clinical experts in all the analysis steps and integrating external data or information that is not collected in the ED data regarding the admitting wards and the processes happening outside the hospital.

This pipeline was designed to analyse a real-world case study that consisted of one year (2019) of hospital production data following patients that visited the Uppsala University Hospital ED. The Uppsala ED constituted an ideal case study because of the reported serious shortcomings and hospital overcrowding in the timespan of the data records^6,9,10,11.

Hospital emergency department production data

The Uppsala University Hospital’s (Sweden) ED production data from 2019 were analysed (n = 33,881 patients for n = 49,938 total event logs). It is the only emergency department in Uppsala city and the largest in the Uppsala county, and it operates 24 h with two main access points: directly from the ambulance entrance, or through a walk-in reception. Previously, these data were used to inform a simulation study aimed to improve the ED acute flows testing which kind of interventions the hospital needed to reach a 4-h length of stay target⁷⁰.

In Tables 1, 2 we reported the summary of the cohort. The following variables were included for each record: age, sex, ADAPT triage code⁷¹ (red: “life-threatening”, orange: “seriously ill”, yellow: “ill”, green: “need of assessment”, blue: “minor injuries or illnesses that can be quickly treated and discharged”, and white: “no need of urgent care or monitoring”), chief complaint reason for the visit, arrival with ambulance (y/n), imaging scan (y/n), main diagnosis in ICD10 codes (https://icd.who.int/browse10/2019/en ), waiting time (from arrival to first contact) and length of stay (from arrival to discharge in the ED), the reason for discharge (sent home, admitted to a hospital ward, death, or other reasons). The ward for each admitted patient to the hospital was also reported. Moreover, eventual reasons of the ED visit (e.g., referral) and specific method of arrival if not from ambulance (e.g., pedestrian, or special transport from geriatric or psychiatric facilities) were retrieved (Supplementary Table 1).

Table 1 Summary of the cohort data.

Full size table

Table 2 Summary of the cohort data.

Full size table

During the analysis the hospital records regarding number of assigned patients and available beds for each hospital ward were also available. In Supplementary Table 2 we reported the summary of the patients admitted in the hospital stratified by speciality of the ward. Here we also reported how many patients were allocated in the right ward. This information was possible to retrieve with the aid of the clinical experts by looking through the medical alarm unit of the Uppsala internal system associated to the admitted patients and compare that with the speciality of the ward. According to the clinical experts, this information was relevant to study because wrong admissions are usually correlated to the lack of available places in the right wards.

Methods

Ethics declaration

The ethical approval regarding the usage of the data with the purpose of the presented research was approved by Uppsala University Hospital (case number: FOU2024-00,078). The need for informed consent was waived by Uppsala University Hospital. The entire research was performed in conformance with the WMA Declaration of Helsinki.

Analytical pipeline

In Fig. 1 is reported the analytical pipeline. Prior to commencing the analysis, the clinical experts described the processes and protocols behind the ED data (step a.1). All data analyses were carried out using RStudio, version 2022.12.0 + 353. RStudio was also used to create the plots presented in the results. A relevant passage was the integration of information regarding external factors that influence the ED performances but that are not collected in the data records (step a.2). These include additional information at higher level of granularity of the real process, and external factors such as patients coming from special facilities and community needs (e.g., psychiatric, and geriatric care).

After contextualizing and explaining the variables, we abstracted the flow characteristics from the data by dividing it into three components (step a.3): input, throughput, and output flow. This follows previously proposed approaches to model ED flows and categorize interventions into types⁷² and associated key performance indicators (KPIs)⁶⁰.

Once the description of the flow from arrival to discharge from ED is abstracted from the data, the impact of patient volume on ED functioning was studied (step b.1). This was done by detecting the possible KPIs that can be computed from the records. Each KPI was computed in relation of the abstraction component to which it belongs. Time series were deployed for the study of the daily metrics, and aggregated statistics distribution for the hourly and absolute values.

The detected KPIs were associated to the flow components in the following way:

Input: number of arrivals, total and with the ambulance, and patient re-visits;
Throughput: rate of performed imaging assessment and time distributions for waiting time and length of stay;
Output: rate of discharges, admissions to the hospital, and number of fatalities.

Following the volume analysis, we investigated how KPIs are connected to the clinical variability of the patients (step b.2 and b.3). In this part of the framework the patient-based variables (e.g., age, sex, chief complaint) were explored in connection with the ones obtained by clinical decisions (e.g., triage, ICD10 diagnosis, scans, and final discharge/admission decision). For the input KPIs the clinical patterns were explored with aggregate statistics and stratification of time series. Special attention was given to the chief complaint and ICD 10 diagnosis by designing an interaction matrix between these two variables to assess the variability of clinical decisions. Single patient re-visits were studied by mining chief complaint-ICD10 sequences from each visit to study patterns and longitudinal correlations between the previous visits.

For the throughput KPIs the time distributions were stratified in function of the clinical variables, and multiple variables were studied by heatmaps referring to the metrics. This allowed to explore if there were operational bottlenecks or patterns in patients with long waiting or length of stay in the ED. A multivariate linear regression was performed as preliminary assessment of the association between length of stay and the variables.

Output flow was analysed in concomitance with input flow component. Furthermore, Sankey flows were adopted to picture the variability of the variables in function of the time moment in flow, thus connecting the input-throughput variability with the final decision. The final decision, including special structures of admission to a ward to the hospital, were considered in this stage to make considerations from the ED to a whole-system perspective. During this passage the information regarding the hospital ward availability was integrated.

Clinical experts informed the analysis of patient volume and clinical variability by identifying logistic and clinical aspects to investigate, including the feature selection of variables of interest to connect volume KPIs with the clinical characteristics of the patients. Finally, clinical experts were involved to evaluate, validate, and interpret the series of outcomes obtained from the pipeline (step c.1 and c.2). This step included a discussion on the operational management aspects of overcrowding and possible future interventions (step c.2).

In this work special, attention was given to the interaction between chief complaint and ICD-10 code of the first diagnosis since these two variables were representative of the interaction between patient and ED practitioners ‘decision. For what concerns the volume and clinical variability analysis, the main investigation was regarding how to stratify the ED flows. Clinicians suggested to consider four main stratifications: patients with need of urgent care, patients with non-urgent need of care and simple to process (“see and treat”), patients requiring complex examination in the ED from which there would be a competitive decision between discharge or send to an hospital ward, and geriatric patients that need basic care. The geriatric flow was the one connected to external processes to the hospital that concerned mostly the clinicians. The urgent care flow management in competition with non-urgent and complex patients was studied in the previous simulation work⁷⁰.

Results

In Table 3 we reported a summary of the key results with the associated feedback of clinicians, and the potential research for future intervention. In Fig. 2 instead the KPIs daily impact along the year are plotted.

Table 3 Key results regarding the main sources of Uppsala ED overcrowding followed by clinical feedback and future research perspectives.

Full size table

Non-urgent patients and generic or non-specific diagnosis

Most patients that visited ED in 2019 were patients having not urgent care: 39.1% triage yellow code, 20.3% white, and 19.2% green on the total visits (Table 1). 82.2% of pedestrians visited the ED without a referral (Supplementary Table 1). Figure 3 shows the heatmap of the yearly reported chief complaint and ICD10 main diagnosis stratified by triage to capture the magnitude of clinical variability as a function of the interaction patient-clinician (patient: chief complain—clinician: ICD10 diagnosis). This plot shows how heterogenous is the clinical information regardless urgency of care, and that from any reason of the ED visit their main diagnosis can fall in any kind of ICD10 category. This could be deduced by the fact that the majority of defined cluster of patients, such as abdominal and chest pain (n = 12,464; 24.9%), are diagnosed with the redundant ICD10 referring to the generic symptom (R104X “Abdominal Pain, unspecified” and R074 “Chest pain, unspecified”). Except for some group of patients with defined categories, such as patients having fractures or cardiovascular diseases, it becomes hard to identify more specific categorizations from the data.

Another surprising aspect related to the main diagnosis can be discovered if we look to the most frequent ICD10 diagnosis complete codes (Table 2). Most of the diagnoses was from the generic symptoms category (ICD10 group R), but surprisingly also most of other codes from the other ICD10 groups resulted in non-specific diagnoses (e.g., M549 “Back pain, unspecified”, I489 “Atrial fibrillation and atrial flutter, unspecified”, M798G “Pain, nonspecific in lower leg”, and N390 “urinary tract infection, site not specified”). Interestingly, patients with Z711 code diagnosis (“feared health complaint in whom no diagnosis is made”), patients that do not need urgent care from ED, were the second most common diagnosis after generic symptoms.

Length of stay underlines the saturation of the ED

The length of stay was long with a large variability (Mean ± Standard deviation: 5.79 ± 4.21 h). Regardless of triage, chief complaint, or ICD-10 category, the length of stay was similar, with a high number of outliers of long staying in the ED for any category (Fig. 4). As expected, some partial differences were detected stratifying waiting time by triage, but long waiting time and outliers were associated also to patients with urgent care codes, thus showing similar patterns of the length of stay distributions.

As shown in Fig. 4, length of stay of patients for which imaging assessment was requested (6.51 ± 4.81 h) was clearly wider and higher compared to the patients that were not (3,96 ± 3,39 h). According to the clinical experts, the number of scans performed in the ED (Fig. 2) is currently extremely high, and the possible causes could rely on not necessary imaging assessment requested by doctors with premature experience when evaluating patients with complex clinical profiles.

The multivariate regression confirmed the high impact of scans on length of stay and detected as relevant the reason for discharge and the age (See Supplementary Results). However, the R-square coefficient underlined that the linear assumption for length of stay was not captured by the data (R = 0.26). This confirms that the saturation of the ED reflected in the data makes multivariate predictions of the length of stay challenging based on the data.

Effects of the overcrowded wards to the ED efficiency

The most common hospital admissions from ED were to the surgery, acute medicine, orthopaedic, cardiology, and stroke wards (Table 2 and Fig. 5).

According to the hospital records, all these wards were overcrowded during the entire the year, thus showing the probable effect of the hospital boarding on increasing the length of stay for ED patients waiting for an available bed in the ward. This can be seen in Table 4 where the most frequent admitting wards (almost all days of 2019) are reported with the daily admissions from the ED and the actual availability of the ward represented by the difference between total number of patients assigned and the number of beds. In Supplementary Table 3 we reported the same information in Table 4 for all the wards.

Table 4 The most frequent admitting wards of ED patients.

Full size table

There was a pattern of hospital admission for older patients (Table 2). This correlation with hospital admission explains why also this variable was relevant for the length of stay regression. More than half of these elderly admitted patients arrived by ambulance. Furthermore, patients with generic symptoms had a huge impact on the hospital admissions for all the wards (Fig. 5, Supplementary Fig. 1, and Supplementary Table 2). The Sankey flow in Fig. 5 shows that these patients have been admitted across wards in the hospital, the high clinical variability of the data is also reflected in the ED process abstraction. This aspect was pointed out by the spider-net obtained from the ED-hospital wards pathways extracted by applying a direct-to-follow graph process mining algorithm (Supplementary Fig. 3). In detail, Supplementary Table 2 underline that surgery ward pressure was mainly from patients with abdominal pain, cardiology by chest pain, and acute medicine by potential high fragile geriatric patients (difficulty of breathing). Supplementary Fig. 4 shows that misallocated patients were admitted everywhere in the hospital (17.6% of the total records, Supplementary Table 3). This phenomenon was more common during the year were neuro, thorax, “ear, nose and throat”, genecology, and “plastic and maxillofacial surgery” (Supplementary Table 4).

Patients re-visiting ED: a global resonant pressure

Figure 2 shows that patients that re-visited the ED impacted significantly to all the KPIs during the entire year (33% ambulance, 35.5% scans, 29.1% hospital admissions on the total yearly visits). We detected few cases of patients that revisited the ED more than 10 times (n = 96, from which the max number of re-visits for a single patient was 65), but from which the cumulative effect with the visits of the other patients across the year was resonantly impacting the ED sources. The analysis of the concomitant chief complaint and ICD10 subsequently occurred after each re-visit showed what were the typical profiles of these patients (Supplementary Table 5). We detected three main patterns: patients having subsequent generic symptoms before receiving a specific diagnosis after several re-visits (e.g., such as consecutive visits with abdominal pain R104 before ileus K590 being diagnosed), patients having psychological issues with consecutive cases of injuries by self-inflicting damage or poisoning, and highly fragile older patients that need basic care (e.g., general weakness or constipation).

As mentioned before, clinical experts were already aware about the importance of solving the issue of geriatric patients. The geriatric flow is characterized by both those residing in Uppsala’s geriatric facilities and those living independently at home. These individuals, often highly fragile and requiring basic care, presented a unique challenge, particularly for those living at home, where logistical difficulties in the discharge process frequently led to prolonged lengths of stay, exceedingly more than three days. From the data was not possible to clearly detect the geriatric patients not living in the special facilities, even with the aid of the clinical experts, because of the similar characteristics with patients with non-specific diagnosis.

The analysis provided further information regarding the impact of re-visits on the ED, thus also underlining the competitive management of the other sub flows. Re-visiting patients with psychological profiles were recognized by the clinicians as a known issue for the ED. Instead for what concerns the delay of specific diagnoses, the data information was not sufficient to detect and stratify these patients in more precise sub-flows, still underlying the impact of this bulk of patients.

Discussion

In this paper we designed a comprehensive pipeline to analyse healthcare production data following ED patient flows aimed to leverage real-world data potentiality to study the overcrowding phenomena. The approach showed in Fig. 1 was designed to account the real-world data challenges in all the steps of the analysis with the involvement of clinical experts, thus allowing to overcome the limitations of the data and explore overcrowding of the Uppsala University Hospital ED from a whole system perspective. According to the knowledge of the authors, this is the first study of ED flows using healthcare production data with this wide a large overview regarding data information, processes, and interaction with hospital wards and external processes.

In traditional data-driven approaches, clinical experts are usually involved in the final step where outcomes are discussed. The involvement of clinical experts in all steps of the pipeline (Fig. 1, steps a-c) was fundamental to contextualise the data with medical and operational knowledge, and informing the analysis and the findings for a proper discussion on how to solve the overcrowding of the ED. This approach underlined the gap that there is between data records and actual operations and how decision-making reasoning is difficult to integrate with the data.

From the multi-objective analysis (Fig. 1, steps b-c) it emerged that there were multiple sources that led to the ED overcrowding. These rely on both clinical and organisational factors and are connected to internal and external processes of the ED environment. This is a result we would expect because it is well-known that the management of overcrowding in EDs is a complex multi-constrained problem due to the interaction between logistic and clinical aspects^16,17,34.

In detail, the results discussed in Table 2 revealed that the main sources of the ED saturation were connected to the high number of patients classified as non-urgent with generic symptoms, the delayed specific diagnosis and hospital admission decision from which multiple imaging evaluation was required, the delayed admission to the hospital because of the lack of available beds in the wards, and the external pressure of high frequent re-visits of geriatric, psychiatric and patients with subsequent generic symptoms before receiving a specific diagnosis.

The aggregated analysis of the outcomes (Fig. 1 steps c.1) allowed to estimate the magnitude of causes of the overcrowding known a priori (e.g., patients seeking basic care and the geriatric flow) and reveal novel insights (e.g., the global impact of the cumulative re-visits). The limitation of the data information emerged when it was not possible to define well separated sub-flows from the clinical variables even with the clinical feedback.

The retrospective evaluation (Fig. 1, steps c.2) provided hints regarding aspects to focus on the future for improving the understanding of overcrowding and explore key strategies.

For what concern the internal improvement of ED operations, a key aspect to discuss will be how to make the evaluation process faster and more accurate of patients with non-urgent need of care but that are difficult to evaluate. These were the ones with delayed decisions regarding discharge or hospital admissions requiring several imaging evaluations, and the ones visiting frequently the ED with generic diagnosis before receiving a specific one. Another internal aspect to discuss regards the improvement of the collected data information that can be re-utilise for future analysis.

There is the need for a deeper discussion regarding the efficacy of the primary care systems outside the hospital. The ED pressure would be drastically decreased if patients could seek basic or non-urgent care outside the ED (e.g., geriatric flow, green triage, or patients with feared health complaint). Furthermore, a deeper study regarding the overcrowding of hospital wards and the management of highly frequent visits of psychiatric patients would be beneficial for the ED distress.

The proposed approach allowed to study concomitantly multiple components of emergency flows and several KPIs, including considerations to where patients are admitted and if these will re-visit the ED. This allowed to overcome the previous limitations of studies that focused merely on specific flows or singular KPIs, especially for what concern analysis of throughput interventions with lack of considerations regarding inflows and outflows⁶⁰. Furthermore, our approach connected considerations regarding the volume of flows with their clinical variability, thus enriching insights of previous analysis where these components were considered separately^{16,33,36,51,63,64,65,73}.

Our approach demonstrated the key role clinical expert’s involvement in data-driven approaches for improving the understanding of overcrowding. This aspect of the pipeline allowed to leverage the gap between data and clinical processes and explore the gap between the collected data and the practical utility³⁵. So far, the utilization of real-world data has been focused more on the operational management rather than discussion about the healthcare policies^41,61, and it is well known that there is a lack of qualitative approaches to healthcare problems⁷⁴.

In data-driven approaches, the widely recognized principle of 'garbage-in garbage-out' cautions against relying on insufficient or unreliable data to solve complex tasks. However, when it comes to use data for addressing real-world healthcare challenges, this paradigm should not be seen as a disruptive barrier, but as an occasion to discuss how to improve and leverage collected information and how this could provide insights for the improvement of the system operations.

From our whole-system analysis, it emerges that pure data-driven approaches would not be a definitive solution for analysing ED overcrowding. In contrast, this paper shows that by adopting an inclusive approach, not only can we enhance real-world data potential to improve operational decisions within the emergency department, but it also provides an opportunity to facilitate policy-making discussions that encompass broader aspects affecting the healthcare system, such as engagement with local municipal or regional authorities. For example, from our results it emerged that an improvement of geriatric and psychiatric pathways, and a serious discussion regarding primary care delivery, would be crucial to decrease pressure on the ED.

From the obtained results, ED resources appear to be squeezed from all directions, from the primary healthcare delivery to the overcrowding of hospital wards that impact on the ED admission process with the boarding. Finally, we can detect the origin of the possible solutions by analysing this mismatch between community needs and the delivery of care from the whole-system perspective not isolating only emergency medicine. The take home message is that we should learn beyond the pure empirical approaches by involving clinicians and managers, and from there we can start to design future solutions to the ED overcrowding looking beyond the walls of the ED and the hospital.

Despite the significance of our work, there are certain limitations that should be acknowledged. Firstly, the study was conducted at a single centre in Uppsala, Sweden, which may limit the generalizability of the findings to other healthcare settings. Furthermore, the analysis was based on data from a one-year time window, which may not fully capture long-term trends and variations in the ED workflow. It is relevant to note that the available data lacked detailed clinical variables, such as blood test results, and the level of granularity regarding the decision-making process by clinicians was limited. This relied on the fact that the analysed records were health care production data. This enrichment of the data information would be beneficial for the improvement of multivariate regression models for the length of stay since the current information is confounded by the saturation of the system.

Moreover, the discussion and expert input primarily involved clinical practitioners, and no other stakeholders and actors in the healthcare system. The absence of comprehensive discussions with external stakeholders, such as policymakers, administrators, and patients, may have limited the breadth of insights and potential solutions generated from the analysis.

It is essential to recognize these limitations as they highlight the need for future research to address these gaps. This could include conducting multi-centre studies to validate the findings across different healthcare contexts, extending the time window of analysis to capture long-term dynamics, and enhancing data collection efforts to include more detailed clinical variables. Additionally, engaging a broader range of stakeholders in the analysis and decision-making process can lead to more comprehensive and impactful strategies for addressing the challenges faced by EDs and improving overall healthcare delivery.

As mentioned before, overcrowding in EDs is an international problem^1,6, and that regardless the massive quantity of works aimed to operational research there is still a lot of work to do to solve this problem^21,22, especially in the current discussion on real-world evidence and healthcare data^19,26,75. In the current discussion regarding data-driven healthcare in international settings, our pipeline could be interesting to implement for participatory approaches and to facilitate discussions about the problem from the perspectives of different healthcare policies.

Conclusions

Our analysis reveals insights into ED overcrowding and enables to identify systemic issues and directions for solutions. The whole systems perspective opened the scope to the boundary effects of inflow and outflow of the ED inside the hospital. Finally, our approach demonstrates that to enhance and unlock the potential of real-world data in studying ED overcrowding challenge we need to look to systems beyond the walls of the ED and the hospitals to solve this problem.

Data availability

The dataset used and analyzed during the current study and computer code are available from the corresponding author on reasonable request.

References

Hoot, N. R. & Aronsky, D. Systematic review of emergency department crowding: Causes, effects, and solutions. Ann. Emerg. Med. 52, 126–136 (2008).
Article PubMed PubMed Central Google Scholar
Hirshon, J. M. The rationale for developing public health surveillance systems based on emergency department data. Acad. Emerg. Med. 7, 1428–1432 (2000).
Article CAS PubMed Google Scholar
Austin, E. E. et al. Strategies to measure and improve emergency department performance: a scoping review. Scand. J. Trauma Resusc. Emerg. Med. 28, 1–14 (2020).
Article Google Scholar
Aringhieri, R., Bruni, M. E., Khodaparasti, S. & van Essen, J. T. Emergency medical services and beyond: Addressing new challenges through a wide literature review. Comput. Oper. Res. 78, 349–368 (2017).
Article MathSciNet Google Scholar
Soremekun, O. A., Terwiesch, C. & Pines, J. M. Emergency medicine: An operations management view. Acad. Emerg. Med. 18, 1262–1268 (2011).
Article PubMed Google Scholar
Lindner, G. & Woitok, B. K. Emergency department overcrowding: Analysis and strategies to manage an international phenomenon. Wien Klin Wochenschr 133, 229–233 (2021).
Article PubMed Google Scholar
Pines, J. M. et al. International perspectives on emergency department crowding. Acad. Emerg. Med. 18, 1358–1370 (2011).
Article PubMed Google Scholar
Mistry, B. et al. Accuracy and reliability of emergency department triage using the emergency severity index: An International multicenter assessment. Ann. Emerg. Med. 71, 581-587.e3 (2018).
Article PubMed Google Scholar
Wretborn, J., Ekelund, U. & Wilhelms, D. B. Differentiating properties of occupancy rate and workload to estimate crowding: A Swedish national cross-sectional study. J. Am. Coll. Emerg. Phys. Open 3, e12648 (2022).
Google Scholar
Blom, M. C., Jonsson, F., Landin-Olsson, M. & Ivarsson, K. The probability of patients being admitted from the emergency department is negatively correlated to in-hospital bed occupancy—A registry study. Int. J. Emerg. Med. 7, 1–7 (2014).
Article Google Scholar
Ugglas, B., Lindmarker, P., Ekelund, U., Djarv, T. & Holzmann, M. J. Emergency department crowding and mortality in 14 Swedish emergency departments, a cohort study leveraging the Swedish Emergency Registry (SVAR). PLoS ONE 16, e0247881 (2021).
Article Google Scholar
Wretborn, J., Starkenberg, H., Ruge, T., Wilhelms, D. B. & Ekelund, U. Validation of the modified Skåne emergency department assessment of patient load (mSEAL) model for emergency department crowding and comparison with international models; an observational study. BMC Emerg. Med. 21, 21 (2021).
Article PubMed PubMed Central Google Scholar
Källberg, A. S. et al. Contributing factors to errors in Swedish emergency departments. Int. Emerg. Nurs. 23, 156–161 (2015).
Article PubMed Google Scholar
Rantala, A., Nordh, S., Dvorani, M. & Forsberg, A. The meaning of boarding in a swedish accident & emergency department: A qualitative study on patients’ experiences of awaiting admission. Healthcare 9, 66 (2021).
Article PubMed PubMed Central Google Scholar
Kannampallil, T. G., Schauer, G. F., Cohen, T. & Patel, V. L. Considering complexity in healthcare systems. J. Biomed. Inform. 44, 943–947 (2011).
Article PubMed Google Scholar
Hahn, B., Zuckerman, B., Durakovic, M. & Demissie, S. The relationship between emergency department volume and patient complexity. Am. J. Emerg. Med. 36, 366–369 (2018).
Article PubMed Google Scholar
Norberg, G., Wireklint Sundström, B., Christensson, L., Nyström, M. & Herlitz, J. Swedish emergency medical services’ identification of potential candidates for primary healthcare: Retrospective patient record study. Scand. J. Primary Health Care 33, 311–317 (2015).
Article Google Scholar
Scobie, S. & Castle-Clarke, S. Implementing learning health systems in the UK NHS: Policy actions to improve collaboration and transparency and support innovation and better use of analytics. Learn. Health Syst. 4, e10209 (2020).
Article PubMed Google Scholar
Varela-Rodríguez, C., Rosillo-Ramirez, N., Rubio-Valladolid, G. & Ruiz-López, P. Editorial: Real world evidence, outcome research and healthcare management improvement through real world data (RWD). Front. Public Health 10, 1064580 (2022).
Article PubMed Google Scholar
Schurman, B. The Framework for FDA’s real-world evidence program. Appl. Clin. Trials 28, 15–17 (2019).
Google Scholar
Saghafian, S., Austin, G. & Traub, S. J. Operations research/management contributions to emergency department patient flow optimization: Review and research prospects. IIE Trans. Healthc. Syst. Eng. 5, 101–123 (2015).
Article Google Scholar
Wiler, J. L., Griffey, R. T. & Olsen, T. Review of modeling approaches for emergency department patient flow and crowding research. Acad. Emerg. Med. 18, 1371–1379 (2011).
Article PubMed Google Scholar
Gunal, M. M. A guide for building hospital simulation models. Health Syst. 1, 17–25 (2012).
Article Google Scholar
Boyle, L. M., Marshall, A. H. & Mackay, M. A framework for developing generalisable discrete event simulation models of hospital emergency departments. Eur. J. Oper. Res. 302, 337–347 (2022).
Article Google Scholar
Pianykh, O. S. et al. Improving healthcare operations management with machine learning. Nat. Mach. Intell. 2(5), 266–273 (2020).
Article Google Scholar
Beckmann, J. S. & Lew, D. Reconciling evidence-based medicine and precision medicine in the era of big data: Challenges and opportunities. Genome Med. 8, 1–11 (2016).
Article Google Scholar
Munoz-Gama, J. et al. Process mining for healthcare: Characteristics and challenges. J. Biomed. Inform. 127, 103994 (2022).
Article PubMed Google Scholar
Chen, K., Abtahi, F., Carrero, J.-J., Fernandez-Llatas, C. & Seoane, F. Process mining and data mining applications in the domain of chronic diseases: A systematic review. Artif. Intell. Med. 144, 102645 (2023).
Article PubMed Google Scholar
Ferrão, J. C., Oliveira, M. D., Gartner, D., Janela, F. & Martins, H. M. G. Leveraging electronic health record data to inform hospital resource management : A systematic data mining approach. Health Care Manag. Sci. 24, 716–741 (2021).
Article PubMed Google Scholar
Perdahl, T., Axelsson, S., Svensson, P. & Djärv, T. Patient and organizational characteristics predict a long length of stay in the emergency department—A Swedish cohort study. Eur. J. Emerg. Med. 24, 284–289 (2017).
Article PubMed Google Scholar
Liu, Y. et al. Development and validation of a practical machine-learning triage algorithm for the detection of patients in need of critical care in the emergency department. Sci. Rep. 11(1), 1–9 (2021).
Article Google Scholar
Chmiel, F. P. et al. Using explainable machine learning to identify patients at risk of reattendance at discharge from emergency departments. Sci. Rep. 11(1), 1–11 (2021).
Article Google Scholar
Handel, D. A., Sun, B., Augustine, J. J., Shufflebarger, C. M. & Fu, R. Association among emergency department volume changes, length of stay, and leaving before treatment complete. Hosp. Top. 93, 53–59 (2015).
Article PubMed Google Scholar
Burton, C., Elliott, A., Cochran, A. & Love, T. Do healthcare services behave as complex systems? Analysis of patterns of attendance and implications for service delivery. BMC Med. 16, 138 (2018).
Article PubMed PubMed Central Google Scholar
Sudat, S. E., Robinson, S. C., Mudiganti, S., Mani, A. & Pressman, A. R. Mind the clinical-analytic gap: Electronic Health Records and COVID-19 pandemic response. J. Biomed. Inf. 116, 103715 (2021).
Article Google Scholar
Howell, S. C., Wills, R. A. & Johnston, T. C. Should diagnosis codes from emergency department data be used for case selection for emergency department key performance indicators?. Austr. Health Rev. 38, 38 (2014).
Article Google Scholar
Abad-Grau, M. M., Ierache, J., Cervino, C. & Sebastiani, P. Evolution and challenges in the design of computational systems for triage assistance. J. Biomed. Inform. 41, 432–441 (2008).
Article PubMed PubMed Central Google Scholar
Chen, T.-L. et al. Imbalanced prediction of emergency department admission using natural language processing and deep neural network. J. Biomed. Inform. 133, 104171 (2022).
Article PubMed Google Scholar
Fone, D. et al. Systematic review of the use and value of computer simulation modelling in population health and health care delivery. J. Public Health Med. 25, 325–335 (2003).
Article PubMed Google Scholar
Gul, M. & Guneri, A. F. A comprehensive review of emergency department simulation applications for normal and disaster conditions. Comput. Ind. Eng. 83, 327–344 (2015).
Article Google Scholar
Günal, M. M. & Pidd, M. Discrete event simulation for performance modelling in health care: A review of the literature. J. Simul. 4, 42–51 (2010).
Article Google Scholar
Paul, S. A., Reddy, M. C. & DeFlitch, C. J. A systematic review of simulation studies investigating emergency department overcrowding. Simulation 86, 559–571 (2010).
Article Google Scholar
Yousefi, M., Yousefi, M. & Fogliatto, F. S. Simulation-based optimization methods applied in hospital emergency departments: A systematic review. Simulation 96, 791–806 (2020).
Article Google Scholar
Brailsford, S. & Vissers, J. OR in healthcare: A European perspective. Eur. J. Oper. Res. 212, 223–234 (2011).
Article MathSciNet Google Scholar
Rismanchian, F. & Lee, Y. H. Process mining-based method of designing and optimizing the layouts of emergency departments in hospitals. HERD Health Environ. Res. Des. J. 10, 105–120 (2017).
Article Google Scholar
van Hulzen, G., Martin, N., Depaire, B. & Souverijns, G. Supporting capacity management decisions in healthcare using data-driven process simulation. J. Biomed. Inform. 129, 104060 (2022).
Article PubMed Google Scholar
Ceglowski, R., Churilov, L. & Wasserthiel, J. Combining data mining and discrete event simulation for a value-added view of a hospital emergency department. J. Oper. Res. Soc. 58, 246–254 (2007).
Article Google Scholar
Chen, J., Sun, L., Guo, C., Wei, W. & Xie, Y. A data-driven framework of typical treatment process extraction and evaluation. J. Biomed. Inform. 83, 178–195 (2018).
Article PubMed Google Scholar
Ajmi, I. et al. Mapping patient path in the Pediatric Emergency Department: A workflow model driven approach. J. Biomed. Inform. 54, 315–328 (2015).
Article PubMed Google Scholar
Ben-Assuli, O., Shabtai, I. & Leshno, M. The impact of EHR and HIE on reducing avoidable admissions: Controlling main differential diagnoses. BMC Med. Inform. Decis. Mak. 13, 1–10 (2013).
Article Google Scholar
Ben-Assuli, O., Sagi, D., Leshno, M., Ironi, A. & Ziv, A. Improving diagnostic accuracy using EHR in emergency departments: A simulation-based study. J. Biomed. Inform. 55, 31–40 (2015).
Article PubMed Google Scholar
Jacob, R., Wong, M. L., Hayhurst, C., Watson, P. & Morrison, C. Designing services for frequent attenders to the emergency department: A characterisation of this population to inform service design. Clin. Med. 16, 325–329 (2016).
Article Google Scholar
Franklin, A. et al. Dashboard visualizations: Supporting real-time throughput decision-making. J. Biomed. Inform. 71, 211–221 (2017).
Article PubMed Google Scholar
Martinez, D. A. et al. An electronic dashboard to monitor patient flow at the johns hopkins hospital: Communication of key performance indicators using the Donabedian model. J. Med. Syst. 42, 133 (2018).
Article PubMed Google Scholar
Jin, F. et al. Gap between real-world data and clinical research within hospitals in China: A qualitative study. BMJ Open 10, e038375 (2020).
Article PubMed PubMed Central Google Scholar
Suriadi, S., Andrews, R., ter Hofstede, A. H. M. & Wynn, M. T. Event log imperfection patterns for process mining: Towards a systematic approach to cleaning event logs. Inf. Syst. 64, 132–150 (2017).
Article Google Scholar
van Zelst, S. J., Mannhardt, F., de Leoni, M. & Koschmider, A. Event abstraction in process mining: Literature review and taxonomy. Granul. Comput. 6(3), 719–736 (2020).
Article Google Scholar
Vanbrabant, L., Martin, N., Ramaekers, K. & Braekers, K. Quality of input data in emergency department simulations: Framework and assessment techniques. Simul. Model Pract. Theory 91, 83–101 (2019).
Article Google Scholar
Kuo, Y.-H., Leung, J. M. Y., Tsoi, K. K. F., Meng, H. M. & Graham, C. A. Embracing big data for simulation modelling of emergency department processes and activities. In 2015 IEEE International Congress on Big Data 313–316 (IEEE, 2015). https://doi.org/10.1109/BigDataCongress.2015.52.
Vanbrabant, L., Braekers, K., Ramaekers, K. & Van Nieuwenhuyse, I. Simulation of emergency department operations: A comprehensive review of KPIs and operational improvements. Comput. Ind. Eng. 131, 356–381 (2019).
Article Google Scholar
Zhang, X. Application of discrete event simulation in health care: A systematic review. BMC Health Serv. Res. 18, 1–11 (2018).
Article Google Scholar
Kang, S. W. & Park, H. S. Emergency department visit volume variability. Clin. Exp. Emerg. Med. 2, 150–154 (2015).
Article PubMed PubMed Central Google Scholar
McCrum, M. L., Lipsitz, S. R., Berry, W. R., Jha, A. K. & Gawande, A. A. Beyond volume: Does hospital complexity matter? An analysis of inpatient surgical mortality in the United States. Med. Care 52, 235–242 (2014).
Article PubMed Google Scholar
Welch, S. J. et al. Volume-related differences in emergency department performance. Jt. Commun. J. Qual. Patient Saf. 38, 395–402 (2012).
Google Scholar
Lee, D. C. et al. The impact of hospital closures and hospital and population characteristics on increasing emergency department volume: A geographic analysis. Popul. Health Manag. 18, 459–466 (2015).
Article PubMed PubMed Central Google Scholar
Carmen, R., Van Nieuwenhuyse, I. & Van Houdt, B. Inpatient boarding in emergency departments: Impact on patient delays and system capacity. Eur. J. Oper. Res. 271, 953–967 (2018).
Article MathSciNet Google Scholar
George, G., Jell, C. & Todd, B. S. Effect of population ageing on emergency department speed and efficiency: A historical perspective from a district general hospital in the UK. Emerg. Med. J. 23, 379 (2006).
Article CAS PubMed PubMed Central Google Scholar
Rundo, L., Pirrone, R., Vitabile, S., Sala, E. & Gambino, O. Recent advances of HCI in decision-making tasks for optimized clinical workflows and precision medicine. J. Biomed. Inform. 108, 103479 (2020).
Article PubMed Google Scholar
Franklin, A. et al. Opportunistic decision making and complexity in emergency care. J. Biomed. Inform. 44, 469–476 (2011).
Article PubMed Google Scholar
Abourraja, M. N., et al. A data-driven discrete event simulation model to improve emergency department logistics.
Farrokhnia, N. & Göransson, K. E. Swedish emergency department triage and interventions for improved patient flows: A national update. Scand. J. Trauma Resusc. Emerg. Med. 19, 1–5 (2011).
Article Google Scholar
Welch, S. J. Using data to drive emergency department design: A metasynthesis. HERD Health Environ. Res. Des. J. 5, 26–45 (2012).
Article Google Scholar
Berkowitz, D., Chamberlain, J. & Provost, L. P. Addressing challenges of baseline variability in the clinical setting: Lessons from an emergency department. Pediatr. Qual. Saf. 4, e216 (2019).
Article PubMed PubMed Central Google Scholar
Im, D., Pyo, J., Lee, H., Jung, H. & Ock, M. qualitative research in healthcare: Data analysis. J. Prev. Med. Public Health 56, 100 (2023).
Article PubMed PubMed Central Google Scholar
Schad, F. & Thronicke, A. Real-world evidence-current developments and perspectives. Int. J. Environ. Res. Public Health 19, 10159 (2022).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This project is a contribution to the Centre for Data-Driven Health (CDDH), KTH Royal Institute of Technology (https://www.kth.se/cddh).

Funding

Open access funding provided by Royal Institute of Technology.

Author information

Authors and Affiliations

Department of Biomedical Engineering and Health Systems, KTH Royal Institute of Technology, Stockholm, Sweden
Luca Marzano, Adam S. Darwich, Raghothama Jayanth & Sebastiaan Meijer
Uppsala University Hospital, Uppsala, Sweden
Lethvall Sven, Nina Falk & Patrik Bodeby

Authors

Luca Marzano
View author publications
You can also search for this author in PubMed Google Scholar
Adam S. Darwich
View author publications
You can also search for this author in PubMed Google Scholar
Raghothama Jayanth
View author publications
You can also search for this author in PubMed Google Scholar
Lethvall Sven
View author publications
You can also search for this author in PubMed Google Scholar
Nina Falk
View author publications
You can also search for this author in PubMed Google Scholar
Patrik Bodeby
View author publications
You can also search for this author in PubMed Google Scholar
Sebastiaan Meijer
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

L.M. wrote the manuscript. L.M., A.S.D., J.R., S.L. designed the research, L.M., A.S.D., J.R., S.L., S.M., N.F., P.B. performed the research. L.M., A.S.D., J.R., S.M. analyzed the data and harmonized and aggregated the results.

Corresponding author

Correspondence to Luca Marzano.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1.

Supplementary Tables.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Marzano, L., Darwich, A.S., Jayanth, R. et al. Diagnosing an overcrowded emergency department from its Electronic Health Records. Sci Rep 14, 9955 (2024). https://doi.org/10.1038/s41598-024-60888-9

Download citation

Received: 16 November 2023
Accepted: 29 April 2024
Published: 30 April 2024
DOI: https://doi.org/10.1038/s41598-024-60888-9

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.