Wearable sensor derived decompensation index for continuous remote monitoring of COVID-19 diagnosed patients

Richards, Dylan M.; Tweardy, MacKenzie J.; Steinhubl, Steven R.; Chestek, David W.; Hoek, Terry L. Vanden; Larimer, Karen A.; Wegerich, Stephan W.

doi:10.1038/s41746-021-00527-z

Download PDF

Article
Open access
Published: 08 November 2021

Wearable sensor derived decompensation index for continuous remote monitoring of COVID-19 diagnosed patients

npj Digital Medicine volume 4, Article number: 155 (2021) Cite this article

3548 Accesses
14 Citations
82 Altmetric
Metrics details

Subjects

Abstract

The COVID-19 pandemic has accelerated the adoption of innovative healthcare methods, including remote patient monitoring. In the setting of limited healthcare resources, outpatient management of individuals newly diagnosed with COVID-19 was commonly implemented, some taking advantage of various personal health technologies, but only rarely using a multi-parameter chest-patch for continuous monitoring. Here we describe the development and validation of a COVID-19 decompensation index (CDI) model based on chest patch-derived continuous sensor data to predict COVID-19 hospitalizations in outpatient-managed COVID-19 positive individuals, achieving an overall AUC of the ROC Curve of 0.84 on 308 event negative participants, and 22 event positive participants, out of an overall study cohort of 400 participants. We retrospectively compare the performance of CDI to standard of care modalities, finding that the machine learning model outperforms the standard of care modalities in terms of both numbers of events identified and with a lower false alarm rate. While only a pilot phase study, the CDI represents a promising application of machine learning within a continuous remote patient monitoring system.

Observational study on wearable biosensors and machine learning-based remote monitoring of COVID-19 patients

Article Open access 23 February 2021

Early adverse physiological event detection using commercial wearables: challenges and opportunities

Article Open access 23 May 2024

Use of artificial intelligence to develop predictive algorithms of cough and PCR-confirmed COVID-19 infections based on inputs from clinical-grade wearable sensors

Article Open access 05 April 2024

Introduction

To date, more than 130 million people have been infected with SARS-CoV-2 (the virus that causes COVID-19)¹. Of those infected, approximately one in five are at risk for severe decompensation, resulting in hospitalization^2,3,4. While much has been learned over the past year about the transmission of the virus, little is known about the factors that predict a symptom-free disease course versus an acute disease course requiring interventions like oxygen, hospitalization and/or mechanical ventilation. Determining which patient will decompensate (have worsening COVID-19 illness) in order to appropriately triage to home or hospital is challenging^5,6.

One approach to care delivery that has received greater focus during the pandemic is remote patient monitoring (RPM)^7,8,9. Driving the uptake of this technology is the necessity to minimize in-person patient interactions as well as the need to leverage acute care facilities for the sickest patients and manage patients at home if possible¹⁰. The goal of RPM in the COVID-19 use-case is to keep healthcare professionals safe and optimize patient care^11,12,13. For the most part, RPM in the context of COVID-19 has included intermittent monitoring (point measurements while the patient is awake) to assess oxygen saturation (pulse oximeter) and temperature (thermometer)^14,15. Patients are then instructed to call their healthcare professional if the values for these measurements are outside the parameters established by the healthcare team. Interestingly, these clinical parameters, for both oxygen saturation and body temperature, have all demonstrated a likelihood of decompensation at a range of thresholds, making their clinical value equivocal^{2,16,17,18,19,20,21,22}.

While commonly used and believed helpful for alerting physiologic decompensation⁸, intermittent active monitoring of SpO₂ and body temperature are not without their challenges in a disease that manifests with a wide variety of signs and symptoms²³. Pulse oximeters often do not provide accurate readings of oxygen saturations below 90% and have inconsistent repeated measurements^24,25. Additionally, as SpO₂ measurements rely on photoplethysmography technology, readings are sensitive to skin pigmentation and highly susceptible to motion artifact^26,27. Therefore, people with darker skin tones may not be able to obtain accurate readings as well as patients with Parkinson’s or essential tremor. Though less sensitive to the irregularities of SpO₂, fever has only been identified as a clinical symptom for less than a third of all hospitalizations for COVID-19^28,29. Additionally, clinical definitions for fever and SpO₂ are varied. Fever definitions range between prolonged elevation above 38 °C for at least 24 h²² to any temperature above 37.3 °C ³⁰ or 39 °C²¹, while thresholds for low SpO₂ levels could be as high as 95%¹⁷ to anything below 90%^20,31. These varying definitions contribute to inconsistent findings when characterizing the case definition of COVID-19.

While both intermittent temperature and SpO₂ monitoring may be incrementally beneficial, measuring other physiological features such as heart rate or respiration rate, particularly combined with patient ambulation, may provide greater insight into physiologic changes. Davis et al. found lead times for variations in physiological features following infection of non-human primates, finding significant heart rate increase approximately two days prior to a fever³². Other studies have found that respiratory rate changes may precede decreases in oxygen saturation^33,34,35 and abnormalities of respiration have been identified as one the most important indications of clinical deterioration³⁶.

In order to acquire and clinically manage a larger set of physiologic variables, like heart rate and respiratory rate, passive continuous remote patient monitoring (CRPM) could be advantageous to intermittent self-spot checking. Additionally, the continuous monitoring captures both ambulatory and resting physiological data. Numerous studies have identified the benefits of a steady data stream over spot-checking toward more timely alerting of patient degradation. Benefits include earlier interventions, better personnel allocation, shorter hospital stays, and decreased chance of hospital readmission^{37,38,39,40,41}. Focusing on just intermittent monitoring of oxygen saturation and temperature rather than early changes in overall cardiorespiratory status through heart rate, respiratory rate, and other derived indices, it is likely that early warning signs of COVID-19 decompensation are being missed⁴².

Detection of early indicators of decompensation could be optimized through machine learning with continuous data streams. Numerous models have been trained and validated for predicting decompensation and diagnosis of COVID-19. Decompensation models have included predicting clinical outcomes, critical events, mortality, and hospitalization in the setting of COVID-19. However, each of these models relied on collection of laboratory panels, chest radiographs, patient reported symptoms, and/or computerized tomography scans^{43,44,45,46,47,48}. To our knowledge, there have been no multivariate models developed using only non-invasive CRPM to predict decompensation due to COVID-19.

Therefore, we propose a model using only data collected continuously from a non-invasive wearable device, to create a COVID-19 decompensation index (CDI), which accurately predicts COVID-19 clinical deterioration, resulting in hospitalization. Continuous minute signals were windowed using a moving 24 h window, with a 1 h step size, timestamped to the end of the 24 h. The windowed signals were processed to create a feature vector on a once per hour cadence, using only the information in the past 24 h. Each hourly feature vector formed the input to the CDI model (see Fig. 6). To collect training data for the CDI, a clinical team leveraged CRPM as part of their standard of care, requiring participants to wear a biosensor to collect electrocardiogram and accelerometer data continuously for 28 days. If a clinical team member observed a participant’s physiology deteriorating (decompensation), participants were encouraged to return to seek care. Using this labeled data as the ground truth for decompensation, various features were engineered related to participants’ activity and cardiovascular system to predict a decompensation event. CDI, a gradient boosted model, was developed and outperformed current standard of care alerting systems of oxygen saturation and temperature monitoring. The study protocol from which this work was derived was the “Personalized Analytics and Wearable Biosensor Platform for Early Detection of COVID-19 Decompensation Study" or DeCODe: Detection of COVID-19 Decompensation)⁴⁹.

Results

Participant Information

Four hundred COVID-19 positive participants were enrolled from University of Illinois Health System (UIH). Participants were adult patients (>18 years of age), and were recruited from two sources: (1) patients testing positive for COVID-19 in the outpatient setting and (2) patients who were admitted to the hospital with a diagnosis of COVID-19 and subsequently discharged to home convalescence. The cohort had a high percentage of individuals from traditionally underserved communities, including 46.2% Hispanic and 36.8% Non-Hispanic Black (see Table 1). The average age was 46 years old (±15.4), 67.2% female, with an average body mass index of 33.7 (±9.4). The most common co-morbidities were obesity (58.8%), hypertension (32.0%), and diabetes (26.2%). Participants were recruited from point of care testing centers, clinics, or the emergency department (360); however, 40 participants were enrolled post hospital discharge for COVID-19. Participants were expected to complete a total of 28 days of CRPM; however, averaged 22.2 ± 8.6 days of continuous data collection. Throughout monitoring, 25 participants were hospitalized due to complications of COVID-19. Eleven of the hospitalized participants reported to the emergency department (ED) independently while 14 were prompted to visit the ED by the study team. Treatments during hospitalization were varied, but primarily consisted of supplementary oxygen (15) and/or steroids (15). One participant required mechanical ventilation and one participant died after completing the study. After filtering for minimum data requirements, 308 participants were used to form the COVID-19 decompensation negative group, and 22 participants formed the COVID-19 decompensation positive group (“Methods”). Each COVID-19 decompensation event was scored using the WHO ordinal score⁵⁰, where decompensation scores ranged from 3 (hospitalized, no oxygen therapy) to 8 (death).

Table 1 Demographics and co-mordibities of the participants in the DeCODe phase 1 study.

Full size table

Validation and modality comparisons

The CDI model was trained using K-Fold cross validation. The overall area under the curve (AUC) of the receiver operating characteristic (ROC) of CDI was 0.84 (mean AUC across all folds was 0.85, 95% confidence interval (CI) 0.77–0.94), with an overall AUC of the precision-recall (PR) Curve of 0.43 (mean AUC across all folds was 0.48, 95% CI 0.33–0.62). All results were weighted for equal participant contribution to the ROC and PR curves. An operating threshold with a false positive rate of 7% was chosen to match the false positive rate of SpO₂ at rest at a 93% O₂ discrimination threshold. As seen in Fig. 1, the corresponding true positive rate at the 7% false positive rate is 55.7%, compared to 33.9% for SpO₂ at rest. To understand CDI performance in the context of other types of data typically analyzed in a clinical setting, we compared CDI to several univariate features and alternative models. The best univariate features were determined by the largest ROC-AUC among the feature categories of temperature, respiration rate, and heart rate. Additional models of the CDI features with SpO₂ and demographics were trained identically to the primary CDI model. Both CDI models produce the best overall ROC and PR curves, although the addition of SpO₂ measurements into CDI improves the positive predictive value (PPV) of the model. We find the model built on demographic features of age, sex, race, height, weight, and body mass index to be less predictive of decompensation events than any model built on biosensor derived data. In general, the univariate features and model built from demographic data perform worse than the CDI models. However, maximum temperature while asleep does provide strong predictive value in the lower true positive rate (TPR) region of the ROC curve.

**Fig. 1: ROC and PR curves for selected models and univariate signals.**

The performance of models built of various feature combinations are listed in Table 2. Only models utilizing features derived from the combination of biosensor derived features (ECG and acceleromety) achieve a ROC-AUC of greater than 0.8. Models utilizing features derived only from heart rate, respiration rate, actigraphy, and heart rate variability achieve ROC-AUC values from 0.69 to 0.71.

Table 2 Performance of univariate features and multivariate models.

Full size table

Tree based models like gradient boosted trees allow for more interpretable indications of what features were important for the model to reach its decisions. One way of measuring feature importance is Shapley additive explanations, or SHAP scores⁵¹. SHAP scores for the top 25 features are plotted in Fig. 2. We find low activity, high respiration rate, reduced heart rate variability, and high skin temperatures as important features for hospitalization prediction. Among the top 25 features, 20 were filtered statistical features, of which 9 were “while sleeping”, 2 were “while not sleeping”, 5 were “while walking”, and 5 were “while not walking”. We also find the interaction feature “the slope of delta heart rate to delta step count” as the most important individual feature. Intuitively, this feature can be understood as how dynamically one’s cardiorespiratory system can react to the physical demands of a given activity, where faster responses to the changes of physical demands may be indicative of better cardiorespiratory health. This feature has only mediocre discriminative value by itself, with a univariate ROC-AUC value of 0.62, compared to the second most important feature, the interquartile range of gross activity, which has a ROC-AUC of 0.77. We hypothesize that the slope of delta heart rate to delta step count is a feature indicative of overall cardiorespiratory function and helps the model contextualize the other features, thereby acting as a hospitalization risk factor, but individually is only slightly predictive of hospitalization.

**Fig. 2: Shapley additive explanations (SHAP) scores of the most important 25 features, plotted as a beeswarm plot.**

While we did not have true external validation data in this phase 1 study, we perform two additional case study analyses using two datasets not used during training to test model performance. The first dataset was composed of previously collected data from participants who did not have COVID-19. The dataset was from 161 participants, representing almost 200,000 h of continuous biosensor data. This dataset was used to confirm the false positive rates of the K-Fold models, and that it matched the false positive rate of the overall CDI model. The false positive rate was set to 7% for the COVID-19 decompensation task, and the final model achieved a 7.4% FPR on the COVID-19 negative task, compared with a mean FPR of the 22-fold models of 6.8% (95% CI 5.8–7.8%).

On the second dataset, two participants who were hospitalized but were unable to be officially adjudicated and given a WHO score were analyzed as decompensation events with unknown WHO score (though likely ≥WHO 3). Using the overall trained CDI model, the overall true positive rate for both participants was an average of 87.5%, with a true positive rate during the last two days before hospitalization of 96.0%. Adding these two participants to the overall positive dataset, the overall TPR increases from 55.7 to 58.3%.

Comparison to standard of care and remote patient monitoring

The detection capabilities of CDI were compared to a simulated standard of care, where standard of care was defined as thresholded SpO₂ and thresholded skin temperature values (oral temperature unavailable), as defined in the “Methods”. The comparison between CDI and simulated standard of care leading up to each analyzed hospitalization event is shown in Fig. 3. In the 2 days leading up to hospitalization, CDI alerts for 19 out of 22 participants, contrasted to 10 of 22 for simulated standard of care alerts. Four participants did not take any SpO₂ measurements during the two days before hospitalization; all four participants had CDI alerts during the same time period. Only one participant had a standard of care alerts without also having a CDI alert in the last 2 days before hospitalization, and two participants had no alerts of either kind. As can also be seen in Fig. 3, CDI alerted far more frequently for participants with WHO scores of four or greater than those with WHO scores of 3. Participants with WHO scores of 3 had an average time in the hospital of 2 days, and those with WHO scores of 4 or higher had an average hospitalization time of 6.3 days. Temperature alerts were rarely triggered, despite the good predictive value of the daily maximum temperature, the temperature taken at an arbitrary patient-selected time point during the day seems to have low sensitivity.

**Fig. 3: Comparison of simulated standard of care alerts to CDI alerts.**

To further examine the potential benefits of using a machine learning model like CDI, we compared the detection capabilities of CDI to a CRPM system set to alert on conventional clinical rules. These CRPM alerts are non-specific to COVID-19, with every hour considered an alert if any of the underlying CRPM rules are met (“Methods”). The CRPM alerts compared to CDI based alerts can be seen in Fig. 4. Both CRPM and CDI alerts show relatively high true positive rates leading up to the events, although CDI does have a consistently higher TPR in the two days before hospitalization. The true positive rate of CDI gradually rises starting 2 days before the events, going from around a 40% TPR to a 70% TPR on the day of the event. However, the largest difference between CRPM and CDI alerts occur in the reduction in false positive rates among negative participants. In comparison to CDI, the simulated CRPM system alerted more frequently and across more individual participants.

**Fig. 4: Comparison of a continuous remote monitoring system with clinical alerting rules to CDI.**

Discussion

While the majority of individuals infected by SARS-CoV-2 will ultimately not require hospitalization, up to 20% will². The time from symptom onset to the need for hospitalization can extend beyond 10 days⁵², and the severity of an individual’s clinical course is rarely clear at the time of diagnosis. Accordingly, effective CRPM capabilities are critical to assuring patient safety while conserving scarce resources of both hospital beds and health care professionals. We show that through the use of a multi-parameter sensor patch in the outpatient setting that we were able to develop a machine learning based CDI model that has the potential to significantly improve the lead time and accuracy of identifying individuals requiring hospitalization due to progressive COVID-19, relative to what is routinely done in current remote monitoring programs.

The CDI model moves well beyond the current standard of care that depends on one or two population-based cut-offs (e.g., oxygen saturation <93%, or temperature >38 °C), captured only 2- to 3-times a day. Instead, using 17 source signals sampled every minute, which are further processed to create 361 features each hour, earlier signals of decompensation are detectable. While the continuous physiologic data available through a wireless patch sensor adds considerable value to intermittent spot checks, we have shown that the CDI model improved overall performance with a large reduction in false positive alerts relative to the sensor data alone. It is telling that the model features of greatest value were based on the interactions between individual features. For example, the relationship between heart rate during a given amount of activity may better track overall cardiorespiratory performance and characterize the impact of progressive COVID-19 on it.

The CDI model inherits the best attributes of the underlying features used to build the model. Features derived from respiration and temperature tend to show good discriminatory values in the low false positive region, as very high respiration rates and temperatures are quite predictive of decompensation events. Features derived from actigraphy signals are not able to predict decompensation events well, as those with COVID-19 already generally have low levels of activity, but those with high levels of activity can be ruled out for COVID-19 decompensation.

The CDI model shows an improvement over the standard of care patient initiated SpO₂ and temperature checks (Fig. 3), particularly for participants with decompensation events of WHO ≥4, as 15 out of 16 participants had at least one CDI alert in the 2 days before their hospitalization. Four participants did not take any SpO₂ measurements in the 2 days leading to their hospitalization, and eight participants did not take any SpO₂ measurements during the 24 h before their hospitalization. This illustrates the importance of a monitoring system that does not require the participants to take action precisely at the time they are likely to feel the worst, but during which time their compliance is most critical. A CRPM system that passively collects biosensor data and can alert clinicians to potentially dangerous situations without the need for patient action could be an improvement over a system only utilizing SpO₂ and temperature checks. Though, as seen in Fig. 4, simply using typical clinical rules based on remote monitoring data is likely to lead to alarm fatigue due to the high false positive rate. While simple thresholds on standard vital signs can provide decent COVID-19 decompensation detection, it is at the cost of a high false alert burden across many individual participants. Using a CDI model within a remote monitoring system could allow for the best of both worlds, achieving high detection capabilities, but with a suppressed false alarm rate over simpler clinical rules. However, while early detection of patient deterioration is an extremely important dimension of an effective CRPM approach, there are other dimensions that are just as important. For a CRPM platform to be adopted by clinicians it needs to be highly reliable, cost effective, easy to use, and enable clinicians to more efficiently manage a patient population. CRPM solutions that do not address all these dimensions are unlikely to become part of patient management clinical workflow. In addition, while we believe that digital health solutions have the potential to improve equity in healthcare, unless socioeconomic and educational barriers are proactively addressed, they could actually contribute to increasing inequity⁵³.

Our work adds to a growing body of literature addressing remote monitoring in COVID-19. A recent systematic review identified 17 examples of published remote home monitoring models⁵⁴. Virtually all incorporated some aspect of symptom tracking, most included some form of daily temperature checks, and 11 of the 17 included pulse oximetry. The methods of each model and outcomes tracked were too heterogeneous to allow for any meaningful conclusions regarding the efficacy and safety of any of these models but do help inform the scope of the problem. Looking at the totality of data from remote COVID-19 patient monitoring studies, the incidence rate of decompensation requiring at least an emergency department visit varied from as high as 36% to as low as 2.6%^55,56. In one well-characterized study involving 2348 people enrolled in a twice-daily text messaging symptom tracker study, 23% required some escalation of care, with 8.6% returning to the emergency department and 42% of those individuals requiring hospitalization⁵⁷. For COVID-19 positive individuals in whom deterioration is recognized earlier, proven therapies such as monoclonal antibody cocktails that have been shown to decrease hospitalization and death by 70%, can be initiated sooner in those most likely to benefit the greatest⁵⁸. A CDI model used within a CRPM system may provide valuable lead time before hospitalization, leading to better patient outcomes.

The pandemic has also served to accelerate the utilization of digital health technologies in order to better care for individuals without the typical requirement of a visit to a brick-and-mortar healthcare facility. Smartphone apps, and SMS-based messaging programs allow people to enter their overall health and any symptoms on a daily basis that can be monitored and acted upon by healthcare professionals is one such method^57,59. Symptom tracking alone is subjective and will be inconsistent between people, and would also miss individuals with “silent hypoxia,” a phenomenon described in COVID-19⁶⁰. As noted earlier, most remote monitoring programs incorporate temperature checks, but a fever has turned out to not be the hallmark for COVID-19 infection-initially thought with some case series finding a fever at the time of initial hospitalization in less than one-third of patients, and only just over a quarter of COVID-19 positive nursing home residents^28,29. Some programs have also incorporated home pulse oximetry, which has been found in one study of 77 COVID-19 outpatients to be helpful in prompting a return to the emergency department for an asymptomatic decrease in oxygen saturation and to reassure others that they did not need to return to the emergency department when their oxygen saturation was unchanged²⁹. However, beyond the inherent limitations of intermittent self-testing with a pulse oximeter, there remain many practical issues around their reliability in large populations including their accuracy at lower oxygen levels, in settings of low pulsatile flow, and most especially racial bias leading to higher levels of unrecognized hypoxemia in Black relative to White populations^24,61. Our work specifically sought to recruit from traditionally underserved communities of color to avoid such racial bias, with the study cohort consisting of 46.2% Hispanic and 36.8% Non-Hispanic Black participants.

This work has several limitations. Perhaps the largest limitation is the relatively small number of decompensation events. Because of the small number of decompensation events, we could not use a true holdout test set, and although we took several precautions against overfitting, given the small size of the positive dataset, we cannot entirely guard against it. However, this was only the phase I pilot part of the study, and we expect additional model validation to occur during the larger phase II part of the study. An additional limitation to the predictive performance of the CDI model is that we had no information about participants’ healthy, pre-COVID-19 baselines. For example, prior research has found significant inter-individual variability in “normal" resting heart rates⁶². With up to a 70-BPM difference between individuals’ average resting heart rates, any non-individualized heart rate-based features will lack the precision truly possible. Other limitations include: the temperature sensor used in the biosensor device was a skin temperature measurement and not a true oral temperature, and that the study cohort skewed mostly female.

We show a multivariate COVID-19 Decompensation Index capable of outperforming several standard clinical monitoring modalities using only data collected from wearable biosensors. CDI shows promise as a means of extending continuous remote patient monitoring capabilities, particularly above the use of intermittent, patient driven univariate monitoring. While cross-validation and hundreds of thousands of hours of data were used to minimize the likelihood of overfitting, a larger sample size is necessary to further validate the accuracy of the model. Future validation work is currently in progress as part of the phase 2 DeCODe study where CDI will be evaluated extensively to continue optimizing the use of CRPM⁴⁹.

Methods

Data collection

Data were collected to develop and test a COVID-19 Decompensation Index from participants during the phase I portion of the Detection of COVID-19 Decompensation (DeCODe) study⁶³. DeCODe was supported by the NIH National Cancer Institute (NCT04575532). The DeCODe study is a prospective, non-randomized, open-label study, with the primary outcome to develop a machine learning model to predict COVID-19 decompensation events, with secondary outcomes evaluating the feasibility of the pinpointIQ™ continuous remote patient monitoring system. Participants were adult patients (≥18 years of age) in the University of Illinois Health System. Patients were recruited from two sources: (1) patients testing positive for COVID-19 in the outpatient setting and (2) patients who were admitted to the hospital with a diagnosis of COVID-19 and subsequently discharged to home convalescence. The enrollment target was 400 participants for the phase I study. Participants were monitored for 28 days using the physIQ pinpointIQ™ platform to capture continuous biosensor data from the VitalConnect VitalPatch^® chest patch biosensor. The biosensor is an FDA cleared device that acquires 5–7 days of continuous raw 125 Hz ECG, 50 Hz triaxial accelerometer, and 0.25 Hz skin temperature data from which physiological features are derived by the pinpointIQ™ platform. These derived features drove the development of the CDI model.

Participants responded to symptom questionnaires and performed manual finger pulse oximetry measurements. The finger pulse oximeter sensor provided was the Proactive, Protekt^® Finger Pulse Oximeter 20110. Results of both were captured by the pinpointIQ™ smartphone app. Prompts to respond to the symptom questionnaire and to perform pulse oximeter measurements were pushed to the study smartphone twice daily. Each pulse oximetry measurement prompt included two measurements. The prompts for the symptom questionnaire and pulse oximetry measurements are shown below.

1.
Symptom questionnaire:
- How are you feeling since the last time you completed the survey?
  - OPTIONS: “Better”; “Worse”; “No Change”.
2.
SpO₂ measurements:
- Record your SpO₂ at rest:
  - Options: “100%”; “99%”; “98%”; “97%”; “96%”; “95%”; “94%”; “93%”; “92% or Below”.
- Record your SpO₂ while walking:
  - Options: “100%”; “99%”; “98%”; “97%”; “96%”; “95%”; “94%”; “93%”; “92% or Below”.

Once enrolled, participants were provided with a study kit containing a smartphone, finger pulse oximeter, a supply of biosensors, and instructions on the study procedures to follow and how to uses the devices.

Exclusion criteria

Individuals were excluded with known allergic reactions to components of the biosensor hydrocolloid gel adhesives; or with cognitive or physical limitations that could have limited their ability to fully follow study procedures. Individuals that did not speak or read English or Spanish were also excluded.

Ethical approval

This study was approved by the University of Illinois, Chicago Institutional Review Board. All participants provided written informed consent prior to enrolment in the study.

COVID-19 decompensation event adjudication

A “clinical event” was defined as an escalation of care from home-based remote monitoring to a higher level of care. For example, if during monitoring the clinical user identifies that the patient is worsening and they need acute care, this is defined as a clinical event. Health record documentation was gathered from care facilities on any clinical event and was used for final adjudication as a COVID-19-related event. The adjudication consisted of review of the electronic health record of the participant experiencing the clinical event and two emergency department physicians independently deciding on whether the event was a “COVID-19 clinical event” or “non-COVID-19 clinical event”. If the two opinions did not agree, the case was reviewed by a third emergency department physician and final decision was made. Finally, a COVID-19 clinical event was defined to be a “Decompensation Event” if a hospitalized patient reached a maximum WHO Ordinal Scale for Clinical Improvement (WHO OSCI) score of 3 or more during their hospitalization. A WHO Score of 3 corresponds to “Hospitalization, no oxygen therapy”⁵⁰. The COVID-19 Decompensation events served as the positive cases for the development of CDI.

Ground truth formatting and analysis methods

Clinical outcomes were formatted for scoring of a classifier by the following steps. All participants who did not experience a COVID-19 event of any kind based on the WHO Ordinal Scale⁵⁰ and who had at least 12 h of continuous biosensor data formed the negative group (308 participants). The entirety of the data collected from enrollment to completion or withdrawal from each event-negative participant was analyzed. All participants who experienced only a COVID-19 event(s) with a WHO score less than 3 (16 participants) were removed from analysis. WHO scores below 3 are not hospitalized and thus are not decompensation events, but conversely do not fit into the event negative group as many sought medical treatment, though they were not sick enough to be admitted to the hospital. Data from participants with WHO scores of 3 or above (25 participants) which had enough data for analysis (22 participants) were analyzed up until their first event with a WHO score of 3 or above. Two participants had more than one event with a WHO score of 3 or above; these additional events occurred two days after being discharged from the hospital from their first event and were not analyzed. We defined the positive time window for each event as up to two weeks before their hospitalization time, ending at the time of hospitalization. Any WHO scores less than 3 that occurred during the positive time window were ignored. The participant clinical flowchart can be seen in Fig. 5.

**Fig. 5: The analysis flowchart from 400 enrolled to 330 ultimately analyzed participants.**

Each hour of data from participants without events and each hour of data from the positive time window of participants with events were treated as ground truth targets, with target values of 0 for the negative class, and values of 1 for the positive class. Any hour of ground truth target which did not have a corresponding CDI value (for example, during the first 12 h of monitoring) was not considered in the CDI performance analysis. To account for the variance in the amount of data collected by each participant and to ensure reported performance was not skewed by participants with more data than others, all confusion matrix calculations were weighted so that the contribution from each participant was equal. Weighting the confusion matrix in this manner results in a maximum true negative of 308, and a maximum true positive of 22.

Feature extraction

The VitalConnect VitalPatch^® device records raw ECG, 3-axis Accelerometer, and skin temperature, which is processed by the physIQ platform to produce 17 source signals at a once per minute sampling rate, listed in Table 3. The minute level signals were further processed to create a 361-length feature vector using a 24 h analysis windows with a 1 h slide. Feature extraction for each window would occur if at least 12 out of the maximum 24 h of data was available (that is, if at least 770-min samples were available in a given 24 h period). Provided the data requirements were met, the 17 source signals would produce 361 features at a hourly cadence, timestamped to the end of the 24 h window. Figure 6 shows the overall processing steps from minute signals to CDI probability. The feature extraction steps to produce the 361 features for the CDI model are defined below.

Table 3 Minute signals used as the source signals for feature extraction in the CDI model.

Full size table

**Fig. 6: Processing steps from wearable features to CDI outputs.**

Two additional minute features, here named “Breaths Per Beat” and “HRV Normalized by HR,” were created by dividing minute respiration rate by minute heart rate, and by dividing the minute time domain heart rate variability by heart rate, respectively. A set of six statistical operations (median, mean, standard deviation, 1st percentile, 99th percentile, and interquartile range) were applied to the minute signals as listed in the Statistical Feature Group in Table 4. The 1st and 99th percentile were used in lieu of minimum and maximum to avoid any erroneous true minimum or maximum values. Statistical features were also calculated on a subset of signals while filtered for certain activities. The filtered statistical features were filtered such that the operations of median, mean, standard deviation, 1st percentile, and 99th percentile were calculated during periods of walking, not walking, sleeping, and not sleeping. Minutes were classified as walking or sleeping if more than 50% of the minute was classified as walking or sleeping.

Table 4 Feature extraction transforms from 17 source signals to 361 features.

Full size table

Inspired by Mishra, et al.⁶⁴, who found that the ratio of heart rate and step count was a predictive feature of COVID-19 infection, we developed a set of features to capture interactions between minute step counts and vital signs. The first set of features, called the Weighted Average Features, in Table 4 are features found by taking the weighted average of input signals with step counts as the weighting variable, according to the following weighting formula in Equation (1).

$${{weightedSignal}}=\frac{\sum {{Signal}}\cdot {{stepCount}}}{\sum {{stepCount}}}$$

(1)

The second set of features, called the interaction and delta interaction features in Table 4 are features found by calculating the slope and intercept of a linear regression of step count against heart rate, time domain heart rate variability (HRV), respiration rate, and breaths per beat, with step count as the independent variable. Step counts and the corresponding signal values when step counts were zero were not used to calculate the linear regression parameters. Additionally, the same linear regression procedure was repeated to calculate the linear regression slope on the first order differences of step count and the first order difference of the other signals.

Sleep quality and sleep apnea have been found to be correlated with COVID-19 outcomes^64,65; to capture some sleep disturbance information, we calculated two sleep disturbance features in the CDI model. These were calculated by determining the number of awakenings, computed by counting the number of transitions from sleep to awake state, and the number of awakenings per hour of sleep, where a given minute was classified as sleeping if more than 50% of the minute was the sleep state.

One data quality feature was calculated by summing over an ECG signal quality index to calculate the percent of good quality data with respect to the total possible amount of data per day.

Other features not derived from biosensor data like demographics, survey responses, and SpO₂ were interpolated with zero-order interpolation and timestamped to the end of each hour. Survey responses and SpO₂ responses were interpolated for up to 24 h past the time of their original measurement.

Model training

In this work, we chose to use gradient boosted decision trees⁶⁶ as our classifier, as gradient boosted trees achieve high levels of performance and can provide greater insight into their decisions than other black box machine learning models, which is a desired attribute in the medical field. The principal difficulty in modeling COVID-19 decompensation was the large class imbalance, and the relative rarity of pre-hospitalization data. Of the 330 participants ultimately analyzed in this study, only 22 of them had analyzable events. The median amount of time from enrollment to COVID-19 decompensation event was 4 days, which was far smaller than the median number of days (26 days) of data collected by each negative participant. As a result, the total positive data was 1.4% of the total data. Given the small amount of positive data available, we were concerned with the possibility of overfitting on the dataset and took several steps to safeguard against overfitting and check that it did not occur.

We separated our modeling steps into (1) feature and hyperparameter selection, (2) K-Fold training of the CDI model, and (3) validation of K-Fold trained models on additional datasets. To help avoid overfitting on the decompensation detection task, we performed hyperparameter selection using a related but easier task of differentiating participants with a COVID-19 diagnosis from those without a COVID-19 diagnosis, using a previously collected dataset of roughly equal size to the DeCODe study, with the hypothesis that similar features and hyperparameters for the COVID-19 vs non-COVID-19 task would transfer to the task of COVID-19 decompensation detection. After selection of the hyperparameters, we used K-Fold model training and validation to produce performance estimates for decompensation classification, then trained a single model on all the training data. Finally, to validate the model we turned back to the non-COVID-19 data used in step 1, and though that data only contained examples negative for COVID-19 decompensation, it allows for confirmation that the false positive rates match the expected false positive rates found in the K-Fold model validation procedure.

To reduce the likelihood of overfitting on the COVID-19 decompensation detection task, we performed feature selection and XGBoost hyperparameter selection on the related task of classifying COVID-19 positive vs COVID-19 negative participants. The COVID-19 positive dataset was the entirety of the DeCODe Phase 1 dataset, and the COVID-19 negative dataset was a dataset of previously collected data from 161 participants, consisting of almost 200,000 h of continuous biosensor data. Feature selection was performed by training an XGBoost model and ranking the features by SHAP importance⁵¹. We selected the top 50 features based on SHAP importance and selected the rest of the model hyperparameters using a 5-fold grid search, based on the number of participants.

We applied a modified K-Fold validation approach, with K equal to the number of positive participants (22), such that each fold had one positive participant, and 1/22nd of the negative participants. On each iteration, 21 out of the 22 folds were used to train the model, with the remaining fold used for validation. For each training iteration, 2 days of data before events from participants with only WHO scores of two were added to the positive training data to help provide additional data. All features were z-scored based on the mean and standard deviation parameters calculated for each training iteration. To help combat class imbalance, samples during training were weighted with increasing importance leading up to the date of the COVID-19 decompensation. This weighting was done based on the hypothesis that in each positive window, the certainty that the COVID-19 event is happening grows as the time of actual event grows closer. Conversely, negative participants’ data were weighted with importance as decreasing from the date of enrollment, as we wanted the model to focus more on time periods where their disease severity is likely worse than toward the end of their data collection, when they likely would have largely recovered from the disease. The final model was trained using all 22 folds.

In an effort to further guard against overfitting, we performed an additional validation of the false positive rate of the K-Fold models and the final model trained on all the training data on the COVID-19 negative dataset used to select the model hyperparameters. Using this procedure allows verification that firstly the false positive rate of the final model is consistent with the false positive rates produced by each of the 22 models trained in the K-Fold validation, and secondly that the false positive rate on the COVID-19 negative dataset is consistent with the false positive rate achieved on the COVID-19 decompensation task.

Furthermore, we used two hospitalized participants data as an additional small positive dataset. These two participants could not be adjudicated and given a WHO score, as their medical records were inaccessible to the UIH medical system, but were likely hospitalized as a result of COVID-19.

Experiments

We conducted several additional experiments to compare the CDI model to available standard and care and remote patient monitoring without using machine learning.

Comparisons to the CDI model were done in one of two ways, either a comparison to a univariate feature analysis, or in a multivariate feature analysis by building a new XGBoost model. The univariate feature comparison was created by using the feature of interest directly in the ROC analysis.

The multivariate feature analyses follow the same modeling steps as described above in “Model training”. However, instead of using the 50 best features without restriction, the model would use the 50 best with certain types of features removed. For example, a model only using actigraphy features would select the top 50 actigraphy features from the full list of ranked features used for CDI. The same hyperparameters used for the CDI model were used in all other multivariate model comparisons.

Simulated standard of care was defined as patient initiated SpO₂ and temperature readings, which would alert if either are above typical clinical thresholds. An SpO₂ alert was considered triggered if the measured SpO₂ value at rest was less than or equal to 93%. Patient initiated temperature alerts were simulated by thresholding the skin temperature value occurring at the same time as the SpO₂ measurement. Since skin temperature is lower overall than oral or core body temperature, standard clinical oral temperature thresholds were not valid thresholds. Instead, we considered a skin temperature reading to be an alert if the value was greater 36.11 °C, which is the 90th percentile of the temperature readings in the study.

We wanted to compare CDI to continuous remote patient monitoring to better understand the differences between a machine learning model derived from the same information, compared to a set of clinical rules. The CRPM rules were computed each hour, and an alert was considered triggered if any of the following conditions were met:

Tachypnea rule: alert if the average respiration rate while not walking is greater than 26 breaths per minute.
Tachycardia rule: alert if heart rate is greater than 100 beats per minute while not walking for more than 40 min per hour
Atrial fibrillation rule: alert if AFib is detected for more than 40 min per hour.
Bradycardia rule: alert if heart rate is lower than 45 beats per minute for more than 48 min per hour or if heart rate is lower than 52 beats per minute for more than 48 min per hour while awake.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The data that support the findings of this study are available from the Digital Health Technologies Data Hub as well as the NIH RADx data hub (https://rapier.ll.mit.edu/studies). Users will be required to electronically sign a data use agreement. Each dataset will have its own DOI. Data will be available for general research use. Aggregate data analyzed in this study are available from the corresponding author on reasonable request.

Code availability

The code used to train and validate the model, as well as perform the analyses, are available at the Digital Health Technologies Data Hub. They are also available from the corresponding author on reasonable request. All analyses were performed using Python 3.8, with xgboost==1.2.1, and shap==0.37.0.

References

COVID-19 Map - Johns Hopkins Coronavirus Resource Center. https://coronavirus.jhu.edu/map.html (2020).
Wu, Z. & McGoogan, J. M. Characteristics of and important lessons from the coronavirus disease 2019 (COVID-19) outbreak in China: summary of a report of 72314 cases from the chinese center for disease control and prevention. JAMA 323, 1239–1242 (2020).
Article CAS PubMed Google Scholar
Grasselli, G. et al. Baseline characteristics and outcomes of 1591 patients infected with SARS-CoV-2 admitted to ICUs of the Lombardy Region, Italy. JAMA 323, 1574–1581 (2020).
Article CAS PubMed PubMed Central Google Scholar
Zhou, F. et al. Clinical course and risk factors for mortality of adult inpatients with COVID-19 in Wuhan, China: a retrospective cohort study. Lancet 395, 1054–1062 (2020).
Article CAS PubMed PubMed Central Google Scholar
Sen-Crowe, B., Sutherland, M., McKenney, M. & Elkbuli, A. A closer look into global hospital beds capacity and resource shortages during the COVID-19 pandemic. J. Surg. Res. 260, 56–63 (2021).
Article CAS PubMed Google Scholar
O’Keefe, J. B., Tong, E. J., Taylor, T. H. J., O’Keefe, G. A. D. & Tong, D. C. Use of a telemedicine risk assessment tool to predict the risk of hospitalization of 496 outpatients with COVID-19: retrospective analysis. JMIR Public Health surveill. 7, e25075 (2021).
Article PubMed PubMed Central Google Scholar
Massaroni, C., Nicolò, A., Schena, E. & Sacchetti, M. Remote respiratory monitoring in the time of COVID-19. Front. Physiol. 11, 635 (2020).
Article PubMed PubMed Central Google Scholar
O'Carroll, O. et al. Remote monitoring of oxygen saturation in individuals with COVID-19 pneumonia. Eur. Respir. J. 56, 2001492 (2020).
Article CAS PubMed PubMed Central Google Scholar
Watson, A. R., Wah, R. & Thamman, R. The value of remote monitoring for the COVID-19 pandemic. Telemed. e-Health 26, 1110–1112 (2020).
Article Google Scholar
Aziz, S. et al. Managing ICU surge during the COVID-19 crisis: rapid guidelines. Intensive Care Med. 46, 1303–1325 (2020).
Article CAS PubMed Google Scholar
Greenhalgh, T., Koh, GCH. & Car, J. Covid-19: a remote assessment in primary care. BMJ 368, m1182 (2020).
Article PubMed Google Scholar
Portnoy, J., Waller, M. & Elliott, T. Telemedicine in the era of COVID-19. J. Allergy Clin. Immunol Pract. 8, 1489–1491 (2020).
Article PubMed PubMed Central Google Scholar
Aalam, A. A. et al. Remote patient monitoring for ED discharges in the COVID-19 pandemic. Emerg. Med. J. 38, 229–231 (2021).
Article PubMed Google Scholar
Burke, R. M. et al. Active monitoring of persons exposed to patients with confirmed COVID-19—United States, January-February 2020. Morb. Mortal. Wee. Rep. 69, 245–246 (2020).
Article CAS Google Scholar
Bahl, S. et al. Telemedicine technologies for confronting covid-19 pandemic: a review. J. Ind. Integr. Manage. 5, 547–561 (2020).
Article Google Scholar
Bertsimas, D. et al. COVID-19 mortality risk assessment: an international multi-center study. PLoS ONE 15, 1–13 (2020).
Article Google Scholar
Fuentes-Antrás, J. et al. A snapshot of covid-19 infection in patients with solid tumors. Int. J. Cancer 148, 2389–2397 (2021).
Article CAS Google Scholar
Cascella, M., Rajnik, M., Cuomo, A., Dulebohn, S. C. & Di Napoli, R. Features, Evaluation and Treatment Coronavirus (COVID-19) (StatPearls Publishing, 2020).
Chawla, R. et al. A Retrospective observational study to determine the early predictors of in-hospital mortality at admission with COVID-19. Indian. J. Crit. Care Med. 24, 1174–1179 (2020).
Article PubMed PubMed Central CAS Google Scholar
Mejía, F. et al. Oxygen saturation as a predictor of mortality in hospitalized adult patients with covid-19 in a public hospital in lima, peru. PLoS ONE 15, 1–12 (2021).
Google Scholar
Gul, M. H., Htun, Z. M. & Inayat, A. Role of fever and ambient temperature in COVID-19. Exp. Rev. Respir. Med. 15, 171–173 (2021).
Article CAS Google Scholar
Ng, DHL. et al. Fever patterns, cytokine profiles, and outcomes in COVID-19. Open Forum Infect. Dis. 7, ofaa375 (2020).
Article PubMed PubMed Central CAS Google Scholar
Pereira, N. L. et al. COVID-19: understanding inter-individual variability and implications for precision medicine. Mayo Clin. Proc. 96, 446–463 (2021).
Article CAS PubMed Google Scholar
Luks, A. M. & Swenson, E. R. Pulse oximetry for monitoring patients with COVID-19 at home potential pitfalls and practical guidance. Ann. Am. Thorac. Soc. 17, 1040–1046 (2020).
Article PubMed PubMed Central Google Scholar
Hidalgo, D. C., Olusanya, O. & Harlan, E. Critical care trainees call for pulse oximetry reform. Lancet Respir.y Med. 9, e37 (2021).
Article Google Scholar
Feiner, JR., Severinghaus, JW. & Bickler, PE. Dark skin decreases the accuracy of pulse oximeters at low oxygen saturation: The effects of oximeter probe type and gender. Anesth. Analg. 105, S18–S23 (2007).
Article PubMed Google Scholar
Clarke, G. W., Chan, A. D. & Adler, A. Effects of motion artifact on the blood oxygen saturation estimate in pulse oximetry. In IEEE MeMeA 2014—IEEE International Symposium on Medical Measurements and Applications, Proceedings (IEEE Computer Society, 2014).
Richardson, S. et al. Presenting characteristics, comorbidities, and outcomes among 5700 patients hospitalized with COVID-19 in the New York City Area. JAMA 323, 2052–2059 (2020).
Article CAS PubMed PubMed Central Google Scholar
Rudolph, J. L. et al. Temperature in nursing home residents systematically tested for SARS-CoV-2. J. Am. Med. Dir. Ass. 21, 895–899.e1 (2020).
Google Scholar
Wu, C. et al. Risk factors associated with acute respiratory distress syndrome and death in patients with Coronavirus disease 2019 pneumonia in Wuhan, China. JAMA Intern. Med. 180, 934 (2020).
Article CAS PubMed Google Scholar
Petrilli, CM. et al. Factors associated with hospital admission and critical illness among 5279 people with coronavirus disease 2019 in New York City: prospective cohort study. BMJ 369, m1966 (2020).
Article PubMed PubMed Central Google Scholar
Davis, S. et al. Detecting pathogen exposure during the non-symptomatic incubation period using physiological data: proof of concept in non-human primates. Front. Physiol. 12, 1173 (2021).
Article Google Scholar
Addison, P. S. et al. Pulse oximetry-derived respiratory rate in general care floor patients. J. Clin. Monit. Comput. 29, 113–120 (2015).
Article PubMed Google Scholar
Barker, S., Tremper, K. & Gamel, D. Clinical comparison of transcutaneous Po2 and pulse oximetry in the operating Room. Anesth. Analg. 65, 805–808 (1986).
Article CAS PubMed Google Scholar
Bergese, S. D. et al. Multicenter study validating accuracy of a continuous respiratory rate measurement derived from pulse oximetry: A comparison with capnography. Anesth. Analg. 124, 1153–1159 (2017).
Article PubMed PubMed Central Google Scholar
Mochizuki, K. et al. Importance of respiratory rate for the prediction of clinical deterioration after emergency department discharge: a single-center, case-control study. Acute Med. Surg. 4, 172–178 (2017).
Article PubMed Google Scholar
Downey, C., Randell, R., Brown, J. & Jayne, DG. Continuous versus intermittent vital signs monitoring using a wearable, wireless patch in patients admitted to surgical wards: Pilot cluster randomized controlled trial. J. Med. Intern. Res. 20, e10802 (2018).
Google Scholar
Schein, R. M., Hazday, N., Pena, M., Ruben, B. H. & Sprung, C. L. Clinical antecedents to in-hospital cardiopulmonary arrest. Chest 98, 1388–1392 (1990).
Article CAS PubMed Google Scholar
Jones, D., Mitchell, I., Hillman, K. & Story, D. Defining clinical deterioration. Resuscitation 84, 1029–1034 (2013).
Article PubMed Google Scholar
Chen, L. et al. Dynamic and personalized risk forecast in step-down units: Implications for monitoring paradigms. Ann. Am. Thorac. Soc. 14, 384–391 (2017).
Article PubMed PubMed Central Google Scholar
Maas, M. B. Evaluating the prognostic utility of intermittent vs continuous electroencephalography in comatose survivors of cardiac arrest. JAMA Network open 3, e203743 (2020).
Article PubMed Google Scholar
Madjid, M., Safavi-Naeini, P., Solomon, S. D. & Vardeny, O. Potential effects of coronaviruses on the cardiovascular system: a review. JAMA Cardiol. 5, 831–840 (2020).
Article PubMed Google Scholar
Burdick, H. et al. Prediction of respiratory decompensation in Covid-19 patients using machine learning: The READY trial. Comput. Biol. Med. 124, 103949 (2020).
Article CAS PubMed PubMed Central Google Scholar
Kwon, Y. J. F. et al. Combining initial radiographs and clinical variables improves deep learning prognostication in patients with COVID-19 from the emergency department. Radiol. Artif. Intell. 3, e200098 (2021).
Article PubMed Google Scholar
Razavian, N. et al. A validated, real-time prediction model for favorable outcomes in hospitalized COVID-19 patients. npj Digit. Med. 3, 1–13 (2020).
Article Google Scholar
Vaid, A. et al. Machine learning to predict mortality and critical events in a cohort of patients with COVID-19 in New York City: Model development and validation. J. Med. Intern. Res. 22, e24018 (2020).
Google Scholar
Shashikumar, SP. et al. Development and prospective validation of a deep learning algorithm for predicting need for Mechanical Ventilation. Chest 159, 2264–2273 (2020).
Article PubMed PubMed Central CAS Google Scholar
Quer, G. et al. Wearable sensor data and self-reported symptoms for COVID-19 detection. Nat. Med. 27, 73–77 (2021).
Article CAS PubMed Google Scholar
Detection of COVID-19 Decompensation—Full Text View—ClinicalTrials.gov. https://clinicaltrials.gov/ct2/show/NCT04575532?term=physIQ&cond=decompensation+covid-19&draw=2&rank=1 (2021).
WHO R&D Blueprint novel Coronavirus COVID-19 Therapeutic Trial Synopsis (2020).
Lundberg, S. M. & Lee, S.-I. A Unified Approach to Interpreting Model Predictions. In Guyon, I. et al. (eds.) Advances in Neural Information Processing Systems, vol. 30 (Curran Associates, Inc., 2017).
Faes, C. et al. Time between symptom onset, hospitalisation and recovery or death: Statistical analysis of belgian covid-19 patients. Int. J. Environ. Res. Public Health 17, 1–18 (2020).
Article CAS Google Scholar
Radin, J. M., Quer, G., Jalili, M., Hamideh, D. & Steinhubl, S. R. The hopes and hazards of using personal health technologies in the diagnosis and prognosis of infections. Lancet Digit. Health 3, e455–e461 (2021).
Article PubMed Google Scholar
Vindrola-Padros, C. et al. Remote home monitoring (virtual wards) for confirmed or suspected COVID-19 patients: a rapid systematic review. EClinicalMedicine. 37, 100965 (2021).
Article PubMed PubMed Central Google Scholar
Shah, S. et al. Novel use of home pulse oximetry monitoring in COVID-19 patients discharged from the emergency department identifies need for hospitalization. Acad. Emerg. Med. 27, 681–692 (2020).
Article PubMed Google Scholar
Ford, D. et al. Leveraging health system telehealth and informatics infrastructure to create a continuum of services for COVID-19 screening, testing, and treatment. J. Am. Med. Inform. Assoc. 27, 1871–1877 (2020).
Article PubMed PubMed Central Google Scholar
Morgan, A. U. et al. Remote monitoring of patients with covid-19: Design, implementation, and outcomes of the first 3,000 patients in COVID Watch. NEJM Catalyst Innovations in Care Delivery https://catalyst.nejm.org/doi/full/10.1056/CAT.20.0342 (2020).
Weinreich, D. M. et al. REGN-COV2, a Neutralizing Antibody Cocktail, in Outpatients with Covid-19. N. Eng. J. Med. 384, 238–251 (2021).
Article CAS Google Scholar
Kricke, G. et al. Rapid implementation of an outpatient Covid-19 monitoring program. NEJM Catalyst Innovations in Care Delivery https://catalyst.nejm.org/doi/full/10.1056/CAT.20.0214 (2020).
Tobin, M. J., Laghi, F. & Jubran, A. Why COVID-19 silent hypoxemia is baffling to physicians. Am. J. Respir. Crit. Care Med. 202, 356–360 (2020).
Article CAS PubMed PubMed Central Google Scholar
Sjoding, M. W., Dickson, R. P., Iwashyna, T. J., Gay, S. E. & Valley, T. S. Racial bias in pulse oximetry measurement. N. Eng. J. Med. 383, 2477–2478 (2020).
Article Google Scholar
Quer, G., Gouda, P., Galarnyk, M., Topol, EJ. & Steinhubl, SR. Inter- and intraindividual variability in daily resting heart rate and its associations with age, sex, sleep, BMI, and time of year: retrospective, longitudinal cohort study of 92,457 adults. PLoS ONE 15, e0227709 (2020).
Article CAS PubMed PubMed Central Google Scholar
Larimer, K. et al. Personalized analytics and wearable biosensor platform for early detection of COVID-19 decompensation (DeCODe: detection of COVID-19 decompensation): protocol for development of COVID-19 decompensation index. JMIR Res. Protoc. 10, e27271 (2021).
Article PubMed PubMed Central Google Scholar
Mishra, T. et al. Pre-symptomatic detection of COVID-19 from smartwatch data. Nat. Biomed. Eng. 4, 1208–1220 (2020).
Article CAS PubMed PubMed Central Google Scholar
Cade, B. E., Dashti, H. S., Hassan, S. M., Redline, S. & Karlson, E. W. Sleep apnea and COVID-19 mortality and hospitalization. Am. J. Respir. Crit. Care Med. 202, 1462–1463 (2020).
Article CAS PubMed PubMed Central Google Scholar
Chen, T. & Guestrin, C. XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’16, 785-794 (ACM, New York, NY, USA, 2016).

Download references

Acknowledgements

This study was funded by the National Cancer Institute of the National Institutes of Health (75N91020C00040) in support of the HHS and NIH Digital Health Solutions for COVID-19 initiative.

Author information

Authors and Affiliations

physIQ Inc., 200 W Jackson Blvd Suite 550, Chicago, IL, USA
Dylan M. Richards, MacKenzie J. Tweardy, Steven R. Steinhubl, Karen A. Larimer & Stephan W. Wegerich
University of Illinois Health, Chicago, IL, USA
David W. Chestek & Terry L. Vanden Hoek

Authors

Dylan M. Richards
View author publications
You can also search for this author in PubMed Google Scholar
MacKenzie J. Tweardy
View author publications
You can also search for this author in PubMed Google Scholar
Steven R. Steinhubl
View author publications
You can also search for this author in PubMed Google Scholar
David W. Chestek
View author publications
You can also search for this author in PubMed Google Scholar
Terry L. Vanden Hoek
View author publications
You can also search for this author in PubMed Google Scholar
Karen A. Larimer
View author publications
You can also search for this author in PubMed Google Scholar
Stephan W. Wegerich
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.M.R. developed the CDI algorithm. D.M.R. and M.J.T. consolidated data and performed data analyses. S.W.W. and S.R.S. advised on the development of CDI, and the interpretation and presentation of analysis results. D.M.R., M.J.T., and S.R.S. drafted the manuscript. K.A.L., S.W.W., D.W.C., and T.L.V.H. conceived of and designed the study. All authors contributed to revisions and finalization of the submitted manuscript.

Corresponding author

Correspondence to Stephan W. Wegerich.

Ethics declarations

Competing interests

D.M.R., M.J.T., K.A.L., S.W.W., and S.R.S are employees of physIQ. The remaining authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Reporting summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Richards, D.M., Tweardy, M.J., Steinhubl, S.R. et al. Wearable sensor derived decompensation index for continuous remote monitoring of COVID-19 diagnosed patients. npj Digit. Med. 4, 155 (2021). https://doi.org/10.1038/s41746-021-00527-z

Download citation

Received: 10 May 2021
Accepted: 07 October 2021
Published: 08 November 2021
DOI: https://doi.org/10.1038/s41746-021-00527-z

This article is cited by

Ultra-low power, high-data rate, fully on-chip radio frequency on-off keying receiver for internet-of-things applications
- Aasish Boora
- Bharatha Kumar Thangarasu
- Kiat Seng Yeo
Communications Engineering (2024)
Discovery of associative patterns between workplace sound level and physiological wellbeing using wearable devices and empirical Bayes modeling
- Karthik Srinivasan
- Faiz Currim
- Sudha Ram
npj Digital Medicine (2023)

Subjects

Abstract

Similar content being viewed by others

Observational study on wearable biosensors and machine learning-based remote monitoring of COVID-19 patients

Early adverse physiological event detection using commercial wearables: challenges and opportunities

Use of artificial intelligence to develop predictive algorithms of cough and PCR-confirmed COVID-19 infections based on inputs from clinical-grade wearable sensors

Introduction

Results

Participant Information

Validation and modality comparisons

Comparison to standard of care and remote patient monitoring

Discussion

Methods

Data collection

Exclusion criteria

Ethical approval

COVID-19 decompensation event adjudication

Ground truth formatting and analysis methods

Feature extraction

Model training

Experiments

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Reporting summary

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Ultra-low power, high-data rate, fully on-chip radio frequency on-off keying receiver for internet-of-things applications

Discovery of associative patterns between workplace sound level and physiological wellbeing using wearable devices and empirical Bayes modeling

Search

Quick links