Skip to main content

The completeness, accuracy and impact on alerts, of wearable vital signs monitoring in hospitalised patients

Abstract

Background

Use of wearable vital signs sensors (WVSSs) to monitor hospitalised patients is growing but uncertainty exists about how such sensors should be adopted into existing practice. The aim of this observational study was to determine the completeness of data capture and accuracy of measurements recorded by a suite of WVSSs. The implications of using such measurements to derive early warning scores was also assessed.

Methods

Adult inpatients with Covid-19 wore four WVSSs recording heart rate/respiratory rate (HR/RR), oxygen saturation (SpO2), axillary temperature and blood pressure (BP). Wearable vitals were paired with traditional vitals (measured by nurses) recorded concurrently. The accuracy of the wearable vitals was assessed using traditional vitals as the reference. National early warning (NEWS2) scores were calculated using wearable and traditional vitals.

Results

Forty-eight patients were monitored for 204 days with the sensors. Median sensor wear was 3.9(IQR:1.7–5.9), 3.9(IQR:1.6–5.9) and 3.8(IQR:0.9–5.9) days for HR/RR, temperature and SpO2 respectively. The BP cuff was worn for median 1.9(IQR:0.9–3.8) days in 33 patients. Length of hospital stay was 8(IQR:6–13) days. Completeness of data capture was 84% for HR/RR, 98% for temperature, 72% for SpO2 and 36% for BP.

There were 1633 HR, 1614 RR, 1412 temperature, 1294 SpO2 and 59 BP wearable-traditional measurement pairs. 59.7% of HR pairs were within ± 5 bpm, 38.5% of RR pairs within ± 3breaths/min, 24.4% of temperature pairs within ± 0.3℃, 32.9% of SpO2 pairs within ± 2% and 39.0% of BP pairs within ± 10 mmHg. Agreement between wearable and traditional RRs was poor at high RRs.

In a ward setting, 613 NEWS2 scores were calculated using wearable-traditional HR, RR, temperature and SpO2 pairs. The median NEWS2traditional was 1(IQR:1–2) and the median NEWS2wearable was 4(IQR:3–6). Using traditional NEWS2 alerts as a reference, 86% (225/262) of wearable NEWS2 5 + alerts and 89% (82/92) of wearable NEWS2 7 + alerts were false positives.

Conclusions

Agreement between vital signs recorded by wearable sensors and concurrent traditional vitals is poor. In this context, data from wearable sensors should not be used in existing track and trigger systems.

Trial registration

The COSMIC-19 study was registered with clinicaltrials.gov (registration: NCT04581031, date of registration: Oct 6th 2020).

Peer Review reports

Background

Wearable vital signs sensors (WVSSs) are wireless, non-invasive devices worn by patients which permit near-continuous recording of heart rate (HR), respiratory rate (RR), oxygen saturation (SpO2) and blood pressure (BP). In low resource settings and healthcare economies struggling with staff shortages, such sensors could automate repetitive manual vital signs measurements thereby freeing time to care. The near-continuous recording theoretically allows no deterioration to go unnoticed and permits advanced analytics to predict and detect important patient centred outcomes and potentially to personalise care. Patients and their relatives may feel a sense of security in knowing that they are always monitored. With these benefits in mind, there has been growing interest [1] in the use of WVSSs to monitor patients in hospital but whether such sensors improve patient outcomes is still to be conclusively established. Recent, single-centre studies have suggested that monitoring with WVSSs can reduce unplanned intensive care unit (ICU) admissions [2] and adverse outcomes [3] but these findings were not replicated in earlier systematic reviews and meta-analyses [4, 5].

Uncertainty concerning the accuracy of the vital signs recorded by WVSSs may partly explain why it has been difficult to establish their impact on clinical outcomes. Numerous studies have assessed the accuracy of vitals recorded by WVSSs but many have focussed on short timescales or have compared WVSSs to ICU standards of monitoring, sometimes in optimised environments [6, 7]. A true picture of their performance requires assessment in hospital wards, ideally following patients through the course of a hospital admission and whilst subject to the challenges and limitations of typical hospital environments.

Most hospitals use some form of track and trigger system to detect and respond to the deteriorating patient [8]. Such systems advocate a protocolised approach to monitoring patients’ vital signs and to alerting when deranged vitals are discovered. In such systems an early warning score (EWS) is often calculated in which patients are assigned a numerical score for each vital sign, with larger scores indicating greater deviation from accepted norms. The EWS is created by summing the component score from each vital with the total used to drive clinical escalation decisions in protocolised track and trigger systems.

Clinical staff do not always follow track and trigger protocols [9] precisely and WVSSs which could automate EWS completion could be valuable. However, studies which assess the impact of incorporating vital signs data from WVSSs into such systems are uncommon [10]. Instead, many authors have suggested developing new approaches to the deteriorating patient which can accommodate data from WVSSs. These have included incorporating alerting from a WVSS to prompt manual calculation of an EWS [11], adjusting EWS systems to reflect wearable data [12, 13] or the development of novel deterioration indices utilising wearable data [14]. A benefit of the EWS system is its simplicity and widespread adoption. New approaches would add complexity. It is therefore prudent to assess what would happen if vital signs from WVSSs were simply used in existing EWS systems.

In this context, we conducted the COSMIC-19 (continuous signs monitoring in Covid-19) study which evaluated a suite of WVSSs in hospitalised patients with Covid-19. We assessed the sensors in a ward environment and endeavoured to continue the assessment even when patients deteriorated. Our primary aim was to determine the completeness and accuracy of the data recorded by the WVSSs in comparison to traditional vital signs. We considered traditional vital signs to be those recorded by healthcare professionals on the ward and in ICU using existing equipment and techniques. Our secondary aim was to assess how National Early Warning Score 2 (NEWS2) scores [15] would differ if calculated using vitals obtained by WVSSs instead of concurrent traditional vitals measurements.

Methods

The COSMIC-19 study was registered with clinicaltrials.gov (registration: NCT04581031, date of registration: Oct 6th 2020). It was approved by Yorkshire and the Humber – Bradford Leeds Research Ethics Committee, UK (reference 20/YH/0156). It was funded by the Innovate Manchester Advanced Therapy Centre Hub, Innovate UK, (project ID: 6239) with additional support from the Christie Hospital Charitable Fund.

Symptomatic, adult inpatients (16 years or older) with suspected or confirmed Covid-19, who were suitable for escalation to ICU and receiving supplemental oxygen when screened were eligible for enrolment. Patients were recruited within 72 h of hospital admission. We excluded patients unable to give informed consent, who were anticipated to die within 24 h or those with a contraindication to wearing the WVSSs. We recruited a convenience sample of 30–60 participants, informed by the number of WVSSs available for the research. No formal sample size calculations were performed. Some participants were admitted to ICU during the study and monitoring with WVSSs continued unless they were sedated or mechanically ventilated.

Patients were recruited at Manchester Royal Infirmary, an academic, tertiary, metropolitan hospital in the UK with over 150,000 emergency department presentations annually.

Wearable vital signs sensors

We investigated four WVSSs which measured heart rate/respiratory rate (HR/RR), oxygen saturation (SpO2), axillary temperature and systolic blood pressure (SBP). The four sensors were purchased from Isansys Ltd [16] and were used as part of the Isansys Patient Status Engine (PSE), a digital platform for vital signs capture and reporting which is a regulated medical device. Isansys Ltd had no input in the design or conduct of the research. The wearables are summarised in Table 1.

Table 1 A summary of the wearable health monitor devices deployed in this study

Participants wore the four WVSSs for up to 20 days or until they were discharged, whichever was earlier. They were free to discontinue any sensor at any time. The research team monitored the participants for WVSS disconnection via a remote dashboard which also allowed researchers to monitor WVSS battery levels. Each participant’s data was reviewed at least once every 24 h. In the event of a disconnection or a low battery, the researchers visited the participant to aid with battery/wearable replacement, wearable re-application or removal if participants chose to discontinue a sensor. The research team maintained a log of all visits and the actions taken during the visits.

Reference vital signs measurements

Participants received usual care during monitoring with the WVSSs, including traditional vital signs measurements recorded by nurses.

On the ward, according to our institution’s policy, nurses recorded vital signs by direct observation of RR, and measurement of SBP (via arm cuff), SpO2 (via finger probe) and HR (via SpO2 trace) using the GE Carescape V100 Dinamap vital signs monitor (General Electric Medical Systems Ltd, USA). HR measurements were verified by manual palpation of the radial pulse/cardiac auscultation if required. Temperature was recorded using a Coviden Gen 3 tympanic membrane thermometer (Medtronic Ltd, UK). These traditional vitals measurements were recorded at intervals varying from 1 to 12 hourly in the hospital electronic track and trigger system (Patientrack, Alcidion Ltd, Aus).

In ICU, participants were monitored using a SpaceLabs Xprezzion bedside monitor (SpaceLabs Healthcare Ltd, USA) which continuously recorded HR (via single lead electrocardiogram) and SpO2 (via finger probe), and intermittent SBP (via arm cuff). Invasive arterial blood pressure measurements were not included in the analysis. RR and temperature were recorded intermittently by the same methods used on the ward. ICU nurses verified and documented vital signs once every hour from these readings in the ICU electronic patient record (Intellispace Critical Care and Anaesthesia, Philips NV, NL).

Data capture

An electronic case report form (DataTrial Ltd, UK) was used to capture demographics, diagnostic information, sensor application/removal and outcomes. Traditional vital signs were downloaded from the hospital electronic track and trigger system and the ICU electronic patient record. This was in accordance with local data protection legislation and in accordance with the consent provided by participants. No traditional vital signs measurements were excluded from the analysis except for two HR measurements (0 bpm) and one temperature measurement (5.9℃) which we considered to be data entry errors. Wearable vital signs were captured via the Isansys PSE. Error codes were removed from the wearable data but no vital signs measurements were excluded.

Data analysis

The completeness of vital sign capture was reported as the percentage of time, in minutes, for which a vital sign was recorded by a WVSS as a proportion of the total time for which the sensor was worn. The number of participants who removed each sensor prematurely and their reasons for doing so was summarised. A Kaplan Meier analysis was conducted where the event of interest was a temporary gap in wearable sensor data of varying durations. Participants were censored if they permanently removed the WVSS for any reason.

For each WVSS, the accuracy of the vital signs was assessed using traditional vital signs as a reference standard. For each traditional measurement, a corresponding wearable vital signs measurement was determined by taking the median of all wearable measurements within the preceding five minutes (for HR, RR, temperature and SpO2) or 15 min (for SBP). These timescales and the approach were chosen to align with existing published work [10, 19,20,21,22,23,24]. A Bland Altman analysis [25] was performed to determine the mean absolute difference (bias) and 95% limits of agreement (LoA) for each vital. The repeated nature of measurements was accounted for using the methods described by Zou et al. [26]. In keeping with previous literature [6], we defined ± 5 bpm, ± 3breaths/min, ± 2%, ± 0.3℃ and ± 10 mmHg a priori as clinically acceptable agreement between HR, RR, SpO2, temperature and SBP respectively. The correlation between wearable and traditional measurements was also determined, accounting for the repeated nature of measurements [27].

To assess how monitoring patients with WVSSs would impact clinical alerts, a partial NEWS2 score was calculated using wearable sensor measurements and compared to the NEWS2 calculated from traditional vitals. NEWS2 is the established track and trigger system to assess illness severity and risk of deterioration for hospitalised patients in the UK. This portion of the analysis was limited to vital signs recorded on the ward as EWS are not typically calculated in ICU.

The calculated NEWS2 scores in this study were partial because we did not include level of consciousness and air/oxygen scores as in the full NEWS2 calculation. There are two SpO2 scales in NEWS2 with scale 2 reserved for patients with chronic hypercapnic respiratory failure [28]. This did not apply to any participant in this study and we used SpO2 scale 1 for all NEWS2 calculations. A modified Clarke Error Grid analysis [29] was also performed for each individual NEWS2 component. This quantifies how differences between corresponding wearable and traditional measurements would impact the NEWS2 component.

All analysis was conducted using R (version 3.6) [30]. Continuous variables were assessed for normality and are presented as mean (standard deviation) or median [interquartile range]. Categorical variables are presented as the population size and the percentage (of available data) for each class.

Results

Study participants

Figure 1 summarises screening and recruitment. 179 eligible patients were identified and 48 took part. Most (43/48) were recruited between July 2020 and March 2021, during the first and second UK waves of the coronavirus pandemic. Five were recruited between July 2021 and February 2022, during the delta and omicron waves in the UK [31]. In 47 participants, SARS-CoV-2 infection was confirmed on nasopharyngeal swab (lateral flow or polymerase chain reaction), one participant had symptoms consistent with Covid-19 and high clinical suspicion of infection but without a positive screening result.

Fig. 1
figure 1

Screening and recruitment to the study

Table 2 summarises the demographics and clinical characteristics of study participants, stratified according to whether they were admitted to ICU. 32/48 (66.7%) were male and 32/48 (66.7%) were from non-Caucasian ethnicities.

Table 2 Demographics and clinical characteristics of patients recruited into the study

Completeness of data capture from wearable sensors

The median time from admission to wearable sensor application was 27 (IQR:22–46) hours. The median length of hospital stay for each participant was 8 (IQR:6–13) days. The total number of patient-days of sensor wear was 202, 200, 204 and 82 days for the HR/RR, temperature, SpO2 and BP WVSSs respectively (Table 3). This represented a median duration of wear of approximately 4 days per participant for the HR/RR, temperature and SpO2 sensors. The duration of wear for the BP cuff was median 1.9 days per participant. This was due to connectivity difficulties and because many participants either declined the sensor or requested early removal. The duration of wearable application was similar when calculated from the data recorded by each wearable or from the sensor log maintained by the research team (Appendix 1).

Table 3 Duration of wear (first to last valid wearable vital signs measurement), completeness of data capture and reasons for wearable removal. The blood pressure cuff was not applied to 15 participants at their request

The Kaplan Meier analysis of gaps of varying duration in wearable sensor data is displayed in Fig. 2. The event of interest was the first occurrence, in each participant, of a temporary gap in sensor data of varying durations. The analysis was limited for SBP due to the intermittent nature of measurements (maximum frequency of wearable recordings: 1–2 hourly). Short gaps in data capture were common, but longer, clinically significant gaps in data (4 h or more) were less frequent. Many participants had no such gaps in the first 24 h of wear. Appendix 2 summarises the median survival time without gaps of varying durations and the percentage of patients without such gaps in 24 h. 87.9% of participants had no gaps in HR/RR longer than 4 h in the first 24 h of wear. For SpO2, temperature and SBP there were no such gaps in 81.5%, 97.5% and 65.5% respectively.

Fig. 2
figure 2

Kaplan Meier analysis of gaps in WVSS data. The event of interest was a temporary gap in wearable sensor data of varying durations. Participants were censored if they permanently removed the WVSS for any reason. HR/RR = heart rate/respiratory rate, SpO2 = oxygen saturation, SBP = systolic blood pressure, Temp = temperature

Accuracy of wearable sensor measurements

After creating pairs of wearable and traditional vital signs measurements (within the 5/15 min epochs, see “Methods”), there were 1633, 1614, 1412, 1294 and 59 pairs of HR, RR, temperature, SpO2 and SBP measurements respectively.

Figure 3 displays the Bland Altman plots for HR, RR, temperature and SpO2 stratified according to whether the participant was on a ward or in ICU. No SBP measurement pairs were recorded in ICU and the Bland Altman plot for SBP is available in appendix 3. Correlation coefficients and associated scatterplots for each vital sign are displayed in appendix 4. There was constant variation between wearable and traditional vital signs measurements across the measurement range for HR and SpO2. RR measurements showed a systematic difference at high mean RRs in ICU, with traditional RR measurements being higher than wearable measurements. Temperature measurements also showed a systematic difference at low mean temperatures with traditional measurements being higher than wearable measurements. There were insufficient SBP measurement pairs to comment on the variation in measurements.

Fig. 3
figure 3

Bland Altman plots corrected for repeated measures. HR = heart rate, RR = respiratory rate, SpO2 = oxygen saturation, Temp = temperature. The mean absolute difference (black line) and 95% limits of agreement (dashed lines) plus 95% confidence intervals (dotted lines) are for the entire population of measurement pairs (ward plus ICU measurements)

Table 4 displays the mean absolute difference (bias) and limits of agreement for each vital sign according to whether the participant was on a ward or in ICU. Not all participants wore all sensors and only participants with at least two measurement pairs are included in the calculations of bias and limits of agreement. Overall, 59.7% HR pairs were within ± 5 bpm, 38.5% of RR pairs were within ± 3breaths/min, 24.4% of temperature pairs were within ± 0.3℃, 32.9% of SpO2 pairs were within ± 2% and 39.0% of SBP pairs were within ± 10 mmHg. There was better agreement between WVSS measurements and reference standards for HR measurements on ICU whereas agreement was better for all other vital signs on the ward.

Table 4 Bland Altman metrics for each wearable vital sign measurement compared to traditional vital signs measurements recorded on the ward or in ICU as the reference standard

Impact of wearable vital signs measurements on alerts

Amongst the pairs of traditional and wearable vital signs measurements, there were only 31 instances in 13 participants when HR, RR, temperature, SpO2 and SBP were simultaneously available to calculate a partial NEWS2 (see “Methods”). Appendix 5 summarises the differences in partial NEWS2 scores calculated from these five vital signs and the impact on the rate of NEWS2 5 + or 7 + alerts by each method. This small number of NEWS2 scores reflects that many participants did not wear the BP cuff or chose to remove it early.

In contrast, there were 613 instances in 41 participants when HR, RR, temperature and SpO2 were simultaneously available to calculate a partial NEWS2. The median NEWS2 by traditional methods was 1 [IQR: 1–2] and by wearable methods was 4 [IQR: 3–6]. Table 5 summarises the number of times when a NEWS2 5 + or 7 + alert would have been generated by each method. At our institution, NEWS2 5 + would typically alert the ward based medical team, whilst 7 + would typically generate an ICU response.

Table 5 Confusion matrix for NEWS2 scores calculated from 4 vital signs (heart rate, respiratory rate, temperature and SpO2) using paired traditional and wearable vital signs recorded on the ward. A positive NEWS2 score is considered a score of 5 + or 7 + respectively

Appendix 6 displays the differences in NEWS2 component scores for each pair of vital signs measurements. The corresponding modified Clarke Error Grids for each vital sign are displayed in appendix 7. The greatest agreement in NEWS2 component scores was observed for HR (83.3%) and SBP (76.3%). NEWS2 agreement for RR, Temp and SpO2 was poor at 52.7%, 43.3% and 28.6% respectively.

Discussion

In this observational study of hospitalised patients with Covid-19 we monitored 48 individuals using WVSSs during their admission. Our primary aim was to determine the completeness and accuracy of vitals recorded by the WVSSs in comparison to traditional vital signs recorded using standard techniques and equipment on the ward and in ICU. Our secondary aim was to assess how NEWS2 scores on the ward would differ if calculated using vitals obtained by WVSSs instead of concurrent traditional vitals measurements.

We found that wearable patch/wrist-based sensors can achieve comprehensive data capture in an inpatient setting with modest (once daily) intervention to maintain the devices. In contrast, an automated, arm-worn, blood pressure cuff was poorly tolerated and unacceptable to most participants. The accuracy of measurements from the WVSSs varied when compared with ward and ICU reference standards but was generally poor with only HR measurements achieving clinically acceptable agreement for more than 50% of the time. The implication of this poor agreement was illustrated by the NEWS2 scores calculated using wearable vitals which were typically higher than those calculated using traditional vitals. When compared with traditional vital signs, NEWS2 alerts (5 + and 7 +) derived from WVSSs were false positives in most cases.

Studies in which WVSSs have been assessed in ward-based settings and for prolonged periods (> 48 h) have found completeness of data capture by wearable sensors ranging from 76–96% for patch-based, chest HR/RR sensors [32, 33] and from 50–68% for wrist-worn pulse oximeters [34, 35]. Our results are similar, and variation may be attributed to different patient populations and different levels of experience amongst patients, researchers and clinical teams in using and maintaining the sensors. Isolation measures due to Covid-19 in our study, may have also limited opportunities for sensor maintenance. Even the lowest rate of data capture in our study (SpO2, 68.6%) equates to continuous vitals for 16 out of every 24 h, far exceeding what could be captured by traditional methods. 50–60% of patients wore the patch and wrist-based sensors until hospital discharge or until they were sedated/ventilated on ICU, suggesting that sensor wear was acceptable to many if not all patients.

Our survival analysis found that there would be no gaps in the data of over 4 h duration for 87.9%, 81.5% and 97.5% of patients using the HR/RR, SpO2 and temperature sensors respectively in the first 24 h of wear. As four hours is often the interval between nurse measured vital signs [36] this suggests that the time nursing staff would need to spend troubleshooting/reapplying such devices would be acceptable. In high-risk surgical populations in the Netherlands, Breteler and colleagues [23, 24] found even greater durability of data capture with wearable sensors. Our results extend this finding to a UK setting with medical inpatients who were subject to isolation restrictions.

In contrast to the patch/wrist-based sensors, capture of BP measurements by an automated, arm-worn cuff in our study was poor. Studies of similar BP sensors have found data capture rates of 44–63% [34, 37] and our rates were lower than this due to technical difficulties and patient discomfort. Over 30% of participants refused to wear the cuff from the outset. This limits the conclusions which we can draw about the A&D TM2441 device as a useful wearable monitor but highlights the importance of considering if such devices are acceptable to patients as an automated, wearable monitor. Inflation of a BP cuff can be uncomfortable, and it may be more tolerable when recorded with a nurse present.

We observed wide limits of agreement between traditional and wearable vitals. Other validation studies using the same WVSSs in post-operative surgical patients [32], in those presenting with acute exacerbation of chronic obstructive pulmonary disease [38], and in labouring women [39] have found narrower limits of agreement for all vital signs but have typically monitored patients for a shorter duration of time, often with direct observation by a researcher [21, 40]. In keeping with previous work, we found that a patch-based, chest sensor underestimated high RRs in critically ill patients [6], that an axillary skin temperature sensor underestimated body temperature when compared to a tympanic reference [21] and that unsupervised, wearable SpO2 measurements were typically lower than supervised measurements made using similar sensing technology [32, 38].

The reasons for imperfect agreement between wearable vitals and nurse recorded vitals are myriad. It is recognised that vitals recorded by healthcare professionals are impacted by poor measurement technique, value bias in recording and a Hawthorne effect during measurement [41]. Arguably therefore, WVSSs may offer a truer reflection of a patient’s ongoing, non-observed physiological state. However, systematic errors in wearable vital signs measurements may also play an important role. In our study, the temperature differences we observed may be because estimates of core temperature recorded by an axillary skin sensor are different to estimates recorded by tympanic thermometers. Similarly, RRs determined by a wearable sensor based on an algorithm utilising R-R variation may have been subject to systematic error in Covid-19 patients in whom a relative bradyarrhythmia has been observed [42] (potential for error due to Nyqist sampling limit). Finally, movement artefact during SpO2 measurement is arguably more common during continuous monitoring than during brief, supervised assessment of SpO2 by a nurse. Previous work has demonstrated that movement artefact leads to underestimates of SpO2 [43, 44]. Acknowledging these inherent differences, some authors have suggested that WVSSs should not be compared to nurse recorded “spot” vitals [24]. We disagree, because the comparison serves to illustrate the impact that using WVSSs could have on existing patient deterioration alerting mechanisms in hospitals.

In our study, NEWS2 scores derived from HR, RR, temperature and SpO2 measurements were higher when calculated from WVSSs than traditional vitals. Differences in RR, SpO2 and temperature NEWS2 component scores were responsible for most of the discrepancy. Similarly, Weenk and colleagues found that two WVSS systems studied on surgical and medical wards [10] both returned higher modified early warning scores than traditional nurse recorded vitals. Our findings suggest that vital signs obtained from WVSSs should not directly replace traditional vital signs in track and trigger systems utilising NEWS2 as this would generate unacceptable levels of false alerts for clinical teams. More work is also needed to confirm the validity of wearable vital signs measurements in settings of illness (very high RR), where a patient’s physiology may be quite different from healthy volunteers.

We propose that a different approach is needed to adopt WVSSs into care pathways for the deteriorating patient. This could include scheduled reviews of wearable sensor trends by healthcare providers [45] as opposed to automated alerts, re-development of EWS thresholds for specific use with wearable sensor data [12, 13] or development of novel, broadly applicable deterioration indices using the granular data which wearable sensors provide or some of the additional metrics which they record. Examples include the presence or absence of micro-events [46] within the continuous data, trend information [47] and heart rate variability [48].

We acknowledge several limitations in our work. Firstly, we studied a population with Covid-19, with a preponderance of Black and Asian participants and a high rate of admission to ICU [49]. This may limit the generalisability of our findings to wider populations without respiratory illness, to those who are less unwell and to those with differing skin pigmentation which may alter pulse oximetry readings [50]. Secondly, our clinical teams were blinded to wearable sensor data and not involved in sensor maintenance which may mean that future deployment of WVSSs in broader inpatient groups could achieve better data capture. Thirdly, the variation in number of vital sign pairs analysed for each participant may have introduced bias in our validation results, although we sought to account for this by adjusting for repeated measures throughout our analysis. Finally, the precise time that traditional vital signs were measured was unknown, in that there may be a delay between measurements and recording in the electronic patient record by our nursing staff. Whilst this could impact our validation results, the guidance at our institution is that recording of vital signs should be performed promptly, making it reasonable to assume that measurements were made in the preceding five minutes.

Conclusion

In hospitalised patients with Covid-19, a suite of patch-based/wrist worn wearable vital signs sensors achieved comprehensive data capture over prolonged periods of inpatient monitoring. However, the validity of the recorded, wearable vital signs was poor when compared to vitals recorded by nurses on the ward and in ICU using standard techniques.

NEWS2 scores calculated using the wearable sensor data would have been higher and would have generated frequent false alerts in this patient population. Novel approaches are therefore needed to utilise wearable sensor data in track and trigger systems that seek to identify and respond to the deteriorating patient.

Data availability

The COSMIC-19 dataset is available on request subject to appropriate information governance and data sharing agreements. Interested parties should contact the corresponding author via the email address provided.

References

  1. Michard F, Sessler DI. Ward monitoring 3.0. Br J Anaesth. 2018;121:999–1001 Available from: http://www.ncbi.nlm.nih.gov/pubmed/30336872. Elsevier. [cited 2018 Nov 4].

    Article  CAS  PubMed  Google Scholar 

  2. Eddahchouri Y, Peelen RV, Koeneman M, Touw HRW, van Goor H, Bredie SJH. Effect of continuous wireless vital sign monitoring on unplanned ICU admissions and rapid response team calls: a before-and-after study. Br J Anaesth. 2022;128:857–63. Available from: https://pubmed.ncbi.nlm.nih.gov/35282866/. [cited 2024 Nov 17].

    Article  PubMed  Google Scholar 

  3. Rowland BA, Motamedi V, Michard F, Saha AK, Khanna AK. Impact of continuous and wireless monitoring of vital signs on clinical outcomes: a propensity-matched observational study of surgical ward patients. Br J Anaesth. 2024;132:519–27. Available from: http://www.bjanaesthesia.org.uk/article/S0007091223006669/fulltext. Elsevier Ltd. [cited 2024 Nov 17].

    Article  PubMed  Google Scholar 

  4. Areia C, Biggs C, Santos M, et al. The impact of wearable continuous vital sign monitoring on deterioration detection and clinical outcomes in hospitalised patients: a systematic review and meta-analysis. Crit Care. 2021;25:1–17. Available from: https://biomedcentral-ccforum.publicaciones.saludcastillayleon.es/articles/10.1186/s13054-021-03766-4. BioMed Central. [cited 2021 Oct 24].

    Article  Google Scholar 

  5. Cardona-Morrell M, Prgomet M, Turner RM, Nicholson M, Hillman K. Effectiveness of continuous or intermittent vital signs monitoring in preventing adverse events on general wards: a systematic review and meta-analysis. Int J Clin Pract. 2016;70:806–24. Available from: https://pubmed.ncbi.nlm.nih.gov/27582503/. [cited 2024 Nov 17].

    Article  CAS  PubMed  Google Scholar 

  6. Chan PY, Ryan NP, Chen D, McNeil J, Hopper I. Novel wearable and contactless heart rate, respiratory rate, and oxygen saturation monitoring devices: a systematic review and meta-analysis. Anaesthesia. 2022;77:1268–80. Available from: https://pubmed.ncbi.nlm.nih.gov/35947876/. [cited 2024 Apr 23].

    Article  CAS  PubMed  Google Scholar 

  7. Patel V, Orchanian-Cheff A, Wu R. Evaluating the validity and utility of wearable technology for continuously monitoring patients in a hospital setting: systematic review. JMIR Mhealth Uhealth. 2021;9(8):e17411. Available from: https://pubmed.ncbi.nlm.nih.gov/34406121/. [cited 2024 Apr 23].

  8. Gao H, McDonnell A, Harrison DA, et al. Systematic review and evaluation of physiological track and trigger warning systems for identifying at-risk patients on the ward. Intensive Care Med. 2007;33:667–79. Available from: https://link.springer.com/article/10.1007/s00134-007-0532-3. Springer. [cited 2022 Feb 21].

    Article  PubMed  Google Scholar 

  9. Eddahchouri Y, Koeneman M, Plokker M, et al. Low compliance to a vital sign safety protocol on general hospital wards: a retrospective cohort study. Int J Nurs Stud. 2021;115:103849. Available from: https://pubmed.ncbi.nlm.nih.gov/33517078/. [cited 2024 Jul 3].

  10. Weenk M, Koeneman M, van de Belt TH, Engelen LJLPG, van Goor H, Bredie SJH. Wireless and continuous monitoring of vital signs in patients at the general ward. Resuscitation. 2019;136:47–53. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.resuscitation.2019.01.017. European Resuscitation Council, American Heart Association, Inc., and International Liaison Committee on Resuscitation.~Published by Elsevier Ireland Ltd. [cited 2021 Nov 17].

    Article  PubMed  Google Scholar 

  11. Stellpflug C, Pierson L, Roloff D, et al. Continuous physiological monitoring improves patient outcomes. Am J Nurs. 2021;121:40–6. Available from: https://pubmed.ncbi.nlm.nih.gov/33755624/. [cited 2021 Nov 28].

    Article  PubMed  Google Scholar 

  12. van Ede ES, Scheerhoorn J, Bonomi AG, Buise MP, Bouwman RA, Nienhuijs SW. Continuous remote monitoring in post-bariatric surgery patients: development of an early warning protocol. Surg Obes Relat Dis. 2022;18:1298–303. Available from: https://pubmed.ncbi.nlm.nih.gov/35850957/. [cited 2024 Jul 3].

    Article  PubMed  Google Scholar 

  13. van der Stam JA, Mestrom EHJ, Nienhuijs SW, et al. A wearable patch based remote early warning score (REWS) in major abdominal cancer surgery patients. Eur J Surg Oncol. 2023;49:278–84. W.B. Saunders.

    Article  PubMed  Google Scholar 

  14. Watkinson PJ, Pimentel MA, Clifton L, et al. Early detection of physiological deterioration in post-surgical patients using wearable technology combined with an integrated monitoring system: a pre- and post-interventional study. medRxiv. 2020;2020.12.01.20240770. Available from: https://www.medrxiv.org/content/10.1101/2020.12.01.20240770v1. Cold Spring Harbor Laboratory Press. [cited 2021 Nov 17].

  15. National Early Warning Score (NEWS) 2 | RCP London. Available from: https://www.rcplondon.ac.uk/projects/outputs/national-early-warning-score-news-2. [cited 2021 Jun 6].

  16. Welcome to Isansys. Available from: https://www.isansys.com/. [cited 2023 Jun 24].

  17. WristOx2® Model 3150 with USB - Nonin. Available from: https://www.nonin.com/products/3150-usb/. [cited 2024 Apr 23].

  18. TM-2441 | Professional | Medical | Products | A&D. Available from: https://www.aandd.jp/products/medical/professional/2441.html. [cited 2024 Apr 23].

  19. Van Der Stam JA, Mestrom EHJ, Scheerhoorn J, et al. Accuracy of vital parameters measured by a wearable patch following major abdominal cancer surgery. Eur J Surg Oncol. 2022;48:917–23. Available from: http://www.ejso.com/article/S0748798321007848/fulltext. Elsevier. [cited 2022 Apr 14].

    Article  PubMed  Google Scholar 

  20. Cheng SM, Chan JJI, Tan CW, Lu E, Sultana R, Sng BL. Use of wireless respiratory rate sensor monitoring during opioid patient-controlled analgesia after gynaecological surgery: a prospective cohort study. Indian J Anaesth. 2021;65:146–52. Available from: https://journals.lww.com/ijaweb/Fulltext/2021/02000/Use_of_wireless_respiratory_rate_sensor_monitoring.10.aspx. Wolters Kluwer Medknow Publications. [cited 2022 Apr 18].

    Article  PubMed  PubMed Central  Google Scholar 

  21. Downey C, Ng S, Jayne D, Wong D. Reliability of a wearable wireless patch for continuous remote monitoring of vital signs in patients recovering from major surgery: a clinical validation study from the TRaCINg trial. BMJ Open. 2019;9:e031150. Available from: https://bmjopen.bmj.com/content/9/8/e031150. British Medical Journal Publishing Group. [cited 2021 Nov 17].

    Article  PubMed  PubMed Central  Google Scholar 

  22. Kant N, Peters GM, Voorthuis BJ, et al. Continuous vital sign monitoring using a wearable patch sensor in obese patients: a validation study in a clinical setting. J Clin Monit Comput. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s10877-021-00785-y. [cited 2022 Apr 14].

  23. Breteler MJM, Huizinga E, Van Loon K, et al. Reliability of wireless monitoring using a wearable patch sensor in high-risk surgical patients at a step-down unit in the Netherlands: a clinical validation study. BMJ Open. 2018;8:e020162. Available from: https://pubmed.ncbi.nlm.nih.gov/29487076/. [cited 2022 Feb 21].

    Article  PubMed  Google Scholar 

  24. Breteler MJMM, KleinJan EJ, Dohmen DAJJ, et al. Vital signs monitoring with wearable sensors in high-risk surgical patients a clinical validation study. Anesthesiology. 2020;132:424–39. Available from: http://journals.lww.com/10.1097/ALN.0000000000003029. American Society of Anesthesiologists. [cited 2020 Jan 21].

    Article  PubMed  Google Scholar 

  25. Bland JM, Altman DG. Measuring agreement in method comparison studies. Stat Methods Med Res. 1999;8:135–60. https://journals.sagepub.com/doi/abs/10.1177/096228029900800204. Sage PublicationsSage CA: Thousand Oaks, CA. [cited 2022 Jan 3].

    Article  CAS  PubMed  Google Scholar 

  26. Zou GY. Confidence interval estimation for the Bland-Altman limits of agreement with multiple observations per individual. Stat Methods Med Res. 2013;22:630–42. Available from: https://pubmed.ncbi.nlm.nih.gov/21705434/. [cited 2022 Jun 7].

    Article  CAS  PubMed  Google Scholar 

  27. Bakdash JZ, Marusich LR. Repeated measures correlation. Front Psychol. 2017;8:252904. Frontiers Media S.A.

    Article  Google Scholar 

  28. NEWS2: Additional implementation guidance | RCP London. Available from: https://www.rcp.ac.uk/improving-care/resources/news2-additional-implementation-guidance/. [cited 2024 Nov 17].

  29. Clarke WL, Cox D, Gonder-Frederick LA, Carter W, Pohl SL. Evaluating clinical accuracy of systems for self-monitoring of blood glucose. Diabetes Care. 1987;10:622–8. Available from: https://diabetesjournals.publicaciones.saludcastillayleon.es/care/article-pdf/10/5/622/515265/10-5-622.pdf. American Diabetes Association. [cited 2021 Dec 28].

    Article  CAS  PubMed  Google Scholar 

  30. Team RC. R: A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing; 2019. Available from: http://www.r-project.org/.

  31. Cases in the UK | Coronavirus in the UK. Available from: https://coronavirus.data.gov.uk/details/cases. [cited 2023 Jun 24].

  32. Haahr-Raunkjaer C, Skovbye M, Rasmussen SM, et al. Agreement between standard and continuous wireless vital sign measurements after major abdominal surgery: a clinical comparison study. Physiol Meas. 2022;43:11. Available from: https://pubmed.ncbi.nlm.nih.gov/36322987/. Physiol Meas. [cited 2023 Apr 11].

  33. Elvekjaer M, Aasvang EK, Olsen RM, et al. Physiological abnormalities in patients admitted with acute exacerbation of COPD: an observational study with continuous monitoring. J Clin Monit Comput. 2020;34:1051–60. Available from: https://link.springer.com/article/10.1007/s10877-019-00415-8. Springer. [cited 2021 Nov 18].

    Article  PubMed  Google Scholar 

  34. Mølgaard J, Rasmussen SS, Eiberg J, Sørensen HBD, Meyhoff CS, Aasvang EK. Continuous wireless pre- and postoperative vital sign monitoring reveal new, severe desaturations after vascular surgery. Acta Anaesthesiol Scand. 2023;67:19–28. Available from: https://onlinelibrary.wiley.com/doi/full/10.1111/aas.14158. John Wiley & Sons, Ltd. [cited 2023 Apr 12].

    Article  PubMed  Google Scholar 

  35. Rasmussen PS, Aasvang EK, Olsen RM, et al. Continuous peripheral perfusion index in patients admitted to hospital wards – an observational study. Acta Anaesthesiol Scand. 2021;65:257–65. Available from: https://onlinelibrary.wiley.com/doi/full/10.1111/aas.13711. John Wiley & Sons, Ltd. [cited 2024 Apr 23].

    Article  PubMed  Google Scholar 

  36. Dall’Ora C, Griffiths P, Hope J, et al. How long do nursing staff take to measure and record patients’ vital signs observations in hospital? A time-and-motion study. Int J Nurs Stud. 2021;118:103921. Available from: https://www.sciencedirect.com/science/article/pii/S0020748921000614. [cited 2025 Feb 23].

  37. Haahr-Raunkjaer C, Mølgaard J, Elvekjaer M, et al. Continuous monitoring of vital sign abnormalities; association to clinical complications in 500 postoperative patients. Acta Anaesthesiol Scand. 2022;66(5):552–62. Available from: https://onlinelibrary.wiley.com/doi/10.1111/aas.14048. John Wiley and Sons Inc. [cited 2025 Feb 23].

  38. Elvekjaer M, Carlsson CJ, Rasmussen SM, et al. Agreement between wireless and standard measurements of vital signs in acute exacerbation of chronic obstructive pulmonary disease: a clinical validation study. Physiol Meas. 2021;42:055006. Available from: https://iopscience.iop.org/article/10.1088/1361-6579/ac010c. IOP Publishing Ltd. [cited 2021 Nov 27].

    Article  Google Scholar 

  39. Koppel R, Debnath S, Zanos TP, et al. Efficacy of continuous monitoring of maternal temperature during labor using wireless axillary sensors. J Clin Monit Comput. 2022;36:103–7. Available from: https://link.springer.com/article/10.1007/s10877-020-00625-5. Springer Science and Business Media B.V. [cited 2022 Apr 18].

    Article  PubMed  Google Scholar 

  40. ZsuzsannaBalla H, Theodorsson E, Ström JO. Evaluation of commercial, wireless dermal thermometers for surrogate measurements of core temperature. Scand J Clin Lab Invest. 2019;79:1–6. Available from: https://pubmed.ncbi.nlm.nih.gov/30882250/. [cited 2023 Mar 28].

    Article  Google Scholar 

  41. Kallioinen N, Hill A, Christofidis MJ, Horswill MS, Watson MO. Quantitative systematic review: sources of inaccuracy in manually measured adult respiratory rate data. J Adv Nurs. 2021;77:98. Available from: /pmc/articles/PMC7756810/. Wiley-Blackwell. [cited 2024 May 14].

    Article  PubMed  Google Scholar 

  42. Jung L-Y, Kim J-M, Ryu S, Lee C-S. Relative bradycardia in patients with COVID-19. Int J Arrhythmia. 2022;23:1–5. Available from: https://biomedcentral-arrhythmia.publicaciones.saludcastillayleon.es/articles/10.1186/s42444-022-00073-z. BioMed Central. [cited 2024 Apr 23].

    Article  Google Scholar 

  43. Barker SJ, Shah NK. The effects of motion on the performance of pulse oximeters in volunteers (Revised publication). Anesthesiology. 1997;86:101–8. https://doiorg.publicaciones.saludcastillayleon.es/10.1097/00000542-199701000-00014. American Society of Anesthesiologists. [cited 2024 Dec 2].

    Article  CAS  PubMed  Google Scholar 

  44. Clarke GWJ, Chan ADC, Adler A. Effects of motion artifact on the blood oxygen saturation estimate in pulse oximetry.In: IEEE MeMeA 2014 - IEEE International Symposium on Medical Measurements and Applications, Proceedings. IEEE Computer Society; 2014:1–4. Available at: https://ieeexplore.ieee.org/document/6860071. [cited 2025 Feb 23].

  45. Leenen JPL, Rasing HJM, Van Dijk JD, Kalkman CJ, Schoonhoven L, Patijn GA. Feasibility of wireless continuous monitoring of vital signs without using alarms on a general surgical ward: a mixed methods study. PLoS One. 2022;17(3):e0265435. Public Library of Science.

  46. Duus CL, Aasvang EK, Olsen RM, et al. Continuous vital sign monitoring after major abdominal surgery—quantification of micro events. Acta Anaesthesiol Scand. 2018;62:1200–8. Available from: https://onlinelibrary.wiley.com/doi/full/10.1111/aas.13173. [cited 2021 Nov 20].

    Article  CAS  PubMed  Google Scholar 

  47. van Rossum MC, Vlaskamp LB, Posthuma LM, et al. Adaptive threshold-based alarm strategies for continuous vital signs monitoring. J Clin Monit Comput. 2022;36:407-417. Available from: https://link.springer.com/article/10.1007/s10877-021-00666-4. Springer Science and Business Media B.V. [cited 2022 Apr 14].

  48. Jansen C, Chatterjee DA, Thomsen KL, et al. Significant reduction in heart rate variability is a feature of acute decompensation of cirrhosis and predicts 90-day mortality. Aliment Pharmacol Ther. 2019;50:568–79. Available from: https://onlinelibrary.wiley.com/doi/full/10.1111/apt.15365. Blackwell Publishing Ltd. [cited 2021 Nov 28].

    Article  CAS  PubMed  Google Scholar 

  49. Docherty AB, Harrison EM, Green CA, et al. Features of 20 133 UK patients in hospital with covid-19 using the ISARIC WHO Clinical Characterisation Protocol: prospective observational cohort study. BMJ. 2020;369:m1985 Available from: https://www.bmj.com/content/369/bmj.m1985. British Medical Journal Publishing Group. [cited 2023 Aug 14].

  50. Cabanas AM, Fuentes-Guajardo M, Latorre K, León D, Martín-Escudero P. Skin pigmentation influence on pulse oximetry accuracy: a systematic review and bibliometric analysis. Sensors (Basel). 2022;22:3402. Available from: https://pmc.ncbi.nlm.nih.gov/articles/PMC9102088/. MDPI. [cited 2024 Dec 1].

    Article  CAS  PubMed  Google Scholar 

Download references

Acknowledgements

The authors acknowledge Manchester University NHS Foundation Trust’s Clinical Data Science Unit who supported extraction of traditional vital signs measurements from the electronic patient record.

Funding

Funded by the Innovate Manchester Advanced Therapy Centre Hub, Innovate UK, (project ID: 6239) with additional support from the Christie Hospital Charitable Fund.

Author information

Authors and Affiliations

Authors

Contributions

AJW, GBK, AM and FCT conceptualised the study. AJW, GBK and FCT administered the project. AJW and AJP conducted data curation and analysis. AJW wrote the main manuscript text and prepared all tables and figures. GPM, NP and FCT offered advice and supervision. All authors reviewed the manuscript.

Corresponding author

Correspondence to Anthony J. Wilson.

Ethics declarations

Ethics approval and consent to participate

The COSMIC-19 study was registered with clinicaltrials.gov (registration: NCT04581031, registered 4th May 2020). It was approved by Yorkshire and the Humber – Bradford Leeds Research Ethics Committee, UK (reference 20/YH/0156). All participants gave written consent to participate.

Consent for publication

Not applicable.

Competing interests

Prof Thistlethwaite declares that she is the director of the Innovate Manchester Advanced Therapy Centre Hub, Innovate UK, which provided funding for this work. All other authors have no competing interests to declare.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wilson, A.J., Parker, A.J., Kitchen, G.B. et al. The completeness, accuracy and impact on alerts, of wearable vital signs monitoring in hospitalised patients. BMC Digit Health 3, 13 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s44247-025-00151-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s44247-025-00151-x

Keywords