Knowledge Discovery in Ubiquitous and Personal Sleep Tracking: Scoping Review

doi:10.2196/42750

Review

Graduate School of Engineering, Kyoto University of Advanced Science, Kyoto, Japan

*all authors contributed equally

Corresponding Author:

Nhung Huyen Hoang, BA

Graduate School of Engineering

Kyoto University of Advanced Science

Ukyo Ward, Yamanouchi Gotandacho, 18

Kyoto, 615-0096

Japan

Phone: 81 0754067000

Email: 2021mm12@kuas.ac.jp

Background: Over the past few decades, there has been a rapid increase in the number of wearable sleep trackers and mobile apps in the consumer market. Consumer sleep tracking technologies allow users to track sleep quality in naturalistic environments. In addition to tracking sleep per se, some sleep tracking technologies also support users in collecting information on their daily habits and sleep environments and reflecting on how those factors may contribute to sleep quality. However, the relationship between sleep and contextual factors may be too complex to be identified through visual inspection and reflection. Advanced analytical methods are needed to discover new insights into the rapidly growing volume of personal sleep tracking data.

Objective: This review aimed to summarize and analyze the existing literature that applies formal analytical methods to discover insights in the context of personal informatics. Guided by the problem-constraints-system framework for literature review in computer science, we framed 4 main questions regarding general research trends, sleep quality metrics, contextual factors considered, knowledge discovery methods, significant findings, challenges, and opportunities of the interested topic.

Methods: Web of Science, Scopus, ACM Digital Library, IEEE Xplore, ScienceDirect, Springer, Fitbit Research Library, and Fitabase were searched to identify publications that met the inclusion criteria. After full-text screening, 14 publications were included.

Results: The research on knowledge discovery in sleep tracking is limited. More than half of the studies (8/14, 57%) were conducted in the United States, followed by Japan (3/14, 21%). Only a few of the publications (5/14, 36%) were journal articles, whereas the remaining were conference proceeding papers. The most used sleep metrics were subjective sleep quality (4/14, 29%), sleep efficiency (4/14, 29%), sleep onset latency (4/14, 29%), and time at lights off (3/14, 21%). Ratio parameters such as deep sleep ratio and rapid eye movement ratio were not used in any of the reviewed studies. A dominant number of the studies applied simple correlation analysis (3/14, 21%), regression analysis (3/14, 21%), and statistical tests or inferences (3/14, 21%) to discover the links between sleep and other aspects of life. Only a few studies used machine learning and data mining for sleep quality prediction (1/14, 7%) or anomaly detection (2/14, 14%). Exercise, digital device use, caffeine and alcohol consumption, places visited before sleep, and sleep environments were important contextual factors substantially correlated to various dimensions of sleep quality.

Conclusions: This scoping review shows that knowledge discovery methods have great potential for extracting hidden insights from a flux of self-tracking data and are considered more effective than simple visual inspection. Future research should address the challenges related to collecting high-quality data, extracting hidden knowledge from data while accommodating within-individual and between-individual variations, and translating the discovered knowledge into actionable insights.

JMIR Mhealth Uhealth 2023;11:e42750

doi:10.2196/42750

Keywords

sleep tracking (4); knowledge discovery (5); data mining (124); personal informatics (8); self-experimentation (3); sleep health (8); scoping review (332); mobile phone (3539)

In tandem with the advent of consumer wearable technologies, there has been a growing interest in using consumer sleep tracking technologies for personal health management. Being aware of the importance of having a good night’s sleep, many individual users are routinely monitoring their sleep [Grandner MA, Lujan MR, Ghani SB. Sleep-tracking technology in scientific research: looking to the future. Sleep. May 14, 2021;44(5):zsab071. [FREE Full text] [CrossRef] [Medline]1-Liu W, Ploderer B, Hoang T. In bed with technology: challenges and opportunities for sleep tracking. Presented at: OzCHI '15: The Annual Meeting of the Australian Special Interest Group for Computer Human Interaction; Dec 7-10, 2015, 2015; Parkville, VIC, Australia. [CrossRef]3], and sleep tracking has been a popular topic, especially in the quantified-self community. Consumer sleep tracking technologies are largely divided into 2 types: smartphone dependent and smartphone independent. Smartphone-dependent sleep tracking technologies leverage the integrated sensors of a smartphone (eg, accelerometer, gyroscope, and microphone) to measure body movements and ambient sound, based on which a user’s sleep states can be estimated. Smartphone-independent sleep tracking technologies use independent hardware with multiple sensing modalities, such as accelerometers and photoplethysmography. These devices often come in the form of a wristband (eg, Fitbit [Fitbit Inc] and Apple Watch [Apple]), headband (eg, SleepSheperd [Sleep Shepherd] and Neuroon [Vandrico Inc]), or finger ring (eg, Oura [Oura Health Oy]), and they rely on proprietary sleep staging algorithms to calculate sleep metrics based on measurable physiological signals [Kolla BP, Mansukhani S, Mansukhani MP. Consumer sleep tracking devices: a review of mechanisms, validity and utility. Expert Rev Med Devices. May 2016;13(5):497-506. [CrossRef] [Medline]4]. The accuracy of these consumer technologies has been significantly improved over the years. Recent models have proven to be reasonably accurate, especially in measuring the time of sleep onset and offset, total sleep duration, and sleep efficiency (SE) [Menghini L, Cellini N, Goldstone A, Baker FC, de Zambotti M. A standardized framework for testing the performance of sleep-tracking technology: step-by-step guidelines and open-source code. Sleep. Feb 12, 2021;44(2):zsaa170. [FREE Full text] [CrossRef] [Medline]5-Liang Z, Chapa Martell MA. Validity of consumer activity wristbands and wearable EEG for measuring overall sleep parameters and sleep structure in free-living conditions. J Healthc Inform Res. Jun 20, 2018;2(1-2):152-178. [FREE Full text] [CrossRef] [Medline]7]. A recent study comparing 7 consumer sleep tracking devices with polysomnography (the gold standard of sleep measurement) demonstrated that their validity could outperform medical-grade actigraphy [Chinoy ED, Cuellar JA, Huwa KE, Jameson JT, Watson CH, Bessman SC, et al. Performance of seven consumer sleep-tracking devices compared with polysomnography. Sleep. May 14, 2021;44(5):zsaa291. [FREE Full text] [CrossRef] [Medline]6]. In the past decade, sleep tracking has become one of the most studied topics in the research field of personal informatics. At the intersection of ubiquitous computing, human-computer interaction, and sleep science, researchers from multiple disciplines have made joint efforts to investigate the validity of existing consumer sleep tracking devices [Menghini L, Cellini N, Goldstone A, Baker FC, de Zambotti M. A standardized framework for testing the performance of sleep-tracking technology: step-by-step guidelines and open-source code. Sleep. Feb 12, 2021;44(2):zsaa170. [FREE Full text] [CrossRef] [Medline]5-Liang Z, Chapa-Martell MA. Combining numerical and visual approaches in validating sleep data quality of consumer wearable wristbands. Presented at: IEEE International Conference on Pervasive Computing and Communications Workshops (PerCom Workshops); Mar 11-15, 2019, 2019; Kyoto, Japan. [CrossRef]10], develop accurate sleep staging algorithms tailored to consumer sleep tracking devices [Roberts DM, Schade MM, Mathew GM, Gartenberg D, Buxton OM. Detecting sleep using heart rate and motion data from multisensor consumer-grade wearables, relative to wrist actigraphy and polysomnography. Sleep. Jul 13, 2020;43(7):zsaa045. [FREE Full text] [CrossRef] [Medline]11-Sirithummarak P, Liang Z. Investigating the effect of feature distribution shift on the performance of sleep stage classification with consumer sleep trackers. Presented at: IEEE 10th Global Conference on Consumer Electronics (GCCE); Oct 12-15, 2021, 2021; Kyoto, Japan. [CrossRef]15], develop smartphone apps for visualizing personal sleep data [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16-Ravichandran R, Patel SN, Kientz JA. SleepApp: providing contextualized and actionable sleep feedback. Presented at: PervasiveHealth '20: 14th EAI International Conference on Pervasive Computing Technologies for Healthcare; May 18-20, 2020, 2020; Atlanta, GA, USA. [CrossRef]19], develop artificial intelligence–based sleep coaching systems that help people improve sleep hygiene [Daskalova N, Metaxa-Kakavouli D, Tran A, Nugent N, Boergers J, McGeary J, et al. SleepCoacher: a personalized automated self-experimentation system for sleep recommendations. Presented at: UIST '16: The 29th Annual ACM Symposium on User Interface Software and Technology; Oct 16-19, 2016, 2016; Tokyo, Japan. [CrossRef]20], and understand the challenges for sleep tracking technologies to eventually improve sleep health [Liu W, Ploderer B, Hoang T. In bed with technology: challenges and opportunities for sleep tracking. Presented at: OzCHI '15: The Annual Meeting of the Australian Special Interest Group for Computer Human Interaction; Dec 7-10, 2015, 2015; Parkville, VIC, Australia. [CrossRef]3,Ploderer B, Rodgers S, Liang Z. What’s keeping teens up at night? Reflecting on sleep and technology habits with teens. Pers Ubiquitous Comput. Jan 28, 2022;27(2):249-270. [CrossRef]21-Liang Z, Ploderer B. Sleep tracking in the real world: a qualitative study into barriers for improving sleep. Presented at: OzCHI '16: The 28th Australian Conference on Human-Computer Interaction; Nov 29-Dec 2, 2016, 2016; Launceston, Tasmania, Australia. [CrossRef]23]. At a higher level, sleep tracking studies have mostly been guided by general self-tracking frameworks, such as the lived informatics model [Epstein DA, An P, Fogarty J, Munson SA. A lived informatics model of personal informatics. Presented at: UbiComp '15: The 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing; Sep 7-11, 2015, 2015; Osaka, Japan. [CrossRef]24] and the Prevetiver Health care on Individual Level framework [Liang Z, Chapa-Martell M. Framing self-quantification for individual-level preventive health care. Presented at: HEALTHINF '15: 8th International Conference on Health Informatics; January 12-15, 2015, 2015;336-343; Lisbon, Portugal. URL: https://www.scitepress.org/Papers/2015/52025/52025.pdf [CrossRef]25]. Both frameworks emphasize the iterative exploration and analysis of the self-tracking data to gain insight and drive behavioral changes.

One of the known challenges in sleep tracking is how to empower layperson users to make sense of their sleep data and to identify lifestyle or environmental factors that they can modify for better sleep [Liang Z, Ploderer B. How does Fitbit measure brainwaves: a qualitative study into the credibility of sleep-tracking technologies. Proc ACM Interact Mob Wearable Ubiquitous Technol. Mar 18, 2020;4(1):1-29. [FREE Full text] [CrossRef]22]. In the field of health informatics, health data analytics could be divided into multiple levels according to their analytical capabilities [Burke J. The world of health analytics. In: Kudyba SP, editor. Healthcare Informatics: Improving Efficiency through Technology, Analytics, and Management. Boca Raton, FL, USA. CRC Press; 2016;161-180.26]. Depending on its outcomes, health analytics can be descriptive, diagnostic, predictive, or prescriptive in nature [Banerjee A, Bandyopadhyay T, Acharya P. Data analytics: hyped up aspirations or true potential? Vikalpa. Oct 01, 2013;38(4):1-12. [CrossRef]27]. At the lowest level lies descriptive analytics, which answers the question, what has happened? This level of analytics describes data as is without applying complex calculations and exploration. Common techniques at this level, such as standard reports and alerts, focus on categorizing, characterizing, aggregating, and classifying data to understand the past and current states. Existing sleep tracking analytics is mostly centered on this level, which aims to help users gain a nice-to-know validation of their subjective perception of sleep. At the second level is diagnostic analytics, which focuses on possible antecedents and answers the question, why did it happen? This level of analytics requires extensive exploration and directed analysis and inference based on existing data to identify the potential problems and their probable causes. At the third level lies the predictive analytics, which focuses on the possible consequences and answers the question, what is likely to happen next? Sleep tracking technologies at this level should be able to predict users’ sleep quality in the near or far future by examining their historical self-tracking data, detecting patterns, and then leveraging the patterns to forecast. The highest level of analytics is prescriptive analytics, which answers the question, what should be done about it? It uses domain knowledge in medicine and health science in addition to data to generate recommendations for health interventions (eg, recommendations for better sleep).

Table 1 provides a mapping of the 4 levels of the analytics framework by Burke [Burke J. The world of health analytics. In: Kudyba SP, editor. Healthcare Informatics: Improving Efficiency through Technology, Analytics, and Management. Boca Raton, FL, USA. CRC Press; 2016;161-180.26] for sleep tracking. The descriptive analytics would focus on answering questions such as “How many hours did I sleep last night?” “How many awakenings did I have last night?” and “What is the average deep sleep ratio during the past one month?” So far, sleep tracking has predominantly centered on such descriptive analytics, typically by visualizing data with charts and tables on a dashboard. This type of application could be meaningful in understanding users’ current sleep patterns. However, with a flux of multiple models of sensor data, simple data visualization may miss important patterns that are not easily observable through visual inspection. As the complexity of sleep tracking data increases, it becomes necessary to examine the data in a more structured manner using advanced analytics. For example, diagnostic analytics could help answer questions such as “Was my sleep normal?” or “Why did I have so many awakenings last night?” Predictive and prescriptive analytics could answer questions such as “How will my sleep quality be in five years if I keep going to bed at 2:00 am?” or “Will I sleep better tonight if I work out 10 minutes longer in the morning?” To achieve advanced analytics, it is necessary to combine different streams of contextual information into a sleep analysis. Although many consumers’ sleep tracking systems support the simultaneous collection of multiple streams of contextual information, these data are often visualized separately and rarely integrated with sleep analysis. Currently, there is a paucity of advanced analytics in sleep tracking research.

Knowledge discovery is the process of finding meaningful patterns from data, and data mining is a central step within a knowledge discovery process. Popular data mining techniques, such as association rules mining and anomaly detection, are widely used in many application domains to detect hidden patterns in large data sets [Burke J. The world of health analytics. In: Kudyba SP, editor. Healthcare Informatics: Improving Efficiency through Technology, Analytics, and Management. Boca Raton, FL, USA. CRC Press; 2016;161-180.26]. In this paper, we present a scoping review on the application of knowledge discovery methods in sleep tracking. Such a review is useful for technical researchers interested in applying a wide range of machine learning and data mining techniques to the personal health domain as well as for sleep scientists who want to leverage the latest wearable technology combined with a data-driven approach for personalized and nonpharmaceutical interventions. Previous reviews on consumer sleep tracking technologies have dominantly focused on the utility and validity of these devices, especially in terms of their strengths and limitations relative to more widely accepted devices [Kolla BP, Mansukhani S, Mansukhani MP. Consumer sleep tracking devices: a review of mechanisms, validity and utility. Expert Rev Med Devices. May 2016;13(5):497-506. [CrossRef] [Medline]4,de Zambotti M, Cellini N, Goldstone A, Colrain IM, Baker FC. Wearable sleep technology in clinical and research settings. Med Sci Sports Exerc. Jul 2019;51(7):1538-1557. [FREE Full text] [CrossRef] [Medline]28,Van de Water AT, Holmes A, Hurley DA. Objective measurements of sleep for non-laboratory settings as alternatives to polysomnography--a systematic review. J Sleep Res. Mar 2011;20(1 Pt 2):183-200. [CrossRef] [Medline]29]. To the best of our knowledge, this scoping review is the first to focus on the advanced data analytics related to sleep tracking. An advanced data-driven approach has the potential to discover meaningful patterns or hidden correlations that could be used to guide behavioral change for better sleep. On the basis of the scoping review, we highlight the research opportunities for data-driven sleep computing.

Table 1. Level of analytics and its mapping to sleep tracking.

Analytics level	Questions answered	Mapping to sleep tracking
Descriptive	What has happened?	“How many hours did I sleep last night?”
Diagnostic	Why did it happen?	“Why did I have so many awakenings last night?”
Predictive	What is likely to happen next?	“How will my sleep quality be in five years if I keep going to bed at 2:00 am?”
Prescriptive	What should be done about it?	“Will I sleep better tonight if I work out 10 minutes longer in the morning?”

Research Questions

This scoping review was guided by the problem-constraints-system framework, which is widely used for conducting a literature review in computer science. The problem-constraints-system framework focuses on 3 different aspects of a research topic in computing research: a specific problem of interest (P); systems, applications, or algorithms (S) for tackling the problem; and constraints (C). After iterative brainstorming, we proposed the following 4 research questions (RQs) to anchor the entire review process:

RQ1: What is the general research trend of knowledge discovery in sleep tracking?
RQ2: What sleep quality metrics and contextual factors were considered, and how were they measured?
RQ3: What knowledge discovery methods or algorithms were applied? What knowledge was discovered?
RQ4: What challenges exist? What are the opportunities for future research?

Search Strategy and Query String

In this review, we focused on the application of data mining to identify the relationships between sleep and contextual factors with consumer wearable devices. Therefore, search results must contain all 4 pertinent aspects: sleep metrics, contextual factors, wearable devices, and knowledge discovery. Automated searches were conducted in a number of databases, including the Web of Science, Scopus, ACM Digital Library, IEEE Xplore, ScienceDirect, Springer, Fitbit Research Library, and Fitabase. As our review was centered on the computing aspect rather than the clinical aspect of sleep tracking, we used databases that focus more on engineering and computer science publications, including the ACM Digital Library, IEEE Xplore, ScienceDirect, and Springer. PubMed was not used because we were not interested in cohort studies or medical assessments. We also included gray literature such as the Fitbit Research Library and Fitabase because of their relevance to our topics of interest. The query strings were slightly different for each database but always contained 4 main keyword combinations: “sleep,” “lifestyle” AND “contextual,” “data mining” OR “knowledge discovery,” and “wearable device.” Synonyms words and words of related concepts (eg, “self-tracking,” “sensors,” “machine learning,” “correlation,” and “statistics”) were also listed to avoid missing out on related publications. The search strategy varied for each search engine, as each of them had different rules and options. The query string was truncated in some databases (eg, Web of Science, ScienceDirect, and IEEE Xplore) either because long query strings resulted in many irrelevant entries or because of word limit. We included several types of publications, including journal articles, conference proceeding papers, workshop presentations, patents, and gray literature, to gain a broad scope of the research topic. Editorial articles, theses, and dissertations were also excluded.

Study Selection

The study selection process is shown in the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) flowchart in Figure 1. We applied the following exclusion criteria to filter out irrelevant papers retrieved from the databases: (1) studies not related to human sleep (eg, animal studies and sleep mode of sensor system), (2) clinical studies aiming at treating sleep disorders, (3) hypothesis-driven laboratory-based studies using invasive sleep tests (eg, polysomnography), (4) validation studies focusing on comparing wearable devices with medical devices, (5) studies focusing on estimating sleep architecture based on concurrent physiological signals, and (6) papers not written in English. We retrieved the returned entries from each database and imported them into the web-based review management software Rayyan (Rayyan). All databases provided tools to export the returned entries in a single file, except for the Fitbit Research Library and Fitabase. We exported only 400 of the 2072 papers from the ScienceDirect because the rest of the papers were not relevant to our review topic. For the ScienceDirect database, the returned items were first listed using the “relevant order” tool provided by the database. We used an a prior condition to include the first 400 items. For the remaining items, we separated them into groups of 200 items and randomly checked 50 items in each group. We found that from the 400th item onward, the studies were off the topic according to our exclusion criteria. Thus, we eliminated these items because they did not help address our central RQs. For example, the keyword “contextual” led to the retrieval of papers in civil engineering on checking bedroom quality with air condition, ambient light, and noise exposure. Those were excluded based on the exclusion criterion 1. The keyword “wearable device” led to the retrieval of papers in electrical and mechanical engineering on hardware design for sensors, batteries, and ergonomic design. Those were excluded based on the exclusion criterion 4. For the Fitbit Research Library, 2 papers were missing because they were either retracted or no longer available. Entries in Fitabase were screened for duplicity before manually adding to Rayyan because Fitabase does not support automatic export. As many of the Fitabase entries were duplicates of publications from other databases, only 37 papers were added to Rayyan.

**Figure 1.** PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) flowchart of article screening process.

In total, 1999 papers were imported to Rayyan. Screening for duplication and written language was performed automatically in Rayyan. After the rapid screening step, the first author performed title and abstract screening and suggested the inclusion of 341 papers. The second author repeated the title and abstract screening of the 341 papers while paying special attention to the exclusion criteria 3 and 5. Any conflicts were resolved through discussion with the first author. After the screening and eligibility checks, 29 papers remained. Both authors read the full text of the 29 papers in detail and finally selected 14 papers for the scoping review.

General Research Trend (RQ1)

Ubiquitous and personal sleep tracking has become an active research area since approximately 2011, when a few pioneer studies were published in the human-computer interaction community [Bauer JS, Consolvo S, Greenstein B, Schooler J, Wu E, Watson NF, et al. ShutEye: encouraging awareness of healthy sleep recommendations with a mobile, peripheral display. Presented at: CHI '12: CHI Conference on Human Factors in Computing Systems; May 5-10, 2012, 2012; Austin, TX, USA. [CrossRef]17,Kay M, Choe EK, Shepherd J, Greenstein B, Watson N, Consolvo S, et al. Lullaby: a capture and access system for understanding the sleep environment. Presented at: Ubicomp '12: The 2012 ACM Conference on Ubiquitous Computing; Sep 5-8, 2012, 2012; Pittsburgh, PE, USA. [CrossRef]18,Choe EK, Consolvo S, Watson NF, Kientz JA. Opportunities for computing technologies to support healthy sleep behaviors. Presented at: CHI '11: the SIGCHI Conference on Human Factors in Computing Systems; May 7-12, 2011, 2011;3053-3062; Vancouver, Canada. URL: https://dl.acm.org/doi/10.1145/1978942.1979395 [CrossRef]30]. Since then, a large number of studies on sleep tracking have been published, but most of them are dominantly centered on the validation of existing sleep tracking devices or systems [Menghini L, Cellini N, Goldstone A, Baker FC, de Zambotti M. A standardized framework for testing the performance of sleep-tracking technology: step-by-step guidelines and open-source code. Sleep. Feb 12, 2021;44(2):zsaa170. [FREE Full text] [CrossRef] [Medline]5-Liang Z, Chapa-Martell MA. Combining numerical and visual approaches in validating sleep data quality of consumer wearable wristbands. Presented at: IEEE International Conference on Pervasive Computing and Communications Workshops (PerCom Workshops); Mar 11-15, 2019, 2019; Kyoto, Japan. [CrossRef]10], on the development of new sleep staging algorithms [Roberts DM, Schade MM, Mathew GM, Gartenberg D, Buxton OM. Detecting sleep using heart rate and motion data from multisensor consumer-grade wearables, relative to wrist actigraphy and polysomnography. Sleep. Jul 13, 2020;43(7):zsaa045. [FREE Full text] [CrossRef] [Medline]11-Sirithummarak P, Liang Z. Investigating the effect of feature distribution shift on the performance of sleep stage classification with consumer sleep trackers. Presented at: IEEE 10th Global Conference on Consumer Electronics (GCCE); Oct 12-15, 2021, 2021; Kyoto, Japan. [CrossRef]15], or on the investigation into users’ experience with the technologies [Liu W, Ploderer B, Hoang T. In bed with technology: challenges and opportunities for sleep tracking. Presented at: OzCHI '15: The Annual Meeting of the Australian Special Interest Group for Computer Human Interaction; Dec 7-10, 2015, 2015; Parkville, VIC, Australia. [CrossRef]3,Ploderer B, Rodgers S, Liang Z. What’s keeping teens up at night? Reflecting on sleep and technology habits with teens. Pers Ubiquitous Comput. Jan 28, 2022;27(2):249-270. [CrossRef]21-Liang Z, Ploderer B. Sleep tracking in the real world: a qualitative study into barriers for improving sleep. Presented at: OzCHI '16: The 28th Australian Conference on Human-Computer Interaction; Nov 29-Dec 2, 2016, 2016; Launceston, Tasmania, Australia. [CrossRef]23]. In contrast, studies that focus on knowledge discovery in sleep tracking data are scarce and ad hoc. The screening process identified only 14 relevant studies, of which more than half (8/14, 57%) were conducted in the United States, followed by Japan (3/14, 21%). The other publications were from China, Korea, Finland, and Australia. Only 5 (36%) of the 14 publications were journal articles, and the rest were conference proceeding papers. Chronologically, studies by Jayarajah et al [Jayarajah K, Radhakrishnan M, Hoi SC, Misra A. Candy crushing your sleep. Presented at: UbiComp/ISWC '15: 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing and 2015 ACM International Symposium on Wearable Computers; September 7-11, 2015, 2015;753-762; Osaka, Japan. URL: https://dl.acm.org/doi/10.1145/2800835.2804393 [CrossRef]31] and Gelman and Hill [Gelman A, Hill J. Data Analysis Using Regression and Multilevel/Hierarchical Models. Cambridge, UK. Cambridge University Press; 2006.32] were one of the earliest studies in this field. The authors developed a binary tree model to predict good and poor sleep based on app use activity and social time during the day. Since 2015, the topic of knowledge discovery in sleep tracking has begun to attract more attention along with the advances in consumer sleep tracking technologies. As a result, the number of publications has increased slightly in the subsequent years, but the total amount is still limited.

A common objective of these studies was to help users gain insights into how their sleep quality was associated with other aspects of their daily lives. Some of the specific motivations are as follows:

Identify aspects of daily life that demonstrate significant associations with personal sleep quality from self-tracking data [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16,Daskalova N, Metaxa-Kakavouli D, Tran A, Nugent N, Boergers J, McGeary J, et al. SleepCoacher: a personalized automated self-experimentation system for sleep recommendations. Presented at: UIST '16: The 29th Annual ACM Symposium on User Interface Software and Technology; Oct 16-19, 2016, 2016; Tokyo, Japan. [CrossRef]20,Liang Z, Chapa Martell MA, Nishimura T. Mining hidden correlations between sleep and lifestyle factors from quantified-self data. Presented at: UbiComp '16: 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing; Sep 12-16, 2016, 2016;547-552; Heidelberg, Germany. URL: https://dl.acm.org/doi/10.1145/2968219.2968319 [CrossRef]33]
Highlight the potential for sleep metrics from wearable devices to provide novel insights into data generated from a large cohort [Althoff T, Horvitz EJ, White RW, Zeitzer J. Harnessing the web for population-scale physiological sensing: a case study of sleep and performance. In: Proceedings of the 26th International Conference on World Wide Web. Presented at: WWW '17; Apr 3-7, 2017, 2017;113-122; Perth, Australia. URL: https://dl.acm.org/doi/10.1145/3038912.3052637 [CrossRef]34-Guo T, Li L, Zhang D, Xia F. Surf or sleep?: understanding the influence of bedtime patterns on campus. SIGKDD Explor Newsl. Dec 30, 2021;23(2):3-12. [FREE Full text] [CrossRef]36]
Detect aberrant sleep patterns or typical events during sleep by considering individuals’ sleep baselines [Liang Z, Chapa-Martell MA, Nishimura T. A personalized approach for detecting unusual sleep from time series sleep-tracking data. In: Proceedings of the 2016 IEEE International Conference on Healthcare Informatics. Presented at: ICHI '16; Oct 4-7, 2016, 2016;18-23; Chicago, IL, USA. URL: https://ieeexplore.ieee.org/document/7776322 [CrossRef]37,Feng T, Narayanan SS. Discovering optimal variable-length time series motifs in large-scale wearable recordings of human bio-behavioral signals. Presented at: ICASSP '19: 2019 IEEE International Conference on Acoustics, Speech and Signal Processing; May 12-17, 2019, 2019;7615-7619; Brighton, UK. URL: https://ieeexplore.ieee.org/abstract/document/8682427 [CrossRef]38]
Guide users in designing self-experiments to identify personal modifiable lifestyle factors for better sleep health [Daskalova N, Metaxa-Kakavouli D, Tran A, Nugent N, Boergers J, McGeary J, et al. SleepCoacher: a personalized automated self-experimentation system for sleep recommendations. Presented at: UIST '16: The 29th Annual ACM Symposium on User Interface Software and Technology; Oct 16-19, 2016, 2016; Tokyo, Japan. [CrossRef]20]
Develop a recommender system that provides both general and personalized recommendations for better sleep health [Daskalova N, Metaxa-Kakavouli D, Tran A, Nugent N, Boergers J, McGeary J, et al. SleepCoacher: a personalized automated self-experimentation system for sleep recommendations. Presented at: UIST '16: The 29th Annual ACM Symposium on User Interface Software and Technology; Oct 16-19, 2016, 2016; Tokyo, Japan. [CrossRef]20]

The studies reviewed in this paper demonstrated a tendency to analyze sleep tracking data along with a flux of contextual factors. These factors were used as independent variables for predicting sleep quality and, to a lesser degree, for identifying antecedent events that affect sleep quality. In the scheme of traditional sleep science studies, only a limited number of independent variables were considered, and the confounding effect of noninterested factors needed to be controlled through a rigid experiment design. In comparison, data collection experiments in ubiquitous sleep tracking studies are often conducted in a naturalistic environment, making it challenging to control for confounding factors. Therefore, advanced data analysis techniques are required to control the effects of confounding factors during the analysis. Another line of research effort is the personalized detection of aberrant sleep. Although sleep quality assessment may sound straightforward using clinical standards [Ohayon M, Wickwire EM, Hirshkowitz M, Albert SM, Avidan A, Daly FJ, et al. National Sleep Foundation's sleep quality recommendations: first report. Sleep Health. Feb 2017;3(1):6-19. [CrossRef] [Medline]39], it remains challenging if personal differences in sleep needs should be taken into consideration. Studies in this direction are limited, and we identified only 2 relevant studies [Liang Z, Chapa-Martell MA, Nishimura T. A personalized approach for detecting unusual sleep from time series sleep-tracking data. In: Proceedings of the 2016 IEEE International Conference on Healthcare Informatics. Presented at: ICHI '16; Oct 4-7, 2016, 2016;18-23; Chicago, IL, USA. URL: https://ieeexplore.ieee.org/document/7776322 [CrossRef]37,Liu X, Tamminen S, Korhonen T, Röning J. How physical exercise level affects sleep quality? Analyzing big data collected from wearables. Procedia Comput Sci. 2019;155:242-249. [FREE Full text] [CrossRef]40].

Quantification and Measurement of Sleep and Contextual Factors (RQ2)

Human sleep can be quantified along multiple dimensions, such as sleep duration, sleep continuity, sleep timing, and subjective perception of the sleep event [Buysse DJ. Sleep health: can we define it? Does it matter? Sleep. Jan 01, 2014;37(1):9-17. [FREE Full text] [CrossRef] [Medline]41]. We found that not all studies used the same set of metrics to characterize sleep quality. Even the same sleep metric may be capsulated in different terminologies across studies. To facilitate cross-study comparisons, we mapped the sleep metrics in each study to standard clinical terms, whenever possible. Original sleep metrics that have no corresponding clinical terms were simply left as they were. As presented in Table 2, the most used sleep metrics among the reviewed studies were subjective sleep quality (4 studies), sleep efficiency (SE, 4 studies), sleep onset latency (SOL; 4 studies), and time at lights off (3 studies). Ratio parameters such as deep sleep ratio and rapid eye movement ratio were not used in any of the studies. Moreover, the cutoff between good and poor sleep varied from study to study. A few studies have adopted clinical cutoffs, particularly for the Pittsburgh Sleep Quality Index (5) [Jayarajah K, Radhakrishnan M, Hoi SC, Misra A. Candy crushing your sleep. Presented at: UbiComp/ISWC '15: 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing and 2015 ACM International Symposium on Wearable Computers; September 7-11, 2015, 2015;753-762; Osaka, Japan. URL: https://dl.acm.org/doi/10.1145/2800835.2804393 [CrossRef]31], SE (85%) [Upadhyay D, Pandey V, Nag N, Jain R. Personalized user modelling for sleep insight. Presented at: HuMA'20: 1st International Workshop on Human-centric Multimedia Analysis; Oct 12, 2020, 2020;13-20; Seattle, WA, USA. URL: https://dl.acm.org/doi/10.1145/3422852.3423478 [CrossRef]42], wake after sleep onset (30 minutes) [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16], and total sleep time (7-9 hours) [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16]. The rest chose to use heuristic cutoffs that did not comply with the clinical guidelines.

Table 2. Sleep quality metrics, measurement methods, and cutoff of good or poor sleep.

Clinical term, original term, and measurement method			Cutoff
Subjective sleep quality
	Pittsburgh Sleep Quality Index [Jayarajah K, Radhakrishnan M, Hoi SC, Misra A. Candy crushing your sleep. Presented at: UbiComp/ISWC '15: 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing and 2015 ACM International Symposium on Wearable Computers; September 7-11, 2015, 2015;753-762; Osaka, Japan. URL: https://dl.acm.org/doi/10.1145/2800835.2804393 [CrossRef]31,Feng T, Narayanan SS. Discovering optimal variable-length time series motifs in large-scale wearable recordings of human bio-behavioral signals. Presented at: ICASSP '19: 2019 IEEE International Conference on Acoustics, Speech and Signal Processing; May 12-17, 2019, 2019;7615-7619; Brighton, UK. URL: https://ieeexplore.ieee.org/abstract/document/8682427 [CrossRef]38]	Good: ≤5 and poor: >5 [Jayarajah K, Radhakrishnan M, Hoi SC, Misra A. Candy crushing your sleep. Presented at: UbiComp/ISWC '15: 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing and 2015 ACM International Symposium on Wearable Computers; September 7-11, 2015, 2015;753-762; Osaka, Japan. URL: https://dl.acm.org/doi/10.1145/2800835.2804393 [CrossRef]31]; good: ≤7 and poor: >7 [Feng T, Narayanan SS. Discovering optimal variable-length time series motifs in large-scale wearable recordings of human bio-behavioral signals. Presented at: ICASSP '19: 2019 IEEE International Conference on Acoustics, Speech and Signal Processing; May 12-17, 2019, 2019;7615-7619; Brighton, UK. URL: https://ieeexplore.ieee.org/abstract/document/8682427 [CrossRef]38]
	Sleep rating (SleepAsAndroid app) [Daskalova N, Metaxa-Kakavouli D, Tran A, Nugent N, Boergers J, McGeary J, et al. SleepCoacher: a personalized automated self-experimentation system for sleep recommendations. Presented at: UIST '16: The 29th Annual ACM Symposium on User Interface Software and Technology; Oct 16-19, 2016, 2016; Tokyo, Japan. [CrossRef]20]	N/A^a
	Leeds Sleep Evaluation Questionnaire [Ravichandran R, Patel SN, Kientz JA. SleepApp: providing contextualized and actionable sleep feedback. Presented at: PervasiveHealth '20: 14th EAI International Conference on Pervasive Computing Technologies for Healthcare; May 18-20, 2020, 2020; Atlanta, GA, USA. [CrossRef]19]	N/A
	Sleep rating (SleepApp) [Ravichandran R, Patel SN, Kientz JA. SleepApp: providing contextualized and actionable sleep feedback. Presented at: PervasiveHealth '20: 14th EAI International Conference on Pervasive Computing Technologies for Healthcare; May 18-20, 2020, 2020; Atlanta, GA, USA. [CrossRef]19]	N/A
SE^b
	SE (Fitbit) [Liang Z, Chapa Martell MA, Nishimura T. Mining hidden correlations between sleep and lifestyle factors from quantified-self data. Presented at: UbiComp '16: 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing; Sep 12-16, 2016, 2016;547-552; Heidelberg, Germany. URL: https://dl.acm.org/doi/10.1145/2968219.2968319 [CrossRef]33]	Good: ≥95% and poor: <95%
	Efficiency (MS Band) [Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43]	N/A
	SE (Polar) [Liu X, Tamminen S, Korhonen T, Röning J. How physical exercise level affects sleep quality? Analyzing big data collected from wearables. Procedia Comput Sci. 2019;155:242-249. [FREE Full text] [CrossRef]40]	Good: >mean (SD) and poor: <mean (SD)
	SE (Garmin) [Upadhyay D, Pandey V, Nag N, Jain R. Personalized user modelling for sleep insight. Presented at: HuMA'20: 1st International Workshop on Human-centric Multimedia Analysis; Oct 12, 2020, 2020;13-20; Seattle, WA, USA. URL: https://dl.acm.org/doi/10.1145/3422852.3423478 [CrossRef]42]	Good: ≥85% and poor: <85%
SOL^c (minutes)
	Minutes to fall asleep (Fitbit) [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16]	N/A
	SOL (SleepAsAndroid app) [Daskalova N, Metaxa-Kakavouli D, Tran A, Nugent N, Boergers J, McGeary J, et al. SleepCoacher: a personalized automated self-experimentation system for sleep recommendations. Presented at: UIST '16: The 29th Annual ACM Symposium on User Interface Software and Technology; Oct 16-19, 2016, 2016; Tokyo, Japan. [CrossRef]20]	N/A
	Sleep latency (Garmin) [Upadhyay D, Pandey V, Nag N, Jain R. Personalized user modelling for sleep insight. Presented at: HuMA'20: 1st International Workshop on Human-centric Multimedia Analysis; Oct 12, 2020, 2020;13-20; Seattle, WA, USA. URL: https://dl.acm.org/doi/10.1145/3422852.3423478 [CrossRef]42]	Good: ≤15, average: 15-30, and poor: >30
	Time to fall asleep (MS Band) [Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43]	N/A
Time at lights off
	Bedtime (Fitbit) [Faust L, Feldman K, Mattingly SM, Hachen D, Chawla NV. Deviations from normal bedtimes are associated with short-term increases in resting heart rate. NPJ Digit Med. Mar 23, 2020;3:39. [FREE Full text] [CrossRef] [Medline]35]	Normal bedtime: median of a participant’s bedtimes; deviation categories were 1-30 minutes, 30-60 minutes, 1-2 hours, 2-3 hours, and ≥3 hours
	Bedtime (Fitbit) [Liang Z, Chapa-Martell MA, Nishimura T. A personalized approach for detecting unusual sleep from time series sleep-tracking data. In: Proceedings of the 2016 IEEE International Conference on Healthcare Informatics. Presented at: ICHI '16; Oct 4-7, 2016, 2016;18-23; Chicago, IL, USA. URL: https://ieeexplore.ieee.org/document/7776322 [CrossRef]37]	N/A
	Bedtime as estimated by the time of the last network signal [Guo T, Li L, Zhang D, Xia F. Surf or sleep?: understanding the influence of bedtime patterns on campus. SIGKDD Explor Newsl. Dec 30, 2021;23(2):3-12. [FREE Full text] [CrossRef]36]	N/A
Wake after sleep onset (minutes)
	Awake minutes (Garmin) [Upadhyay D, Pandey V, Nag N, Jain R. Personalized user modelling for sleep insight. Presented at: HuMA'20: 1st International Workshop on Human-centric Multimedia Analysis; Oct 12, 2020, 2020;13-20; Seattle, WA, USA. URL: https://dl.acm.org/doi/10.1145/3422852.3423478 [CrossRef]42]	Good: ≤20 and poor: >20
	Minutes awake (Fitbit) [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16]	Good: ≤30 and poor: >30
Number of awakenings >5 minutes
	Awakenings >5 minutes (Garmin) [Upadhyay D, Pandey V, Nag N, Jain R. Personalized user modelling for sleep insight. Presented at: HuMA'20: 1st International Workshop on Human-centric Multimedia Analysis; Oct 12, 2020, 2020;13-20; Seattle, WA, USA. URL: https://dl.acm.org/doi/10.1145/3422852.3423478 [CrossRef]42]	Good: ≤1 and poor: >1
	Number of wakeups (MS Band) [Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43]	N/A
Time in bed (minutes)
	Time in bed (MS Band) [Althoff T, Horvitz EJ, White RW, Zeitzer J. Harnessing the web for population-scale physiological sensing: a case study of sleep and performance. In: Proceedings of the 26th International Conference on World Wide Web. Presented at: WWW '17; Apr 3-7, 2017, 2017;113-122; Perth, Australia. URL: https://dl.acm.org/doi/10.1145/3038912.3052637 [CrossRef]34]	N/A
Total sleep time (minutes)
	Minutes asleep (Fitbit) [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16]	Good: 420-540 and poor: <420 or >540
Original metrics (no corresponding medical term)
	Sleep ratio=the ratio of the sleep minutes with normal heart rate versus total sleep time (Fitbit) [Kang WS, Choi RS, Son CS. Explainable sleep quality evaluation model using machine learning approach. Presented at: MFI '17: 2017 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems; Nov 16-18, 2017, 2017;542-546; Daegu, South Korea. URL: https://ieeexplore.ieee.org/document/8170377 [CrossRef]44]	Good: >90%, normal: 60%-90%, and bad: <60%
	Number of awakenings per hour [Daskalova N, Metaxa-Kakavouli D, Tran A, Nugent N, Boergers J, McGeary J, et al. SleepCoacher: a personalized automated self-experimentation system for sleep recommendations. Presented at: UIST '16: The 29th Annual ACM Symposium on User Interface Software and Technology; Oct 16-19, 2016, 2016; Tokyo, Japan. [CrossRef]20]	N/A
	Awakening count including restlessness (Fitbit) [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16]	N/A
	Permutation entropy of Fitbit measured sleep state time series [Liang Z, Chapa-Martell MA, Nishimura T. A personalized approach for detecting unusual sleep from time series sleep-tracking data. In: Proceedings of the 2016 IEEE International Conference on Healthcare Informatics. Presented at: ICHI '16; Oct 4-7, 2016, 2016;18-23; Chicago, IL, USA. URL: https://ieeexplore.ieee.org/document/7776322 [CrossRef]37]	N/A

^aN/A: not applicable.

^bSE: sleep efficiency.

^cSOL: sleep onset latency.

Sleep quality metrics were measured using 3 methods: questionnaire based, app based, and wearable based. The Pittsburgh Sleep Quality Index was the most widely used questionnaire to measure the subjective perception of sleep duration, continuity, efficiency, and satisfaction [Jayarajah K, Radhakrishnan M, Hoi SC, Misra A. Candy crushing your sleep. Presented at: UbiComp/ISWC '15: 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing and 2015 ACM International Symposium on Wearable Computers; September 7-11, 2015, 2015;753-762; Osaka, Japan. URL: https://dl.acm.org/doi/10.1145/2800835.2804393 [CrossRef]31,Feng T, Narayanan SS. Discovering optimal variable-length time series motifs in large-scale wearable recordings of human bio-behavioral signals. Presented at: ICASSP '19: 2019 IEEE International Conference on Acoustics, Speech and Signal Processing; May 12-17, 2019, 2019;7615-7619; Brighton, UK. URL: https://ieeexplore.ieee.org/abstract/document/8682427 [CrossRef]38]. The SleepAsAndroid app was used to collect sleep data based on user movement patterns in the study by Daskalova et al [Daskalova N, Metaxa-Kakavouli D, Tran A, Nugent N, Boergers J, McGeary J, et al. SleepCoacher: a personalized automated self-experimentation system for sleep recommendations. Presented at: UIST '16: The 29th Annual ACM Symposium on User Interface Software and Technology; Oct 16-19, 2016, 2016; Tokyo, Japan. [CrossRef]20], and the SleepApp was used to collect users’ subjective sleep quality ratings in the study by Ravichandran et al [Ravichandran R, Patel SN, Kientz JA. SleepApp: providing contextualized and actionable sleep feedback. Presented at: PervasiveHealth '20: 14th EAI International Conference on Pervasive Computing Technologies for Healthcare; May 18-20, 2020, 2020; Atlanta, GA, USA. [CrossRef]19]. Although sleep tracking apps such as SleepAsAndroid were widely downloaded and used by millions of users, their ability to distinguish between quiet awakenings, deep sleep, or empty beds was still limited [Daskalova N, Metaxa-Kakavouli D, Tran A, Nugent N, Boergers J, McGeary J, et al. SleepCoacher: a personalized automated self-experimentation system for sleep recommendations. Presented at: UIST '16: The 29th Annual ACM Symposium on User Interface Software and Technology; Oct 16-19, 2016, 2016; Tokyo, Japan. [CrossRef]20]. In a related vein, studies by Jayarajah et al [Jayarajah K, Radhakrishnan M, Hoi SC, Misra A. Candy crushing your sleep. Presented at: UbiComp/ISWC '15: 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing and 2015 ACM International Symposium on Wearable Computers; September 7-11, 2015, 2015;753-762; Osaka, Japan. URL: https://dl.acm.org/doi/10.1145/2800835.2804393 [CrossRef]31] and Faust et al [Guo T, Li L, Zhang D, Xia F. Surf or sleep?: understanding the influence of bedtime patterns on campus. SIGKDD Explor Newsl. Dec 30, 2021;23(2):3-12. [FREE Full text] [CrossRef]36] have defined sleep time as the longest period during which there was no activity on users’ smartphones. However, the authors acknowledged that this method was not reliable because not everyone had the habit of using smartphones until they fell asleep. Most studies (9/14, 64%) used commercial wearable devices such as Fitbit [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16,Liang Z, Chapa Martell MA, Nishimura T. Mining hidden correlations between sleep and lifestyle factors from quantified-self data. Presented at: UbiComp '16: 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing; Sep 12-16, 2016, 2016;547-552; Heidelberg, Germany. URL: https://dl.acm.org/doi/10.1145/2968219.2968319 [CrossRef]33,Faust L, Feldman K, Mattingly SM, Hachen D, Chawla NV. Deviations from normal bedtimes are associated with short-term increases in resting heart rate. NPJ Digit Med. Mar 23, 2020;3:39. [FREE Full text] [CrossRef] [Medline]35,Liang Z, Chapa-Martell MA, Nishimura T. A personalized approach for detecting unusual sleep from time series sleep-tracking data. In: Proceedings of the 2016 IEEE International Conference on Healthcare Informatics. Presented at: ICHI '16; Oct 4-7, 2016, 2016;18-23; Chicago, IL, USA. URL: https://ieeexplore.ieee.org/document/7776322 [CrossRef]37,Kang WS, Choi RS, Son CS. Explainable sleep quality evaluation model using machine learning approach. Presented at: MFI '17: 2017 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems; Nov 16-18, 2017, 2017;542-546; Daegu, South Korea. URL: https://ieeexplore.ieee.org/document/8170377 [CrossRef]44], Microsoft Band (MS Band) [Althoff T, Horvitz EJ, White RW, Zeitzer J. Harnessing the web for population-scale physiological sensing: a case study of sleep and performance. In: Proceedings of the 26th International Conference on World Wide Web. Presented at: WWW '17; Apr 3-7, 2017, 2017;113-122; Perth, Australia. URL: https://dl.acm.org/doi/10.1145/3038912.3052637 [CrossRef]34,Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43], Garmin [Upadhyay D, Pandey V, Nag N, Jain R. Personalized user modelling for sleep insight. Presented at: HuMA'20: 1st International Workshop on Human-centric Multimedia Analysis; Oct 12, 2020, 2020;13-20; Seattle, WA, USA. URL: https://dl.acm.org/doi/10.1145/3422852.3423478 [CrossRef]42], and Polar [Liu X, Tamminen S, Korhonen T, Röning J. How physical exercise level affects sleep quality? Analyzing big data collected from wearables. Procedia Comput Sci. 2019;155:242-249. [FREE Full text] [CrossRef]40]. Although all 3 methods are noninvasive, easy to use, and allow longitudinal collection of sleep data, each method has some limitations. The questionnaire-based method is subject to memory recall bias. Sleep data collected by sleep tracking apps and wearable sleep trackers provide an objective description of sleep quality and sleep structure. However, they may also be prone to measurement errors because of hardware and software limitations [Liang Z, Chapa Martell MA. Validity of consumer activity wristbands and wearable EEG for measuring overall sleep parameters and sleep structure in free-living conditions. J Healthc Inform Res. Jun 20, 2018;2(1-2):152-178. [FREE Full text] [CrossRef] [Medline]7,de Zambotti M, Cellini N, Goldstone A, Colrain IM, Baker FC. Wearable sleep technology in clinical and research settings. Med Sci Sports Exerc. Jul 2019;51(7):1538-1557. [FREE Full text] [CrossRef] [Medline]28]. They also require users to place the smartphone nearby or to wear the device continuously, which may cause discomfort during long-term use. Moreover, despite being small and convenient, consumer wearable trackers cannot provide hypnogram information that is as detailed as medical devices.

Along with sleep quality metrics, researchers have considered a wide range of contextual factors in their studies. As presented in Table 3, the contextual factors of interest include the sleep environment [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16,Daskalova N, Metaxa-Kakavouli D, Tran A, Nugent N, Boergers J, McGeary J, et al. SleepCoacher: a personalized automated self-experimentation system for sleep recommendations. Presented at: UIST '16: The 29th Annual ACM Symposium on User Interface Software and Technology; Oct 16-19, 2016, 2016; Tokyo, Japan. [CrossRef]20,Upadhyay D, Pandey V, Nag N, Jain R. Personalized user modelling for sleep insight. Presented at: HuMA'20: 1st International Workshop on Human-centric Multimedia Analysis; Oct 12, 2020, 2020;13-20; Seattle, WA, USA. URL: https://dl.acm.org/doi/10.1145/3422852.3423478 [CrossRef]42], daily activities [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16,Ravichandran R, Patel SN, Kientz JA. SleepApp: providing contextualized and actionable sleep feedback. Presented at: PervasiveHealth '20: 14th EAI International Conference on Pervasive Computing Technologies for Healthcare; May 18-20, 2020, 2020; Atlanta, GA, USA. [CrossRef]19,Daskalova N, Metaxa-Kakavouli D, Tran A, Nugent N, Boergers J, McGeary J, et al. SleepCoacher: a personalized automated self-experimentation system for sleep recommendations. Presented at: UIST '16: The 29th Annual ACM Symposium on User Interface Software and Technology; Oct 16-19, 2016, 2016; Tokyo, Japan. [CrossRef]20,Jayarajah K, Radhakrishnan M, Hoi SC, Misra A. Candy crushing your sleep. Presented at: UbiComp/ISWC '15: 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing and 2015 ACM International Symposium on Wearable Computers; September 7-11, 2015, 2015;753-762; Osaka, Japan. URL: https://dl.acm.org/doi/10.1145/2800835.2804393 [CrossRef]31,Guo T, Li L, Zhang D, Xia F. Surf or sleep?: understanding the influence of bedtime patterns on campus. SIGKDD Explor Newsl. Dec 30, 2021;23(2):3-12. [FREE Full text] [CrossRef]36,Liang Z, Chapa-Martell MA, Nishimura T. A personalized approach for detecting unusual sleep from time series sleep-tracking data. In: Proceedings of the 2016 IEEE International Conference on Healthcare Informatics. Presented at: ICHI '16; Oct 4-7, 2016, 2016;18-23; Chicago, IL, USA. URL: https://ieeexplore.ieee.org/document/7776322 [CrossRef]37,Liu X, Tamminen S, Korhonen T, Röning J. How physical exercise level affects sleep quality? Analyzing big data collected from wearables. Procedia Comput Sci. 2019;155:242-249. [FREE Full text] [CrossRef]40,Upadhyay D, Pandey V, Nag N, Jain R. Personalized user modelling for sleep insight. Presented at: HuMA'20: 1st International Workshop on Human-centric Multimedia Analysis; Oct 12, 2020, 2020;13-20; Seattle, WA, USA. URL: https://dl.acm.org/doi/10.1145/3422852.3423478 [CrossRef]42,Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43], physiological states [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16,Faust L, Feldman K, Mattingly SM, Hachen D, Chawla NV. Deviations from normal bedtimes are associated with short-term increases in resting heart rate. NPJ Digit Med. Mar 23, 2020;3:39. [FREE Full text] [CrossRef] [Medline]35,Guo T, Li L, Zhang D, Xia F. Surf or sleep?: understanding the influence of bedtime patterns on campus. SIGKDD Explor Newsl. Dec 30, 2021;23(2):3-12. [FREE Full text] [CrossRef]36,Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43,Kang WS, Choi RS, Son CS. Explainable sleep quality evaluation model using machine learning approach. Presented at: MFI '17: 2017 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems; Nov 16-18, 2017, 2017;542-546; Daegu, South Korea. URL: https://ieeexplore.ieee.org/document/8170377 [CrossRef]44], and mental states [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16,Ravichandran R, Patel SN, Kientz JA. SleepApp: providing contextualized and actionable sleep feedback. Presented at: PervasiveHealth '20: 14th EAI International Conference on Pervasive Computing Technologies for Healthcare; May 18-20, 2020, 2020; Atlanta, GA, USA. [CrossRef]19]. It was a common practice to collect demographic information (eg, age, gender, and BMI) and medical history using questionnaires at the beginning of a sleep tracking study [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16,Faust L, Feldman K, Mattingly SM, Hachen D, Chawla NV. Deviations from normal bedtimes are associated with short-term increases in resting heart rate. NPJ Digit Med. Mar 23, 2020;3:39. [FREE Full text] [CrossRef] [Medline]35,Guo T, Li L, Zhang D, Xia F. Surf or sleep?: understanding the influence of bedtime patterns on campus. SIGKDD Explor Newsl. Dec 30, 2021;23(2):3-12. [FREE Full text] [CrossRef]36,Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43,Kang WS, Choi RS, Son CS. Explainable sleep quality evaluation model using machine learning approach. Presented at: MFI '17: 2017 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems; Nov 16-18, 2017, 2017;542-546; Daegu, South Korea. URL: https://ieeexplore.ieee.org/document/8170377 [CrossRef]44]. Other contextual factors were recorded during the data collection experiments. Researchers have been curious about how web activities of university students could be coupled with their sleep quality [Jayarajah K, Radhakrishnan M, Hoi SC, Misra A. Candy crushing your sleep. Presented at: UbiComp/ISWC '15: 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing and 2015 ACM International Symposium on Wearable Computers; September 7-11, 2015, 2015;753-762; Osaka, Japan. URL: https://dl.acm.org/doi/10.1145/2800835.2804393 [CrossRef]31,Althoff T, Horvitz EJ, White RW, Zeitzer J. Harnessing the web for population-scale physiological sensing: a case study of sleep and performance. In: Proceedings of the 26th International Conference on World Wide Web. Presented at: WWW '17; Apr 3-7, 2017, 2017;113-122; Perth, Australia. URL: https://dl.acm.org/doi/10.1145/3038912.3052637 [CrossRef]34,Guo T, Li L, Zhang D, Xia F. Surf or sleep?: understanding the influence of bedtime patterns on campus. SIGKDD Explor Newsl. Dec 30, 2021;23(2):3-12. [FREE Full text] [CrossRef]36]. These studies developed their own tools to track users’ web-based behavior and linked them to sleep quality metrics. Using consumer wearable trackers such as Fitbit, MS Band, and Garmin, researchers were able to expand their list of factors to include, for example, bedtime, steps, distance, hours of exercise, and calories burned [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16,Ravichandran R, Patel SN, Kientz JA. SleepApp: providing contextualized and actionable sleep feedback. Presented at: PervasiveHealth '20: 14th EAI International Conference on Pervasive Computing Technologies for Healthcare; May 18-20, 2020, 2020; Atlanta, GA, USA. [CrossRef]19,Daskalova N, Metaxa-Kakavouli D, Tran A, Nugent N, Boergers J, McGeary J, et al. SleepCoacher: a personalized automated self-experimentation system for sleep recommendations. Presented at: UIST '16: The 29th Annual ACM Symposium on User Interface Software and Technology; Oct 16-19, 2016, 2016; Tokyo, Japan. [CrossRef]20,Jayarajah K, Radhakrishnan M, Hoi SC, Misra A. Candy crushing your sleep. Presented at: UbiComp/ISWC '15: 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing and 2015 ACM International Symposium on Wearable Computers; September 7-11, 2015, 2015;753-762; Osaka, Japan. URL: https://dl.acm.org/doi/10.1145/2800835.2804393 [CrossRef]31,Guo T, Li L, Zhang D, Xia F. Surf or sleep?: understanding the influence of bedtime patterns on campus. SIGKDD Explor Newsl. Dec 30, 2021;23(2):3-12. [FREE Full text] [CrossRef]36,Liang Z, Chapa-Martell MA, Nishimura T. A personalized approach for detecting unusual sleep from time series sleep-tracking data. In: Proceedings of the 2016 IEEE International Conference on Healthcare Informatics. Presented at: ICHI '16; Oct 4-7, 2016, 2016;18-23; Chicago, IL, USA. URL: https://ieeexplore.ieee.org/document/7776322 [CrossRef]37,Liu X, Tamminen S, Korhonen T, Röning J. How physical exercise level affects sleep quality? Analyzing big data collected from wearables. Procedia Comput Sci. 2019;155:242-249. [FREE Full text] [CrossRef]40,Upadhyay D, Pandey V, Nag N, Jain R. Personalized user modelling for sleep insight. Presented at: HuMA'20: 1st International Workshop on Human-centric Multimedia Analysis; Oct 12, 2020, 2020;13-20; Seattle, WA, USA. URL: https://dl.acm.org/doi/10.1145/3422852.3423478 [CrossRef]42,Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43]. Biosignals such as heart rate during sleep and daytime activities were included in the studies by Faust et al [Faust L, Feldman K, Mattingly SM, Hachen D, Chawla NV. Deviations from normal bedtimes are associated with short-term increases in resting heart rate. NPJ Digit Med. Mar 23, 2020;3:39. [FREE Full text] [CrossRef] [Medline]35], Farajtabar et al [Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43], and Choi et al [Kang WS, Choi RS, Son CS. Explainable sleep quality evaluation model using machine learning approach. Presented at: MFI '17: 2017 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems; Nov 16-18, 2017, 2017;542-546; Daegu, South Korea. URL: https://ieeexplore.ieee.org/document/8170377 [CrossRef]44]. Several studies have also manually collected input features such as coffee, alcohol, mood, and stress [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16,Ravichandran R, Patel SN, Kientz JA. SleepApp: providing contextualized and actionable sleep feedback. Presented at: PervasiveHealth '20: 14th EAI International Conference on Pervasive Computing Technologies for Healthcare; May 18-20, 2020, 2020; Atlanta, GA, USA. [CrossRef]19,Daskalova N, Metaxa-Kakavouli D, Tran A, Nugent N, Boergers J, McGeary J, et al. SleepCoacher: a personalized automated self-experimentation system for sleep recommendations. Presented at: UIST '16: The 29th Annual ACM Symposium on User Interface Software and Technology; Oct 16-19, 2016, 2016; Tokyo, Japan. [CrossRef]20]. These factors may have a significant effect on the circadian cycle of hormone secretion and thus may provide useful information for sleep quality prediction. However, collecting these data are nontrivial, as users tend to forget to log the data on a daily basis. How to collect these data more efficiently and how to reduce the risk of missing data remain challenging.

Table 3. Contextual factors and measurement methods.

Category and contextual factor			Data collection method
Physiological factors
	Age	Self-report [Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43,Kang WS, Choi RS, Son CS. Explainable sleep quality evaluation model using machine learning approach. Presented at: MFI '17: 2017 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems; Nov 16-18, 2017, 2017;542-546; Daegu, South Korea. URL: https://ieeexplore.ieee.org/document/8170377 [CrossRef]44]
	Sex	Self-report [Guo T, Li L, Zhang D, Xia F. Surf or sleep?: understanding the influence of bedtime patterns on campus. SIGKDD Explor Newsl. Dec 30, 2021;23(2):3-12. [FREE Full text] [CrossRef]36,Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43]
	Body weight	Fitbit [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16] and self-report [Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43]
	BMI	Self-report [Kang WS, Choi RS, Son CS. Explainable sleep quality evaluation model using machine learning approach. Presented at: MFI '17: 2017 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems; Nov 16-18, 2017, 2017;542-546; Daegu, South Korea. URL: https://ieeexplore.ieee.org/document/8170377 [CrossRef]44]
	Body temperature	Diary [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16]
	Heart rate	Fitbit [Faust L, Feldman K, Mattingly SM, Hachen D, Chawla NV. Deviations from normal bedtimes are associated with short-term increases in resting heart rate. NPJ Digit Med. Mar 23, 2020;3:39. [FREE Full text] [CrossRef] [Medline]35,Kang WS, Choi RS, Son CS. Explainable sleep quality evaluation model using machine learning approach. Presented at: MFI '17: 2017 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems; Nov 16-18, 2017, 2017;542-546; Daegu, South Korea. URL: https://ieeexplore.ieee.org/document/8170377 [CrossRef]44] and MS Band [Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43]
	Menstrual cycle	Diary [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16]
	Calorie in and out	Fitbit [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16] and MS Band [Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43]
	Activity calorie	Fitbit [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16]
Psychological factors
	Stress	Diary [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16]
	Mood	Diary [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16] and SleepApp [Ravichandran R, Patel SN, Kientz JA. SleepApp: providing contextualized and actionable sleep feedback. Presented at: PervasiveHealth '20: 14th EAI International Conference on Pervasive Computing Technologies for Healthcare; May 18-20, 2020, 2020; Atlanta, GA, USA. [CrossRef]19]
	Tiredness	Diary [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16]
	Dream	Diary [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16]
	Sleep quality the previous night	SleepAsAndroid [Daskalova N, Metaxa-Kakavouli D, Tran A, Nugent N, Boergers J, McGeary J, et al. SleepCoacher: a personalized automated self-experimentation system for sleep recommendations. Presented at: UIST '16: The 29th Annual ACM Symposium on User Interface Software and Technology; Oct 16-19, 2016, 2016; Tokyo, Japan. [CrossRef]20] and MS Band [Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43]
	Cognitive performance	Keystroke time [Althoff T, Horvitz EJ, White RW, Zeitzer J. Harnessing the web for population-scale physiological sensing: a case study of sleep and performance. In: Proceedings of the 26th International Conference on World Wide Web. Presented at: WWW '17; Apr 3-7, 2017, 2017;113-122; Perth, Australia. URL: https://dl.acm.org/doi/10.1145/3038912.3052637 [CrossRef]34] and click time [Althoff T, Horvitz EJ, White RW, Zeitzer J. Harnessing the web for population-scale physiological sensing: a case study of sleep and performance. In: Proceedings of the 26th International Conference on World Wide Web. Presented at: WWW '17; Apr 3-7, 2017, 2017;113-122; Perth, Australia. URL: https://dl.acm.org/doi/10.1145/3038912.3052637 [CrossRef]34]
Behavioral factors
	Steps	Fitbit [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16,Liang Z, Chapa-Martell MA, Nishimura T. A personalized approach for detecting unusual sleep from time series sleep-tracking data. In: Proceedings of the 2016 IEEE International Conference on Healthcare Informatics. Presented at: ICHI '16; Oct 4-7, 2016, 2016;18-23; Chicago, IL, USA. URL: https://ieeexplore.ieee.org/document/7776322 [CrossRef]37] and MS Band [Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43]
	Distance walked	Fitbit [Liang Z, Chapa-Martell MA, Nishimura T. A personalized approach for detecting unusual sleep from time series sleep-tracking data. In: Proceedings of the 2016 IEEE International Conference on Healthcare Informatics. Presented at: ICHI '16; Oct 4-7, 2016, 2016;18-23; Chicago, IL, USA. URL: https://ieeexplore.ieee.org/document/7776322 [CrossRef]37]
	Active time	Fitbit [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16,Liang Z, Chapa-Martell MA, Nishimura T. A personalized approach for detecting unusual sleep from time series sleep-tracking data. In: Proceedings of the 2016 IEEE International Conference on Healthcare Informatics. Presented at: ICHI '16; Oct 4-7, 2016, 2016;18-23; Chicago, IL, USA. URL: https://ieeexplore.ieee.org/document/7776322 [CrossRef]37]
	Exercise	SleepAsAndroid [Daskalova N, Metaxa-Kakavouli D, Tran A, Nugent N, Boergers J, McGeary J, et al. SleepCoacher: a personalized automated self-experimentation system for sleep recommendations. Presented at: UIST '16: The 29th Annual ACM Symposium on User Interface Software and Technology; Oct 16-19, 2016, 2016; Tokyo, Japan. [CrossRef]20], MS Band [Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43], Polar [Liu X, Tamminen S, Korhonen T, Röning J. How physical exercise level affects sleep quality? Analyzing big data collected from wearables. Procedia Comput Sci. 2019;155:242-249. [FREE Full text] [CrossRef]40], Garmin [Upadhyay D, Pandey V, Nag N, Jain R. Personalized user modelling for sleep insight. Presented at: HuMA'20: 1st International Workshop on Human-centric Multimedia Analysis; Oct 12, 2020, 2020;13-20; Seattle, WA, USA. URL: https://dl.acm.org/doi/10.1145/3422852.3423478 [CrossRef]42], and SleepApp [Ravichandran R, Patel SN, Kientz JA. SleepApp: providing contextualized and actionable sleep feedback. Presented at: PervasiveHealth '20: 14th EAI International Conference on Pervasive Computing Technologies for Healthcare; May 18-20, 2020, 2020; Atlanta, GA, USA. [CrossRef]19]
	Coffee	Diary [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16], SleepAsAndroid [Daskalova N, Metaxa-Kakavouli D, Tran A, Nugent N, Boergers J, McGeary J, et al. SleepCoacher: a personalized automated self-experimentation system for sleep recommendations. Presented at: UIST '16: The 29th Annual ACM Symposium on User Interface Software and Technology; Oct 16-19, 2016, 2016; Tokyo, Japan. [CrossRef]20], and SleepApp [Ravichandran R, Patel SN, Kientz JA. SleepApp: providing contextualized and actionable sleep feedback. Presented at: PervasiveHealth '20: 14th EAI International Conference on Pervasive Computing Technologies for Healthcare; May 18-20, 2020, 2020; Atlanta, GA, USA. [CrossRef]19]
	Alcohol	Diary [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16], SleepAsAndroid [Daskalova N, Metaxa-Kakavouli D, Tran A, Nugent N, Boergers J, McGeary J, et al. SleepCoacher: a personalized automated self-experimentation system for sleep recommendations. Presented at: UIST '16: The 29th Annual ACM Symposium on User Interface Software and Technology; Oct 16-19, 2016, 2016; Tokyo, Japan. [CrossRef]20], and SleepApp [Ravichandran R, Patel SN, Kientz JA. SleepApp: providing contextualized and actionable sleep feedback. Presented at: PervasiveHealth '20: 14th EAI International Conference on Pervasive Computing Technologies for Healthcare; May 18-20, 2020, 2020; Atlanta, GA, USA. [CrossRef]19]
	Tobacco	Self-report [Kang WS, Choi RS, Son CS. Explainable sleep quality evaluation model using machine learning approach. Presented at: MFI '17: 2017 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems; Nov 16-18, 2017, 2017;542-546; Daegu, South Korea. URL: https://ieeexplore.ieee.org/document/8170377 [CrossRef]44] and SleepApp [Ravichandran R, Patel SN, Kientz JA. SleepApp: providing contextualized and actionable sleep feedback. Presented at: PervasiveHealth '20: 14th EAI International Conference on Pervasive Computing Technologies for Healthcare; May 18-20, 2020, 2020; Atlanta, GA, USA. [CrossRef]19]
	Electronic device use	App use time (total and different app categories) [Jayarajah K, Radhakrishnan M, Hoi SC, Misra A. Candy crushing your sleep. Presented at: UbiComp/ISWC '15: 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing and 2015 ACM International Symposium on Wearable Computers; September 7-11, 2015, 2015;753-762; Osaka, Japan. URL: https://dl.acm.org/doi/10.1145/2800835.2804393 [CrossRef]31], diary [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16], Bing search logs [Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43], SleepApp [Ravichandran R, Patel SN, Kientz JA. SleepApp: providing contextualized and actionable sleep feedback. Presented at: PervasiveHealth '20: 14th EAI International Conference on Pervasive Computing Technologies for Healthcare; May 18-20, 2020, 2020; Atlanta, GA, USA. [CrossRef]19], and campus network [Guo T, Li L, Zhang D, Xia F. Surf or sleep?: understanding the influence of bedtime patterns on campus. SIGKDD Explor Newsl. Dec 30, 2021;23(2):3-12. [FREE Full text] [CrossRef]36]
	Nap	Diary [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16], SleepAsAndroid [Daskalova N, Metaxa-Kakavouli D, Tran A, Nugent N, Boergers J, McGeary J, et al. SleepCoacher: a personalized automated self-experimentation system for sleep recommendations. Presented at: UIST '16: The 29th Annual ACM Symposium on User Interface Software and Technology; Oct 16-19, 2016, 2016; Tokyo, Japan. [CrossRef]20], and SleepApp [Ravichandran R, Patel SN, Kientz JA. SleepApp: providing contextualized and actionable sleep feedback. Presented at: PervasiveHealth '20: 14th EAI International Conference on Pervasive Computing Technologies for Healthcare; May 18-20, 2020, 2020; Atlanta, GA, USA. [CrossRef]19]
	Location	Campus Wi-Fi [Jayarajah K, Radhakrishnan M, Hoi SC, Misra A. Candy crushing your sleep. Presented at: UbiComp/ISWC '15: 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing and 2015 ACM International Symposium on Wearable Computers; September 7-11, 2015, 2015;753-762; Osaka, Japan. URL: https://dl.acm.org/doi/10.1145/2800835.2804393 [CrossRef]31] and Cortana [Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43]
	Social activity	GruMon (location estimation based on Wi-Fi signals) [Jayarajah K, Radhakrishnan M, Hoi SC, Misra A. Candy crushing your sleep. Presented at: UbiComp/ISWC '15: 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing and 2015 ACM International Symposium on Wearable Computers; September 7-11, 2015, 2015;753-762; Osaka, Japan. URL: https://dl.acm.org/doi/10.1145/2800835.2804393 [CrossRef]31], diary [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16], and Twitter [Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43]
	Mealtime	Diary [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16], smartphone camera [Upadhyay D, Pandey V, Nag N, Jain R. Personalized user modelling for sleep insight. Presented at: HuMA'20: 1st International Workshop on Human-centric Multimedia Analysis; Oct 12, 2020, 2020;13-20; Seattle, WA, USA. URL: https://dl.acm.org/doi/10.1145/3422852.3423478 [CrossRef]42], SleepApp [Ravichandran R, Patel SN, Kientz JA. SleepApp: providing contextualized and actionable sleep feedback. Presented at: PervasiveHealth '20: 14th EAI International Conference on Pervasive Computing Technologies for Healthcare; May 18-20, 2020, 2020; Atlanta, GA, USA. [CrossRef]19], and campus smart card [Guo T, Li L, Zhang D, Xia F. Surf or sleep?: understanding the influence of bedtime patterns on campus. SIGKDD Explor Newsl. Dec 30, 2021;23(2):3-12. [FREE Full text] [CrossRef]36]
	Waketime	SleepAsAndroid [Daskalova N, Metaxa-Kakavouli D, Tran A, Nugent N, Boergers J, McGeary J, et al. SleepCoacher: a personalized automated self-experimentation system for sleep recommendations. Presented at: UIST '16: The 29th Annual ACM Symposium on User Interface Software and Technology; Oct 16-19, 2016, 2016; Tokyo, Japan. [CrossRef]20]
	Bedtime	Fitbit [Liang Z, Chapa-Martell MA, Nishimura T. A personalized approach for detecting unusual sleep from time series sleep-tracking data. In: Proceedings of the 2016 IEEE International Conference on Healthcare Informatics. Presented at: ICHI '16; Oct 4-7, 2016, 2016;18-23; Chicago, IL, USA. URL: https://ieeexplore.ieee.org/document/7776322 [CrossRef]37], IoT^a sensor [Upadhyay D, Pandey V, Nag N, Jain R. Personalized user modelling for sleep insight. Presented at: HuMA'20: 1st International Workshop on Human-centric Multimedia Analysis; Oct 12, 2020, 2020;13-20; Seattle, WA, USA. URL: https://dl.acm.org/doi/10.1145/3422852.3423478 [CrossRef]42], and SleepApp [Ravichandran R, Patel SN, Kientz JA. SleepApp: providing contextualized and actionable sleep feedback. Presented at: PervasiveHealth '20: 14th EAI International Conference on Pervasive Computing Technologies for Healthcare; May 18-20, 2020, 2020; Atlanta, GA, USA. [CrossRef]19]
Environmental factors
	Ambient temperature	Diary [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16] and IoT sensor [Upadhyay D, Pandey V, Nag N, Jain R. Personalized user modelling for sleep insight. Presented at: HuMA'20: 1st International Workshop on Human-centric Multimedia Analysis; Oct 12, 2020, 2020;13-20; Seattle, WA, USA. URL: https://dl.acm.org/doi/10.1145/3422852.3423478 [CrossRef]42]
	Ambient humidity	Diary [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16] and IoT sensor [Upadhyay D, Pandey V, Nag N, Jain R. Personalized user modelling for sleep insight. Presented at: HuMA'20: 1st International Workshop on Human-centric Multimedia Analysis; Oct 12, 2020, 2020;13-20; Seattle, WA, USA. URL: https://dl.acm.org/doi/10.1145/3422852.3423478 [CrossRef]42]
	Ambient light	Diary [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16] and SleepAsAndroid [Daskalova N, Metaxa-Kakavouli D, Tran A, Nugent N, Boergers J, McGeary J, et al. SleepCoacher: a personalized automated self-experimentation system for sleep recommendations. Presented at: UIST '16: The 29th Annual ACM Symposium on User Interface Software and Technology; Oct 16-19, 2016, 2016; Tokyo, Japan. [CrossRef]20]
	Ambient noise	SleepAsAndroid [Daskalova N, Metaxa-Kakavouli D, Tran A, Nugent N, Boergers J, McGeary J, et al. SleepCoacher: a personalized automated self-experimentation system for sleep recommendations. Presented at: UIST '16: The 29th Annual ACM Symposium on User Interface Software and Technology; Oct 16-19, 2016, 2016; Tokyo, Japan. [CrossRef]20]
	Day of week	MS Band [Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43]

^aIoT: internet of things.

We found that 13 (93%) of the 14 reviewed studies conducted their own data collection experiments [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16,Jayarajah K, Radhakrishnan M, Hoi SC, Misra A. Candy crushing your sleep. Presented at: UbiComp/ISWC '15: 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing and 2015 ACM International Symposium on Wearable Computers; September 7-11, 2015, 2015;753-762; Osaka, Japan. URL: https://dl.acm.org/doi/10.1145/2800835.2804393 [CrossRef]31,Liang Z, Chapa Martell MA, Nishimura T. Mining hidden correlations between sleep and lifestyle factors from quantified-self data. Presented at: UbiComp '16: 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing; Sep 12-16, 2016, 2016;547-552; Heidelberg, Germany. URL: https://dl.acm.org/doi/10.1145/2968219.2968319 [CrossRef]33,Liang Z, Chapa-Martell MA, Nishimura T. A personalized approach for detecting unusual sleep from time series sleep-tracking data. In: Proceedings of the 2016 IEEE International Conference on Healthcare Informatics. Presented at: ICHI '16; Oct 4-7, 2016, 2016;18-23; Chicago, IL, USA. URL: https://ieeexplore.ieee.org/document/7776322 [CrossRef]37,Feng T, Narayanan SS. Discovering optimal variable-length time series motifs in large-scale wearable recordings of human bio-behavioral signals. Presented at: ICASSP '19: 2019 IEEE International Conference on Acoustics, Speech and Signal Processing; May 12-17, 2019, 2019;7615-7619; Brighton, UK. URL: https://ieeexplore.ieee.org/abstract/document/8682427 [CrossRef]38,Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43,Kang WS, Choi RS, Son CS. Explainable sleep quality evaluation model using machine learning approach. Presented at: MFI '17: 2017 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems; Nov 16-18, 2017, 2017;542-546; Daegu, South Korea. URL: https://ieeexplore.ieee.org/document/8170377 [CrossRef]44] in free-living conditions. In these studies, sleep data were recorded in participants’ usual sleep environments (eg, homes and caregiving facilities), whereas contextual factors were recorded while participants were at schools, universities, workplaces, or sports centers. In contrast, only 1 study used an existing data set [Faust L, Feldman K, Mattingly SM, Hachen D, Chawla NV. Deviations from normal bedtimes are associated with short-term increases in resting heart rate. NPJ Digit Med. Mar 23, 2020;3:39. [FREE Full text] [CrossRef] [Medline]35], which can be accessed over the web [Nethealth Data. University of Notre Dame. URL: http://sites.nd.edu/nethealth/data-2/ [accessed 2023-06-19] 45]. Data sharing is not yet a common practice in the field, and the number of public data sets is limited.

Knowledge Discovery in Sleep Tracking (RQ3)

Data Preprocessing

All the reviewed studies used data sets that were collected in free-living environments. Data collection “in the wild” increases the ecological validity of the studies but at the sacrifice of data quality. The first step in the knowledge discovery process is to deal with missing, wrong, and duplicate data. Although there are many methods for data cleaning in the literature, the methods adopted in the reviewed publications are extremely simple and straightforward. Most of the studies (5/14, 36%) simply excluded records that contain missing values (eg, sleep logs with missing fields or sleep duration of 0 minutes) [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16,Ravichandran R, Patel SN, Kientz JA. SleepApp: providing contextualized and actionable sleep feedback. Presented at: PervasiveHealth '20: 14th EAI International Conference on Pervasive Computing Technologies for Healthcare; May 18-20, 2020, 2020; Atlanta, GA, USA. [CrossRef]19,Liang Z, Chapa-Martell MA, Nishimura T. A personalized approach for detecting unusual sleep from time series sleep-tracking data. In: Proceedings of the 2016 IEEE International Conference on Healthcare Informatics. Presented at: ICHI '16; Oct 4-7, 2016, 2016;18-23; Chicago, IL, USA. URL: https://ieeexplore.ieee.org/document/7776322 [CrossRef]37] or excluded users who do not contribute sufficient data (eg, fewer than 30 sleep records) [Faust L, Feldman K, Mattingly SM, Hachen D, Chawla NV. Deviations from normal bedtimes are associated with short-term increases in resting heart rate. NPJ Digit Med. Mar 23, 2020;3:39. [FREE Full text] [CrossRef] [Medline]35,Guo T, Li L, Zhang D, Xia F. Surf or sleep?: understanding the influence of bedtime patterns on campus. SIGKDD Explor Newsl. Dec 30, 2021;23(2):3-12. [FREE Full text] [CrossRef]36]. One study excluded a certain data source (eg, search engine interactions originating from mobile devices) to avoid causing distortion to the data distribution [Althoff T, Horvitz EJ, White RW, Zeitzer J. Harnessing the web for population-scale physiological sensing: a case study of sleep and performance. In: Proceedings of the 26th International Conference on World Wide Web. Presented at: WWW '17; Apr 3-7, 2017, 2017;113-122; Perth, Australia. URL: https://dl.acm.org/doi/10.1145/3038912.3052637 [CrossRef]34].

Similarly, data out of logical ranges were removed. Users aged <10 years or >100 years, with weight <22.7 kg or >112.5 kg, or with height <127 cm or >457.2 cm were excluded in the study by Farajtabar et al [Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43]. Extremely short (<0.5 or 4 hours) and long (>12 hours) sleep records were removed in the studies by Althoff et al [Althoff T, Horvitz EJ, White RW, Zeitzer J. Harnessing the web for population-scale physiological sensing: a case study of sleep and performance. In: Proceedings of the 26th International Conference on World Wide Web. Presented at: WWW '17; Apr 3-7, 2017, 2017;113-122; Perth, Australia. URL: https://dl.acm.org/doi/10.1145/3038912.3052637 [CrossRef]34] and Farajtabar et al [Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43]. Sleep entries with bedtime between 7:00 AM and 7:00 PM were removed in the study by Liang et al [Liang Z, Chapa Martell MA, Nishimura T. Mining hidden correlations between sleep and lifestyle factors from quantified-self data. Presented at: UbiComp '16: 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing; Sep 12-16, 2016, 2016;547-552; Heidelberg, Germany. URL: https://dl.acm.org/doi/10.1145/2968219.2968319 [CrossRef]33]. Exercise time is another criterion for filtering out potentially erroneous data records. For example, exercise events <5 or >180 minutes were removed in the study by Farajtabar et al [Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43]. Exercises with calorie consumption per hour <50 or >2000 calories or with a duration of <10 minutes were excluded in the study by Liu et al [Liu X, Tamminen S, Korhonen T, Röning J. How physical exercise level affects sleep quality? Analyzing big data collected from wearables. Procedia Comput Sci. 2019;155:242-249. [FREE Full text] [CrossRef]40]. In addition, data records with steps <1000 and those with a sedentary time of 0 minute were removed in the study by Liang et al [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16].

Time stamps are another focus in data preprocessing. The data collection in the study by Ravichandran et al [Ravichandran R, Patel SN, Kientz JA. SleepApp: providing contextualized and actionable sleep feedback. Presented at: PervasiveHealth '20: 14th EAI International Conference on Pervasive Computing Technologies for Healthcare; May 18-20, 2020, 2020; Atlanta, GA, USA. [CrossRef]19] primarily relied on users’ manual input in SleepApp and thus had a higher risk of human errors. Consequently, all logs with aberrant timestamps were removed. In addition, 12 hours were added to or subtracted from the recorded bedtime or wake-up time where the users might have forgotten to toggle the AM and PM switch on the app. Dealing with timestamps involves not only data cleaning but also temporal matching among multiple data sources [Upadhyay D, Pandey V, Nag N, Jain R. Personalized user modelling for sleep insight. Presented at: HuMA'20: 1st International Workshop on Human-centric Multimedia Analysis; Oct 12, 2020, 2020;13-20; Seattle, WA, USA. URL: https://dl.acm.org/doi/10.1145/3422852.3423478 [CrossRef]42] as well as data type conversion (eg, 18:30 to 1830) [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16,Liang Z, Chapa-Martell MA, Nishimura T. A personalized approach for detecting unusual sleep from time series sleep-tracking data. In: Proceedings of the 2016 IEEE International Conference on Healthcare Informatics. Presented at: ICHI '16; Oct 4-7, 2016, 2016;18-23; Chicago, IL, USA. URL: https://ieeexplore.ieee.org/document/7776322 [CrossRef]37].

Other types of data preprocessing included selecting users with larger variations in sleep and exercise [Feng T, Narayanan SS. Discovering optimal variable-length time series motifs in large-scale wearable recordings of human bio-behavioral signals. Presented at: ICASSP '19: 2019 IEEE International Conference on Acoustics, Speech and Signal Processing; May 12-17, 2019, 2019;7615-7619; Brighton, UK. URL: https://ieeexplore.ieee.org/abstract/document/8682427 [CrossRef]38], removing redundant entries [Ravichandran R, Patel SN, Kientz JA. SleepApp: providing contextualized and actionable sleep feedback. Presented at: PervasiveHealth '20: 14th EAI International Conference on Pervasive Computing Technologies for Healthcare; May 18-20, 2020, 2020; Atlanta, GA, USA. [CrossRef]19], and resampling the raw data (eg, the photoplethysmography-derived heart rate time series was aggregated every 3 minutes to achieve a constant sampling rate [Feng T, Narayanan SS. Discovering optimal variable-length time series motifs in large-scale wearable recordings of human bio-behavioral signals. Presented at: ICASSP '19: 2019 IEEE International Conference on Acoustics, Speech and Signal Processing; May 12-17, 2019, 2019;7615-7619; Brighton, UK. URL: https://ieeexplore.ieee.org/abstract/document/8682427 [CrossRef]38]).

Some knowledge discovery processes that rely on machine learning or data mining techniques may require a feature engineering process instead of directly using the cleaned data as input. For example, several studies have involved the construction of secondary features from the cleaned data [Liang Z, Chapa-Martell MA, Nishimura T. A personalized approach for detecting unusual sleep from time series sleep-tracking data. In: Proceedings of the 2016 IEEE International Conference on Healthcare Informatics. Presented at: ICHI '16; Oct 4-7, 2016, 2016;18-23; Chicago, IL, USA. URL: https://ieeexplore.ieee.org/document/7776322 [CrossRef]37,Feng T, Narayanan SS. Discovering optimal variable-length time series motifs in large-scale wearable recordings of human bio-behavioral signals. Presented at: ICASSP '19: 2019 IEEE International Conference on Acoustics, Speech and Signal Processing; May 12-17, 2019, 2019;7615-7619; Brighton, UK. URL: https://ieeexplore.ieee.org/abstract/document/8682427 [CrossRef]38,Liu X, Tamminen S, Korhonen T, Röning J. How physical exercise level affects sleep quality? Analyzing big data collected from wearables. Procedia Comput Sci. 2019;155:242-249. [FREE Full text] [CrossRef]40]. Features were normalized to have a mean and SD equal to 0 and 1 [Upadhyay D, Pandey V, Nag N, Jain R. Personalized user modelling for sleep insight. Presented at: HuMA'20: 1st International Workshop on Human-centric Multimedia Analysis; Oct 12, 2020, 2020;13-20; Seattle, WA, USA. URL: https://dl.acm.org/doi/10.1145/3422852.3423478 [CrossRef]42] or normalized over another feature (eg, exercise intensity features were normalized by dividing the basal metabolic rate [Feng T, Narayanan SS. Discovering optimal variable-length time series motifs in large-scale wearable recordings of human bio-behavioral signals. Presented at: ICASSP '19: 2019 IEEE International Conference on Acoustics, Speech and Signal Processing; May 12-17, 2019, 2019;7615-7619; Brighton, UK. URL: https://ieeexplore.ieee.org/abstract/document/8682427 [CrossRef]38]). Dimension reduction (eg, principal component analysis) was applied to reduce the number of input features to avoid the adverse effect of the “curse of dimension” [Jayarajah K, Radhakrishnan M, Hoi SC, Misra A. Candy crushing your sleep. Presented at: UbiComp/ISWC '15: 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing and 2015 ACM International Symposium on Wearable Computers; September 7-11, 2015, 2015;753-762; Osaka, Japan. URL: https://dl.acm.org/doi/10.1145/2800835.2804393 [CrossRef]31].

Data Mining

The selection of the data mining method depends on the purpose of the studies and, to a lesser degree, on the size of the available data set. Table 4 provides a summary of the data mining methods and the specific techniques or algorithms used in the reviewed studies. We also listed the independent variables (or input) and dependent variables (or output) of the constructed models. Correlation analysis, regression analysis, and rule induction are the most used methods for finding meaningful associations between contextual factors and sleep quality metrics. In total, 3 correlation analysis techniques, Pearson correlation, Spearman correlation, and repeated measure correlation, were applied to examine the strength of the pairwise linear relationships between sleep and contextual factors [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16,Ravichandran R, Patel SN, Kientz JA. SleepApp: providing contextualized and actionable sleep feedback. Presented at: PervasiveHealth '20: 14th EAI International Conference on Pervasive Computing Technologies for Healthcare; May 18-20, 2020, 2020; Atlanta, GA, USA. [CrossRef]19,Daskalova N, Metaxa-Kakavouli D, Tran A, Nugent N, Boergers J, McGeary J, et al. SleepCoacher: a personalized automated self-experimentation system for sleep recommendations. Presented at: UIST '16: The 29th Annual ACM Symposium on User Interface Software and Technology; Oct 16-19, 2016, 2016; Tokyo, Japan. [CrossRef]20]. Similarly, various regression analysis methods have been used, ranging from simple linear regression to linear mixed effects regression to piecewise fixed effects regression [Althoff T, Horvitz EJ, White RW, Zeitzer J. Harnessing the web for population-scale physiological sensing: a case study of sleep and performance. In: Proceedings of the 26th International Conference on World Wide Web. Presented at: WWW '17; Apr 3-7, 2017, 2017;113-122; Perth, Australia. URL: https://dl.acm.org/doi/10.1145/3038912.3052637 [CrossRef]34,Faust L, Feldman K, Mattingly SM, Hachen D, Chawla NV. Deviations from normal bedtimes are associated with short-term increases in resting heart rate. NPJ Digit Med. Mar 23, 2020;3:39. [FREE Full text] [CrossRef] [Medline]35,Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43]. Least square estimation was the most popular technique for parameter estimation in regression analysis and was used in the studies by Althoff et al [Althoff T, Horvitz EJ, White RW, Zeitzer J. Harnessing the web for population-scale physiological sensing: a case study of sleep and performance. In: Proceedings of the 26th International Conference on World Wide Web. Presented at: WWW '17; Apr 3-7, 2017, 2017;113-122; Perth, Australia. URL: https://dl.acm.org/doi/10.1145/3038912.3052637 [CrossRef]34] and Farajtabar et al [Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43]. The study by Faust et al [Faust L, Feldman K, Mattingly SM, Hachen D, Chawla NV. Deviations from normal bedtimes are associated with short-term increases in resting heart rate. NPJ Digit Med. Mar 23, 2020;3:39. [FREE Full text] [CrossRef] [Medline]35] provided no information but is highly likely to use the same technique. It is worth noting that the Pearson correlation coefficient is equivalent to the standardized slope of a simple linear regression line.

Table 4. Summary of the data mining methods used in the reviewed studies.

Data mining method and techniques or algorithms			Data size		Independent variable		Dependent variable
Correlation analysis
	Pearson correlation [Daskalova N, Metaxa-Kakavouli D, Tran A, Nugent N, Boergers J, McGeary J, et al. SleepCoacher: a personalized automated self-experimentation system for sleep recommendations. Presented at: UIST '16: The 29th Annual ACM Symposium on User Interface Software and Technology; Oct 16-19, 2016, 2016; Tokyo, Japan. [CrossRef]20]	Preliminary study: 24 users over 20 days; final study: 19 users over 21 days		TST^a and contextual factors		SOL^b, NAWK^c, and sleep rating
	Spearman correlation [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16]	12 users over 2 weeks		Contextual factors		TST, WASO^d, NAWK, SOL, and SE^e
	Repeated measure correlation [Ravichandran R, Patel SN, Kientz JA. SleepApp: providing contextualized and actionable sleep feedback. Presented at: PervasiveHealth '20: 14th EAI International Conference on Pervasive Computing Technologies for Healthcare; May 18-20, 2020, 2020; Atlanta, GA, USA. [CrossRef]19]	10 users over 2 weeks		Bedtime, TIB^f, and contextual factors		SE, SOL, NAWK, restlessness, TIB, and LSEQ^g
Regression analysis
	Piecewise fixed effects regression [Althoff T, Horvitz EJ, White RW, Zeitzer J. Harnessing the web for population-scale physiological sensing: a case study of sleep and performance. In: Proceedings of the 26th International Conference on World Wide Web. Presented at: WWW '17; Apr 3-7, 2017, 2017;113-122; Perth, Australia. URL: https://dl.acm.org/doi/10.1145/3038912.3052637 [CrossRef]34]	31,793 users over 18 months; all American users		Time of day, time after waking up, and sleep duration		Cognitive performance
	Simple linear regression [Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43]	Approximately 20,000 users over 4 months		Contextual factors		SOL, NAWK, and SE
	Linear mixed effects model [Faust L, Feldman K, Mattingly SM, Hachen D, Chawla NV. Deviations from normal bedtimes are associated with short-term increases in resting heart rate. NPJ Digit Med. Mar 23, 2020;3:39. [FREE Full text] [CrossRef] [Medline]35]	557 users over 1 year		Bedtime regularity		Resting heart rate
Rule induction
	A priori algorithm [Liang Z, Chapa Martell MA, Nishimura T. Mining hidden correlations between sleep and lifestyle factors from quantified-self data. Presented at: UbiComp '16: 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing; Sep 12-16, 2016, 2016;547-552; Heidelberg, Germany. URL: https://dl.acm.org/doi/10.1145/2968219.2968319 [CrossRef]33]	1 user over 180 days; 4 users over 2 weeks		Contextual factors		SE
	Learn from Examples using Rough Sets [Kang WS, Choi RS, Son CS. Explainable sleep quality evaluation model using machine learning approach. Presented at: MFI '17: 2017 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems; Nov 16-18, 2017, 2017;542-546; Daegu, South Korea. URL: https://ieeexplore.ieee.org/document/8170377 [CrossRef]44]	280 users over 1 month; only the data of males were used		Contextual factors		Sleep ratio
	Event mining (+causal inference) [Upadhyay D, Pandey V, Nag N, Jain R. Personalized user modelling for sleep insight. Presented at: HuMA'20: 1st International Workshop on Human-centric Multimedia Analysis; Oct 12, 2020, 2020;13-20; Seattle, WA, USA. URL: https://dl.acm.org/doi/10.1145/3422852.3423478 [CrossRef]42]	1 user over 800 days		Contextual factors		SOL, WASO, NAWK, and SE
Causal inference
	Stratified propensity score analysis [Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43]	Approximately 20,000 users over 4 months		Contextual factors		SOL, NAWK, and SE
	Bayesian network analysis [Guo T, Li L, Zhang D, Xia F. Surf or sleep?: understanding the influence of bedtime patterns on campus. SIGKDD Explor Newsl. Dec 30, 2021;23(2):3-12. [FREE Full text] [CrossRef]36]	5200 users over 6 months		Contextual factors and bedtime		Contextual factors and bedtime
Time series analysis
	Anomaly detection [Liang Z, Chapa-Martell MA, Nishimura T. A personalized approach for detecting unusual sleep from time series sleep-tracking data. In: Proceedings of the 2016 IEEE International Conference on Healthcare Informatics. Presented at: ICHI '16; Oct 4-7, 2016, 2016;18-23; Chicago, IL, USA. URL: https://ieeexplore.ieee.org/document/7776322 [CrossRef]37]	1 user over 35 days		Fitbit measured intraday time series, TST, WASO, NAWK, and bedtime		Permutation entropy of sleep time series
	SAX^h-based motif matching and principle optimization [Feng T, Narayanan SS. Discovering optimal variable-length time series motifs in large-scale wearable recordings of human bio-behavioral signals. Presented at: ICASSP '19: 2019 IEEE International Conference on Acoustics, Speech and Signal Processing; May 12-17, 2019, 2019;7615-7619; Brighton, UK. URL: https://ieeexplore.ieee.org/abstract/document/8682427 [CrossRef]38]	100 users over 10 weeks		Heart rate time series data		PSQIⁱ
Statistical test
	Unpaired 2-samples Wilcoxon test [Liu X, Tamminen S, Korhonen T, Röning J. How physical exercise level affects sleep quality? Analyzing big data collected from wearables. Procedia Comput Sci. 2019;155:242-249. [FREE Full text] [CrossRef]40]	271 users over 8 months		Contextual factors		Statistical differences between good and poor sleep
Decision tree
	J4.8 Classifier [Jayarajah K, Radhakrishnan M, Hoi SC, Misra A. Candy crushing your sleep. Presented at: UbiComp/ISWC '15: 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing and 2015 ACM International Symposium on Wearable Computers; September 7-11, 2015, 2015;753-762; Osaka, Japan. URL: https://dl.acm.org/doi/10.1145/2800835.2804393 [CrossRef]31]	400 users over 15 months		Contextual factors		PSQI

^aTST: total sleep time.

^bSOL: sleep onset latency.

^cNAWK: number of awakenings.

^dWASO: wake after sleep onset.

^eSE: sleep efficiency.

^fTIB: time in bed.

^gLSEQ: Leeds Sleep Evaluation Questionnaire.

^hSAX: Symbolic Aggregate Approximation.

ⁱPSQI: Pittsburgh Sleep Quality Index.

Although correlation analysis and regression analysis (except piecewise constant approximation) capture the relationships between sleep and contextual factors in the entire sampling range, the piecewise regression in the study by Althoff et al [Althoff T, Horvitz EJ, White RW, Zeitzer J. Harnessing the web for population-scale physiological sensing: a case study of sleep and performance. In: Proceedings of the 26th International Conference on World Wide Web. Presented at: WWW '17; Apr 3-7, 2017, 2017;113-122; Perth, Australia. URL: https://dl.acm.org/doi/10.1145/3038912.3052637 [CrossRef]34] shared some resemblance with rule induction methods that capture the relationship between sleep and contextual factors within a constrained range. However, in contrast to piecewise regression, which quantifies the covariance between 2 variables in each partitioned segment, rule induction methods focus on extracting frequent patterns in the data sets that characterize the co-occurrence of 2 variables when their values fall into the corresponding ranges specified in a rule. Moreover, rule induction methods are usually robust to missing data. As the rule induction methods were originally developed to analyze categorical data, numerical data need to be converted to categorical data through a discretization step before rule induction methods can be applied. There are 4 discretization methods used in the reviewed studies: equal size discretization [Liang Z, Chapa Martell MA, Nishimura T. Mining hidden correlations between sleep and lifestyle factors from quantified-self data. Presented at: UbiComp '16: 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing; Sep 12-16, 2016, 2016;547-552; Heidelberg, Germany. URL: https://dl.acm.org/doi/10.1145/2968219.2968319 [CrossRef]33,Kang WS, Choi RS, Son CS. Explainable sleep quality evaluation model using machine learning approach. Presented at: MFI '17: 2017 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems; Nov 16-18, 2017, 2017;542-546; Daegu, South Korea. URL: https://ieeexplore.ieee.org/document/8170377 [CrossRef]44], equal frequency discretization [Liang Z, Chapa Martell MA, Nishimura T. Mining hidden correlations between sleep and lifestyle factors from quantified-self data. Presented at: UbiComp '16: 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing; Sep 12-16, 2016, 2016;547-552; Heidelberg, Germany. URL: https://dl.acm.org/doi/10.1145/2968219.2968319 [CrossRef]33], k-means clustering discretization [Liang Z, Chapa Martell MA, Nishimura T. Mining hidden correlations between sleep and lifestyle factors from quantified-self data. Presented at: UbiComp '16: 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing; Sep 12-16, 2016, 2016;547-552; Heidelberg, Germany. URL: https://dl.acm.org/doi/10.1145/2968219.2968319 [CrossRef]33], and discretization with heuristically defined cutoffs [Upadhyay D, Pandey V, Nag N, Jain R. Personalized user modelling for sleep insight. Presented at: HuMA'20: 1st International Workshop on Human-centric Multimedia Analysis; Oct 12, 2020, 2020;13-20; Seattle, WA, USA. URL: https://dl.acm.org/doi/10.1145/3422852.3423478 [CrossRef]42]. Association rules mining is a popular rule induction method that has been widely used in traditional medical and health informatics applications; however, it has only been used in 1 study among all the studies we reviewed [Liang Z, Chapa Martell MA, Nishimura T. Mining hidden correlations between sleep and lifestyle factors from quantified-self data. Presented at: UbiComp '16: 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing; Sep 12-16, 2016, 2016;547-552; Heidelberg, Germany. URL: https://dl.acm.org/doi/10.1145/2968219.2968319 [CrossRef]33]. In that study, the a priori algorithm was applied for rule induction. The quality of the induced association rules was validated by higher local correlation coefficients (ie, the Pearson correlation coefficient when the variables fall into the ranges specified in a rule) than the global correlation coefficients (ie, the Pearson correlation coefficient between 2 variables within the entire sampling range). To better handle the potential inconsistency (eg, conflicting records) in the data set, Rock-Hyun et al [Kang WS, Choi RS, Son CS. Explainable sleep quality evaluation model using machine learning approach. Presented at: MFI '17: 2017 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems; Nov 16-18, 2017, 2017;542-546; Daegu, South Korea. URL: https://ieeexplore.ieee.org/document/8170377 [CrossRef]44] applied another rule induction algorithm named learning from examples using rough sets. The global covering algorithm computes the lower and upper approximations of all the target sleep quality metrics (eg, sleep ratio=“good”) if the input data set contains conflicting records. The quality of the induced rules was assessed based on the predictive accuracy of the target sleep metrics. Although association rules mining and learning from examples using rough sets capture only the parallel co-occurrence of 2 items (ie, when the values of 2 variables fall into the corresponding ranges specified by a rule), event mining can also capture the co-occurrence of 2 items with a time lag (ie, the temporal sequence when the 2 items occur) [Jalali L. Interactive Event-driven Knowledge Discovery from Data Streams. Irvine, CA, USA. University of California; 2016.46]. To a certain degree, event mining resembles sequential pattern mining [Wright AP, Wright AT, McCoy AB, Sittig DF. The use of sequential pattern mining to predict next prescribed medications. J Biomed Inform. Feb 2015;53:73-80. [FREE Full text] [CrossRef] [Medline]47]; however, this characteristic was not used in the study by Upadhyay et al [Upadhyay D, Pandey V, Nag N, Jain R. Personalized user modelling for sleep insight. Presented at: HuMA'20: 1st International Workshop on Human-centric Multimedia Analysis; Oct 12, 2020, 2020;13-20; Seattle, WA, USA. URL: https://dl.acm.org/doi/10.1145/3422852.3423478 [CrossRef]42].

Causal inference is a powerful approach to reduce potential bias in the identified relationships between sleep and contextual factors because of observed confounding factors. This method is likely to outperform simple correlation analysis or rule induction–based methods. In the study by Farajtabar et al [Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43], stratified propensity score analysis was performed to isolate the effects of potential confounding factors. A similar technique was used in the study by Upadhyay [Upadhyay D, Pandey V, Nag N, Jain R. Personalized user modelling for sleep insight. Presented at: HuMA'20: 1st International Workshop on Human-centric Multimedia Analysis; Oct 12, 2020, 2020;13-20; Seattle, WA, USA. URL: https://dl.acm.org/doi/10.1145/3422852.3423478 [CrossRef]42] to enhance the quality of the induced rules by accommodating confounding factors. In addition, the Bayesian network was applied to explore the relationship between sleep schedules and behavioral factors [Guo T, Li L, Zhang D, Xia F. Surf or sleep?: understanding the influence of bedtime patterns on campus. SIGKDD Explor Newsl. Dec 30, 2021;23(2):3-12. [FREE Full text] [CrossRef]36].

Statistical tests and decision trees were also used in the existing literature, but only in 1 study each. The unpaired 2-samples Wilcoxon test was applied to identify significant differences in a set of selected contextual factors between good and poor sleepers [Liu X, Tamminen S, Korhonen T, Röning J. How physical exercise level affects sleep quality? Analyzing big data collected from wearables. Procedia Comput Sci. 2019;155:242-249. [FREE Full text] [CrossRef]40]. Despite its simplicity, this method does not generate quantitative relationships between the contextual factors and sleep quality. In contrast, decision trees were used in the study by Jayarajah et al [Jayarajah K, Radhakrishnan M, Hoi SC, Misra A. Candy crushing your sleep. Presented at: UbiComp/ISWC '15: 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing and 2015 ACM International Symposium on Wearable Computers; September 7-11, 2015, 2015;753-762; Osaka, Japan. URL: https://dl.acm.org/doi/10.1145/2800835.2804393 [CrossRef]31] to predict sleep quality using contextual factors as input features.

In addition to the abovementioned methods, time series analysis was also used but only in 2 studies [Liang Z, Chapa-Martell MA, Nishimura T. A personalized approach for detecting unusual sleep from time series sleep-tracking data. In: Proceedings of the 2016 IEEE International Conference on Healthcare Informatics. Presented at: ICHI '16; Oct 4-7, 2016, 2016;18-23; Chicago, IL, USA. URL: https://ieeexplore.ieee.org/document/7776322 [CrossRef]37,Feng T, Narayanan SS. Discovering optimal variable-length time series motifs in large-scale wearable recordings of human bio-behavioral signals. Presented at: ICASSP '19: 2019 IEEE International Conference on Acoustics, Speech and Signal Processing; May 12-17, 2019, 2019;7615-7619; Brighton, UK. URL: https://ieeexplore.ieee.org/abstract/document/8682427 [CrossRef]38]. Dimension reduction and anomaly detection were combined to identify aberrant sleep recordings while counting in personal sleep baseline in the study by Liang et al [Liang Z, Chapa-Martell MA, Nishimura T. A personalized approach for detecting unusual sleep from time series sleep-tracking data. In: Proceedings of the 2016 IEEE International Conference on Healthcare Informatics. Presented at: ICHI '16; Oct 4-7, 2016, 2016;18-23; Chicago, IL, USA. URL: https://ieeexplore.ieee.org/document/7776322 [CrossRef]37]. Feng and Narayanan [Feng T, Narayanan SS. Discovering optimal variable-length time series motifs in large-scale wearable recordings of human bio-behavioral signals. Presented at: ICASSP '19: 2019 IEEE International Conference on Acoustics, Speech and Signal Processing; May 12-17, 2019, 2019;7615-7619; Brighton, UK. URL: https://ieeexplore.ieee.org/abstract/document/8682427 [CrossRef]38] introduced a method to discover motifs in heart rate time series, which were signal patterns that appeared most frequently during sleep [Feng T, Narayanan SS. Discovering optimal variable-length time series motifs in large-scale wearable recordings of human bio-behavioral signals. Presented at: ICASSP '19: 2019 IEEE International Conference on Acoustics, Speech and Signal Processing; May 12-17, 2019, 2019;7615-7619; Brighton, UK. URL: https://ieeexplore.ieee.org/abstract/document/8682427 [CrossRef]38]. They then used these motifs as features to predict sleep quality. In contrast, although the study by Upadhyay et al [Upadhyay D, Pandey V, Nag N, Jain R. Personalized user modelling for sleep insight. Presented at: HuMA'20: 1st International Workshop on Human-centric Multimedia Analysis; Oct 12, 2020, 2020;13-20; Seattle, WA, USA. URL: https://dl.acm.org/doi/10.1145/3422852.3423478 [CrossRef]42] adopted a streaming data perspective, it did not incorporate any formal time series analysis technique [Ohayon M, Wickwire EM, Hirshkowitz M, Albert SM, Avidan A, Daly FJ, et al. National Sleep Foundation's sleep quality recommendations: first report. Sleep Health. Feb 2017;3(1):6-19. [CrossRef] [Medline]39].

Knowledge Discovered

The knowledge discovery process in the reviewed studies identified interesting associations both at the cohort level and the individual level. First, significant associations were found among the sleep quality metrics. Late bedtime was associated with a higher permutation entropy of the Fitbit measured sleep time series (indicating a higher chance of aberrancy) [Liang Z, Chapa-Martell MA, Nishimura T. A personalized approach for detecting unusual sleep from time series sleep-tracking data. In: Proceedings of the 2016 IEEE International Conference on Healthcare Informatics. Presented at: ICHI '16; Oct 4-7, 2016, 2016;18-23; Chicago, IL, USA. URL: https://ieeexplore.ieee.org/document/7776322 [CrossRef]37]. Bedtime deviation was correlated to longer SOL [Ravichandran R, Patel SN, Kientz JA. SleepApp: providing contextualized and actionable sleep feedback. Presented at: PervasiveHealth '20: 14th EAI International Conference on Pervasive Computing Technologies for Healthcare; May 18-20, 2020, 2020; Atlanta, GA, USA. [CrossRef]19]. In addition, sleep duration was positively associated with subjective sleep satisfaction [Ravichandran R, Patel SN, Kientz JA. SleepApp: providing contextualized and actionable sleep feedback. Presented at: PervasiveHealth '20: 14th EAI International Conference on Pervasive Computing Technologies for Healthcare; May 18-20, 2020, 2020; Atlanta, GA, USA. [CrossRef]19,Daskalova N, Metaxa-Kakavouli D, Tran A, Nugent N, Boergers J, McGeary J, et al. SleepCoacher: a personalized automated self-experimentation system for sleep recommendations. Presented at: UIST '16: The 29th Annual ACM Symposium on User Interface Software and Technology; Oct 16-19, 2016, 2016; Tokyo, Japan. [CrossRef]20].

Regarding the relationship between sleep quality and contextual factors, exercise was the most identified association factor [Liang Z, Chapa Martell MA, Nishimura T. Mining hidden correlations between sleep and lifestyle factors from quantified-self data. Presented at: UbiComp '16: 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing; Sep 12-16, 2016, 2016;547-552; Heidelberg, Germany. URL: https://dl.acm.org/doi/10.1145/2968219.2968319 [CrossRef]33,Liu X, Tamminen S, Korhonen T, Röning J. How physical exercise level affects sleep quality? Analyzing big data collected from wearables. Procedia Comput Sci. 2019;155:242-249. [FREE Full text] [CrossRef]40], but the relationship between exercise and sleep was complex [Liu X, Tamminen S, Korhonen T, Röning J. How physical exercise level affects sleep quality? Analyzing big data collected from wearables. Procedia Comput Sci. 2019;155:242-249. [FREE Full text] [CrossRef]40,Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43]. First, not all exercise features have a predictive power of sleep quality. For example, Liu et al [Liu X, Tamminen S, Korhonen T, Röning J. How physical exercise level affects sleep quality? Analyzing big data collected from wearables. Procedia Comput Sci. 2019;155:242-249. [FREE Full text] [CrossRef]40] found that exercise duration, relative calories consumption, and exercise timing could be used as predictors of sleep quality, but exercise intensity was not significantly associated with sleep quality. Second, exercise may be positively associated with some sleep quality metrics but negatively associated with others. For example, exercise before bed may be linked to shorter SOL and higher SE [Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43], and exercise seems to improve SOL the most among all the sleep quality metrics [Upadhyay D, Pandey V, Nag N, Jain R. Personalized user modelling for sleep insight. Presented at: HuMA'20: 1st International Workshop on Human-centric Multimedia Analysis; Oct 12, 2020, 2020;13-20; Seattle, WA, USA. URL: https://dl.acm.org/doi/10.1145/3422852.3423478 [CrossRef]42]. However, the results diverged as different types of exercises were considered. Taking more steps meant fewer awakenings, whereas running and burning more calories were correlated with more awakenings [Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43]. Moreover, longer exercise duration (eg, >100 minutes) may be associated with good sleep for some users but poor sleep for others [Liu X, Tamminen S, Korhonen T, Röning J. How physical exercise level affects sleep quality? Analyzing big data collected from wearables. Procedia Comput Sci. 2019;155:242-249. [FREE Full text] [CrossRef]40]. Furthermore, confounding factors may modulate the relationship between sleep and exercise. With causal inference, it was found that pleasant ambient temperature at bedtime significantly strengthened the relationship between exercise and sleep, whereas having a poor sleep the previous night detracted from the beneficial effects of exercise [Upadhyay D, Pandey V, Nag N, Jain R. Personalized user modelling for sleep insight. Presented at: HuMA'20: 1st International Workshop on Human-centric Multimedia Analysis; Oct 12, 2020, 2020;13-20; Seattle, WA, USA. URL: https://dl.acm.org/doi/10.1145/3422852.3423478 [CrossRef]42].

Digital device use is another important factor that correlates with sleep quality and sleep schedule. No web searches before bed correlated with shorter SOL and higher SE, whereas web searches before bed correlated with more awakenings [Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43]. The association between app use and sleep quality was modulated by the frequency and timing of use [Jayarajah K, Radhakrishnan M, Hoi SC, Misra A. Candy crushing your sleep. Presented at: UbiComp/ISWC '15: 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing and 2015 ACM International Symposium on Wearable Computers; September 7-11, 2015, 2015;753-762; Osaka, Japan. URL: https://dl.acm.org/doi/10.1145/2800835.2804393 [CrossRef]31]. In particular, low social app use was associated with good sleep, and more app use correlated with good sleep if used for >4 hours before sleep. Reading and gaming app use within 1 hour before bedtime correlated with poor sleep. A strong connection between internet surfing habits and bedtime was identified by Guo et al [Guo T, Li L, Zhang D, Xia F. Surf or sleep?: understanding the influence of bedtime patterns on campus. SIGKDD Explor Newsl. Dec 30, 2021;23(2):3-12. [FREE Full text] [CrossRef]36]. Video lovers tended to go to bed later than game fans. Going to bed late, in turn, has negative consequences. Deviations from usual bedtime may result in a higher resting heart rate during sleep [Faust L, Feldman K, Mattingly SM, Hachen D, Chawla NV. Deviations from normal bedtimes are associated with short-term increases in resting heart rate. NPJ Digit Med. Mar 23, 2020;3:39. [FREE Full text] [CrossRef] [Medline]35]. Students who went to bed late were more likely to have a poor academic performance [Guo T, Li L, Zhang D, Xia F. Surf or sleep?: understanding the influence of bedtime patterns on campus. SIGKDD Explor Newsl. Dec 30, 2021;23(2):3-12. [FREE Full text] [CrossRef]36]. Late bedtime (in relation to one’s circadian cycle) reduced cognitive performance the next day, whereas early bedtime did not have the same negative effect [Althoff T, Horvitz EJ, White RW, Zeitzer J. Harnessing the web for population-scale physiological sensing: a case study of sleep and performance. In: Proceedings of the 26th International Conference on World Wide Web. Presented at: WWW '17; Apr 3-7, 2017, 2017;113-122; Perth, Australia. URL: https://dl.acm.org/doi/10.1145/3038912.3052637 [CrossRef]34]. In contrast, having a sufficient sleep was important for maintaining normal cognitive performance [Althoff T, Horvitz EJ, White RW, Zeitzer J. Harnessing the web for population-scale physiological sensing: a case study of sleep and performance. In: Proceedings of the 26th International Conference on World Wide Web. Presented at: WWW '17; Apr 3-7, 2017, 2017;113-122; Perth, Australia. URL: https://dl.acm.org/doi/10.1145/3038912.3052637 [CrossRef]34].

As expected, caffeine and alcohol consumption were significant association factors. Consuming caffeine late in the day was a universal negative factor in subjective sleep ratings [Daskalova N, Metaxa-Kakavouli D, Tran A, Nugent N, Boergers J, McGeary J, et al. SleepCoacher: a personalized automated self-experimentation system for sleep recommendations. Presented at: UIST '16: The 29th Annual ACM Symposium on User Interface Software and Technology; Oct 16-19, 2016, 2016; Tokyo, Japan. [CrossRef]20]. Alcohol consumption was positively correlated with SE and wake-up freshness and was negatively correlated with wake after sleep onset and number of awakenings for some users but not all users [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16].

Not just personal activities or substance consumption associated with sleep, but places visited before bedtime and social life also play a role. One study found that students who spent more time with friends had better sleep quality than those who stayed alone on campus most of the time [Jayarajah K, Radhakrishnan M, Hoi SC, Misra A. Candy crushing your sleep. Presented at: UbiComp/ISWC '15: 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing and 2015 ACM International Symposium on Wearable Computers; September 7-11, 2015, 2015;753-762; Osaka, Japan. URL: https://dl.acm.org/doi/10.1145/2800835.2804393 [CrossRef]31]. In addition, if students spent most of their time outside campus, good quality sleep was found for those who spent <15% of their time being alone. Another study tracked users’ location during the day and stated that users took longer to fall asleep if they visited food-related or bank-related locations close to bedtime [Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43]. In addition, sleep quality may vary depending on the environment in which sleep takes place. The most common relationship identified for many users by Daskalova et al [Daskalova N, Metaxa-Kakavouli D, Tran A, Nugent N, Boergers J, McGeary J, et al. SleepCoacher: a personalized automated self-experimentation system for sleep recommendations. Presented at: UIST '16: The 29th Annual ACM Symposium on User Interface Software and Technology; Oct 16-19, 2016, 2016; Tokyo, Japan. [CrossRef]20] was the pair of noisiness and number of awakenings [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16]. Temperature was positively associated with all sleep metrics except for SOL [Upadhyay D, Pandey V, Nag N, Jain R. Personalized user modelling for sleep insight. Presented at: HuMA'20: 1st International Workshop on Human-centric Multimedia Analysis; Oct 12, 2020, 2020;13-20; Seattle, WA, USA. URL: https://dl.acm.org/doi/10.1145/3422852.3423478 [CrossRef]42].

Challenges and Opportunities (RQ4)

Knowledge discovery in sleep tracking is the process of extracting nonobvious hidden knowledge from self-tracking sleep data and other available contextual information. Preparing a data set of a sufficient size is the first step in this process. Almost all the reviewed studies (13/14, 93%) conducted original data collection experiments using noninvasive wearable and mobile sensors. The existing literature highlighted several challenges of data collection in sleep tracking. First, the absence of an objective, quantifiable, and universal definition of good sleep places a big challenge in annotating the collected sleep data [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16,Ravichandran R, Patel SN, Kientz JA. SleepApp: providing contextualized and actionable sleep feedback. Presented at: PervasiveHealth '20: 14th EAI International Conference on Pervasive Computing Technologies for Healthcare; May 18-20, 2020, 2020; Atlanta, GA, USA. [CrossRef]19]. Without a well-annotated data set, it is not feasible to apply supervised data mining techniques, and the absence of ground truth impedes the unbiased evaluation of the knowledge discovery process. Second, some contextual factors are considered difficult to quantify. These factors include digital device use, caffeine and alcohol consumption, and social interaction [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16], to name but a few. The timing of data logging may also influence the results [Daskalova N, Metaxa-Kakavouli D, Tran A, Nugent N, Boergers J, McGeary J, et al. SleepCoacher: a personalized automated self-experimentation system for sleep recommendations. Presented at: UIST '16: The 29th Annual ACM Symposium on User Interface Software and Technology; Oct 16-19, 2016, 2016; Tokyo, Japan. [CrossRef]20]. For example, users were advised to avoid using digital devices 2 hours before bedtime but had to log on to their smartphones to submit daily data at the end of the day. Third, existing passive sensing methods may have strong limitations [Althoff T, Horvitz EJ, White RW, Zeitzer J. Harnessing the web for population-scale physiological sensing: a case study of sleep and performance. In: Proceedings of the 26th International Conference on World Wide Web. Presented at: WWW '17; Apr 3-7, 2017, 2017;113-122; Perth, Australia. URL: https://dl.acm.org/doi/10.1145/3038912.3052637 [CrossRef]34]. For example, some studies (2/14, 14%) assume that users check their smartphone right before bedtime and immediately after waking up [Jayarajah K, Radhakrishnan M, Hoi SC, Misra A. Candy crushing your sleep. Presented at: UbiComp/ISWC '15: 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing and 2015 ACM International Symposium on Wearable Computers; September 7-11, 2015, 2015;753-762; Osaka, Japan. URL: https://dl.acm.org/doi/10.1145/2800835.2804393 [CrossRef]31,Guo T, Li L, Zhang D, Xia F. Surf or sleep?: understanding the influence of bedtime patterns on campus. SIGKDD Explor Newsl. Dec 30, 2021;23(2):3-12. [FREE Full text] [CrossRef]36] and thus may miss out on users who have no such habits. Consumer wearables may have limited accuracy in measuring sleep stages and other factors [Liang Z, Chapa Martell MA. Validity of consumer activity wristbands and wearable EEG for measuring overall sleep parameters and sleep structure in free-living conditions. J Healthc Inform Res. Jun 20, 2018;2(1-2):152-178. [FREE Full text] [CrossRef] [Medline]7], but the issue of data quality was not considered in the reviewed studies [Faust L, Feldman K, Mattingly SM, Hachen D, Chawla NV. Deviations from normal bedtimes are associated with short-term increases in resting heart rate. NPJ Digit Med. Mar 23, 2020;3:39. [FREE Full text] [CrossRef] [Medline]35].

Moreover, interpreting the knowledge discovery outcome is not always straightforward. Correlation analysis essentially captures the covariance of 2 variables. Users with regular sleep and daily life routines may end up with no significant correlations found because of the lack of variability in their data. However, users may misinterpret this as having no relationship [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16]. Rule induction methods usually generate a large number of rules, but not all of them are useful. Long rules with too many factors in the antecedent, despite of being explainable, provide no actionable insights because of their complexity (eg, “IF 17.85 < BMI < 25.21 AND Smoking is Yes AND 61.81 < Normal_Avg_HR < 79.0 AND 0 < Normal_Awake <19.0 AND 1.5 < Normal_Really_Awake < 24.0 AND 1.5 < High_Asleep < 606.5 AND 1.5 < High_Awake < 155.0 AND 0.5 < High_Really_Awake < 175.5 THEN Sleep Quality Status is ‘Bad’ with support 8”) [Kang WS, Choi RS, Son CS. Explainable sleep quality evaluation model using machine learning approach. Presented at: MFI '17: 2017 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems; Nov 16-18, 2017, 2017;542-546; Daegu, South Korea. URL: https://ieeexplore.ieee.org/document/8170377 [CrossRef]44]. In contrast, short association rules may be more comprehensible (eg, “minutes very active={33; 38}=> good sleep or steps={18,658; 20,263}=> good sleep” [Liang Z, Chapa Martell MA, Nishimura T. Mining hidden correlations between sleep and lifestyle factors from quantified-self data. Presented at: UbiComp '16: 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing; Sep 12-16, 2016, 2016;547-552; Heidelberg, Germany. URL: https://dl.acm.org/doi/10.1145/2968219.2968319 [CrossRef]33]). However, heuristic discretization without a semantic meaning may impede understandability [Daskalova N, Metaxa-Kakavouli D, Tran A, Nugent N, Boergers J, McGeary J, et al. SleepCoacher: a personalized automated self-experimentation system for sleep recommendations. Presented at: UIST '16: The 29th Annual ACM Symposium on User Interface Software and Technology; Oct 16-19, 2016, 2016; Tokyo, Japan. [CrossRef]20].

Despite the challenges, the reviewed studies highlighted several opportunities for future research. In total, 3 studies suggested considering more contextual factors in addition to the ones already studied, such as emotion, diet, productivity, and chronotypes [Jayarajah K, Radhakrishnan M, Hoi SC, Misra A. Candy crushing your sleep. Presented at: UbiComp/ISWC '15: 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing and 2015 ACM International Symposium on Wearable Computers; September 7-11, 2015, 2015;753-762; Osaka, Japan. URL: https://dl.acm.org/doi/10.1145/2800835.2804393 [CrossRef]31,Althoff T, Horvitz EJ, White RW, Zeitzer J. Harnessing the web for population-scale physiological sensing: a case study of sleep and performance. In: Proceedings of the 26th International Conference on World Wide Web. Presented at: WWW '17; Apr 3-7, 2017, 2017;113-122; Perth, Australia. URL: https://dl.acm.org/doi/10.1145/3038912.3052637 [CrossRef]34,Guo T, Li L, Zhang D, Xia F. Surf or sleep?: understanding the influence of bedtime patterns on campus. SIGKDD Explor Newsl. Dec 30, 2021;23(2):3-12. [FREE Full text] [CrossRef]36,Upadhyay D, Pandey V, Nag N, Jain R. Personalized user modelling for sleep insight. Presented at: HuMA'20: 1st International Workshop on Human-centric Multimedia Analysis; Oct 12, 2020, 2020;13-20; Seattle, WA, USA. URL: https://dl.acm.org/doi/10.1145/3422852.3423478 [CrossRef]42]. Acknowledging that the correlations at a cohort level may be weak [Jayarajah K, Radhakrishnan M, Hoi SC, Misra A. Candy crushing your sleep. Presented at: UbiComp/ISWC '15: 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing and 2015 ACM International Symposium on Wearable Computers; September 7-11, 2015, 2015;753-762; Osaka, Japan. URL: https://dl.acm.org/doi/10.1145/2800835.2804393 [CrossRef]31], argues for an individual-centric approach to identifying the most important contextual factors for each user. Along the same line [Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43], it was pointed out that building predictive models within similar user groups is more practical. They proposed a hierarchical modeling scheme with a top layer containing population parameters and lower layers personalized to individual users. Similar user profiling and segmented modeling proposals were presented in the studies by Liang et al [Liang Z, Chapa Martell MA, Nishimura T. Mining hidden correlations between sleep and lifestyle factors from quantified-self data. Presented at: UbiComp '16: 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing; Sep 12-16, 2016, 2016;547-552; Heidelberg, Germany. URL: https://dl.acm.org/doi/10.1145/2968219.2968319 [CrossRef]33] and Farajtabar et al [Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43].

Principal Findings

Sleep tracking using consumer wearable devices and mobile apps has attracted remarkable attention from the research community. However, sleep tracking studies have focused on developing sleep tracking technologies for accurately measuring sleep per se, and little attention has been directed to the extraction of patterns and insights from these data. To the best of our knowledge, this scoping review is the first to map the existing literature from a knowledge discovery perspective in sleep tracking.

Our analysis results showed that the number of publications on the topic of interest has slightly increased over the years but is still low, probably because the data-driven scheme has not been fully embraced in personal informatics. Nonetheless, we found that the existing literature covered all 4 levels of analytics, as presented in Table 1. Most of the 14 studies that we reviewed applied simple correlation analysis, regression analysis, and rule induction methods to discover the associations between sleep and other aspects of life. Although most consumer sleep tracking technologies allow users to visually inspect their sleep data (which is descriptive in nature), the reviewed studies demonstrated the feasibility of diagnostic analysis with a flux of sleep and contextual data. Although correlation does not necessarily indicate causality, a combination of association analysis and causal inference—as was done in the study by Upadhyay et al [Upadhyay D, Pandey V, Nag N, Jain R. Personalized user modelling for sleep insight. Presented at: HuMA'20: 1st International Workshop on Human-centric Multimedia Analysis; Oct 12, 2020, 2020;13-20; Seattle, WA, USA. URL: https://dl.acm.org/doi/10.1145/3422852.3423478 [CrossRef]42]—may help users narrow down the scope of possible modifiable factors that are likely to affect their sleep quality. Machine learning and data mining techniques were only used in a few studies for anomaly detection (which is diagnostic) [Liang Z, Chapa-Martell MA, Nishimura T. A personalized approach for detecting unusual sleep from time series sleep-tracking data. In: Proceedings of the 2016 IEEE International Conference on Healthcare Informatics. Presented at: ICHI '16; Oct 4-7, 2016, 2016;18-23; Chicago, IL, USA. URL: https://ieeexplore.ieee.org/document/7776322 [CrossRef]37] or sleep quality prediction (which is predictive) [Jayarajah K, Radhakrishnan M, Hoi SC, Misra A. Candy crushing your sleep. Presented at: UbiComp/ISWC '15: 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing and 2015 ACM International Symposium on Wearable Computers; September 7-11, 2015, 2015;753-762; Osaka, Japan. URL: https://dl.acm.org/doi/10.1145/2800835.2804393 [CrossRef]31,Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43]. In total, 2 studies developed computational models to generate personalized recommendations for better sleep and showed promise in prescriptive analysis of sleep tracking [Daskalova N, Metaxa-Kakavouli D, Tran A, Nugent N, Boergers J, McGeary J, et al. SleepCoacher: a personalized automated self-experimentation system for sleep recommendations. Presented at: UIST '16: The 29th Annual ACM Symposium on User Interface Software and Technology; Oct 16-19, 2016, 2016; Tokyo, Japan. [CrossRef]20,Upadhyay D, Pandey V, Nag N, Jain R. Personalized user modelling for sleep insight. Presented at: HuMA'20: 1st International Workshop on Human-centric Multimedia Analysis; Oct 12, 2020, 2020;13-20; Seattle, WA, USA. URL: https://dl.acm.org/doi/10.1145/3422852.3423478 [CrossRef]42]. The most used sleep metrics among the reviewed studies were subjective sleep quality, SE, SOL, and time at lights off. Exercise, digital device use, places visited during the day and before bedtime, and sleep environment are the major factors that significantly correlate with various dimensions of sleep quality.

Taken together, there are a few key challenges that are relevant to the findings. On the one hand, it is nontrivial to collect high-quality data in naturalistic settings. Challenges include how to motivate users to overcome tracking fatigue, how to enhance the reliability of consumer wearables and apps, and how to quantify and automate the collection of contextual information to represent the current challenges surrounding the collection of sleep tracking data sets. On the other hand, how to extract hidden knowledge from data, how to accommodate commonness and individuality, and how to interpret data mining results are topics for future studies.

Nuance in Handling Within-Individual Variation

The selection of appropriate data mining methods relies on a correct understanding of the nature of the sleep tracking data set. Researchers often conduct longitudinal data collection experiments that involve the collection of multiple measurements of the same variables (eg, sleep quality, exercise, and ambient light) from each individual user. The data form a hierarchical or clustered structure when aggregated at the cohort level. Caution must be exercised when applying traditional analytic and modeling techniques developed for single-level data, as hierarchical data are likely to violate the assumption of independent errors of those techniques. In particular, although a hierarchical data set offers the benefit of a larger amount of data, the within-individual variation at the individual level needs to be addressed carefully through multilevel analysis and modeling.

In a multilevel analysis framework, the repeated measures are clustered within the level of an individual, and each individual is treated as a cluster unit. Depending on whether the analysis of one cluster involves pooling the data of other clusters, there are 3 approaches to analyzing a hierarchical data set: complete pooling, no pooling, and partial pooling. Complete pooling completely ignores the variation between individual users and treats all samples as being drawn from the same population. A dominant portion of the studies reviewed in this work [Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]16,Jayarajah K, Radhakrishnan M, Hoi SC, Misra A. Candy crushing your sleep. Presented at: UbiComp/ISWC '15: 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing and 2015 ACM International Symposium on Wearable Computers; September 7-11, 2015, 2015;753-762; Osaka, Japan. URL: https://dl.acm.org/doi/10.1145/2800835.2804393 [CrossRef]31,Liang Z, Chapa Martell MA, Nishimura T. Mining hidden correlations between sleep and lifestyle factors from quantified-self data. Presented at: UbiComp '16: 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing; Sep 12-16, 2016, 2016;547-552; Heidelberg, Germany. URL: https://dl.acm.org/doi/10.1145/2968219.2968319 [CrossRef]33,Althoff T, Horvitz EJ, White RW, Zeitzer J. Harnessing the web for population-scale physiological sensing: a case study of sleep and performance. In: Proceedings of the 26th International Conference on World Wide Web. Presented at: WWW '17; Apr 3-7, 2017, 2017;113-122; Perth, Australia. URL: https://dl.acm.org/doi/10.1145/3038912.3052637 [CrossRef]34,Guo T, Li L, Zhang D, Xia F. Surf or sleep?: understanding the influence of bedtime patterns on campus. SIGKDD Explor Newsl. Dec 30, 2021;23(2):3-12. [FREE Full text] [CrossRef]36,Feng T, Narayanan SS. Discovering optimal variable-length time series motifs in large-scale wearable recordings of human bio-behavioral signals. Presented at: ICASSP '19: 2019 IEEE International Conference on Acoustics, Speech and Signal Processing; May 12-17, 2019, 2019;7615-7619; Brighton, UK. URL: https://ieeexplore.ieee.org/abstract/document/8682427 [CrossRef]38,Liu X, Tamminen S, Korhonen T, Röning J. How physical exercise level affects sleep quality? Analyzing big data collected from wearables. Procedia Comput Sci. 2019;155:242-249. [FREE Full text] [CrossRef]40,Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]43,Kang WS, Choi RS, Son CS. Explainable sleep quality evaluation model using machine learning approach. Presented at: MFI '17: 2017 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems; Nov 16-18, 2017, 2017;542-546; Daegu, South Korea. URL: https://ieeexplore.ieee.org/document/8170377 [CrossRef]44] adopted this approach. Nonetheless, this approach is undesirable, as it violates the assumption of independence. The results could have been distorted when the between-individual variation was high. At the other end of the spectrum lies the no pooling approach, where the analysis of the relationship between sleep and contextual factors was performed only on the data of each individual user without considering data from other users. Daskalova et al [Daskalova N, Metaxa-Kakavouli D, Tran A, Nugent N, Boergers J, McGeary J, et al. SleepCoacher: a personalized automated self-experimentation system for sleep recommendations. Presented at: UIST '16: The 29th Annual ACM Symposium on User Interface Software and Technology; Oct 16-19, 2016, 2016; Tokyo, Japan. [CrossRef]20] Upadhyay et al [Upadhyay D, Pandey V, Nag N, Jain R. Personalized user modelling for sleep insight. Presented at: HuMA'20: 1st International Workshop on Human-centric Multimedia Analysis; Oct 12, 2020, 2020;13-20; Seattle, WA, USA. URL: https://dl.acm.org/doi/10.1145/3422852.3423478 [CrossRef]42] embraced an N-of-1 design and correspondingly adopted the no pooling approach to analyze the collected data. At the surface, this approach is plausible for fully handling the within-individual variation. However, it bears the risk of overstating the variation between individual users because of potential overfitting when the number of samples from individual users is small. Partial pooling or multilevel modeling compromises between pooled and unpooled estimates, with the relative weights of pooling determined by the sample size of each individual user and the variation within and between individuals. Multilevel modeling automatically adjusts the degree of pooling with a “soft constraint,” which ensures strong pooling for users with fewer records and weak pooling for users with abundant records in the data set [Gelman A, Hill J. Data Analysis Using Regression and Multilevel/Hierarchical Models. Cambridge, UK. Cambridge University Press; 2006.32]. We found that only Ravichandran et al [Ravichandran R, Patel SN, Kientz JA. SleepApp: providing contextualized and actionable sleep feedback. Presented at: PervasiveHealth '20: 14th EAI International Conference on Pervasive Computing Technologies for Healthcare; May 18-20, 2020, 2020; Atlanta, GA, USA. [CrossRef]19] and Faust et al [Faust L, Feldman K, Mattingly SM, Hachen D, Chawla NV. Deviations from normal bedtimes are associated with short-term increases in resting heart rate. NPJ Digit Med. Mar 23, 2020;3:39. [FREE Full text] [CrossRef] [Medline]35] used the multilevel modeling approach and explicitly considered the within-individual variation.

Most reviewed studies (8/14, 57%) seem to have relied on the undue assumption of an independent and identically distributed data set. As a result, some studies (2/14, 14%) found mixed or even conflicting results on the relationships between sleep and contextual factors at the cohort level [Liu X, Tamminen S, Korhonen T, Röning J. How physical exercise level affects sleep quality? Analyzing big data collected from wearables. Procedia Comput Sci. 2019;155:242-249. [FREE Full text] [CrossRef]40,Kang WS, Choi RS, Son CS. Explainable sleep quality evaluation model using machine learning approach. Presented at: MFI '17: 2017 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems; Nov 16-18, 2017, 2017;542-546; Daegu, South Korea. URL: https://ieeexplore.ieee.org/document/8170377 [CrossRef]44] and, consequently, generated no valuable insights. Studies using an N-of-1 design are interesting exceptions. In these studies, analysis was conducted on each user’s data, thus eliminating the effect at the cohort level. However, the robustness and generalizability of the findings are questionable. Even in studies with an N-of-1 design, it may still be helpful to partially pool some samples from the population to increase the reliability of the model parameter estimates. As such, Gelman et al [Bloom BS. Bloom's Human Characteristics and School Learning. New York, NY, USA. McGraw-Hill; 1976.48] suggested always using multilevel modeling (ie, “random effects”) as a rule of thumb, for example, linear mixed effect model over simple linear regression model and generalized linear mixed model trees [Fokkema M, Edbrooke-Childs J, Wolpert M. Generalized linear mixed-model (GLMM) trees: a flexible decision-tree method for multilevel and longitudinal data. Psychother Res. Mar 2021;31(3):313-325. [CrossRef] [Medline]49] over the J4.8 classifier.

Limitations of the Study

There is room for improvement in several aspects of this study. First, because of the limitations inherent in scoping reviews, this study is exploratory and primarily qualitative in nature. Limited by the review methodology, we were unable to generate a quantitative “summary of findings, ” as required for systematic reviews or meta-analyses. Second, our method is nonstandard in a sense that we performed prescreening on the items identified in the ScienceDirect database before importing all entries into Rayyan. Although we made an effort to ensure that the removed items were not relevant, we cannot rule out the possibility of missing publications that should have been included. Third, we did not conduct critical appraisal on the quality of the selected papers or perform a risk of bias assessment, which may have led to potential bias in the selection and interpretation of the papers. Despite these limitations, this review provides a well-scoped summary of existing research and could lay the groundwork for future systematic reviews. The research gaps that we identified can be used to inform future research agendas.

Conclusions

This scoping review built an understanding of the scope and nature of existing literature on knowledge discovery in ubiquitous and personal sleep tracking. To the best of our knowledge, this is the first review that exclusively focused on the knowledge discovery aspect of self-tracking in the realm of sleep health. In total, 14 studies were included in the review based on the exclusion criteria. We found that the existing literature covered all 4 levels of the analytics framework in health informatics. However, half (7/14, 50%) of the studies have only applied simple correlation analysis and regression analysis, aiming to discover significant associations between sleep and available contextual information. Machine learning and data mining techniques have not yet been widely used, probably because of the lack of large and quality data sets. Exercise, digital device use, places visited during the day and before bedtime, and sleep environment were the most identified factors associated with sleep quality. We identified key challenges surrounding the collection of high-quality sleep tracking data sets with consumer-grade sensors and in naturalistic settings as well as the extraction of hidden knowledge that could be translated into actionable insights and personalized behavior interventions. We highlight that future research should develop data analytics techniques and prediction models that properly handle the within-individual variation and between-individual variation in sleep tracking data sets. We hope that this scoping review could lay the groundwork for future research on ubiquitous and personal sleep tracking.

Acknowledgments

This study was sponsored by a Japan Society for the Promotion of Science (JSPS) KAKENHI Grant-in-Aid for Early-Career Scientists (grant 21K17670). The funder had no role in the study design, data collection and analysis, or manuscript preparation.

Authors' Contributions

ZL contributed to study conception. ZL and NHH contributed to study design, analysis and interpretation of results, and manuscript drafting and revision. NHH collected the data. All authors have reviewed the results and approved the final version of the manuscript.

Conflicts of Interest

None declared.

Grandner MA, Lujan MR, Ghani SB. Sleep-tracking technology in scientific research: looking to the future. Sleep. May 14, 2021;44(5):zsab071. [FREE Full text] [CrossRef] [Medline]
Choi YK, Demiris G, Lin SY, Iribarren SJ, Landis CA, Thompson HJ, et al. Smartphone applications to support sleep self-management: review and evaluation. J Clin Sleep Med. Oct 15, 2018;14(10):1783-1790. [FREE Full text] [CrossRef] [Medline]
Liu W, Ploderer B, Hoang T. In bed with technology: challenges and opportunities for sleep tracking. Presented at: OzCHI '15: The Annual Meeting of the Australian Special Interest Group for Computer Human Interaction; Dec 7-10, 2015, 2015; Parkville, VIC, Australia. [CrossRef]
Kolla BP, Mansukhani S, Mansukhani MP. Consumer sleep tracking devices: a review of mechanisms, validity and utility. Expert Rev Med Devices. May 2016;13(5):497-506. [CrossRef] [Medline]
Menghini L, Cellini N, Goldstone A, Baker FC, de Zambotti M. A standardized framework for testing the performance of sleep-tracking technology: step-by-step guidelines and open-source code. Sleep. Feb 12, 2021;44(2):zsaa170. [FREE Full text] [CrossRef] [Medline]
Chinoy ED, Cuellar JA, Huwa KE, Jameson JT, Watson CH, Bessman SC, et al. Performance of seven consumer sleep-tracking devices compared with polysomnography. Sleep. May 14, 2021;44(5):zsaa291. [FREE Full text] [CrossRef] [Medline]
Liang Z, Chapa Martell MA. Validity of consumer activity wristbands and wearable EEG for measuring overall sleep parameters and sleep structure in free-living conditions. J Healthc Inform Res. Jun 20, 2018;2(1-2):152-178. [FREE Full text] [CrossRef] [Medline]
Liang Z, Chapa-Martell MA. Accuracy of Fitbit wristbands in measuring sleep stage transitions and the effect of user-specific factors. JMIR Mhealth Uhealth. Jun 06, 2019;7(6):e13384. [FREE Full text] [CrossRef] [Medline]
Liang Z, Nishimura T. Are wearable EEG devices more accurate than fitness wristbands for home sleep tracking? Comparison of consumer sleep trackers with clinical devices. Presented at: IEEE 6th Global Conference on Consumer Electronics (GCCE); Oct 24-27, 2017, 2017; Nagoya, Japan. [CrossRef]
Liang Z, Chapa-Martell MA. Combining numerical and visual approaches in validating sleep data quality of consumer wearable wristbands. Presented at: IEEE International Conference on Pervasive Computing and Communications Workshops (PerCom Workshops); Mar 11-15, 2019, 2019; Kyoto, Japan. [CrossRef]
Roberts DM, Schade MM, Mathew GM, Gartenberg D, Buxton OM. Detecting sleep using heart rate and motion data from multisensor consumer-grade wearables, relative to wrist actigraphy and polysomnography. Sleep. Jul 13, 2020;43(7):zsaa045. [FREE Full text] [CrossRef] [Medline]
Walch O, Huang Y, Forger D, Goldstein C. Sleep stage prediction with raw acceleration and photoplethysmography heart rate data derived from a consumer wearable device. Sleep. Dec 24, 2019;42(12):zsz180. [FREE Full text] [CrossRef] [Medline]
Liang Z, Chapa-Martell MA. A multi-level classification approach for sleep stage prediction with processed data derived from consumer wearable activity trackers. Front Digit Health. May 28, 2021;3:665946. [FREE Full text] [CrossRef] [Medline]
Liang Z, Chapa-Martell MA. Achieving accurate ubiquitous sleep sensing with consumer wearable activity wristbands using multi-class imbalanced classification. Presented at: IEEE Intl Conf on Dependable, Autonomic and Secure Computing, Intl Conf on Pervasive Intelligence and Computing, Intl Conf on Cloud and Big Data Computing, Intl Conf on Cyber Science and Technology Congress (DASC/PiCom/CBDCom/CyberSciTech); Aug 5-8, 2019, 2019; Fukuoka, Japan. [CrossRef]
Sirithummarak P, Liang Z. Investigating the effect of feature distribution shift on the performance of sleep stage classification with consumer sleep trackers. Presented at: IEEE 10th Global Conference on Consumer Electronics (GCCE); Oct 12-15, 2021, 2021; Kyoto, Japan. [CrossRef]
Liang Z, Ploderer B, Liu W, Nagata Y, Bailey J, Kulik L, et al. SleepExplorer: a visualization tool to make sense of correlations between personal sleep data and contextual factors. Pers Ubiquitous Comput. Sep 16, 2016;20(6):985-1000. [CrossRef]
Bauer JS, Consolvo S, Greenstein B, Schooler J, Wu E, Watson NF, et al. ShutEye: encouraging awareness of healthy sleep recommendations with a mobile, peripheral display. Presented at: CHI '12: CHI Conference on Human Factors in Computing Systems; May 5-10, 2012, 2012; Austin, TX, USA. [CrossRef]
Kay M, Choe EK, Shepherd J, Greenstein B, Watson N, Consolvo S, et al. Lullaby: a capture and access system for understanding the sleep environment. Presented at: Ubicomp '12: The 2012 ACM Conference on Ubiquitous Computing; Sep 5-8, 2012, 2012; Pittsburgh, PE, USA. [CrossRef]
Ravichandran R, Patel SN, Kientz JA. SleepApp: providing contextualized and actionable sleep feedback. Presented at: PervasiveHealth '20: 14th EAI International Conference on Pervasive Computing Technologies for Healthcare; May 18-20, 2020, 2020; Atlanta, GA, USA. [CrossRef]
Daskalova N, Metaxa-Kakavouli D, Tran A, Nugent N, Boergers J, McGeary J, et al. SleepCoacher: a personalized automated self-experimentation system for sleep recommendations. Presented at: UIST '16: The 29th Annual ACM Symposium on User Interface Software and Technology; Oct 16-19, 2016, 2016; Tokyo, Japan. [CrossRef]
Ploderer B, Rodgers S, Liang Z. What’s keeping teens up at night? Reflecting on sleep and technology habits with teens. Pers Ubiquitous Comput. Jan 28, 2022;27(2):249-270. [CrossRef]
Liang Z, Ploderer B. How does Fitbit measure brainwaves: a qualitative study into the credibility of sleep-tracking technologies. Proc ACM Interact Mob Wearable Ubiquitous Technol. Mar 18, 2020;4(1):1-29. [FREE Full text] [CrossRef]
Liang Z, Ploderer B. Sleep tracking in the real world: a qualitative study into barriers for improving sleep. Presented at: OzCHI '16: The 28th Australian Conference on Human-Computer Interaction; Nov 29-Dec 2, 2016, 2016; Launceston, Tasmania, Australia. [CrossRef]
Epstein DA, An P, Fogarty J, Munson SA. A lived informatics model of personal informatics. Presented at: UbiComp '15: The 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing; Sep 7-11, 2015, 2015; Osaka, Japan. [CrossRef]
Liang Z, Chapa-Martell M. Framing self-quantification for individual-level preventive health care. Presented at: HEALTHINF '15: 8th International Conference on Health Informatics; January 12-15, 2015, 2015;336-343; Lisbon, Portugal. URL: https://www.scitepress.org/Papers/2015/52025/52025.pdf [CrossRef]
Burke J. The world of health analytics. In: Kudyba SP, editor. Healthcare Informatics: Improving Efficiency through Technology, Analytics, and Management. Boca Raton, FL, USA. CRC Press; 2016;161-180.
Banerjee A, Bandyopadhyay T, Acharya P. Data analytics: hyped up aspirations or true potential? Vikalpa. Oct 01, 2013;38(4):1-12. [CrossRef]
de Zambotti M, Cellini N, Goldstone A, Colrain IM, Baker FC. Wearable sleep technology in clinical and research settings. Med Sci Sports Exerc. Jul 2019;51(7):1538-1557. [FREE Full text] [CrossRef] [Medline]
Van de Water AT, Holmes A, Hurley DA. Objective measurements of sleep for non-laboratory settings as alternatives to polysomnography--a systematic review. J Sleep Res. Mar 2011;20(1 Pt 2):183-200. [CrossRef] [Medline]
Choe EK, Consolvo S, Watson NF, Kientz JA. Opportunities for computing technologies to support healthy sleep behaviors. Presented at: CHI '11: the SIGCHI Conference on Human Factors in Computing Systems; May 7-12, 2011, 2011;3053-3062; Vancouver, Canada. URL: https://dl.acm.org/doi/10.1145/1978942.1979395 [CrossRef]
Jayarajah K, Radhakrishnan M, Hoi SC, Misra A. Candy crushing your sleep. Presented at: UbiComp/ISWC '15: 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing and 2015 ACM International Symposium on Wearable Computers; September 7-11, 2015, 2015;753-762; Osaka, Japan. URL: https://dl.acm.org/doi/10.1145/2800835.2804393 [CrossRef]
Gelman A, Hill J. Data Analysis Using Regression and Multilevel/Hierarchical Models. Cambridge, UK. Cambridge University Press; 2006.
Liang Z, Chapa Martell MA, Nishimura T. Mining hidden correlations between sleep and lifestyle factors from quantified-self data. Presented at: UbiComp '16: 2016 ACM International Joint Conference on Pervasive and Ubiquitous Computing; Sep 12-16, 2016, 2016;547-552; Heidelberg, Germany. URL: https://dl.acm.org/doi/10.1145/2968219.2968319 [CrossRef]
Althoff T, Horvitz EJ, White RW, Zeitzer J. Harnessing the web for population-scale physiological sensing: a case study of sleep and performance. In: Proceedings of the 26th International Conference on World Wide Web. Presented at: WWW '17; Apr 3-7, 2017, 2017;113-122; Perth, Australia. URL: https://dl.acm.org/doi/10.1145/3038912.3052637 [CrossRef]
Faust L, Feldman K, Mattingly SM, Hachen D, Chawla NV. Deviations from normal bedtimes are associated with short-term increases in resting heart rate. NPJ Digit Med. Mar 23, 2020;3:39. [FREE Full text] [CrossRef] [Medline]
Guo T, Li L, Zhang D, Xia F. Surf or sleep?: understanding the influence of bedtime patterns on campus. SIGKDD Explor Newsl. Dec 30, 2021;23(2):3-12. [FREE Full text] [CrossRef]
Liang Z, Chapa-Martell MA, Nishimura T. A personalized approach for detecting unusual sleep from time series sleep-tracking data. In: Proceedings of the 2016 IEEE International Conference on Healthcare Informatics. Presented at: ICHI '16; Oct 4-7, 2016, 2016;18-23; Chicago, IL, USA. URL: https://ieeexplore.ieee.org/document/7776322 [CrossRef]
Feng T, Narayanan SS. Discovering optimal variable-length time series motifs in large-scale wearable recordings of human bio-behavioral signals. Presented at: ICASSP '19: 2019 IEEE International Conference on Acoustics, Speech and Signal Processing; May 12-17, 2019, 2019;7615-7619; Brighton, UK. URL: https://ieeexplore.ieee.org/abstract/document/8682427 [CrossRef]
Ohayon M, Wickwire EM, Hirshkowitz M, Albert SM, Avidan A, Daly FJ, et al. National Sleep Foundation's sleep quality recommendations: first report. Sleep Health. Feb 2017;3(1):6-19. [CrossRef] [Medline]
Liu X, Tamminen S, Korhonen T, Röning J. How physical exercise level affects sleep quality? Analyzing big data collected from wearables. Procedia Comput Sci. 2019;155:242-249. [FREE Full text] [CrossRef]
Buysse DJ. Sleep health: can we define it? Does it matter? Sleep. Jan 01, 2014;37(1):9-17. [FREE Full text] [CrossRef] [Medline]
Upadhyay D, Pandey V, Nag N, Jain R. Personalized user modelling for sleep insight. Presented at: HuMA'20: 1st International Workshop on Human-centric Multimedia Analysis; Oct 12, 2020, 2020;13-20; Seattle, WA, USA. URL: https://dl.acm.org/doi/10.1145/3422852.3423478 [CrossRef]
Farajtabar M, Kıcıman E, Nathan G, White RW. Modeling behaviors and lifestyle with online and social data for predicting and analyzing sleep and exercise quality. Int J Data Sci Anal. 2019;8(4):367-383. [FREE Full text] [CrossRef]
Kang WS, Choi RS, Son CS. Explainable sleep quality evaluation model using machine learning approach. Presented at: MFI '17: 2017 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems; Nov 16-18, 2017, 2017;542-546; Daegu, South Korea. URL: https://ieeexplore.ieee.org/document/8170377 [CrossRef]
Nethealth Data. University of Notre Dame. URL: http://sites.nd.edu/nethealth/data-2/ [accessed 2023-06-19]
Jalali L. Interactive Event-driven Knowledge Discovery from Data Streams. Irvine, CA, USA. University of California; 2016.
Wright AP, Wright AT, McCoy AB, Sittig DF. The use of sequential pattern mining to predict next prescribed medications. J Biomed Inform. Feb 2015;53:73-80. [FREE Full text] [CrossRef] [Medline]
Bloom BS. Bloom's Human Characteristics and School Learning. New York, NY, USA. McGraw-Hill; 1976.
Fokkema M, Edbrooke-Childs J, Wolpert M. Generalized linear mixed-model (GLMM) trees: a flexible decision-tree method for multilevel and longitudinal data. Psychother Res. Mar 2021;31(3):313-325. [CrossRef] [Medline]

‎

PRISMA: Preferred Reporting Items for Systematic Reviews and Meta-Analyses

RQ: research question

SE: sleep efficiency

SOL: sleep onset latency

Edited by L Buis; submitted 16.09.22; peer-reviewed by SM Ayyoubzadeh, M Ribeiro; comments to author 20.12.22; revised version received 03.02.23; accepted 05.06.23; published 28.06.23.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR mHealth and uHealth, is properly cited. The complete bibliographic information, a link to the original publication on https://mhealth.jmir.org/, as well as this copyright and license information must be included.

This paper is in the following e-collection/theme issue:

Knowledge Discovery in Ubiquitous and Personal Sleep Tracking: Scoping Review