Background: The Carrot Rewards app was developed as part of an innovative public-private partnership to reward Canadians with loyalty points, exchangeable for retail goods, travel rewards, and groceries for engaging in healthy behaviors such as walking.
Objective: This study examined whether a multicomponent intervention including goal setting, graded tasks, biofeedback, and very small incentives tied to daily step goal achievement (assessed by built-in smartphone accelerometers) could increase physical activity in two Canadian provinces, British Columbia (BC) and Newfoundland and Labrador (NL).
Methods: This 12-week, quasi-experimental (single group pre-post) study included 78,882 participants; 44.39% (35,014/78,882) enrolled in the Carrot Rewards “Steps” walking program during the recruitment period (June 13–July 10, 2016). During the 2-week baseline (or “run-in”) period, we calculated participants’ mean steps per day. Thereafter, participants earned incentives in the form of loyalty points (worth Can $0.04 ) every day they reached their personalized daily step goal (ie, baseline mean+1000 steps=first daily step goal level). Participants earned additional points (Can $0.40) for meeting their step goal 10+ nonconsecutive times in a 14-day period (called a “Step Up Challenge”). Participants could earn up to Can $5.00 during the 12-week evaluation period. Upon meeting the 10-day contingency, participants could increase their daily goal by 500 steps, aiming to gradually increase the daily step number by 3000. Only participants with ≥5 valid days (days with step counts: 1000-40,000) during the baseline period were included in the analysis (n=32,229).The primary study outcome was mean steps per day (by week), analyzed using linear mixed-effects models.
Results: The mean age of 32,229 participants with valid baseline data was 33.7 (SD 11.6) years; 66.11% (21,306/32,229) were female. The mean daily step count at baseline was 6511.22. Over half of users (16,336/32,229, 50.69%) were categorized as “physically inactive,” accumulating <5000 daily steps at baseline. Results from mixed-effects models revealed statistically significant increases in mean daily step counts when comparing baseline with each study week (P<.001). Compared with baseline, participants walked 115.70 more steps (95% CI 74.59 to 156.81; P<.001) at study week 12. BC and NL users classified as “high engagers” (app engagement above sample median; 15,511/32,229, 48.13%) walked 738.70 (95% CI 673.81 to 803.54; P<.001) and 346.00 (95% CI 239.26 to 452.74; P<.001) more steps, respectively. Physically inactive, high engagers (7022/32,229, 21.08%) averaged an increase of 1224.66 steps per day (95% CI 1160.69 to 1288.63; P<.001). Effect sizes were modest.
Conclusions: Providing very small but immediate rewards for personalized daily step goal achievement as part of a multicomponent intervention increased daily step counts on a population scale, especially for physically inactive individuals and individuals who engaged more with the walking program. Positive effects in both BC and NL provide evidence of replicability.
The health benefits of regular physical activity are unquestionable. Regular moderate-intensity physical activity, brisk walking, for example, reduces the risk of several noncommunicable diseases, such as type 2 diabetes [, ]. Regular physical activity has also been shown to improve cognition [ ], prevent and manage depression [ ], and prevent or delay the onset of dementia [ ]. Furthermore, a recent analysis of objectively measured physical activity (n=5562 American adults) determined that participation in moderate-intensity physical activity was associated with substantial reduction in mortality risk [ ]. For women, even modest participation in low-intensity physical activity, for example, slower walking without “huffing and puffing,” was linked with lower mortality risk [ ]. Unfortunately, physical inactivity remains a global pandemic [ , ]. Conservative estimates suggest that this pandemic cost the global economy US $53.8 billion in direct health care expenses in 2013 [ ]. In Canada, as in most higher-income countries, the public sector bears the largest proportion of health care expenditures attributable to physical inactivity [ ].
Behavioral economics, a branch of economics complimented by insights from psychology , has stimulated interest in using financial health incentives to promote physical activity [ ]. Financial health incentives are defined as rewards with monetary value contingent on achievement of prespecified health behaviors or outcomes [ ], such as rewarding people to walk more [ ] or to lose weight [ ]. One way timely financial incentives might work, according to behavioral economics, is by leveraging people’s predictable tendency to act in favor of their immediate self-interest, a principal referred to as “present bias” [ ]. In the case of physical activity, the likelihood that someone will be more physically active should increase if a financial incentive is at stake—and the more immediate the incentive, the stronger the nudge, according to this theoretical perspective [ ].
Evidence supporting the use of financial health incentives is growing, with 2 systematic reviews [, ] and 1 meta-analysis [ ] finding that incentives generally increase physical activity in the short-term (≤3 months) and while they are still in place (ie, before they are withdrawn). However, evidence regarding sustained physical activity increases (ie, after incentives are removed) is more mixed, with some randomized controlled trials (RCTs) reporting postintervention benefits [ - ] and others not [ - ]. Finkelstein et al (2016) conducted the largest (N=800) of these trials and found that physical activity was higher among incentive group participants at 6 months, but this effect was not sustained 6 months after incentive removal [ ]. The authors suggest that study design (eg, intervention duration), sample characteristics (eg, baseline physical activity), and incentive features (eg, generic, not tailored, physical activity goals) may have moderated postintervention responses. Discrepant findings and a still limited number of studies suggest that more research is needed to elucidate conditions under which incentives are more likely to drive postintervention changes.
In some cases, however, offering incentives for longer periods may be suitable, as Finkelsetin et al (2016) suggest—until a time when physical activity motives are internalized (“I walk because it makes me feel good”) or until clinically meaningful health outcomes are achieved . While acknowledging that more research is needed [ ], the 3 RCTs that have tested physical activity incentives for ≥6 months have reported significant, positive effects [ , , ]. However, the cost of longer term incentive programs may be prohibitive, especially if offered on a population scale. Therefore, at the same time research continues to examine conditions under which incentives drive sustained, long-term changes, efforts to increase efficiency, and thus scalability, of incentive interventions are also needed. The incentive magnitude typically used to promote physical activity in RCT settings (ie, US $1-US $2 per day) [ , , , , , ] may be simply too high for third-party payers and real-world implementation.
To reduce the cost of incentives and realistically operate within fixed government or insurer budgets, several incentive program features or reinforcement properties can be manipulated (eg, size, immediacy, probability, timing, type of incentive) [, , , ]. For example, by shortening the time between behavior and reward so that rewards are delivered immediately after desired responses, the reward size needed to stimulate physical activity may decrease [ ]. Smartphone technology presents an opportunity to provide incentives immediately upon physical activity goal completion (eg, steps per day). Built-in smartphone accelerometers now make it easier to track physical activity (ie, since the Apple Inc. iOS Health Kit app launched in 2014) [ ]; furthermore, previously unavailable moment-by-moment physical activity data can now be used to set and personalize physical activity goals and provide immediate feedback in the form of rewards (eg, rewards automatically transmitted to Web-based accounts). Also, loyalty points (ie, points given by retailers to promote customer loyalty) have emerged as a promising new incentive type (vs cash, vouchers, or charity donations) [ - ]. Research shows that consumers tend to overvalue the points they collect (eg, although US $1 cash may have stimulated physical activity in the past, US $0.50 in loyalty points may produce the same effect) [ ], possibly lowering the reward size needed to stimulate physical activity. These intervention features (using smartphones to track and reward physical activity with loyalty points) may appeal to governments and insurers looking to deploy financial health incentives more efficiently.
In Canada, such features are now available via the Carrot Rewards app, a new mHealth initiative that rewards Canadians with loyalty points (eg, retail goods, travel, groceries) to engage in healthy behaviors (eg, visiting flu shot clinic, walking) [, , ]. This study’s purpose was to examine whether the Carrot Rewards “Steps” walking program, which utilizes very small incentives (Can $ 0.04 in loyalty points) tied to daily step goal achievements could stimulate physical activity in two Canadian provinces.
Carrot Insights Inc. is a private company that developed the free Carrot Rewards app with support from the Public Health Agency of Canada. The British Columbia (BC) Ministry of Health was the company’s founding provincial Ministry partner. Newfoundland and Labrador (NL) was the second Canadian province to offer the app to its residents. Carrot Rewards was made available for BC and NL residents on the Apple iTunes and Google Play app stores on March 3 and June 13, 2016, respectively, in both English and French (Canada’s official languages). Upon downloading the app, the users were asked to enter their age, gender, postal code, and loyalty program card number to complete registration (users without loyalty cards were directed to an easy sign-up page). To register successfully, users must have entered a valid BC or NL postal code and have been ≥13 years (age cutoff of participating loyalty programs). The walking program was not initially available in BC, but was introduced the day the app launched in NL. Carrot Insights Inc. partnered with 4 major Canadian loyalty programs to offer a variety of popular incentives (ie, points could be redeemed for groceries, travel, movies, or gas). While BC users could earn points via any of the 4 participating loyalty programs, NL users could earn points only for the 2 loyalty programs with a regional presence (ie, movies and travel). In addition to the 4 participating loyalty programs, Carrot Insights Inc. also partnered with 4 Canadian health charities (ie, Heart and Stroke Foundation of Canada, Diabetes Canada, Young Men’s Christian Association Canada, and the BC Healthy Living Alliance), primarily for the purpose of reviewing and approving health education content offered in the app. The Behavioural Research Ethics Board of the University of British Columbia approved this study (UBC BREB Number H17-02814).
The marketing assets of the 4 loyalty programs and 1 charity partner were leveraged so that in the first few weeks, partners could heavily promote the app in both provinces (ie, in BC, partners sent 1.64 million emails to their loyalty members; in NL, the number of emails is unknown). The users were not automatically enrolled in the walking program, but were rather asked to opt-in. Study recruitment was open for approximately 1 month from June 13 to July 10, 2016. To participate, users had to agree to allow the app to access step data tracked and stored in their smartphones and were rewarded Can $0.60 in loyalty points for doing so.
Study Participants and Design
Registered users from BC (n=65,414) and NL (n=13,468) were eligible to participate in the walking program. However, only those with iPhone version 5S or higher could participate (ie, the Health Kit app, step data aggregator, is supported and preinstalled on these devices). Android smartphone users could also participate, but they were required to download the Health Kit equivalent (ie, Google Fit app) first. Only those who enabled the walking program on their smartphones (ie, allowed the app to access their data) received the intervention. From June 13 to July 10, 2016, 78,882 users from two Canadian provinces (BC and NL) were eligible to participate in the walking program, and 44.39% (35,014/78,882) ultimately activated it on their smartphones during the recruitment period. To examine the effect of this multicomponent intervention on objectively measured daily step counts, a 12-week quasi-experimental (single group pre-post) study design was employed. Testing the walking program simultaneously in 2 provinces provided a direct replication condition.
This intervention was theoretically based on principles from behavioral economics and self-determination theory. While behavioral economics describes how incentives exploit “present bias” to stimulate behaviors , self-determination theory focuses on the extent to which behaviors are controlled by external agents (eg, physicians) or contingencies (eg, incentives) and can be sustained [ ]. A more thorough review of how these theories complement each other in a financial health incentive context is presented elsewhere [ ]. Briefly, timely in-app notifications (“Congrats! You have achieved your 6600 daily step goal!”), very small incentives (not to be overly controlling and to protect autonomy), and a personalized approach to goal setting (realistic daily step goals, so users experience success early) were deployed to maintain fidelity to both behavioral economics and self-determination theory. As well, a range of behavior change techniques [ ] are embedded in the app, including goal setting, self-monitoring, and biofeedback (ie, feedback using an external monitoring device), and graded tasks (ie, set at “easy” and then their difficulty increased).
For a personalized walking goal to be generated (ie, steps per day), users must have accumulated at least 5 valid days during the initial 14-day baseline or “run-in” period. A valid day was defined as any day with step counts from 1000 to 40,000, as these numbers were considered reasonable, not outliers . Days with step counts <1000 were considered days smartphones were not worn, and days with step counts above 40,000 were deemed suspiciously high (eg, technology bug) and were excluded. For users with at least 5 valid days, a daily step count average was calculated for the baseline period, and 1000 steps were added to set the first daily step goal (rounded to the nearest 100 steps). If users did not have a sufficient number of valid days (ie, ≤4 days) during the baseline period, a generic 5000 daily step goal was provided and they were excluded from analysis. The approximate the number of steps taken daily by the average Canadian adult is 5000, as measured by a popular smartphone-based activity tracking app [ ].
After the 14-day baseline period, users could begin to earn incentives for reaching or exceeding their individualized daily step goals; a progress wheel illustrated progress for the day (seefor walking program screenshots). Incentives for daily achievements were worth Can $0.04 in loyalty points. After 2 weeks of earning daily rewards in the form of points, users could then begin to earn bonus rewards worth Can $0.40 in points for reaching their daily goal ≥10 nonconsecutive times within a 14-day period, called a “Step Up Challenge.” Incentives for longer term (eg, biweekly) physical activity goals, in addition to daily goals only, have worked well in past studies [ ]. Users were automatically enrolled in the first “Step Up Challenge,” but thereafter always had to accept the challenge when it became available. A bar graph to illustrate “Step Up Challenge” progress was also made available upon tapping “Accept” in the app (see ). For users who successfully completed the “Step Up Challenge,” a new higher daily step goal was provided (ie, 500 steps more than the previous goal). For unsuccessful users, the previous goal persisted. Over the 3-month evaluation period, participants could earn a total of Can $5.00 in points (Can $0.60 for activating the walking program, Can $2.80 for daily step goal achievements, and Can $1.60 for successfully completing 4 “Step Up Challenges”).
The primary outcome variable was mean daily step counts as measured by either built-in smartphone accelerometers, for example, iPhone 5S or higher for 53.63% (42,304/78,882) of users, Android devices for 37.48% (29,565/78,882) of users, or any Fitbit device for 7.18% (5664/78,882) of users. Recent validation studies found that the iPhone step counting feature (version 6 or newer), as well as those for Android smartphones (eg, HTC, Motorola) and Fitbit trackers (eg, hip-worn Zip, wrist-worn Flex) were accurate in laboratory and field conditions [- ]. However, Duncan et al (2018) did determine that steps were underestimated by the iPhone step counting feature in their free-living condition by approximately 1340 steps per day [ ]. According to the study authors, this likely reflects not carrying the iPhone continually throughout the day rather than inaccuracy in the step counting feature; they suggest that if adherence can be optimized, smartphones may be suitable for physical activity evaluations.
The majority of demographic variables used to describe the study sample were self-reported (eg, age, gender, province). Median personal income was inferred by linking user postal codes with census data (ie, 2011 National Household Survey) at the local health area level (89) in BC and regional health authority level (4) in NL.
Three different analytical approaches were used to account for missing data and to test the sensitivity of our assumptions with the analytical sample: (1) The “any” data approach included participants with valid baseline data (≥5 days in acceptable range during the 14-day baseline period) and at least 1 other valid week (ie, at least 4 valid days in a 7-day week) from study week 1 to 12 (32,229/35,019, 92.03% of those enabling the walking program met these criteria); (2) the “completer” approach included just participants with valid data at baseline and study week 12 (19,964/32,229, 61.94%); and (3) the “imputed” approach included participants with valid baseline data, but no valid data at study week 12 (29,261/32,229, 90.79%). Then, we imputed participants’ “Pseudo study week 12” by carrying forward their baseline values. Therefore, among those included in the analysis (n=32,229), 61.94% (19,964/32,229) had complete datasets (completers). No differences were observed in demographic characteristics between completers and noncompleters (see). Since the 3 different analytic approaches yielded very similar results, given the public health nature of the intervention and that completers did not differ from noncompleters on key demographic characteristics, analyses using the “any” data approach are presented.
Statistical analysis was performed using R 126.96.36.199 Mavericks build (7202) Rstudio Version 1.0.136 (RStudio, Boston, MA, USA). Study week was treated as a categorical variable (baseline=0, study week 1=1, ..., study week 12=12) to allow for the nonlinear trajectory of daily step counts. Also, the estimate for each study week helped refine the program to maintain user engagement. Mixed-effects models were performed to examine whether there were significant changes in mean daily step counts between baseline and study week 12. We fitted a simple linear mixed-effects model that included study week as the independent variable (baseline data were used as the reference), followed by an adjusted model with random intercepts to account for measurements nesting within individuals and by controlling for age, gender, median personal income, and province as covariates. Analyses were performed on the entire sample, and participants were stratified by physical activity status as defined by Tudor-Locke et al  (ie, physically inactive: baseline mean steps per day<5000; physically active: baseline mean steps per day≥5000) and by province (ie, BC and NL).
As suggested by previous studies , we examined whether participants’ engagement levels had a moderating effect on intervention outcome. Two additional variables, engagement and study week × engagement, were tested in all models. Engagement was a variable dichotomizing all participants into 2 categories, “high” or “low” engagers, based on the median percentage of days when a “Step Up Challenge” was accepted. The interaction term allows the difference between high and low engagers to differ at baseline and study week 12, while controlling for their baseline values and other covariates. Cohen f2 for local effect sizes of mean daily step counts within mixed-effects models were calculated, with f2≥0.02, f2≥0.15, and f2≥0.35 representing small, medium, and large effect sizes, respectively [ ]. Least-square means along with P values were obtained from mixed-effects models for comparing mean daily step counts between subgroups. All data were expressed in least-square means with 95% CIs. Statistical significance levels were set at P<.05.
|Canadian population |
|Age in years, mean (SD)||33.8 (11.4)||33.5 (11.9)||40.6 (median)|
|Gender (% female)||66.1||66.1||50.4|
|Province (% British Columbia)||72.1||70.3||13.2|
|Median personal income (Can $1000/year), mean (SD)||29.7 (4.1)||29.6 (4.0)||33.9|
|Steps per day, baseline mean (SD)||6665.6 (4220.7)||6157.5 (4388.9)||N/Ac|
|Engagementd (% high)||59.4||19.8||N/A|
aParticipants with valid data at baseline and study week 12.
bParticipants with valid data at baseline, but not at study week 12.
cN/A: not applicable.
dA variable dichotomizing participants into 2 categories, “high” or “low” engagers, based on the median percentage of days when a “Step Up Challenge” was accepted.
The mean age of the 32,229 participants with valid baseline data was 33.7 (SD 11.6) years; 66.11% (21,306/32,229) were female (). Participants from BC made up 71.41% (23,016/32,229) of the study sample owing to the province’s larger population and to the app launching 3 months prior to its launch in NL. The mean personal median income was Can $29,650, slightly lower than that of 2014 BC and NL means of Can $31,610 and Can $30,450, respectively [ ]. The mean daily step count at baseline was 6511.22 steps per day. Just over half of users 50.69% (16,336/32,229) were categorized as “physically inactive,” having accumulated <5000 daily steps at baseline. Assuming age, income, and province were held constant, male participants walked 2297.50 steps more steps per day at baseline compared with females (P<.001), and participants from NL walked 992.95 fewer steps per day than those from BC (P<.001).
The trends of daily step counts for the total group and the physically inactive subgroup over the 12-week intervention period are illustrated in. The difference between baseline and the 12-week evaluation period average for the total group (5.01%) and physically inactive participants (21.14%) are also illustrated. Error bars show 95% CIs. For the total, some behavioral decay was observed in later weeks as the weekly steps per day average dropped below the 12-week intervention mean (6864.77 steps) in study weeks 9 (6772.68 steps) through 12 (6626.92 steps). The average increase in daily step counts over the 12-week intervention period was 353.56 steps, which represents a 5.01% difference from baseline. Among physically inactive users, an average increase of 861.12 steps per day was observed, representing a 21.14% difference from baseline. There was no evidence of behavioral decay in this subgroup as weekly steps per day persisted at or above the intervention mean (4621.76 steps) in study weeks 9 (4622.22 steps) to 12 (4634.83 steps).
|Analysis||Baseline least-square meansa (95% CIs)||Week 12 least-square meansa (95% CIs)||Differences (Week 12 – baseline) least-square meansa (95% CIs)||Cohen f2b|
|Total sample analysis||6511.22 (6242.24 to 6780.19)||6626.92 (6357.34 to 6896.50)||115.70 (74.59 to 156.81)||0.0059|
|Physical activity status|
|Physically inactive||3760.64 (3543.31 to 3977.96)||4634.83 (4416.56 to 4853.09)||874.19 (827.98 to 920.40)||0.0234|
|Physically active||8778.01 (8392.20 to 9163.81)||8297.19 (7910.38 to 8684.00)||−480.82 (−545.17 to −416.46)||0.0073|
|British Columbia||7064.82 (6796.12 to 7333.52)||7282.83 (7013.40 to 7552.26)||218.01 (169.56 to 266.46)||0.0061|
|Newfoundland and Labrador||6071.87 (5790.32 to 6353.43)||5938.22 (5654.07 to 6222.37)||−133.66c (−155.98 to −3.37)||0.0087|
|Low engager||6229.27 (5958.75 to 6497.80)||5738.52 (5466.67 to 6010.37)||−490.75 (−551.21 to −428.29||0.0073|
|High engager||6780.37 (6509.93 to 7050.81)||7411.27 (7140.55 to 7681.99)||630.90 (575.43 to 686.36)||N/Ad|
|Physically inactive, low engager||3650.63 (3432.54 to 3868.72)||4132.55 (3 911.07 to 4354.04)||481.92 (414.62 to 549.22)||0.0055|
|Physically inactive, high engager||3838.79 (3628.93 to 4068.65)||5073.45 (4853.31 to 5293.59)||1224.66 (1160.69 to 1288.63)||N/A|
|Physically active, low engager||8588.00 (8200.39 to 8975.61)||7258.26 (6866.58 to 7649.93)||−1329.74 (−1427.93 to −1231.56)||0.0096|
|Physically active, high engager||8928.63 (8540.95 to 9316.31)||9131.09 (8742.81 to 9519.36)||202.26 (117.20 to 287.72)||N/A|
|British Columbia, low engager||6771.26 (6500.00 to 7042.52)||6353.11 (6078.918 to 6627.31)||−418.15 (−491.12 to −345.17)||0.0071|
|British Columbia, high engager||7316.68 (7044.75 to 7588.61)||8055.38 (7783.15 to 8327.61)||738.70 (673.81 to 803.54)||N/A|
|Newfoundland and Labrador, low engager||5769.62 (5480.24 to 6059.00)||5120.22 (4822.23 to 5418.21)||−649.40 (−763.50 to −535.30)||0.0074|
|Newfoundland and Labrador, high engager||6369.92 (6072.63 to 6667.21)||6715.92 (6417.20 to 7014.64)||346.00 (239.26 to 452.74)||N/A|
aLeast-square means adjusted for age, median personal income, gender, and province.
bCohen f2≥0.02, ≥0.15, and ≥0.35 representing small, medium, and large effect sizes, respectively. For the engagement subgroup analysis only, Cohen f2 was calculated for the pre-post difference in steps between the low and high engagement groups (high engagement as the referent group).
cThe difference between baseline and week 12 were statistically significant at P<.001 for total sample and all subgroup analyses, except for Province Newfoundland and Labrador (P<.001).
dN/A: not applicable.
Total Sample Analysis
The results from mixed-effects models revealed statistically significant increases in mean daily step counts when comparing baseline with each study week (P<.001). Changes in mean daily step count from baseline to study week 12 expressed in least-square means are presented in. Overall, compared with baseline, participants walked 115.70 more steps (95% CI 74.59 to 156.81; P<.001) at study week 12. The Cohen f2 value was 0.0059 (P<.001), indicating the effect was modest. Adjusting for demographic variables (ie, age, gender, province, and median personal income) had little effect on the estimated difference between study week 12 and baseline.
The intervention effect was more pronounced in physically inactive users than in physically active users. As with the total sample analysis, the mean daily steps were significantly higher for physically inactive users at each study week than at baseline (P<.001), with an observed increase of 874.19 steps per day at study week 12 (; 95% CI 827.98 to 920.40, P<.001). Cohen f2 statistic indicated that the effect was small (0.0234, P<.001). At study week 12, compared with baseline, a highly significant decrease of 480.82 steps per day was observed among physically active participants ( ; 95% CI −545.17 to −416.46, P<.001, Cohen f2=0.0073, P<.001). Participants from NL did not respond as well as participants from BC. At study week 12, compared with baseline, a highly significant increase of 218.01 was observed in BC ( ; 95% CI 169.56 to 266.46, P<.001, Cohen f2=0.0061, P<.001), while a highly significant decrease of 133.66 steps per day was observed in NL ( ; 95% CI −155.98 to −3.37, P<.001, Cohen f2=0.0087, P<.001).
Participant engagement showed a significant moderating effect on the intervention outcome in all models (P<.001). Therefore, we also conducted subgroup analysis by participants’ engagement levels. As shown in, all subgroups except physically active low engagers showed significant increase in step counts from baseline to study week 12. The difference from baseline to study week 12 for high (15,511/32,229; 48.13%) and low engagers (16,718/32,229; 51.87%) was +630.90 and −489.75 steps per day, respectively (P<.001). As well, users classified as high engagers in BC and NL walked 738.70 and 346.00 more steps per day, respectively (P<.001). Among users classified as high engagers and physically inactive (7,022/32,229; 21.08%), an average increase of 1224.66 steps per day was observed (P<.001).
In this large quasi-experimental study examining the impact of a multicomponent intervention on objectively measured daily step count, a small but significant effect overall was observed (5% average daily step count increase over 12 weeks vs baseline) with a more pronounced effect (21% increase) among physically inactive users (representing over half of the total sample). Notably, this effect was evident irrespective of age, gender, or median personal income. While the overall effect was small (ie, 116 steps per day), these results underscore the potential public health impact of using modest incentives (Can $ 0.04 per day) to stimulate physical activity, particularly among higher risk, physically inactive populations. When considering the clinical significance of this study’s results, it is likely that health benefits (eg, better glucose control)  might be reserved for 51% of the analytic sample that increased their daily step counts by 874 steps per day (the physically inactive). Health economic implications of initiatives like this may be important, especially considering that a mere 1% reduction in the number of Canadians classified as physically inactive would yield annual health care savings of Can $2.1 billion [ ]. The combination of immediate rewards in the form of loyalty points tied to smartphone-assessed physical activity outcomes may prove an efficient way of delivering financial health incentives while still producing a measurable effect.
Other reinforcement-based methods of increasing health behaviors have included using deposit contracts (ie, participants wager their own money) , chance-based designs (ie, 1 in 3 chance of earning Can $3 vs just Can $1 per day) [ ] and loss-framing (ie, incentive given up front and then taken away if goal unmet) [ ]. While deposit contract, chance-based and loss-framed designs may be effective, they may also limit enrollment (in the case of deposits) and may be less palatable to governments or insurers looking to deploy such programs (eg, raising concerns about gambling or punishing citizens or employees for not meeting health goals) [ ]. This study provides evidence that even very small incentives, as modest as Can $0.04 per day, can be implemented as part of a multicomponent intervention and on a population scale to increase walking and other ambulatory behaviors effectively.
Behavioral decay (ie, steps per day decline) was noted as time passed, with weekly steps per day averages dropping below the intervention mean in later weeks. While this was observed in the total sample (driven by the 480.82 daily step count reduction among physically active users), step counts persisted throughout the 12-week evaluation period in the physically inactive subgroup. At study week 12, for example, physically inactive participants were walking 874.19 more steps per day on average (vs baseline). This is consistent with incentives for physical activity literature that suggests that physically inactive adults are more sensitive to incentive interventions and more likely to sustain the behavior for longer periods . Similarly, larger intervention effect sizes are observed among insufficiently active individuals in Web-based physical activity interventions [ ]. Why daily step counts decreased among physically active participants remains unclear. Seasonal effects may partly explain the drop (the evaluation period began in warmer spring and summer seasons and ended in the colder fall). Smartphone (ie, accelerometer) wear time may also explain the decrease. Physically active users, being generally less sensitive to physical activity incentives, may have carried their smartphones less and less (and recorded fewer and fewer steps) as the intervention progressed.
Regarding provincial differences, NL users did not respond as well as BC users (−133.66 steps per day vs +218.01 steps per day at study week 12, respectively). This could be due to a number of factors. The most important factor may have to do with the walking program’s availability to all NL participants right away (upon downloading the app), while BC users who were still engaging with the app 3 months after it launched could activate the walking program (self-selection bias). Additionally, these provinces are on opposite Canadian coasts, with distinct climates and chronic disease risk profiles. Regarding climate, in the final 3 weeks of the evaluation period (when the provincial step count disparity was greatest, ending on October 17, 2016), residents of St. John’s, NL, experienced more “cold days” (ie, below our operational 13.0°C threshold) than their Vancouver, BC, counterparts; 43% (10/23) versus 13% (3/23) of days were “cold”; St. John’s and Vancouver are the largest cities in NL and BC, respectively). Regarding chronic disease risk, while BC has the lowest self-reported adult overweight and obesity rate in Canada (48.0%), NL has the highest (67.5%). Notably, while NL users in general experienced a 133.66 step per day decrease (with low engagers experiencing an even greater 649.40 steps per day drop), a 346.00 step per day increase was observed at study week 12 (vs baseline) among high engagers (3846/9209, 37.85% of the provincial sample). App engagement therefore appears to have boosted intervention effectiveness, regardless of province, suggesting potential effect replication in other jurisdictions. This aligns with broader evidence that greater engagement with a physical activity app or website is associated with increased intervention efficacy . Developing innovative strategies to increase and maintain engagement is a priority (eg, machine learning informed push notifications when “Step Up Challenge” was not accepted within 3 days, rewards for just accepting challenges, small team-based challenges).
The results of this population-level study should be interpreted with caution because there are a number of limitations to consider. First, neither the randomization of participants into intervention and control groups was logistically feasible within this quasi-experimental design nor was the identification of a nonequivalent control group (ie, a group not randomly assigned to receive or not receive the intervention) . For this reason, internal validity (ie, the extent to which causality can be established) may be limited. To improve internal validity as much as possible in this real-world setting, we sought to define a time period that reflected the counter-factual (ie, outcome if the intervention had not been implemented) [ ]. To do this, a preintervention time period clearly differentiated from the intervention was introduced. An immediate increase in daily step count compared with baseline was expected, and this is what was observed. This increase, however, may have occurred because participants simply started carrying their smartphones more (the most likely alternative explanation or rival hypothesis) to get credit for the steps they were taking. Disentangling “wear time” from increased actual daily step count is difficult, however, a limitation cited in more carefully controlled RCTs [ ]. Additionally, more smartphone accelerometer validation studies are likely required in free-living conditions and with different demographic groups to increase confidence in results. Analysis-phase strategies were employed to improve internal validity as well, including (a) testing the sensitivity of assumptions made with 3 different analytic samples to handle missing data and (b) fitting an adjusted mixed-effects model to account for measurements nesting within individuals and controlling for key demographics. As well, an increase in steps in high, but not low, engagers provides further support for the main conclusion that this multicomponent intervention, when utilized above a threshold level, appears to have yielded daily step count improvements. That behavioral decay was noted in weeks 9-12 for the total sample, but not for the physically inactive subgroup (the group more likely to respond to an incentive-based intervention with realistic and personalized goals) also suggests that the intervention achieved its intended effect of stimulating physical activity among the least active. While traditional RCTs strongly prioritize internal validity, this quasi-experimental design seeks to achieve greater balance between internal and external validity in real-world conditions to facilitate real-world implementation. A second limitation was that participants were followed for only 12 weeks, so longitudinal work is required to elucidate longer term effects. Third, this analysis addressed only the earliest Carrot Rewards app adopters and includes just Canadian provinces, so results may not be generalizable to newer users or other countries. Next, only 44.39% (35,014/78,882) of eligible users who could enable the walking program and earn additional incentives did so during the 4-week recruitment period. How those who activated the program during the recruitment period compare with those who did not remains unknown. While on a population scale this recruitment rate is impressive, there is room to improve. The less than ideal recruitment rate may be because health app users in general discontinue use within days or weeks of first download [ ] or a too-short recruitment period. Lastly, at what intensity any extra walking may have occurred is unknown. The association between physical activity and key health outcomes (eg, cardiovascular disease risk factor reduction) is stronger with higher intensity physical activities [ ].
To increase internal validity in this quasi-experimental environment, future studies might incorporate interrupted time series, stepped-wedge, intervention removal, or designs with a nonequivalent control group . Future work might also compare different ways of setting and graduating daily step goals (eg, static vs adaptive goal setting) and include longitudinal analyses examining longer term (at least 6 months) impacts, as well as associated cost-effectiveness studies. For example, an adaptive goal setting feature was introduced in the app in February 2017 (after the study period), when step goals began to be recalculated every 2-4 weeks to encourage engagement (as opposed to the “set it and forget it” approach initially adopted). Examining alternative methods to promote sustained physical activity should continue to be a priority for researchers and others in this field (eg, moving from small, regularly scheduled incentives, to large, more irregular, and less predictable ones). To increase the chances of behavior maintenance, exploring opportunities for enhanced engagement that also promote social interaction and support could be a particular focus of future work (eg, encouraging social networking).
Until recently, financial health incentive programs have shown promise, but little potential for scalability given rewards’ cost. This study adds to the understanding of how incentives can be delivered in ways that are not prohibitively costly. Providing immediate rewards for personalized daily step goal achievement as part of a multicomponent intervention appears to have increased daily step counts on a population scale, especially for higher risk, physically inactive individuals. Positive effects in both BC and NL provide evidence of replicability.
The Carrot Rewards initiative has been made possible in part through funding from the Public Health Agency of Canada. The views expressed herein do not necessarily represent the views of the Public Health Agency of Canada, the BC Ministry of Health, or the Ministry of Children, Seniors, and Social Development in NL. The authors also thank staff from the BC Ministry of Heath, the Ministry of Children, Seniors, and Social Development in NL, the BC Healthy Living Alliance, the Heart and Stroke Foundation of Canada, Diabetes Canada, and Young Men’s Christian Association Canada for providing expert health content advice over the course of this project.
Conflicts of Interest
MM reports grant support from the Canadian Institutes of Health Research, the University Health Network, Green Shield Canada Inc, as well as in-kind research support from Cookson James Loyalty Inc Furthermore, he reports consulting income from Carrot Insights Inc and stock options in Carrot Insights Inc. LW is a Carrot Insights Inc employee and also reports stock options in Carrot Insights Inc. GF is supported by a Canadian Institutes of Health Research-Public Health Agency of Canada Chair in Applied Public Health. The authors with no financial relation to Carrot Insights Inc conducted the analyses (GF, EL). The other coauthors report no conflicts of interest.
- Lee I, Shiroma EJ, Lobelo F, Puska P, Blair SN, Katzmarzyk PT, et al. Effect of physical inactivity on major non-communicable diseases worldwide: an analysis of burden of disease and life expectancy. Lancet 2012 Jul 21;380(9838):219-229 [FREE Full text] [CrossRef] [Medline]
- Warburton DE, Charlesworth S, Ivey A, Nettlefold L, Bredin SS. A systematic review of the evidence for Canada's Physical Activity Guidelines for Adults. Int J Behav Nutr Phys Act 2010 May 11;7:39 [FREE Full text] [CrossRef] [Medline]
- Khan NA, Hillman CH. The relation of childhood physical activity and aerobic fitness to brain function and cognition: a review. Pediatr Exerc Sci 2014 May;26(2):138-146. [CrossRef] [Medline]
- Mammen G, Faulkner G. Physical activity and the prevention of depression: a systematic review of prospective studies. Am J Prev Med 2013 Nov;45(5):649-657. [CrossRef] [Medline]
- Blondell SJ, Hammersley-Mather R, Veerman JL. Does physical activity prevent cognitive decline and dementia?: A systematic review and meta-analysis of longitudinal studies. BMC Public Health 2014;14:510 [FREE Full text] [CrossRef] [Medline]
- Borgundvaag E, Janssen I. Objectively Measured Physical Activity and Mortality Risk Among American Adults. Am J Prev Med 2017 Jan;52(1):e25-e31. [CrossRef] [Medline]
- Kohl HW, Craig CL, Lambert EV, Inoue S, Alkandari JR, Leetongin G, et al. The pandemic of physical inactivity: global action for public health. Lancet 2012 Jul 21;380(9838):294-305. [CrossRef] [Medline]
- Sallis JF, Bull F, Guthold R, Heath GW, Inoue S, Kelly P, et al. Progress in physical activity over the Olympic quadrennium. Lancet 2016 Sep 24;388(10051):1325-1336. [CrossRef] [Medline]
- Ding D, Lawson KD, Kolbe-Alexander TL, Finkelstein EA, Katzmarzyk PT, van MW, et al. The economic burden of physical inactivity: a global analysis of major non-communicable diseases. Lancet 2016 Sep 24;388(10051):1311-1324. [CrossRef] [Medline]
- Camerer C, Loewenstein G. Behavioral Economics: Past, Present, Future. Advances in Behavioral Economics. Princeton: Princeton University Press; 2003.
- Loewenstein G, Asch DA, Volpp KG. Behavioral economics holds potential to deliver better results for patients, insurers, and employers. Health Aff (Millwood) 2013 Jul;32(7):1244-1250 [FREE Full text] [CrossRef] [Medline]
- Adams J, Giles EL, McColl E, Sniehotta FF. Carrots, sticks and health behaviours: a framework for documenting the complexity of financial incentive interventions to change health behaviours. Health Psychol Rev 2014;8(3):286-295. [CrossRef] [Medline]
- Strohacker K, Galarraga O, Williams DM. The impact of incentives on exercise behavior: a systematic review of randomized controlled trials. Ann Behav Med 2014 Aug;48(1):92-99 [FREE Full text] [CrossRef] [Medline]
- Burns RJ, Donovan AS, Ackermann RT, Finch EA, Rothman AJ, Jeffery RW. A theoretically grounded systematic review of material incentives for weight loss: implications for interventions. Ann Behav Med 2012 Dec;44(3):375-388. [CrossRef] [Medline]
- Adams MA, Hurley JC, Todd M, Bhuiyan N, Jarrett CL, Tucker WJ, et al. Erratum to: Adaptive goal setting and financial incentives: a 2 × 2 factorial randomized controlled trial to increase adults' physical activity. BMC Public Health 2017 Dec 06;17(1):303 [FREE Full text] [CrossRef] [Medline]
- Barte JCM, Wendel-Vos GCW. A Systematic Review of Financial Incentives for Physical Activity: The Effects on Physical Activity and Related Outcomes. Behav Med 2017;43(2):79-90. [CrossRef] [Medline]
- Mitchell MS, Goodman JM, Alter DA, John LK, Oh PI, Pakosh MT, et al. Financial incentives for exercise adherence in adults: systematic review and meta-analysis. Am J Prev Med 2013 Nov;45(5):658-667. [CrossRef] [Medline]
- Charness G. Incentives to exercise. Econometrica 2009;77(3):909-931.
- Kullgren JT, Harkins KA, Bellamy SL, Gonzales A, Tao Y, Zhu J, et al. A mixed-methods randomized controlled trial of financial incentives and peer networks to promote walking among older adults. Health Educ Behav 2014 Oct;41(1 Suppl):43S-50S [FREE Full text] [CrossRef] [Medline]
- Patel MS, Asch DA, Rosin R, Small DS, Bellamy SL, Eberbach K, et al. Individual Versus Team-Based Financial Incentives to Increase Physical Activity: A Randomized, Controlled Trial. J Gen Intern Med 2016 Mar 14. [CrossRef] [Medline]
- Andrade LF, Barry D, Litt MD, Petry NM. Maintaining high activity levels in sedentary adults with a reinforcement-thinning schedule. J Appl Behav Anal 2014;47(3):523-536 [FREE Full text] [CrossRef] [Medline]
- Pope L, Harvey J. The efficacy of incentives to motivate continued fitness-center attendance in college first-year students: a randomized controlled trial. J Am Coll Health 2014;62(2):81-90. [CrossRef] [Medline]
- Patel MS, Asch DA, Volpp KG. Framing Financial Incentives to Increase Physical Activity Among Overweight and Obese Adults. Ann Intern Med 2016 Dec 18;165(8):600. [CrossRef] [Medline]
- Finkelstein EA, Haaland BA, Bilger M, Sahasranaman A, Sloan RA, Nang EEK, et al. Effectiveness of activity trackers with and without incentives to increase physical activity (TRIPPA): a randomised controlled trial. Lancet Diabetes Endocrinol 2016 Dec;4(12):983-995. [CrossRef] [Medline]
- Mantzari E, Vogt F, Shemilt I, Wei Y, Higgins JPT, Marteau TM. Personal financial incentives for changing habitual health-related behaviors: A systematic review and meta-analysis. Prev Med 2015 Jun;75:75-85 [FREE Full text] [CrossRef] [Medline]
- Jeffery RW, French SA. Preventing weight gain in adults: the pound of prevention study. Am J Public Health 1999 May;89(5):747-751. [Medline]
- Shin DW, Yun JM, Shin J, Kwon H, Min HY, Joh H, et al. Enhancing physical activity and reducing obesity through smartcare and financial incentives: A pilot randomized trial. Obesity (Silver Spring) 2017 Feb;25(2):302-310. [CrossRef] [Medline]
- Kranker K. The Efficacy of Using Financial Incentives to Change Unhealthy Behaviors Among a Rural Chronically Ill and Uninsured Population. Am J Health Promot 2018 Feb;32(2):301-311. [CrossRef] [Medline]
- Mitchell MS, Goodman JM, Alter DA, Oh PI, Faulkner GEJ. Development of the Health Incentive Program Questionnaire (HIP-Q) in a cardiac rehabilitation population. Transl Behav Med 2015 Dec;5(4):443-459 [FREE Full text] [CrossRef] [Medline]
- Haff N, Patel MS, Lim R, Zhu J, Troxel AB, Asch DA, et al. The role of behavioral economic incentive design and demographic characteristics in financial incentive-based approaches to changing health behaviors: a meta-analysis. Am J Health Promot 2015;29(5):314-323. [CrossRef] [Medline]
- Apple. www.apple.com. 2014. Apple Unveils iOS 8, the Biggest Release Since the Launch of the App Store URL: https://www.apple.com/ca/newsroom/2014/06/02Apple-Unveils-iOS-8-the-Biggest-Release-Since-the-Launch-of-the-App-Store/ [accessed 2018-08-29] [WebCite Cache]
- Liu S, Hodgson C, Zbib AM, Payne AYM, Nolan RP. The effectiveness of loyalty rewards to promote the use of an Internet-based heart health program. J Med Internet Res 2014 Jul 02;16(7):e163 [FREE Full text] [CrossRef] [Medline]
- Goyal S, Morita PP, Picton P, Seto E, Zbib A, Cafazzo JA. Uptake of a Consumer-Focused mHealth Application for the Assessment and Prevention of Heart Disease: The <30 Days Study. JMIR Mhealth Uhealth 2016;4(1):e32 [FREE Full text] [CrossRef] [Medline]
- Mitchell M, White L, Oh P, Alter D, Leahey T, Kwan M, et al. Uptake of an Incentive-Based mHealth App: Process Evaluation of the Carrot Rewards App. JMIR Mhealth Uhealth 2017 May 30;5(5):e70 [FREE Full text] [CrossRef] [Medline]
- Van Osselaer SM, Alba J, Manchanda P. Irrelevant information and mediated intertemporal choice. Journal of Consumer Psychology 2004;14(3):257-270.
- Government of Canada. www.canada.ca. 2015. National Healthy Living Platform: Carrot Rewards targets lifestyle improvements URL: https://www.canada.ca/en/news/archive/2015/07/national-healthy-living-platform-carrot-rewards-targets-lifestyle-improvements.html [accessed 2018-08-29] [WebCite Cache]
- Dale LP, White L, Mitchell M, Faulkner G. Smartphone app uses loyalty point incentives and push notifications to encourage influenza vaccine uptake. Vaccine 2018 Apr 23. [CrossRef] [Medline]
- Deci E, Ryan R. Handbook of Self-Determination Research. Rochester, NY: University of Rochester Press; 2002.
- Mitchell M, Faulkner G. A Nudge at All? The Jury Is Still Out on Financial Health Incentives. Healthcare Papers 2013;12(4).
- Michie S, Richardson M, Johnston M, Abraham C, Francis J, Hardeman W, et al. The behavior change technique taxonomy (v1) of 93 hierarchically clustered techniques: building an international consensus for the reporting of behavior change interventions. Ann Behav Med 2013 Aug;46(1):81-95. [CrossRef] [Medline]
- Bassett DR, Wyatt HR, Thompson H, Peters JC, Hill JO. Pedometer-measured physical activity and health behaviors in U.S. adults. Med Sci Sports Exerc 2010 Oct;42(10):1819-1825 [FREE Full text] [CrossRef] [Medline]
- Althoff T, Sosič R, Hicks JL, King AC, Delp SL, Leskovec J. Large-scale physical activity data reveal worldwide activity inequality. Nature 2017 Dec 20;547(7663):336-339 [FREE Full text] [CrossRef] [Medline]
- Duncan MJ, Wunderlich K, Zhao Y, Faulkner G. Walk this way: validity evidence of iphone health application step count in laboratory and free-living conditions. J Sports Sci 2017 Nov 28:1-10. [CrossRef] [Medline]
- Hekler EB, Buman MP, Grieco L, Rosenberger M, Winter SJ, Haskell W, et al. Validation of Physical Activity Tracking via Android Smartphones Compared to ActiGraph Accelerometer: Laboratory-Based and Free-Living Validation Studies. JMIR Mhealth Uhealth 2015 Apr 15;3(2):e36 [FREE Full text] [CrossRef] [Medline]
- Evenson KR, Goto MM, Furberg RD. Systematic review of the validity and reliability of consumer-wearable activity trackers. Int J Behav Nutr Phys Act 2015;12(1):159 [FREE Full text] [CrossRef] [Medline]
- Tudor-Locke C, Craig CL, Thyfault JP, Spence JC. A step-defined sedentary lifestyle index: <5000 steps/day. Appl Physiol Nutr Metab 2013 Feb;38(2):100-114 [FREE Full text] [CrossRef] [Medline]
- Vandelanotte C, Müller AM, Short CE, Hingle M, Nathan N, Williams SL, et al. Past, Present, and Future of eHealth and mHealth Research to Improve Physical Activity and Dietary Behaviors. J Nutr Educ Behav 2016 Mar;48(3):219-228.e1. [CrossRef] [Medline]
- Selya AS, Rose JS, Dierker LC, Hedeker D, Mermelstein RJ. A Practical Guide to Calculating Cohen's f(2), a Measure of Local Effect Size, from PROC MIXED. Front Psychol 2012;3:111 [FREE Full text] [CrossRef] [Medline]
- Table17-10-0005-01 Population estimates on July 1st, by age and sex and Table 11-10-0008-01 Tax filers and dependants with income by total income, sex and age database on the Internet. 2017 URL: https://www150.statcan.gc.ca/t1/tbl1/en/tv.action?pid=1110000801&pickMembers%5B0%5D=1.35 [accessed 2018-06-29] [WebCite Cache]
- Krueger H, Turner D, Krueger J, Ready AE. The economic benefits of risk factor reduction in Canada: tobacco smoking, excess weight and physical inactivity. Can J Public Health 2014;105(1):e69-e78. [Medline]
- Washington W, McMullen D, Devoto A. A Matched Deposit Contract Intervention to Increase Physical Activity in Underactive and Sedentary Adults. Translational Issues is Psychology Science 2016;2(2).
- Leahey TM, Subak LL, Fava J, Schembri M, Thomas G, Xu X, et al. Benefits of adding small financial incentives or optional group meetings to a web-based statewide obesity initiative. Obesity (Silver Spring) 2015 Jan;23(1):70-76 [FREE Full text] [CrossRef] [Medline]
- Promberger M, Brown RCH, Ashcroft RE, Marteau TM. Acceptability of financial incentives to improve health outcomes in UK and US samples. J Med Ethics 2011 Nov;37(11):682-687 [FREE Full text] [CrossRef] [Medline]
- Davies CA, Spence JC, Vandelanotte C, Caperchione CM, Mummery WK. Meta-analysis of internet-delivered interventions to increase physical activity levels. Int J Behav Nutr Phys Act 2012;9:52 [FREE Full text] [CrossRef] [Medline]
- Schoeppe S, Alley S, Van LW, Bray NA, Williams SL, Duncan MJ, et al. Efficacy of interventions that use apps to improve diet, physical activity and sedentary behaviour: a systematic review. Int J Behav Nutr Phys Act 2016 Dec 07;13(1):127 [FREE Full text] [CrossRef] [Medline]
- Handley MA, Lyles CR, McCulloch C, Cattamanchi A. Selecting and Improving Quasi-Experimental Designs in Effectiveness and Implementation Research. Annu Rev Public Health 2018 Apr 01;39:5-25. [CrossRef] [Medline]
|BC: British Columbia|
|NL: Newfoundland and Labrador|
|RCT: randomized controlled trials|
Edited by G Eysenbach; submitted 25.01.18; peer-reviewed by E Mantzari, K De Cocker; comments to author 08.03.18; revised version received 03.05.18; accepted 29.06.18; published 20.09.18Copyright
©Marc Mitchell, Lauren White, Erica Lau, Tricia Leahey, Marc A Adams, Guy Faulkner. Originally published in JMIR Mhealth and Uhealth (http://mhealth.jmir.org), 20.09.2018.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR mhealth and uhealth, is properly cited. The complete bibliographic information, a link to the original publication on http://mhealth.jmir.org/, as well as this copyright and license information must be included.