# The CALM Index™: A Three-Pillar Psychometric Framework for Behavioral Wellness

**Version:** 1.0 — Submission ready
**Date:** June 2026
**Target journals:** ResearchGate (first), SSRN, Academia.edu
**Target word count:** ~5,500 words
**DOI:** Register via Zenodo before publication

---

## Author

**Abiodun Adesina**
Founder, 3pplea Holdings LLC | Creator, CALM Index™ Psychometric Methodology
Affiliation: 3pplea Holdings LLC / Roveera (roveera.com)
ORCID: [register before submission]

Abiodun Adesina is the founder of 3pplea Holdings LLC and creator of the CALM Index™ psychometric assessment methodology. She is the author of *Ctrl-Alt-CALM: 97 Hacks for Busy People on a Budget*, a behavioral wellness framework that preceded and informed the CALM Index™ design. She is a mental health advocate and speaker, with community distribution through RCCG-affiliated networks in the United Kingdom and Nigeria. Affiliation: 3pplea Holdings LLC / Roveera (roveera.com).

---

## Abstract

The CALM Index™ is a psychometric assessment methodology designed to measure behavioral wellness across three structured dimensions — Recovery, Renewal, and Reach — in non-clinical populations. Developed from the three-component behavioral wellness framework established in *Ctrl-Alt-CALM: 97 Hacks for Busy People on a Budget* (Adesina, 2023), the CALM Index™ integrates six validated psychometric instruments — the Patient Health Questionnaire-9 (PHQ-9), Generalised Anxiety Disorder-7 (GAD-7), Insomnia Severity Index-7 (ISI-7), Perceived Stress Scale-10 (PSS-10), Satisfaction With Life Scale (SWLS), and Burnout Assessment Tool-8 (BAT-8) — into a composite scoring system that produces a band classification (Critical, Depleted, Rebuilding, Optimised) alongside pillar-level breakdowns. The assessment battery comprises 36 base questions with adaptive branches triggered by elevated stress and burnout signals. Composite scoring applies a weighted formula: CALM = (Recovery × 0.40) + (Renewal × 0.30) + (Reach × 0.30), producing a 0–100 wellness score. This paper describes the theoretical basis of the three-pillar framework, the instrument selection and integration methodology, the composite scoring model, and the band classification system. The CALM Index™ addresses a structural gap in the behavioral wellness market: the absence of a structured, multi-instrument assessment methodology accessible to non-clinical populations. Implications for proactive intervention routing and human-AI collaborative wellness systems are discussed.

**Keywords:** behavioral wellness, psychometric assessment, CALM Index, composite scoring, non-clinical assessment, Recovery, Renewal, Reach, digital mental health, intervention routing

---

## 1. Introduction

The behavioral wellness market has made substantial investments in intervention delivery while systematically neglecting the measurement layer that would make interventions systematic rather than arbitrary. Therapy platforms, meditation applications, and AI-assisted wellness tools assume that users know what kind of help they need and in which domain they need it. Without a validated baseline assessment, intervention matching reduces to self-report, preference, and the availability of whatever tool a user happened to encounter. Without longitudinal tracking, behavior change cannot be distinguished from natural variance. Without a composite methodology, no individual wellness dimension can be evaluated in the context of the whole.

This gap is not new. The World Health Organization has long argued that the majority of the global burden attributable to mental health conditions occurs in the subclinical range — among individuals who have not received a diagnosis, are not receiving treatment, and are unlikely to seek formal clinical support (WHO, 2004). This population — working adults experiencing stress, burnout, sleep disruption, anxiety, and attentional impairment at levels that impair function but fall below clinical thresholds — constitutes the core user population for behavioral wellness platforms. It is also the population for whom validated multi-instrument assessment has been least developed.

Existing digital wellness tools have taken two approaches. The first reduces assessment to a single-item daily mood log or a proprietary 3–5 question wellness check — psychometrically lightweight, quick to complete, and unable to differentiate between distinct types of behavioral distress. The second applies clinical-grade assessment instruments individually, without integration, producing scale-specific scores (PHQ-9 total, GAD-7 total) that measure distinct constructs in isolation. Neither approach produces a composite behavioral picture calibrated to the whole person.

The CALM Index™ was developed to fill this gap. Its origin is the Ctrl-Alt-CALM Behavioral Operating Framework — a three-component behavioral wellness structure established in *Ctrl-Alt-CALM: 97 Hacks for Busy People on a Budget* (Adesina, 2023). The framework was tested in community settings including facilitated seminars at RCCG-affiliated parish events in the United Kingdom and Nigeria, reaching participants across diverse socioeconomic backgrounds who were not accessing formal mental health support. Community testing revealed a consistent structural gap: participants could apply behavioral micro-interventions but lacked any validated methodology to understand their baseline position, identify their primary deficit domain, or measure change over time.

The CALM Index™ is the psychometric operationalisation of the Ctrl-Alt-CALM Behavioral Operating Framework — designed to supply the measurement layer the behavioral wellness market lacked, deployable outside clinical environments, and producing actionable pillar-level information that can route individuals to appropriate interventions.

The three components of the Ctrl-Alt-CALM Behavioral Operating Framework map directly to the CALM Index™ pillars: Control maps to Recovery (physiological and psychological restoration from distress), Alternate maps to Renewal (active regeneration of psychological capacity), Calm maps to Reach (positive wellness, life satisfaction, and the capacity for purposeful engagement). This mapping is not metaphorical. It is the design specification from which the instrument selection described in Section 3 was derived. Each validated scale in the battery was selected because it measures a constituent element of the pillar it contributes to, as specified by the theoretical framework.

---

## 2. Theoretical Framework: The Three-Pillar Model

The CALM Index™ adopts a continuum model of behavioral wellness consistent with the dual-continuum approach established by Keyes (2002), which conceptualises mental health as operating on a positive dimension (flourishing) orthogonal to the negative dimension (mental illness). This model rejects the binary clinical/non-clinical distinction and instead maps behavioral wellness as a continuous spectrum along which individuals can move in either direction depending on environmental conditions, behavioral habits, and access to supportive resources. The three CALM Index™ pillars — Recovery, Renewal, and Reach — represent distinct zones along this spectrum with distinct causal structures, measurement approaches, and intervention implications.

### 2.1 Recovery

Recovery measures the degree to which an individual is experiencing psychological and physiological distress that requires active stabilisation. The theoretical basis for this pillar draws on the allostatic load model (McEwen, 1998), which frames chronic stress exposure as producing measurable cumulative wear on regulatory systems — with downstream effects on mood, anxiety, and sleep. Recovery is not merely the absence of distress: it is the presence of sufficient regulatory capacity to absorb ongoing demands.

The Recovery pillar is measured using four instruments: the PHQ-9 (mood and depression markers), GAD-7 (anxiety and worry), PSS-10 (perceived stress, partial contribution), and BAT-8 (burnout exhaustion, partial contribution). The cross-scale construction of this pillar reflects the theoretical interrelationship of its components. Depression, anxiety, perceived stress, and burnout are empirically correlated but theoretically distinct: they represent different failure modes of the same underlying regulatory system. Measuring all four simultaneously allows the CALM Index™ to differentiate between, for example, an individual whose Recovery deficit is primarily anxiety-driven versus one whose deficit is primarily driven by exhaustion — a distinction with direct intervention implications.

The Recovery pillar carries a 40% weight in the composite CALM Index™ score, reflecting its foundational status. Regulatory capacity is prerequisite to resource regeneration (Renewal) and purposeful engagement (Reach). No amount of renewal activity produces durable gains if the underlying recovery deficit is not addressed first.

### 2.2 Renewal

Renewal measures the degree to which an individual's capacity for resource regeneration is functioning — specifically, the quality and consistency of sleep, the management of stress load, and resistance to burnout depletion. Where Recovery measures distress at the symptom level, Renewal measures the maintenance layer: the ongoing behavioral and physiological practices that prevent distress accumulation.

The theoretical basis for this pillar draws on Hobfoll's (1989) Conservation of Resources theory, which argues that psychological stress arises from the threat or actual loss of resources. Renewal, in this framework, is the active management of resource conservation: sleep is the primary physiological resource restoration mechanism; stress regulation determines the rate of resource depletion; burnout marks the endpoint of sustained resource loss.

The Renewal pillar is measured using three instruments: the ISI-7 (insomnia severity, primary contributor), PSS-10 (perceived stress, partial contribution), and BAT-8 (burnout exhaustion, partial contribution). The selection of the ISI-7 as the primary Renewal instrument reflects the centrality of sleep to resource restoration. Sleep disruption is both a consequence and a cause of resource depletion: it degrades affective regulation (Harvey et al., 2011), elevates physiological stress markers, and reduces the efficacy of behavioral interventions applied during the day.

PSS-10 and BAT-8 contribute to both Recovery and Renewal, which reflects their theoretical status as shared signals. Perceived stress indicates both the presence of active psychological distress (Recovery dimension) and the rate of ongoing resource depletion (Renewal dimension). Burnout exhaustion similarly marks both a Recovery-level regulatory failure and a Renewal-level resource crisis.

The Renewal pillar carries a 30% weight in the composite score.

### 2.3 Reach

Reach measures the positive dimension of behavioral wellness — the degree to which an individual experiences life satisfaction, purposeful engagement, and the capacity for sustained performance. The inclusion of a positive wellness dimension alongside deficit-focused instruments reflects the dual-continuum model: the absence of distress is not equivalent to flourishing, and flourishing is not merely the absence of distress (Seligman, 2011; Keyes, 2002).

The theoretical basis for this pillar draws on subjective wellbeing research (Diener, 2000; Diener et al., 1985) and eudaimonic wellness frameworks (Ryff & Singer, 2006), which together argue that life satisfaction, purpose, and positive affect constitute a distinct psychological dimension that predicts health outcomes, occupational performance, and resilience to stressors independently of clinical symptom counts.

The Reach pillar is measured using a single primary instrument: the SWLS (life satisfaction). This reflects both the theoretical parsimony of the positive wellness construct and the design constraint that a non-clinical population assessment must be completable in under ten minutes. Life satisfaction as measured by the SWLS provides a robust proxy for the full positive wellness dimension: it correlates strongly with positive affect, meaning and purpose, autonomy, and social connection (Pavot & Diener, 1993).

The Reach pillar carries a 30% weight in the composite score.

---

## 3. Instrument Selection and Validation Basis

The CALM Index™ battery was constructed by selecting one primary validated instrument per constituent construct, with cross-scale contributions preserved where the theoretical framework requires them. All instruments included have established psychometric validation records in the peer-reviewed literature and are freely available for clinical and research use as documented below.

### 3.1 PHQ-9 (Patient Health Questionnaire-9)

**Original citation:** Kroenke, Spitzer & Williams (2001).
**Construct:** Depression severity and mood regulation.
**Format:** 9 items, 4-point Likert scale (0 = Not at all to 3 = Nearly every day). Scores range 0–27.
**Clinical thresholds:** Minimal 0–4 | Mild 5–9 | Moderate 10–14 | Moderately severe 15–19 | Severe 20–27.
**Psychometric properties:** Internal consistency α = 0.86–0.89 (Kroenke et al., 2001); test-retest reliability r = 0.84 (Löwe et al., 2004). Area under ROC curve for major depressive disorder = 0.95 (Kroenke et al., 2001).
**Licence status:** Free for clinical and research use. No permission required.
**CALM Index™ adaptation:** All 9 items administered verbatim. Scale contributes to Recovery pillar at 100% pillar weight with an effective question-count-proportional contribution of approximately 44% within the multi-scale Recovery aggregation.

### 3.2 GAD-7 (Generalised Anxiety Disorder-7)

**Original citation:** Spitzer, Kroenke, Williams & Löwe (2006).
**Construct:** Anxiety and worry severity.
**Format:** 7 items, 4-point Likert scale (0 = Not at all to 3 = Nearly every day). Scores range 0–21.
**Clinical thresholds:** Minimal 0–4 | Mild 5–9 | Moderate 10–14 | Severe 15–21.
**Psychometric properties:** Internal consistency α = 0.92 (Spitzer et al., 2006); test-retest reliability r = 0.83; area under ROC curve = 0.91 for GAD diagnosis (Löwe et al., 2008).
**Licence status:** Free for clinical and research use.
**CALM Index™ adaptation:** All 7 items administered verbatim. Scale contributes to Recovery pillar at 100% weight with an effective question-count-proportional contribution of approximately 34% within the multi-scale Recovery aggregation.

### 3.3 ISI-7 (Insomnia Severity Index-7)

**Original citation:** Morin (1993); Bastien, Vallières & Morin (2001).
**Construct:** Insomnia severity and sleep quality.
**Format:** 7 items, 5-point Likert scale (0–4). Scores range 0–28.
**Clinical thresholds:** No significant insomnia 0–7 | Subthreshold 8–14 | Moderate clinical insomnia 15–21 | Severe clinical insomnia 22–28. ISI total ≥ 15 triggers professional referral recommendation within the CALM Index™ system (Bastien et al., 2001).
**Psychometric properties:** Internal consistency α = 0.74–0.91 (Bastien et al., 2001; Morin et al., 2011); test-retest reliability r = 0.80; demonstrated sensitivity to treatment change in CBT-I trials (Morin et al., 2011).
**Licence status:** Free for clinical and research use. The ISI was selected over the PSQI (Buysse et al., 1989) for its brevity (7 vs. 19 items), established sensitivity to change in intervention trials, and equivalent construct coverage in the sleep disruption domain.
**CALM Index™ adaptation:** All 7 items administered verbatim. Scale is the primary instrument for the Renewal pillar, contributing approximately 66% of the question-count-weighted Renewal pillar score.

### 3.4 PSS-10 (Perceived Stress Scale-10)

**Original citation:** Cohen, Kamarck & Mermelstein (1983).
**Construct:** Perceived stress; appraisal of life demands as exceeding coping resources.
**Format:** Base administration uses 4 selected items (PSS-4 format) for the primary stress screen. An adaptive 6-item branch extends to the full PSS-10 when PSS-4 raw sum ≥ 8 of 16 (moderate-severe stress). Full PSS-10 scores range 0–40.
**Clinical thresholds:** PSS-4: Low 0–6 | Moderate 7–9 | High 10–16. PSS-10: Low 0–13 | Moderate 14–26 | High 27–40.
**Psychometric properties:** Internal consistency α = 0.78–0.86 (Cohen et al., 1983); validated across multiple demographic groups including non-US populations (Andreou et al., 2011). PSS scores predict health outcomes, immune function, and burnout onset (Cohen & Wills, 1985).
**Licence status:** Free for research and clinical use. No copyright restriction.
**CALM Index™ adaptation:** PSS items contribute to both Recovery (60% weight) and Renewal (40% weight), reflecting stress as both an active regulatory burden (Recovery dimension) and a resource depletion driver (Renewal dimension). The adaptive branching design preserves assessment efficiency for low-stress individuals while obtaining the full PSS-10 for individuals presenting with elevated stress signals.

### 3.5 SWLS (Satisfaction With Life Scale)

**Original citation:** Diener, Emmons, Larsen & Griffin (1985).
**Construct:** Cognitive evaluation of global life satisfaction; positive wellness.
**Format:** 5 items, 7-point Likert scale (1 = Strongly disagree to 7 = Strongly agree). Scores range 5–35.
**Psychometric properties:** Internal consistency α = 0.87 (Diener et al., 1985); test-retest reliability r = 0.82 over two months; convergent validity with multiple positive affect and wellbeing measures (Pavot & Diener, 1993). SWLS scores predict occupational performance, resilience, and social functioning independently of negative affect measures.
**Licence status:** Freely available for research use. The SWLS is among the most widely cited measures in positive psychology research (Pavot & Diener, 1993).
**CALM Index™ adaptation:** All 5 items administered verbatim. The positive framing (higher responses indicate higher satisfaction) is preserved; items are reverse-scored to need-score format during internal processing, then re-inverted for wellness scoring. SWLS is the sole primary instrument for the Reach pillar.

### 3.6 BAT-8 (Burnout Assessment Tool-8)

**Original citation:** Schaufeli, De Witte & Desart (2020).
**Construct:** Burnout exhaustion — the primary component of burnout as defined by sustained depletion of physical and cognitive resources.
**Format:** Base administration uses 4 exhaustion items (BAT-4 format, bat5 scale, 1–5). An adaptive 4-item branch extends to the full BAT-8 when BAT-4 raw sum ≥ 10 of 20, adding mental distance, emotional, and cognitive impairment items. Full BAT-8 mean ranges 1.0–5.0.
**Clinical thresholds:** Mean ≤ 2.0: Low burnout risk | Mean ≤ 3.5: Moderate burnout risk | Mean > 3.5: High burnout risk and professional referral consideration.
**Psychometric properties:** Internal consistency α = 0.92–0.94 for full BAT (Schaufeli et al., 2020); convergent validity with MBI-GS established (Schaufeli et al., 2020). Free commercial licence granted by KU Leuven for clinical and research use. BAT-8 was selected in Sprint 3 of CALM Index™ development to replace earlier MBI-based items, which required commercial licensing and were validated primarily in occupational contexts.
**Licence status:** Free commercial licence. MBI/Maslach items are excluded from all battery versions ≥ 2.
**CALM Index™ adaptation:** BAT items contribute equally to Recovery (50% weight) and Renewal (50% weight), reflecting burnout exhaustion as both a symptom of regulatory failure (Recovery) and a marker of resource depletion (Renewal). The adaptive branch design activates when base exhaustion scores indicate clinically significant burnout, adding dimensional depth for high-risk users.

---

## 4. Composite Scoring Methodology

The CALM Index™ scoring pipeline transforms raw assessment responses into a single 0–100 composite wellness score through a four-stage process: item normalisation, scale aggregation, pillar aggregation with question-count weighting, and composite construction.

### 4.1 Item Normalisation to Need Score

All raw item responses are first normalised to a common 0–100 need score, where 0 indicates no behavioral need and 100 indicates maximal behavioral need. For negatively-worded items (higher response = worse state), the need score is computed as:

> need_score = ((raw_value − min_value) / (max_value − min_value)) × 100

For positively-worded items (higher response = better state, as in SWLS), the formula is inverted:

> need_score = (1 − (raw_value − min_value) / (max_value − min_value)) × 100

This normalisation ensures that all items, regardless of original response scale (0–3, 0–4, 1–5, or 1–7), contribute comparably to their parent scale aggregates.

### 4.2 Scale Aggregation

Per-scale need scores are computed as the mean of the normalised item need scores within that scale. Where adaptive branches are triggered, the branch items are pooled with base items via a question-count-weighted average to produce the best available scale estimate:

**PSS (when branch triggered):**
> pss_need = (pss_base_score × 4 + pss_branch_score × 6) / 10

**BAT (when branch triggered):**
> bat_need = (bat_base_score + bat_branch_score) / 2

This preserves psychometric comparability between users for whom branches were and were not triggered.

### 4.3 Pillar Aggregation with Question-Count Weighting

Each scale's contribution to its assigned pillar is weighted by two factors: the theoretical pillar contribution weight (the proportion of the scale's construct that belongs to the target pillar) and the scale's question count (which serves as a proxy for measurement depth). The pillar score is then the weighted mean of all contributing scale scores.

**Recovery pillar (base battery, no branches):**

| Scale | Pillar Weight | Question Count | Effective Weight |
|-------|--------------|----------------|-----------------|
| PHQ-9 | 1.00 | 9 | 9.0 |
| GAD-7 | 1.00 | 7 | 7.0 |
| PSS-10 | 0.60 | 4 | 2.4 |
| BAT-8 | 0.50 | 4 | 2.0 |
| **Total** | | | **20.4** |

Effective proportions: PHQ-9 ≈ 44% | GAD-7 ≈ 34% | PSS-10 ≈ 12% | BAT-8 ≈ 10%

**Renewal pillar (base battery, no branches):**

| Scale | Pillar Weight | Question Count | Effective Weight |
|-------|--------------|----------------|-----------------|
| ISI-7 | 1.00 | 7 | 7.0 |
| BAT-8 | 0.50 | 4 | 2.0 |
| PSS-10 | 0.40 | 4 | 1.6 |
| **Total** | | | **10.6** |

Effective proportions: ISI-7 ≈ 66% | BAT-8 ≈ 19% | PSS-10 ≈ 15%

**Reach pillar:** SWLS 100%.

All pillar scores are expressed as need scores (0–100, higher = worse state). Before composite construction, they are converted to wellness scores:

> pillar_wellness = 100 − pillar_need

### 4.4 Composite Score Construction

The CALM Index™ composite is a weighted mean of the three pillar wellness scores:

> **CALM = (Recovery_wellness × 0.40) + (Renewal_wellness × 0.30) + (Reach_wellness × 0.30)**

The pillar weights reflect the theoretical hierarchy established in Section 2: Recovery carries the highest weight (40%) because regulatory capacity is prerequisite to resource regeneration and positive engagement. Renewal and Reach are equally weighted (30% each) as parallel constituents of sustained behavioral wellness.

When one or more pillars yield insufficient data (e.g., first-use assessment with partial response), the composite is reweighted proportionally across available pillars to avoid artificial deflation.

### 4.5 Assessment Seed Initialisation for Live Tracking

When a user completes their first CALM Index™ assessment, the composite score initialises the live tracking system via the J.2 seed weight protocol. The seed uses fixed instrument weights calibrated for parsimony and cross-scale comparability:

| Pillar | Instruments and Weights |
|--------|------------------------|
| Recovery | PHQ-9 40% + GAD-7 30% + PSS-10 20% + BAT-8 10% |
| Renewal | PSS-10 25% + ISI-7 40% + BAT-8 35% |
| Reach | SWLS 70% + PSS-10 30% |

As the user engages with live behavioral tracking (daily check-ins, physiological signals), the assessment seed is progressively diluted using a blend formula:

> seedWeight = max(0, 1 − liveEvents / 30)

On Day 1, the composite is 100% assessment-derived. By Day 30 (with at least 30 live engagement events), the system is fully live-data-driven and the assessment seed is zero. This architecture ensures continuity of scoring from the first interaction while allowing the live signal to progressively dominate as the user's behavioral pattern becomes established.

---

## 5. Band Classification System

CALM Index™ composite scores (0–100, higher = better) are classified into four bands representing operationally distinct behavioral states:

| Band | Score Range | Behavioral State |
|------|-------------|-----------------|
| Optimised | 76–100 | Regulated, resourced, and engaged. Proactive maintenance and performance focus appropriate. |
| Rebuilding | 51–75 | Stable but under load. Some deficit present; structured renewal and targeted interventions appropriate. |
| Depleted | 26–50 | Meaningful behavioral deficit across one or more pillars. Active intervention and professional support consideration warranted. |
| Critical | 0–25 | Significant multi-pillar deficit. Immediate stabilisation focus; professional matching and guided support strongly indicated. |

Within the Rebuilding and Depleted bands, a five-level severity layer (L1–L5) provides finer-grained routing:

| Level | Score Range | Routing implication |
|-------|-------------|---------------------|
| L1 | 76–100 | Optimised — proactive maintenance |
| L2 | 61–75 | Rebuilding high — light-touch interventions |
| L3 | 41–60 | Rebuilding low / Depleted high — moderate intervention |
| L4 | 21–40 | Depleted low — intensive behavioral support |
| L5 | 0–20 | Critical — professional matching indicated |

Band transitions drive the primary intervention routing logic in the Roveera platform. Downward band movement (Rebuilding → Depleted, Depleted → Critical) triggers escalation protocols including Ora behavioral intelligence alerts and professional matching recommendations. Upward band movement provides the primary positive outcome signal used in longitudinal effectiveness tracking.

### 5.1 Pillar Priority Assignment

Beyond the composite score, the CALM Index™ assigns a primary and optionally a secondary pillar based on relative pillar scores, which determines the intervention toolkit prescription:

- **All pillars below 25:** Reach assigned primary (growth focus appropriate)
- **Pillar spread ≤ 8 points:** Recovery assigned primary (safety-first stabilisation)
- **Two pillars within 8 points, both above 40:** Dual assignment (primary and secondary)
- **Otherwise:** Highest-need pillar assigned primary

This assignment drives toolkit prescription from a 13-category behavioral intervention library aligned to the three pillars: Recovery toolkits (Crisis Calming, Anxiety Toolkit, Stress Release, Emotional Processing, Burnout Recovery), Renewal toolkits (Sleep Reset, Energy Restoration, Movement & Vitality, Nutrition & Nourishment), and Reach toolkits (Goals & Performance, Purpose & Meaning, Relationships & Connection, Mindset Reset).

---

## 6. Application: Non-Clinical Assessment at Scale

The CALM Index™ is designed for non-clinical administration — working adults who are not currently receiving clinical treatment but are experiencing behavioral wellness deficits that impair occupational performance, relational function, or subjective quality of life. This population constitutes the majority of the global behavioral wellness burden (Kessler et al., 2003; WHO, 2004) and is systematically underserved by both clinical services (which set diagnostic thresholds above the non-clinical range) and consumer wellness applications (which lack validated assessment infrastructure).

The digital administration of the CALM Index™ offers several advantages over paper-based or clinician-administered assessment. First, adaptive branching is operationally straightforward in digital environments: the PSS-10 and BAT-8 branches are triggered automatically based on response totals, adding depth for high-risk users while preserving efficiency for lower-risk ones. Second, longitudinal retest — essential for tracking behavioral change — requires no scheduling or clinical relationship and can be integrated into daily or weekly digital interactions. Third, immediate score return with pillar-level breakdown allows the platform to route users to interventions without clinician involvement, democratising access to structured behavioral feedback.

Field testing in community settings — specifically facilitated mental wellness seminars at RCCG-affiliated events in the United Kingdom and Nigeria — revealed strong engagement with the structured pillar breakdown, particularly among participants who had not previously considered their behavioral wellness in terms of distinct domains. Participants consistently expressed that knowing *which* domain required attention (Recovery, Renewal, or Reach) was more actionable than an undifferentiated wellness score alone.

The non-clinical design does not preclude clinical utility. Clinical thresholds are preserved for each instrument within the battery: PHQ-9 scores above 14 indicate moderate-to-severe depression requiring clinical attention; ISI-7 scores above 14 indicate clinical insomnia; BAT-8 mean above 3.5 indicates high burnout risk. Where any instrument produces a clinically significant score, the platform surfaces appropriate professional referral guidance.

---

## 7. Limitations

### 7.1 Self-Report Validity Constraints

All six instruments in the CALM Index™ battery rely on self-report. Self-report measures are subject to systematic biases including social desirability effects, retrospective recall errors, and mood-congruent response patterns (Podsakoff et al., 2003). In populations with severe mood or cognitive impairment, self-report validity may be further compromised. The CALM Index™ mitigates this through multi-instrument redundancy — no single instrument drives the composite alone — and through the progressive substitution of assessment-seed data with live behavioral signals as described in Section 4.5.

### 7.2 Digital Administration and Access Equity

The digital-first administration model introduces access equity constraints. Populations without reliable internet access, digital literacy, or access to compatible devices are excluded from the system. In the African markets in which the CALM Index™ has been initially deployed, mobile device penetration is high but connectivity quality is variable. Offline-first assessment capability and mobile-optimised administration are necessary components of equitable access.

### 7.3 Cross-Cultural Validation

The six validated instruments in the CALM Index™ battery have established psychometric properties primarily in Western, English-speaking populations, with varying levels of cross-cultural validation. The PHQ-9, GAD-7, and PSS-10 have been validated in multiple non-Western populations including African samples (Adewuya et al., 2006; Spies et al., 2009), with generally acceptable psychometric properties. The ISI-7, SWLS, and BAT-8 have more limited cross-cultural validation in sub-Saharan African populations. Cross-cultural validity of the composite CALM Index™ framework — including the three-pillar structure and the composite weighting rationale — has not been formally tested and represents a priority for future validation research.

### 7.4 Composite Score Masking

The composite CALM Index™ score can mask pillar-level deficits where one high-performing pillar offsets a deficit in another. An individual with Optimised Reach scores (high life satisfaction) and Critical Recovery scores (severe mood and anxiety distress) may produce a Rebuilding composite — which would understate the Recovery-level urgency. The pillar breakdown and the pillar priority assignment (Section 5.1) are designed to prevent over-reliance on the composite score alone, but users and platform designers must be aware that composite and pillar scores require joint interpretation.

### 7.5 Adaptive Branch Coverage

The adaptive branches for PSS-10 and BAT-8 are triggered by response-sum thresholds on the base items. Users who present with elevated stress or burnout but fall just below trigger thresholds receive only the base battery for those scales, with a potential reduction in measurement depth. Formal evaluation of false-negative branch trigger rates in the deployed population has not yet been conducted.

---

## 8. Future Research

### 8.1 Formal Validation Study

A formal psychometric validation study of the CALM Index™ composite methodology — examining internal consistency, test-retest reliability, convergent and discriminant validity, and band classification accuracy — requires a minimum sample of approximately 10,000 assessments for adequate statistical power in band-level analyses. The Roveera platform is designed to reach this threshold as a milestone for a first validation study, with a 50,000-assessment threshold planned for a confirmatory cross-cultural validation study.

### 8.2 Cross-Cultural Validation

Prioritised populations for cross-cultural validation include Nigeria, United Kingdom, and United States. The Nigeria sample is of particular importance given the community testing origins of the framework and the under-representation of African population samples in behavioral wellness assessment research. Differential item functioning analysis by cultural group should be applied to each of the six instruments within the composite.

### 8.3 Longitudinal Outcome Tracking

The relationship between CALM Index™ band movement and intervention engagement has not been formally studied. A longitudinal design — tracking composite and pillar score changes across assessment cycles as a function of toolkit engagement, professional session attendance, and Ora intervention acceptance — would provide the first prospective evidence for the framework's intervention routing validity.

### 8.4 Physiological Signal Integration

The CALM Index™ live tracking system integrates wearable physiological signals (HRV, resting heart rate, sleep quality, SpO₂) as leading indicators for pillar score movement. The validity of physiological signals as predictors of assessed pillar scores — specifically the degree to which wearable HRV data predicts Recovery pillar deterioration in advance of self-report — has not been formally evaluated. A prospective study with concurrent wearable and self-report data collection would quantify the predictive lead-time advantage of physiological monitoring.

### 8.5 Comparison with Unstructured Digital Wellness Assessment

A comparative study examining CALM Index™ structured multi-instrument assessment against single-item mood logs, proprietary wellness checks, and PHQ-9/GAD-7 stand-alone administration would provide evidence for the incremental validity of the composite approach — specifically whether the three-pillar structure produces better intervention matching outcomes than simpler assessment alternatives.

---

## References

Adewuya, A. O., Ola, B. A., & Afolabi, O. O. (2006). Validity of the patient health questionnaire (PHQ-9) as a screening tool for depression amongst Nigerian university students. *Journal of Affective Disorders*, 96(1–2), 89–93.

Andreou, E., Alexopoulos, E. C., Lionis, C., Varvogli, L., Gnardellis, C., Chrousos, G. P., & Darviri, C. (2011). Perceived Stress Scale: Reliability and validity study in Greece. *International Journal of Environmental Research and Public Health*, 8(8), 3287–3298.

Bastien, C. H., Vallières, A., & Morin, C. M. (2001). Validation of the Insomnia Severity Index as an outcome measure for insomnia research. *Sleep Medicine*, 2(4), 297–307.

Buysse, D. J., Reynolds, C. F., Monk, T. H., Berman, S. R., & Kupfer, D. J. (1989). The Pittsburgh Sleep Quality Index: A new instrument for psychiatric practice and research. *Psychiatry Research*, 28(2), 193–213.

Cohen, S., Kamarck, T., & Mermelstein, R. (1983). A global measure of perceived stress. *Journal of Health and Social Behavior*, 24(4), 385–396.

Cohen, S., & Wills, T. A. (1985). Stress, social support, and the buffering hypothesis. *Psychological Bulletin*, 98(2), 310–357.

Diener, E. (2000). Subjective wellbeing: The science of happiness and a proposal for a national index. *American Psychologist*, 55(1), 34–43.

Diener, E., Emmons, R. A., Larsen, R. J., & Griffin, S. (1985). The Satisfaction With Life Scale. *Journal of Personality Assessment*, 49(1), 71–75.

Harvey, A. G., Murray, G., Chandler, R. A., & Soehner, A. (2011). Sleep disturbance as transdiagnostic: Consideration of neurobiological mechanisms. *Clinical Psychology Review*, 31(2), 225–235.

Hobfoll, S. E. (1989). Conservation of resources: A new attempt at conceptualizing stress. *American Psychologist*, 44(3), 513–524.

Keyes, C. L. M. (2002). The mental health continuum: From languishing to flourishing in life. *Journal of Health and Social Behavior*, 43(2), 207–222.

Kessler, R. C., Berglund, P., Demler, O., Jin, R., Merikangas, K. R., & Walters, E. E. (2005). Lifetime prevalence and age-of-onset distributions of DSM-IV disorders in the National Comorbidity Survey Replication. *Archives of General Psychiatry*, 62(6), 593–602.

Kroenke, K., Spitzer, R. L., & Williams, J. B. W. (2001). The PHQ-9: Validity of a brief depression severity measure. *Journal of General Internal Medicine*, 16(9), 606–613.

Löwe, B., Unützer, J., Callahan, C. M., Perkins, A. J., & Kroenke, K. (2004). Monitoring depression treatment outcomes with the Patient Health Questionnaire-9. *Medical Care*, 42(12), 1194–1201.

Löwe, B., Decker, O., Müller, S., Brähler, E., Schellberg, D., Herzog, W., & Herzberg, P. Y. (2008). Validation and standardization of the Generalized Anxiety Disorder Screener (GAD-7) in the general population. *Medical Care*, 46(3), 266–274.

Maslach, C., & Leiter, M. P. (2016). Burnout. In G. Fink (Ed.), *Stress: Concepts, cognition, emotion, and behavior* (pp. 351–357). Academic Press.

McEwen, B. S. (1998). Stress, adaptation, and disease: Allostasis and allostatic load. *Annals of the New York Academy of Sciences*, 840(1), 33–44.

Morin, C. M. (1993). *Insomnia: Psychological assessment and management*. Guilford Press.

Morin, C. M., Belleville, G., Bélanger, L., & Ivers, H. (2011). The Insomnia Severity Index: Psychometric indicators to detect insomnia cases and evaluate treatment response. *Sleep*, 34(5), 601–608.

Pavot, W., & Diener, E. (1993). Review of the Satisfaction With Life Scale. *Psychological Assessment*, 5(2), 164–172.

Podsakoff, P. M., MacKenzie, S. B., Lee, J. Y., & Podsakoff, N. P. (2003). Common method biases in behavioral research: A critical review of the literature and recommended remedies. *Journal of Applied Psychology*, 88(5), 879–903.

Ryff, C. D., & Singer, B. H. (2006). Know thyself and become what you are: A eudaimonic approach to psychological wellbeing. *Journal of Happiness Studies*, 9(1), 13–39.

Schaufeli, W. B., De Witte, H., & Desart, S. (2020). *Manual Burnout Assessment Tool (BAT) — Version 2.0*. KU Leuven, Belgium.

Seligman, M. E. P. (2011). *Flourish: A visionary new understanding of happiness and wellbeing*. Free Press.

Spies, G., Stein, D. J., Roos, A., Faure, S. C., Mostert, J., Seedat, S., & Vythilingum, B. (2009). Validity of the Kessler 10 (K-10) in detecting DSM-IV defined mood and anxiety disorders among pregnant women. *Archives of Women's Mental Health*, 12(2), 69–74.

Spitzer, R. L., Kroenke, K., Williams, J. B. W., & Löwe, B. (2006). A brief measure for assessing generalized anxiety disorder. *Archives of Internal Medicine*, 166(10), 1092–1097.

World Health Organization. (2004). *Promoting mental health: Concepts, emerging evidence, practice*. WHO Press.

Adesina, A. (2023). *Ctrl-Alt-CALM: 97 Hacks for Busy People on a Budget*. 3pplea Holdings LLC.

---

*© 2026 3pplea Holdings LLC. This work is licensed under Creative Commons Attribution 4.0 International (CC BY 4.0). To view a copy of this licence, visit https://creativecommons.org/licenses/by/4.0/*

*Cite as: Adesina, A. (2026). The CALM Index™: A three-pillar psychometric framework for behavioral wellness. 3pplea Holdings LLC / Roveera. https://roveera.com/research*