CMS Depression Screening Measure Adds Behavioral Health Layer to MA Star Ratings for 2027

From modeling Star Rating bonus revenue across a dozen MA contracts over the past three years, adding a behavioral health measure to a program where mental health utilization data has historically been unreliable introduces a level of actuarial uncertainty that most plans are not yet equipped to quantify. The CMS CY 2027 final rule (CMS-4207-F), published April 2, 2026, and effective June 1, adds a Depression Screening and Follow-Up measure to the Part C Star Ratings framework. The measure tracks the percentage of MA members screened for clinical depression using a standardized instrument and, for those who screen positive, whether they received documented follow-up care within 30 days. CMS will display the screening and follow-up rates separately but average the two rates when scoring the measure. For the roughly 33 million beneficiaries enrolled in Medicare Advantage plans, this is the first time behavioral health performance will directly influence the quality bonus payments that determine plan financial viability.

10-15%

Expected positive depression screen rate among MA members, based on USPSTF data and Medicare population prevalence

$18.56B

Projected 10-year transfer to MA plan sponsors from Star Ratings restructuring, per CMS regulatory impact analysis

Star Ratings measures eliminated in the CY 2027 final rule, concentrating scoring on clinical outcomes

What the Depression Screening Measure Requires

The new Part C Depression Screening and Follow-Up measure adapts the existing HEDIS Depression Screening and Follow-Up for Adolescents and Adults (DSF-E) specification, built on NQF #0418, for the Medicare Advantage population. The measure has two distinct components that CMS will track and display separately before combining them into a single score.

The screening component requires that MA plan members receive depression screening using a validated, standardized instrument during the measurement year. Acceptable instruments include the Patient Health Questionnaire-2 (PHQ-2) as an initial screen and the Patient Health Questionnaire-9 (PHQ-9) for more detailed assessment. The PHQ-2 asks two questions about the frequency of depressed mood and anhedonia over the prior two weeks, scored 0 to 6. A score of 3 or above triggers the need for further evaluation, typically via the PHQ-9, which expands the assessment to nine items covering the full DSM-5 criteria for major depressive disorder.

The follow-up component requires that any member who screens positive receive documented follow-up care within 30 days of the positive screen. Follow-up can include pharmacotherapy (antidepressant medication), referral to a behavioral health provider, additional evaluation using a more detailed instrument, or a documented follow-up plan in the medical record. A PHQ-2 positive result followed by a negative PHQ-9 on the same day qualifies as evidence of follow-up, meaning the screening and follow-up can technically occur in a single visit.

CMS will average the two component rates when calculating the final measure score. This averaging methodology has significant actuarial implications: a plan that achieves 90% screening rates but only 60% follow-up rates would score 75%, potentially below the threshold needed for a favorable star rating on this measure. The follow-up rate will likely be the binding constraint for most plans, given current behavioral health provider shortages and the 30-day documentation window.

The Measurement Timeline and Star Ratings Impact

The depression screening measure begins with the 2027 measurement year, meaning data collection starts January 1, 2027. Performance on this measure will first appear in the 2029 Star Ratings, which determine quality bonus payments for Contract Year 2031. This timeline gives plans roughly seven months from the rule's June 2026 effective date to build the operational infrastructure needed before data collection begins.

Milestone	Date	Actuarial Implication
CY 2027 final rule effective	June 1, 2026	Plans must begin operational planning for screening workflows
Measurement year begins	January 1, 2027	Depression screening data collection starts; baseline year for benchmarking
Measurement year ends	December 31, 2027	Full year of screening and follow-up data available for actuarial analysis
2029 Star Ratings published	October 2028	First Star Ratings incorporating depression screening performance
Quality bonus payments	Contract Year 2031	Bonus revenue impact from 2029 Stars reaches plan financials

The timeline creates a modeling challenge: plans must invest in screening infrastructure and provider network capacity now, but the financial return through quality bonus payments will not materialize for nearly five years. Health plan actuaries building investment cases for screening programs need to quantify the interim costs against the projected bonus revenue, discounted back to present value, with wide confidence intervals given the absence of historical performance data on this measure.

Why This Measure Matters: The Broader Star Ratings Restructuring

The depression screening measure does not arrive in isolation. The CY 2027 final rule simultaneously eliminates 13 Star Ratings measures and restructures the entire scoring methodology, making the new measure's introduction more consequential than its individual weight would suggest.

CMS is removing three measures effective with the 2028 Star Ratings: Statin Therapy for Patients with Cardiovascular Disease (C19), and two Call Center Foreign Language Interpreter and TTY Availability measures (C33 and D01). An additional 10 measures disappear for the 2029 Star Ratings, the same cycle where the depression screening measure first appears. Those 10 include Special Needs Plan Care Management (C07), Customer Service (C24), Rating of Health Care Quality (C25), Complaints about Health Plan (C28), Members Choosing to Leave the Plan (C29 and D03), Plan Makes Timely Decisions about Appeals (C31), Reviewing Appeals Decisions (C32), Complaints about Drug Plan (D02), and Medicare Plan Finder Price Accuracy (D07).

Most of the removed measures are administrative and process-oriented categories where plans showed high performance with little variation. Their removal concentrates scoring weight on clinical quality measures, patient experience surveys (CAHPS), and health outcomes (HOS). CMS estimates this restructuring shifts roughly 65% of the overall Star Rating weight to clinical outcomes and patient experience, up from approximately 50% under the prior framework.

The fiscal impact is substantial. CMS projects the Star Ratings restructuring will transfer approximately $18.56 billion to MA plan sponsors over the 2027-2036 period, representing about 0.21% of total Medicare payments to private health plans over that decade. The agency estimated that roughly 63% of contracts would experience no overall rating change, but the remaining 37% face rating movement in either direction. Press Ganey analysis suggests as much as 25% of MA contracts could lose half a star when the new methodology is applied to current performance data, with an estimated $1.3 billion in quality bonus payment dollars at risk based on modeling against 2026 Stars results.

Depression Prevalence in the MA Population: What Actuaries Need to Model

The actuarial challenge begins with establishing a credible baseline for the expected positive screen rate. Depression prevalence data for the Medicare-eligible population varies significantly depending on the measurement instrument, clinical setting, and population segment.

The U.S. Preventive Services Task Force, whose 2023 Grade B recommendation for universal adult depression screening underpins this CMS measure, reviewed evidence showing that 10% to 15% of primary care patients screen positive for depressive symptoms on validated instruments. For the Medicare population specifically, estimates run higher. Research published in JAMA and the American Journal of Geriatric Psychiatry places the prevalence of depressive symptoms among adults 65 and older at 20% to 30%, though clinically significant major depressive disorder affects a narrower 7% to 12% band.

The gap between symptom prevalence and clinical diagnosis is itself an actuarial variable. Plans that implement aggressive screening programs may surface a substantially higher positive rate than plans relying on passive detection. A plan that screens 95% of its enrolled population will inevitably identify more positive cases than a plan at 70% screening, even if the underlying depression prevalence is identical. This creates a tension in the measure design: high screening rates, which CMS rewards, mechanically increase the denominator for the follow-up component.

From tracking behavioral health utilization across MA populations, we have observed that only about 4% of Medicare and Medicare Advantage enrollees received behavioral health services in 2024, according to ASPE data. Less than half of adults 65 and older reported that their primary care provider asked about their mental health. This suggests a massive gap between the number of members who would screen positive under universal screening and the number currently receiving any behavioral health intervention.

Stratifying Expected Screen-Positive Rates

Actuaries building models for this measure should stratify the expected positive rate by at least three dimensions:

Plan type and enrollment composition. D-SNP plans serving dual-eligible populations will likely see positive screen rates of 25% or higher, given the well-documented correlation between poverty, disability, and depression. Standard MA plans with younger, healthier enrollees may see rates closer to 10%. C-SNP plans, particularly those with chronic cardiovascular and diabetes conditions, occupy a middle ground where comorbid depression prevalence runs 15% to 25%.

Geographic provider density. Plans in counties designated as Mental Health Professional Shortage Areas by HRSA face a dual challenge: screening rates may lag because of fewer provider touchpoints, and follow-up rates will be mechanically depressed by the scarcity of behavioral health professionals available for referrals within the 30-day window. As of 2025, over 160 million Americans live in designated mental health shortage areas.

Screening modality and setting. Plans that deploy screening in annual wellness visits will capture a different population than those using telephonic outreach or digital screening tools. The AWV channel likely catches members already engaged with the healthcare system, while telephonic outreach can reach isolated members with potentially higher depression rates but lower follow-up completion rates.

The Follow-Up Rate: Where Plans Will Struggle

The 30-day follow-up requirement is where the actuarial uncertainty concentrates. Screening is a relatively standardized process; a two-question PHQ-2 can be administered by a medical assistant in under two minutes. Follow-up, by contrast, requires either pharmacotherapy, a behavioral health referral, or a documented clinical plan, each of which depends on provider availability, member engagement, and care coordination infrastructure that most MA plans have not built for scale.

Consider the math. If a plan with 100,000 members achieves an 85% screening rate, that is 85,000 members screened. At a 12% positive rate, roughly 10,200 members need documented follow-up within 30 days. If the plan's behavioral health network has capacity for 500 new patient appointments per month (a generous assumption for many markets), clearing 10,200 follow-ups over a 12-month measurement year requires either front-loading early in the year or distributing screening evenly across the calendar, with each month generating roughly 850 follow-up needs against 500 available slots.

The capacity gap gets worse for plans operating in provider-shortage areas. HRSA data shows that designated Mental Health Professional Shortage Areas would need approximately 8,000 additional practitioners nationally to meet existing demand, before accounting for the incremental volume this measure will generate. Plans cannot simply contract their way out of this constraint on a seven-month timeline.

Several operational strategies can improve the follow-up rate without expanding the behavioral health network:

Same-day PHQ-9 resolution. A member who scores positive on the PHQ-2 but negative on a full PHQ-9 administered during the same visit satisfies the follow-up requirement with no additional appointment needed. Training primary care staff to administer the PHQ-9 immediately after a positive PHQ-2 eliminates the referral bottleneck for a meaningful subset of positive screens.
Pharmacotherapy initiation by primary care. A primary care provider who prescribes an antidepressant at the same visit as the positive screen satisfies the follow-up requirement. This avoids the behavioral health referral entirely, though it requires clinical protocols, medication management pathways, and monitoring that not all primary care practices have in place.
Telehealth behavioral health visits. Virtual visits expand the available follow-up capacity beyond the geographic constraints of in-person behavioral health networks. For Medicare beneficiaries, CMS has maintained expanded telehealth access through 2026, including behavioral health visits from the patient's home.
Collaborative care models. The Psychiatric Collaborative Care Model (CoCM), which integrates a behavioral health care manager and consulting psychiatrist into primary care workflows, allows primary care practices to manage mild-to-moderate depression without external referral. CoCM codes (99492, 99493, 99494) are reimbursable under Medicare Part B.

Actuarial Modeling Framework: Screening Program Costs

Plan actuaries need to build a cost model that captures three expense layers: the direct cost of administering screenings, the incremental utilization generated by positive screens and follow-up, and the indirect costs of the care coordination infrastructure.

Layer 1: Direct Screening Costs

The PHQ-2 and PHQ-9 are typically administered as part of a broader primary care encounter, most often the Annual Wellness Visit (AWV). The marginal cost of adding a two-question PHQ-2 to an existing AWV is minimal, perhaps $2 to $5 per screen in staff time and documentation. The more meaningful cost is in the outreach required to achieve high screening rates: contacting members who have not scheduled an AWV, coordinating with provider offices to ensure the screening is documented in a format that populates the HEDIS data feed, and maintaining gap-in-care lists that flag unscreened members for clinical outreach.

For a plan targeting 90% screening rates, the outreach cost alone may run $15 to $25 PMPY (per member per year) when accounting for telephonic, mail, and digital contact attempts plus the care coordinator staff managing the outreach campaigns. Plans with strong existing AWV completion rates (above 70%) will have lower marginal outreach costs because the screening piggybacks on a visit that was already happening.

Layer 2: Incremental Utilization from Follow-Up

This is the largest and most uncertain cost component. Members who screen positive and receive follow-up will generate behavioral health utilization that, in many cases, was not occurring prior to the screening program. The actuarial question is whether this represents genuinely new utilization or accelerated detection of conditions that would have surfaced through other channels.

Patterns we have observed across health plan data suggest that 40% to 60% of members who screen positive through structured screening programs were not previously receiving any behavioral health services. For these members, the positive screen creates a new treatment episode. The remaining 40% to 60% are members already engaged in some form of behavioral health treatment, for whom the positive screen triggers additional documentation rather than new service utilization.

For the genuinely new treatment episodes, the cost trajectory depends on the intervention pathway:

Intervention Pathway	Estimated Annual Cost Per Member	Expected Share of Positive Screens
Same-day PHQ-9 resolution (negative)	$0 incremental	25-35% of positive PHQ-2 screens
Antidepressant only (PCP-managed)	$400-$800	20-30%
Outpatient psychotherapy (8-12 sessions)	$1,200-$2,400	15-25%
Collaborative care model	$600-$1,100	10-15%
Intensive outpatient or psychiatry referral	$3,000-$6,000	5-10%

Using a blended cost estimate across these pathways for the subset of positive screens that generate new utilization, the incremental PMPM cost ranges from $0.80 to $2.50 depending on the plan's enrollment composition, positive screen rate, and existing behavioral health service penetration. For a plan with 100,000 members, that translates to $960,000 to $3,000,000 in annual incremental behavioral health spending.

Layer 3: Infrastructure and Care Coordination

Beyond direct clinical costs, plans need care coordination staff to manage the 30-day follow-up window, data systems to track screening and follow-up completion rates in near real-time, and reporting infrastructure to populate the HEDIS data feed. Plans that do not already have a behavioral health care management team will need to build one. Depending on plan size, this infrastructure layer can add $0.30 to $0.75 PMPM in administrative expense.

Bonus Revenue Modeling: Quantifying the Financial Return

The depression screening measure matters actuarially because of its influence on the overall Star Rating, which determines eligibility for the 5% quality bonus payment (QBP). For plans at the critical 3.5-to-4.0 star threshold, every measure's contribution to the overall score is consequential.

Under the restructured methodology, the quality bonus payment system continues to use the 4.0-star threshold: plans at or above 4.0 stars receive a 5% bonus on their benchmark payment, while plans below 4.0 receive no bonus. In 2025, CMS distributed over $12.7 billion in quality bonus payments. With approximately 64% of MA enrollees in plans rated 4.0 stars or higher, the revenue at stake for any individual contract ranges from $5 million to $200 million annually depending on enrollment size and county-level benchmarks.

The sensitivity analysis for a marginal contract requires estimating the depression screening measure's weight in the overall Star Rating. Under the restructured methodology with 13 fewer measures, each remaining measure carries proportionally more weight. If clinical quality measures collectively account for roughly 65% of the overall score and the depression screening measure is one of approximately 15 clinical measures, its individual contribution is roughly 4% to 5% of the total score.

For a contract hovering at 3.75 stars, a strong performance on the depression screening measure (scoring 4 or 5 stars) could contribute 0.15 to 0.20 stars to the overall rating, while a poor performance (scoring 2 or 3 stars) could subtract the same amount. Given that the 4.0-star threshold is binary, this swing can be worth the entire quality bonus payment for contracts in the critical zone.

The return-on-investment calculation for screening program investment becomes clearer in this context. If a 100,000-member plan spends $2 million annually on the screening program (including outreach, follow-up utilization, and care coordination) and the depression screening measure's contribution is the difference between a 3.75 and a 4.0 overall rating, the quality bonus payment (5% of an approximate $1,000 monthly benchmark) could yield $60 million annually. Even if the depression measure is only one of several factors contributing to the rating improvement, the expected value of the investment is strongly positive for plans near the threshold.

Data Scarcity: The Core Modeling Challenge

The fundamental actuarial problem with the depression screening measure is that most plans have no historical performance data to calibrate their models. This is not a measure where plans scored 95% on a predecessor metric and can project forward with confidence. Most MA plans have not systematically tracked depression screening and follow-up rates at the contract level because the Star Ratings program never required it.

There are partial analogues. The commercial and Medicaid HEDIS DSF-E measure provides some industry benchmarking data, but the Medicare population differs meaningfully in depression presentation, comorbidity burden, and healthcare utilization patterns. NCQA benchmarks for the commercial DSF-E measure show national performance in the 50th percentile at roughly 60% to 65%, but these figures reflect a younger, commercially insured population with different provider access patterns.

Plans that participated in the Medicare Part B Quality Payment Program (QPP) may have provider-level data on NQF #0418 (Screening for Depression and Follow-Up Plan), but aggregating provider-level QPP data to a plan-level Star Ratings measure requires mapping individual clinician performance to the enrolled population, an exercise most plans have not undertaken.

The data scarcity means that actuarial confidence intervals around projected depression screening scores will be substantially wider than for established measures. Where a plan might forecast its diabetes care or blood pressure control score within plus or minus 0.2 stars based on years of historical performance data, the depression screening measure may have confidence intervals of plus or minus 1.0 star or more in the first measurement year.

Operational Readiness: The Seven-Month Window

Plans have from June 2026 to January 2027 to operationalize the depression screening measure. This compressed timeline requires parallel workstreams across clinical operations, provider network, data systems, and actuarial modeling.

Clinical workflow integration. Screening must be embedded in AWV protocols and potentially in other primary care encounter types. Plans need to decide whether to mandate screening at every qualifying visit or target specific high-risk populations first. The former maximizes the screening rate; the latter concentrates resources on the members most likely to screen positive and need follow-up.

Provider education and contracting. Primary care providers need training on PHQ-2/PHQ-9 administration and documentation standards that satisfy HEDIS data collection requirements. If plans intend to pursue same-day PHQ-9 resolution as a follow-up strategy, providers need explicit protocols and the administrative support to document the full screening-to-resolution sequence in a single encounter note.

Behavioral health network adequacy. Plans must assess whether their current behavioral health provider network can absorb the incremental follow-up volume. If network capacity is insufficient, options include expanding telehealth behavioral health contracts, implementing collaborative care models in high-volume primary care practices, or contracting with digital behavioral health platforms that can provide asynchronous or on-demand counseling.

Data pipeline and monitoring. The measure requires near real-time tracking of screening rates and follow-up completion within the 30-day window. Plans that rely on annual HEDIS chart reviews for quality measurement will need to build or acquire real-time dashboards that flag members approaching the 30-day follow-up deadline. This is a data engineering challenge as much as a clinical one.

The Health Equity Dimension

CMS's decision not to implement the Health Equity Index (HEI) reward in the CY 2027 rule, reverting instead to the historical Reward Factor, does not eliminate the equity dimension of the depression screening measure. Depression prevalence is not uniformly distributed across the MA population. Low-income members, dual-eligible populations, racial and ethnic minorities, and members in rural areas all show higher depression prevalence and lower treatment rates in the literature.

Plans serving high-proportion dual-eligible populations through D-SNPs face a compounding challenge: their members are more likely to screen positive, more likely to face barriers to follow-up care (transportation, provider availability, cost-sharing sensitivity), and less likely to have established behavioral health treatment relationships. The same plan characteristics that predict a higher positive screen rate also predict a lower follow-up completion rate, creating a structural disadvantage in the measure scoring.

CMS chose to revert to the historical Reward Factor partly in response to comments that the proposed HEI would disadvantage smaller, regional plans unable to meet enrollment thresholds. But the depression screening measure itself may create a de facto equity test: plans cannot score well on a measure where 15% to 25% of their population screens positive and follow-up infrastructure is inadequate. D-SNP plans and plans with high proportions of low-income subsidized members will need to invest disproportionately in behavioral health capacity to achieve parity with standard MA plans serving healthier, more affluent populations.

Why This Matters for Health Plan Actuaries

The depression screening measure changes the actuarial modeling landscape for MA plans in four specific ways that require immediate attention.

First, bid pricing must now incorporate behavioral health screening costs. Plans preparing June 2027 bids for Contract Year 2029 (the first bid cycle with a full year of depression screening data available) need to include the incremental screening, follow-up, and care coordination costs in their Part C medical expense projections. For plans that have historically treated behavioral health as a low-utilization, low-priority cost category, this represents a structural upward adjustment to medical expense trend.

Second, Star Ratings forecasting models need a behavioral health module. Actuaries who maintain stochastic models for projecting Star Ratings distributions across scenarios need to add the depression screening measure with appropriately wide confidence intervals. Given the absence of historical performance data, the first-year model should use a uniform or weakly informative prior rather than attempting to calibrate to commercial HEDIS data that may not transfer to the Medicare population.

Third, provider contracting must align incentives with screening performance. Many MA plans use value-based payment arrangements with primary care providers that include Star Ratings performance bonuses. Those arrangements need to be updated to include depression screening and follow-up metrics before the 2027 measurement year begins. Provider payment models that do not incentivize screening will produce screening rates that reflect the provider's existing practice patterns rather than the plan's quality targets.

Fourth, the measure creates a new form of correlation risk in Star Ratings portfolios. Plans that underperform on depression screening are also likely to underperform on related measures such as medication adherence (if untreated depression reduces adherence to other chronic disease medications) and patient experience surveys (if untreated depression correlates with lower CAHPS scores). The depression screening measure may reveal a latent behavioral health factor that affects multiple Star Ratings dimensions simultaneously.