What separates churned patients from active ones
Across 1,000 synthetic Medicare patients with 10,000+ encounters, three signals predict churn reliably. The gap in visit recency alone is striking.
Churn is defined as: 6+ months inactive, OR 60%+ of recent care out-of-network, OR zero PCP visits in the last 6 months. The overall churn rate is 8% (80 of 1,000 patients). The patterns are consistent enough across the population to catch reliably.
Random Forest on 15 behavioral features
Feature engineering in PySpark on Databricks, modeling in scikit-learn. Recent encounter volume is the top predictor, accounting for nearly half the model's explanatory power.
Three tiers, three responses
Score all ACO members monthly. The 74 high-risk patients account for nearly all preventable churn, so that's where care coordinator capacity goes first.
Immediate outreach: assign care coordinator, PCP visit within 2 weeks, address access barriers
Proactive monitoring: preventive care reminders, PCP touchpoint within 60 days
Standard wellness programs, quarterly monitoring for escalation
End-to-end pipeline
Built to run on Databricks at scale, using the same architecture common in production ACO analytics environments.
From model to production
Now: Score all patients, flag top 74, assign care coordinators, schedule wellness visits within 2 weeks.
Next quarter: Build a PCP onboarding program targeting new members (under 6 months), reduce out-of-network leakage with proactive referral management.
6 to 12 months: Automate monthly scoring pipeline, build an operational dashboard for care managers, A/B test intervention strategies, track ROI on prevented churns.