Skip to main content
Special Populations · RHTP-09.TD1

Population Identification Methodology

By Syam Adusumilli · 15 min read
In a Hurry? Read the executive summary.

Who counts as a member of a special population determines who receives targeted services, how resources allocate, and whether transformation reaches those most in need. This question seems technical but is fundamentally political. Every definition includes some people and excludes others. The elderly veteran living off-reservation who self-identifies as American Indian but lacks tribal enrollment faces different system access than the enrolled member living on tribal land. Both are “rural tribal veterans.” Programs may serve one or neither.

This technical document provides the methodological framework for identifying and quantifying the sixteen special populations examined across Series 9. The framework serves RHTP planners who must translate universal program language into population-specific implementation. It surfaces the definitional conflicts that arise when federal categories meet lived experience, and it acknowledges the fundamental limitation: administrative categories cannot capture human complexity.

The document emerges from methodological challenges encountered across Series 9 articles rather than abstract framework development. Each population article revealed specific identification problems. Rural farmworkers who move across state lines monthly cannot be counted through any static administrative system. Justice-involved individuals reenter communities without documentation of their status. Frontier populations exist only through geographic definitions that vary by agency. The framework below consolidates these challenges into usable guidance.

Why Definitions Matter for Transformation
#

RHTP applications universally promise to serve “rural populations” and “underserved communities.” These phrases carry no operational meaning until translated into specific identification criteria. A state that commits to serving tribal populations must decide whether that means IHS-eligible individuals, Census-identified American Indians and Alaska Natives, or anyone residing on tribal land regardless of tribal membership. Each definition produces different counts, different service delivery requirements, and different coordination partners.

Definitional choices shape resource allocation in ways that policy language obscures. Consider two states with identical RHTP awards:

State A defines rural elderly as Census-identified residents 65 and older in nonmetropolitan counties. This produces a clean count from American Community Survey data, enables straightforward needs assessment, and creates clear accountability metrics.

State B defines rural elderly as Medicare beneficiaries 65 and older residing in areas meeting HRSA shortage area criteria. This produces a smaller count, focuses on areas with documented access barriers, but excludes elderly populations in technically non-shortage areas who nonetheless face significant access challenges.

Neither definition is wrong. Both are incomplete. The choice between them is a policy decision disguised as a technical one. States making these choices rarely acknowledge the implications, and RHTP guidance provides no framework for consistent decision-making across states.

The identification challenges vary systematically by population type:

Demographic populations (elderly, children) present fewest challenges because Census data provides reliable counts by age and geography. Definitions are standardized and data is current.

Geographic populations (frontier, Appalachian, Black Belt) depend entirely on which classification system applies. A county may be frontier under USDA definitions but not under Census rural classifications. Different definitions produce different counts.

Condition-based populations (SUD, SMI, complex medical conditions) rely on surveys, treatment data, or claims analysis that systematically undercount. Those not in treatment are invisible. Those without insurance generate no claims.

Circumstance-based populations (farmworkers, justice-involved, veterans) exist through administrative systems that capture only portions of actual populations. Farmworkers who avoid documentation are uncounted. Veterans who never register with VA are invisible to VA data.

Legally-defined populations (tribal members) require navigating multiple overlapping definitions with different federal authorities and different service implications.

Identification Approaches by Population
#

The table below establishes primary identification methods for each Series 9 population, documents data sources, and identifies known limitations. These methods represent current best practice rather than perfect solutions.

PopulationPrimary Identification MethodSecondary MethodsData SourcesKey Limitations
Rural Elderly (65+)Census age by rural geographyMedicare enrollment, SSA dataACS, Census, CMSAccurate; definition of “rural” varies
Tribal/IndigenousTribal enrollment, Census AIAN categoryIHS user population, Urban Indian Organization registrationBIA, Census, IHSSignificant undercount of non-enrolled; enrollment criteria vary by tribe
FrontierUSDA Frontier and Remote Area codes (FAR 3-4)Census population density, RUCA codesUSDA ERS, CensusAccurate for defined criteria; multiple frontier definitions exist
FarmworkersAgricultural census employer reports, NCFH estimatesDOL H-2A visa data, Migrant Health Center utilizationUSDA, DOL, NCFH, HRSASevere undercount of undocumented workers; seasonal variation
Persistent PovertyCensus poverty rate by county over multiple decadesARC distressed designation, USDA persistent poverty countiesCensus, ARC, USDACounty-level masks within-county variation; 30-year lag in designation criteria
Post-IndustrialEconomic transition indicators (manufacturing decline, mining closures)BLS industry data, ARC designationsBLS, Census, ARCNo standard definition; identification varies by study
Black Belt/DeltaCensus race composition, historical geographyARC Delta Regional Authority designationCensus, ARCGeographic boundaries contested; demographic composition changing
AppalachianAppalachian Regional Commission county designationNone; ARC designation is authoritativeARCClear definition; includes economically diverse counties
BorderGeographic proximity to international boundariesDHS/CBP data, state designationsCensus, DHSDistance criteria vary (10-100 miles from border)
VeteransVA registration, Census veteran identificationDOD separation data, state veterans affairsVA, Census, DODAccurate for VA-registered; significant non-registered population
ChildrenCensus age by rural geographySchool enrollment, Medicaid child enrollmentACS, Census, Education, CMSAccurate; pediatric access measures less developed
Justice-InvolvedBJS incarceration and community supervision dataState DOC data, county jail recordsBJS, state DOCCaptures currently supervised; poor tracking post-supervision
SUDSAMHSA survey estimates, treatment admissionsMedicaid claims, ED visit dataNSDUH, TEDS, CMSPrevalence estimated; treatment data misses untreated
SMISAMHSA survey estimates, state mental health authority dataMedicaid disability enrollment, SSI/SSDINSDUH, state MHA, SSAPrevalence estimated; severe undercount in rural areas
Complex ConditionsMedicare/Medicaid claims, disease registriesHospital discharge data, specialty referral patternsCMS, HCUP, disease registriesMisses uninsured; condition-specific data quality varies
Autism/IDDCDC ADDM surveillance, Medicaid DD waiver enrollmentSpecial education (IDEA) data, SSI childhood disabilityCDC, CMS, Education, SSARural underdiagnosis distorts prevalence; waiver waitlists obscure actual counts

Critical Methodological Notes
#

Census Undercounts

The decennial Census and American Community Survey systematically undercount specific populations. The Census Bureau estimates net undercounts of:

  • American Indian and Alaska Native populations on reservations: approximately 5.6%
  • Hispanic populations: approximately 4.9%
  • Children under age 5: approximately 2.8%
  • Rural populations overall: approximately 1.5%

These undercounts compound when populations overlap. An American Indian child on a rural reservation faces multiplicative undercount risk. State planners using Census data as denominators should adjust for known undercounts when available.

Administrative Data Limitations

Administrative data captures only populations touching specific systems. This creates systematic biases:

Treatment data identifies those receiving services, not those needing them. SUD prevalence estimates based on treatment admissions miss the majority of individuals with substance use disorders who never enter treatment. Rural treatment gaps mean rural prevalence estimates from treatment data are particularly unreliable.

Claims data identifies insured populations. The uninsured generate no claims. In states without Medicaid expansion, low-income adults may have conditions never documented in claims data. Complex condition identification from claims systematically excludes rural uninsured populations.

Enrollment data identifies those who register. Veterans must actively register with VA to appear in VA data. Rural veterans, particularly younger veterans and those without service-connected disabilities, register at lower rates than urban veterans.

Survey Data Challenges

SAMHSA’s National Survey on Drug Use and Health provides the primary prevalence estimates for SUD and SMI. Survey methodology creates rural-specific limitations:

Sample sizes in rural areas are small, producing large confidence intervals around rural prevalence estimates. State-level rural estimates may be based on fewer than 100 respondents.

Response rates differ between rural and urban populations. Whether rural populations are more or less likely to report stigmatized conditions remains debated.

Survey timing creates point-in-time estimates that may not capture seasonal variation in migrant populations or condition patterns.

Population Size Estimates
#

The estimates below represent the best available quantification of each Series 9 population. Confidence levels reflect both data quality and definitional stability. High confidence indicates reliable data with standardized definitions. Low confidence indicates significant measurement challenges, definitional variation, or both.

PopulationEstimated Size (Rural)Confidence LevelPrimary BasisKey Qualifications
Rural Elderly (65+)9.3 millionHighCensus ACSStable definition; accurate counts
Rural Tribal1.1 millionModerateCensus, IHSUndercount likely; enrollment criteria vary
Frontier (FAR 3-4)5.2 millionHighUSDA ERSAccurate for FAR definition; other definitions yield different counts
Farmworkers (rural)1.8-2.5 millionLowNCFH estimatesUndocumented workers severely undercounted; seasonal variation
Persistent Poverty8.5 millionModerateCensus, ARCCounty-level definition; within-county variation masked
Post-Industrial10-15 millionLowDefinition-dependentNo standardized definition; range reflects definitional choices
Black Belt/Delta4.2 millionModerateCensusGeographic boundaries shift with demographic change
Appalachian26.3 million total; approximately 11 million ruralHighARCDefined boundaries; includes economically diverse areas
Border7.1 million total; approximately 2.8 million ruralHighCensusDistance definition varies (10-100 miles)
Rural Veterans4.7 millionHighVA, CensusAccurate for self-identified; VA registration subset
Rural Children9.1 millionHighCensus ACSAccurate counts
Justice-Involved1.5 million (rural residence)ModerateBJSCaptures currently supervised; post-supervision invisible
Rural SUD3.0-3.5 millionModerateNSDUH estimatesPrevalence estimated; treatment penetration low
Rural SMI1.8-2.3 millionModerateNSDUH estimatesPrevalence estimated; rural underdiagnosis likely
Complex ConditionsVariable by conditionVariableCondition-specificDefinition varies by condition; uninsured undercounted
Rural Autism/IDD1.2-1.8 millionLowCDC ADDM, waiver dataRural underdiagnosis distorts; waiver waitlists obscure

Population Size Uncertainty
#

The ranges above mask substantial uncertainty that RHTP planners must acknowledge. Consider three populations with particularly problematic estimates:

Farmworkers: The National Center for Farmworker Health estimates 2.4 million farmworkers nationally, with approximately 70% residing in rural areas during peak agricultural seasons. However, undocumented workers avoid enumeration, seasonal workers may be counted in multiple locations or none, and the distinction between farmworkers and agricultural laborers varies by data source. True rural farmworker population may be 20-40% higher than estimates suggest.

Justice-Involved: Bureau of Justice Statistics reports 5.5 million adults under correctional supervision nationally. Rural residence estimates derive from offense location, release location, and survey data with significant missing information. Post-supervision populations are essentially invisible to administrative data. A person who completed probation five years ago and now lives in a rural community with health needs related to incarceration history appears in no justice-involved population count.

Autism/IDD: CDC’s Autism and Developmental Disabilities Monitoring Network estimates 1 in 36 children has autism spectrum disorder. However, ADDM sites are primarily metropolitan, and rural areas have documented diagnostic delays and underdiagnosis. Applying national prevalence to rural populations likely overstates identified cases while understating true prevalence. The rural autism population may be simultaneously undercounted (true prevalence) and overcounted (diagnosed prevalence) depending on which measure matters for service planning.

Intersectionality Matrix
#

Real people belong to multiple population categories simultaneously. Single-population analysis misses the compound disadvantage that shapes individual experience. The intersectionality matrix below identifies high-impact population overlaps that RHTP implementation must address.

IntersectionPopulation 1Population 2Compound EffectEstimated SizeSystem Coordination Challenge
Elderly + FrontierRural ElderlyFrontierAging with no accessible services1.1-1.3 millionNo infrastructure to coordinate
Tribal + SUDTribalSUDHistorical trauma intersects addiction180,000-220,000IHS-state treatment system coordination
Veteran + SMIVeteransSMIMilitary trauma, combat-related conditions320,000-380,000VA-state mental health coordination
Black Belt + ElderlyBlack Belt/DeltaRural ElderlyHistorical discrimination compounds aging750,000-850,000Medicaid-Medicare dual eligible coordination
Farmworker + Complex ConditionsFarmworkersComplex ConditionsMobility disrupts specialty care120,000-180,000Cross-state continuity impossible
Appalachian + SUDAppalachianSUDEconomic despair, opioid crisis550,000-650,000Multi-state, multi-system coordination
Justice + SUDJustice-InvolvedSUDReentry + addiction recovery700,000-800,000Correctional-community treatment transitions
Tribal + ElderlyTribalRural ElderlyCultural context, IHS limitations140,000-180,000IHS-Medicare coordination
Frontier + ChildrenFrontierChildrenDevelopment without pediatric access680,000-750,000Telehealth-dependent services
Border + FarmworkerBorderFarmworkersDocumentation barriers, seasonal migration450,000-550,000Cross-border continuity, documentation sensitivity
Autism + FrontierAutism/IDDFrontierDiagnosis and services impossible85,000-120,000Telehealth inadequate for hands-on therapy
Persistent Poverty + SMIPersistent PovertySMIEconomic distress, treatment access280,000-350,000Medicaid coverage gaps, workforce absence

Analytical Implications
#

Compound disadvantage is not additive. An elderly veteran in frontier Montana does not face elderly challenges plus veteran challenges plus frontier challenges in simple combination. The intersection creates qualitatively different circumstances. The veteran’s VA benefits exist in name only when the nearest VA facility is 200 miles away. The elderly person’s Medicare coverage provides little when no providers accept Medicare within driving distance. Each additional population membership does not add to disadvantage proportionally; it multiplies barriers.

System coordination failures concentrate at intersections. A tribal member with SMI living in a persistent poverty county navigates IHS (tribal health), state mental health authority (SMI services), and potentially Medicaid (if expansion state) or no coverage (if non-expansion state). Each system has different eligibility criteria, different providers, different electronic records, and different geographic boundaries. The person most in need of coordinated care is least likely to receive it because multiple systems claim partial responsibility while none assumes full accountability.

Program design rarely acknowledges intersectionality. RHTP applications describe services for elderly populations and services for SUD populations as separate workstreams. The elderly person with alcohol use disorder receives neither geriatric-informed addiction treatment nor addiction-informed geriatric care. Categorical program design produces categorical service delivery that fragments care for people whose needs do not fit single categories.

State-Level Estimation Guidance
#

RHTP implementation requires states to translate national population estimates into state-specific service planning. The guidance below provides methodology for state-level population estimation.

Step 1: Establish Geographic Baseline
#

Define “rural” consistently for all population estimates. Options include:

Census Rural-Urban Continuum Codes (RUCC): Nine-category classification from metropolitan to completely rural. Most states use RUCC codes 4-9 or 7-9 depending on program requirements.

HRSA Rural Health Information Hub definition: Nonmetropolitan counties plus metropolitan census tracts meeting specific population density criteria.

State-specific definition: Some states have statutory rural definitions that may not align with federal classifications.

Whatever definition selected, apply it consistently across all population estimates. Mixing definitions produces population estimates that cannot be summed or compared.

Step 2: Apply Population-Specific Methodology
#

For demographic populations (elderly, children, veterans):

  • Pull Census ACS data for selected geography
  • Apply age, sex, or veteran status filters
  • Adjust for known undercounts where available
  • Confidence: High

For geographic populations (frontier, Appalachian, Black Belt):

  • Apply authoritative designation (USDA FAR codes, ARC counties)
  • Pull total population for designated areas
  • Note overlap with state’s rural definition
  • Confidence: High for designated populations; geographic overlap may create confusion

For tribal populations:

  • Contact state tribal liaison and individual tribal governments
  • Obtain IHS user population data for state
  • Pull Census AIAN data for rural geography
  • Reconcile differences (IHS eligible vs. self-identified vs. residing in state)
  • Confidence: Moderate; multiple definitions yield different counts

For condition-based populations (SUD, SMI, complex conditions):

  • Apply national prevalence to state rural population
  • Adjust for state-specific factors (e.g., opioid prescribing rates, treatment availability)
  • Cross-reference with treatment data as floor estimate
  • Confidence: Low to moderate; prevalence estimates imprecise, treatment data undercounts

For circumstance-based populations (farmworkers, justice-involved):

  • Obtain state-specific administrative data where available
  • Apply multipliers for undocumented/uncounted populations
  • Acknowledge uncertainty in planning documents
  • Confidence: Low; administrative data captures fraction of true population

Step 3: Assess Intersectionality
#

  • Identify populations with significant state overlap
  • Estimate intersection sizes using national ratios where state data unavailable
  • Design service delivery that addresses compound disadvantage
  • Build coordination mechanisms between systems serving overlapping populations

Step 4: Document Methodology and Limitations
#

Every state population estimate should include:

  • Definition used for each population category
  • Data sources and vintage
  • Known limitations and biases
  • Confidence level assessment
  • Comparison to alternative estimates where available

Documentation enables accountability. When state plans claim to serve a population of a certain size, documentation enables evaluation of whether services actually reached that population. Without transparent methodology, evaluation is impossible.

Federal Data Source Reference
#

Primary Federal Sources
#

SourceAgencyContentUpdate FrequencyAccessRural Utility
American Community SurveyCensusDemographics, disability, health insuranceAnnualPublicHigh
Decennial CensusCensusTotal population, demographicsEvery 10 yearsPublicHigh
Behavioral Risk Factor Surveillance SystemCDCHealth behaviors, chronic conditionsAnnualPublicModerate (state-level only)
National Survey on Drug Use and HealthSAMHSASUD and mental health prevalenceAnnualPublic (limited)Moderate (small rural sample)
Treatment Episode Data SetSAMHSASUD treatment admissionsAnnualPublicLimited (treatment population only)
Medicare Provider DataCMSProvider locations, enrollment, claimsContinuousPublic (aggregated)High
Medicaid State Drug UtilizationCMSPrescribing patternsQuarterlyPublicModerate
IHS User Population DataIHSAI/AN population receiving IHS servicesAnnualRequest requiredHigh for tribal populations
Bureau of Justice StatisticsDOJIncarceration, supervision, recidivismVariablePublicModerate
Agricultural CensusUSDAFarm operations, hired workersEvery 5 yearsPublicModerate for farmworkers
Rural-Urban Continuum CodesUSDA ERSCounty rurality classificationEvery 10 yearsPublicEssential
Frontier and Remote Area CodesUSDA ERSFrontier classificationEvery 10 yearsPublicEssential for frontier
Area Health Resources FilesHRSAHealth workforce, facilitiesAnnualPublicHigh

State Data Integration
#

Federal data provides baseline but state sources often offer greater specificity:

State health departments maintain vital statistics, disease registries, and sometimes state-specific health surveys with better rural coverage than national surveys.

State Medicaid agencies have detailed enrollment and claims data that CMS aggregates mask. State-level requests can produce rural-specific utilization patterns.

State departments of corrections maintain incarceration and supervision data with residence information that BJS aggregates cannot provide.

State labor departments may have farmworker program data beyond federal sources.

State mental health authorities have data on public mental health system utilization not captured in national surveys.

Tribal governments maintain enrollment data more accurate than Census estimates for specific tribal nations.

Methodological Recommendations
#

For State RHTP Planners
#

Use consistent definitions. Select rural classification and population definitions that apply throughout your application and implementation. Document selections explicitly.

Acknowledge uncertainty. Precision implies false accuracy. Report ranges where appropriate, identify confidence levels, and explain limitations.

Plan for intersectionality. Design services that address compound disadvantage, not just categorical populations.

Build local partnerships. Community organizations, tribal governments, and advocacy groups often have population knowledge that administrative data cannot capture.

Update estimates regularly. Population distributions change. Annual reassessment using updated sources improves targeting.

For Federal RHTP Oversight
#

Provide definitional guidance. States make different definitional choices producing non-comparable results. Standardized guidance would enable cross-state comparison.

Fund improved rural data collection. National surveys have inadequate rural sample sizes. Dedicated rural health surveys would improve prevalence estimates.

Require intersection analysis. Applications that address populations separately perpetuate categorical thinking that fails people with compound disadvantage.

Support tribal data sovereignty. Tribal nations should control how their population data is collected, analyzed, and shared.

For Researchers
#

Report rural subgroup analyses. When study populations include rural participants, report rural-specific findings even with smaller sample sizes.

Develop rural prevalence adjustments. Research establishing how national prevalence estimates should be adjusted for rural populations would substantially improve state planning.

Study intersectionality empirically. The compound disadvantage framework is theoretically sound but empirical estimation of intersection effects remains limited.

How this article connects to others in Blue Gray Matters.

Rural classification systems in 1-TD-B provide the geographic framework within which population identification methodology operates, distinguishing frontier from rural from micropolitan.
Regional variation matrix in 1-TD-C provides the geographic overlay enabling population mapping to specific health regions where demographic concentrations cluster.
Performance measurement systems in Series 5 that fail to disaggregate by population produce the invisibility that this document's identification methodology attempts to address — measurement design determines whether population-specific outcomes are visible.
RHTP-17.SYN technical
Series 17 state profiles apply the population identification methodology this document establishes to assess which diverse populations require accommodation in each state's RHTP implementation — the identification framework enables consistent cross-state assessment of population-specific implementation requirements that standard cluster analysis does not capture.

Sources cited in this article.

  1. Appalachian Regional Commission. "Data Reports: County Economic Status and Distressed Areas." ARC, 2024, arc.gov/data-reports.
  2. Bureau of Justice Statistics. "Correctional Populations in the United States." DOJ, 2024, bjs.ojp.gov/library/publications.
  3. Centers for Disease Control and Prevention. "Autism and Developmental Disabilities Monitoring Network." CDC, 2024, cdc.gov/autism/addm.
  4. Centers for Disease Control and Prevention. "BRFSS Prevalence & Trends Data." CDC, 2024, cdc.gov/brfss/brfssprevalence.
  5. Centers for Medicare & Medicaid Services. "Geographic Variation Public Use File." CMS, 2024, cms.gov/research-statistics-data-systems/statistics-trends-and-reports/medicare-geographic-variation.
  6. Health Resources and Services Administration. "Area Health Resources Files." HRSA, 2024, data.hrsa.gov/topics/health-workforce/ahrf.
  7. Indian Health Service. "IHS Profile." IHS, 2024, ihs.gov/aboutihs/thisisihs.
  8. National Center for Farmworker Health. "Farmworker Health Factsheet." NCFH, 2024, ncfh.org/resources.
  9. Substance Abuse and Mental Health Services Administration. "National Survey on Drug Use and Health." SAMHSA, 2024, samhsa.gov/data/data-we-collect/nsduh.
  10. U.S. Census Bureau. "American Community Survey Data." Census Bureau, 2024, census.gov/programs-surveys/acs/data.html.
  11. U.S. Census Bureau. "Post-Enumeration Survey: 2020 Census Coverage Estimates." Census Bureau, 2022, census.gov/newsroom/press-releases/2022/2020-census-estimates-of-undercount-and-overcount.html.
  12. U.S. Department of Agriculture Economic Research Service. "Rural-Urban Continuum Codes." USDA ERS, 2023, ers.usda.gov/data-products/rural-urban-continuum-codes.
  13. U.S. Department of Agriculture Economic Research Service. "Frontier and Remote Area Codes." USDA ERS, 2020, ers.usda.gov/data-products/frontier-and-remote-area-codes.
  14. U.S. Department of Veterans Affairs. "VA Enrollee Health Care Utilization." VA, 2024, va.gov/health/utilization.asp.