Zero partisan or demographic data used in map drawing

Congressional Maps
Drawn by Geography Alone

An open-source MCMC algorithm samples hundreds of thousands of redistricting plans, then selects Pareto-optimal maps maximising compactness and minimising county splits — without ever seeing a vote, a party registration, or a demographic.

National Map Source Code Read the Math
Weighted ReCom MCMC
Wilson's loop-erased random walk
Polsby-Popper compactness
Pareto-optimal selection
≤ 0.5% population deviation
44 states · 4.5M+ plans sampled

National Map

Best-compact plan for all 44 multi-district states. Click the buttons — or the map — to highlight competitive districts.

Showing all districts
US Congressional Redistricting National Map
Click to cycle views →

Post-Hoc Partisan Lean

Applied after map selection using VEST 2020 precinct data adjusted for 2024 statewide presidential results.
The algorithm never used this data.

Democrat-leaning
204
seats
vs
Republican-leaning
231
seats
D 204 (46.9%) 435 total seats R 231 (53.1%)

All 435 seats including 6 at-large states (AK, DE, ND, SD, VT, WY). At-large lean: D — DE, VT  |  R — AK, ND, SD, WY. Method: VEST 2020 precinct-level returns shifted by each state's 2020→2024 presidential swing.

Competitive Districts by Margin

Margin = winner's two-party lead. "Within X%" = could flip with a ≤X pp shift.

Within 2% — Toss-up
20
D-leaning: 10 R-leaning: 10
D: CA, FL, MD, MI, NH, NJ, NY, OH
R: FL, MD, MI, MO, MS, NC, OH, PA, SC, TX
Within 5% — Competitive
50
D-leaning: 25 R-leaning: 25
D: CA, CO, FL, LA, MD, MI, NH, NJ, NV, NY, OH, SC, TX, VA, WI
R: AZ, CO, FL, GA, IA, IL, IN, KS, MD, MI, MO, MS, NC, NY, OH, PA, SC, TN, TX, UT, WA
Within 8% — Lean
80
D-leaning: 41 R-leaning: 39
D: CA, CO, CT, FL, GA, IL, KY, LA, MD, MI, NH, NJ, NV, NY, OH, OR, PA, SC, TX, VA, WI
R: AL, AZ, CA, CO, FL, GA, IA, IL, IN, KS, MD, MI, MO, MS, NC, NE, NJ, NV, NY, OH, PA, SC, TN, TX, UT, WA

Blind Algorithm vs. Enacted 118th Congress — Competitive Seats

Same VEST 2020 + 2024 swing method applied to both maps. Enacted covers 429 redistricted seats (excludes 6 at-large states). Does a geography-only blind draw produce more competitive elections?

Threshold Blind Algorithm Enacted 118th Difference Blind D / R split Enacted D / R split
≤ 2% toss-ups 20 17 +3 D 10 / R 10 D 5 / R 12
≤ 5% competitive 50 45 +5 D 25 / R 25 D 26 / R 19
≤ 8% lean 80 70 +10 D 41 / R 39 D 39 / R 31

The blind algorithm yields 10 more competitive seats at ≤ 8% (80 vs 70) and a more balanced D/R split at every threshold — despite never using any partisan data. At ≤ 2%, enacted maps produce 12 R-leaning toss-ups vs only 5 D-leaning, while the blind draw is exactly 10/10.

State Maps

Select a state to view the proposed blind map alongside the enacted 118th Congress districts.

Select a state above
Click any state button to view its map
↑ Select a state above to view its district map

Racial Diversity Analysis

Post-hoc audit using 2020 Census PL 94-171 redistricting data.
This data was never seen by the algorithm.

Blind Algorithm · Mean diversity
0.521
median 0.555 · σ 0.147 · N=435
Enacted 118th Congress · Mean diversity
0.511
median 0.542 · σ 0.146 · N=437
Blind advantage (mean)
+0.010
Geography-only maps are slightly more diverse despite using no racial data
Racial diversity histogram comparison

Herfindahl diversity index = 1 − Σpᵢ² across 7 groups: Hispanic, NH White, NH Black, NH AIAN, NH Asian, NH NHPI, NH Other/Two+. 0 = fully homogeneous · ~0.86 = all 7 groups equal. Source: 2020 Census P.L. 94-171 redistricting data.

How It Works

Five pipeline stages, zero partisan input.

Pipeline Overview

  1. Build precinct graph — Census TIGER 2020 VTDs as nodes; edge weights encode inverse road-network distance
  2. MCMC sampling — Weighted ReCom (DeFord, Duchin & Solomon 2021): boundary edge selection → district merge → Wilson's LERW spanning tree → population-balanced cut
  3. Hard filters — Polsby-Popper min ≥ 0.05, max population deviation ≤ 0.5%
  4. Pareto frontier — non-dominated set: maximise PP mean, minimise county splits, cut edges, max county fragmentation
  5. Extract two maps — most compact and fewest-splits plans from the frontier
The algorithm never reads partisan registration, election results, race, income, or any demographic variable.

Full mathematical derivation: ALGORITHM.md  ·  California failure analysis: CALIFORNIA_ANALYSIS.md

Analysis

Questions this project can directly answer with data.

Yes — but the advantage is geographic, not manufactured. For the 429 seats that can be redrawn (excluding at-large states), the blind algorithm produces D 202 / R 227, while the enacted 118th Congress gives D 203 / R 226. The split is within a single seat of each other.

47.1%
Democrat seats
blind algorithm
47.3%
Democrat seats
enacted 118th
≈0
Seat-count
difference

The Republican lean reflects where Americans live. Democratic voters are extraordinarily concentrated in cities and university towns. Compact, equal-population districts naturally fill those areas with landslide D seats, leaving the surrounding suburbs and rural counties R-leaning by default. This geographic sorting effect is structural — no map-drawing methodology eliminates it. The blind algorithm reveals it clearly because it has no incentive to counteract or exploit it.

Yes, and the difference is substantial. Applying the same 2024 presidential lean estimate to both maps, the blind draw produces 80 competitive seats (≤8% margin) compared to 70 under the enacted plan — 14% more seats where the outcome is genuinely in doubt.

20
Toss-ups ≤2%
blind
17
Toss-ups ≤2%
enacted
80
Competitive ≤8%
blind
70
Competitive ≤8%
enacted

The balance matters too. At the toss-up level, the blind draw is exactly 10D / 10R. The enacted plan is skewed: 12 R-leaning toss-ups vs only 5 D-leaning — a 7-seat asymmetry at the most competitive tier. Competitive elections require both parties to defend their seats; the enacted map insulates many Republican-held seats from challenge while leaving more Democratic-held districts marginally exposed.

The clearest signal is a large gap between the number of competitive R-leaning seats geography would naturally produce versus what the enacted map actually contains. States where Republicans held redistricting control dominate the list.

FL
Blind: 5 competitive R
Enacted: 0
TX
Blind: 4 competitive R
Enacted: 1
GA
Blind: D 6 / R 8
Enacted: D 5 / R 9
OH
Blind: D 4 / R 11
Enacted: D 3 / R 12

Florida (28 seats) is the most striking case. Geography alone produces five R-leaning seats with a ≤8% margin — seats that could realistically flip. The enacted map has eliminated every one of them. All 20 Republican-held seats in Florida are now won by comfortable margins. Texas (38 seats) similarly reduces its competitive R seats from 4 to 1, while also converting two D-leaning blind-draw seats into R seats in the enacted plan (blind D=14, enacted D=12).

It is worth noting that Democratic-controlled states show analogous patterns in the opposite direction. Maryland's enacted plan converts what would be a competitive R seat under the blind draw into a safely Democratic one. The phenomenon of incumbent protection through map-drawing is bipartisan; Republicans simply controlled more high-population states during the 2020 redistricting cycle.

This is the efficiency gap — a structural feature of American political geography, not a product of gerrymandering. Democratic voters are concentrated in dense urban cores: central cities, university districts, and inner suburbs. When you draw a compact, equal-population district around any major American city, you create a seat the Democrat wins by a very large margin (D+40 or more). Those surplus Democratic votes are "wasted" — they win the seat by far more than necessary.

The Republican coalition is more evenly spread across suburbs, exurbs, and rural areas. Compact districts in those geographies produce many R+10 to R+25 wins — enough to take the seat, but not so lopsided as to waste votes. The net effect is that Democrats win some seats very efficiently (large margins, few seats) while Republicans win more seats at moderate margins, accumulating a seat-count advantage without any map-drawing at all.

Under the blind draw, the Republican seat share (52.9%) slightly exceeds the Republican two-party presidential vote share (~51% in 2024) — a ~2-point seat premium attributable purely to voter geography. The enacted plan adds a further ~1-point premium through deliberate map-drawing.

Virtually identically to the enacted plan, which is a striking result given the algorithm has no knowledge of race. Racial diversity is measured here using the Herfindahl index — the probability that two randomly chosen residents from a district belong to different racial groups (0 = fully homogeneous, ~0.86 = all seven groups equally represented).

0.521
Blind mean
diversity score
0.511
Enacted mean
diversity score
0.555
Blind median
diversity score
0.542
Enacted median
diversity score

The blind algorithm scores slightly higher on diversity at both mean and median. This suggests that when districts follow compact geographic boundaries, racial communities — which are geographically clustered — remain reasonably intact. The enacted plan's deliberate line-drawing does not materially improve diversity and in some cases fragments naturally diverse communities across multiple districts. Important caveat: the Voting Rights Act requires more than aggregate diversity. It requires that minority communities have an equal opportunity to elect representatives of their choice, which is a district-level standard the blind algorithm does not explicitly optimize for. A full VRA audit remains necessary before any plan could be considered for adoption.

The estimates use a uniform partisan swing method: VEST 2020 precinct-level presidential returns are shifted by each state's 2020→2024 two-party margin change. If a state moved 3 points toward Republicans between 2020 and 2024, every precinct in that state is shifted 3 points. The technique is standard in academic redistricting analysis and produces results consistent with post-election precinct returns where those are available.

Strengths: Uses actual precinct geography (not smoothed estimates), leverages the most granular available data, and the 2024 certified state results are the gold standard for statewide calibration.

Limitations: Real swings are not uniform — cities and suburbs may have moved differently than rural areas within the same state. The method also uses presidential votes as a proxy for congressional preferences; incumbency effects, candidate quality, and local issues cause real congressional results to differ from presidential lean by ±5–10 points in some districts. Treat margins below 5% as approximately competitive rather than precise predictions of outcomes.

Not without additional review, but the pipeline produces legally defensible starting points. Three requirements would need to be verified before any plan is considered for adoption:

1. Voting Rights Act compliance. Section 2 of the VRA prohibits maps that dilute minority voting power. A full VRA audit must confirm that minority communities retain an effective opportunity to elect their preferred candidates. The blind algorithm ignores race, so some plans may inadvertently crack majority-minority districts. The post-hoc diversity analysis shows aggregate scores similar to enacted plans, but VRA compliance is evaluated district by district, not on averages.

2. State-specific legal requirements. Many states mandate that districts preserve municipalities, counties, or named communities of interest. The blind algorithm minimizes county splits (a Pareto objective) but does not know that a particular city or historic community should stay whole. State redistricting criteria vary significantly.

3. Public input and transparency. Redistricting processes typically require public hearings and comment periods. A purely algorithmic result would need to be presented alongside the criteria used, the ensemble of alternatives considered, and an explanation of why the selected plan was chosen — all of which this pipeline provides via the Pareto frontier CSV and per-state reports.

The most realistic use case today is as an independent benchmark: a court or independent commission can compare any proposed map against the blind-draw ensemble to test whether the human-drawn plan is an outlier with respect to partisan outcomes, compactness, or community preservation.

Sample size. Each state runs 2,000–20,000 MCMC steps, producing a few thousand sampled plans. For large states (CA with 52 districts, TX with 38), the space of valid plans is astronomically large. The Pareto-selected plan is the best among what was sampled, not necessarily the global optimum. Running more steps would likely improve compactness scores, particularly for geographically complex states.

Compactness as a proxy for quality. Polsby-Popper and cut-edge minimization are reasonable mathematical surrogates for "good" districts, but they don't capture everything voters care about — historical representation patterns, language communities, economic ties between cities and their hinterlands, or the cohesion of Native American tribal territories, for example.

Single plan selection. The pipeline selects one "best compact" and one "fewest splits" plan per state. Real redistricting involves many trade-offs, and presenting the full Pareto frontier (available in the per-state pareto_frontier.csv files) would give a more honest picture of the range of defensible outcomes.

No Senate or state-legislative analysis. Congressional districts are only one layer. State legislative districts follow the same dynamics but at smaller geographic units, and the effects of partisan line-drawing are often more extreme at the state level.

West Virginia geometry. WV's mountain-ridge geography and very irregular precinct boundaries make it genuinely difficult to produce visually clean 2-district plans. Even with 20,000 MCMC steps the best plan has a Polsby-Popper score of only 0.15. This is a real geographic constraint, not an algorithmic failure, but it is worth noting when interpreting WV's map.