feat(overton): improvements and extensions — party differentiation, voting margin, SVD viz, mechanism validation, predictive model

U1: JA21 drives moderation effect (+0.203 CS shift, only party with volume+support gains) U2: Coalition coding split at July 2024 — opposition effect confirmed (d=0.85 vs 0.87) U3: Voting margin (ρ=0.812 with centrist support) is far superior to pass rate U4: SVD trajectory confirms spatial divergence — centrists moved left (Δx=-0.30), right stationary U5: Mechanism classification Cohen's κ=0.41 (moderate) — taxonomy needs revision U6: Predictive model AUC-ROC=0.81 — submitter party and category are strongest predictors
3 weeks ago · d34d43a888
parent 7df961ba83
commit d34d43a888
13 changed files with 3584 additions and 0 deletions
--- a/analysis/right_wing/mechanism_validation.py
+++ b/analysis/right_wing/mechanism_validation.py
@ -0,0 +1,946 @@
 #!/usr/bin/env python3
 """Mechanism classification validation with a second classifier.
 Computes inter-rater reliability (Cohen's kappa) between the original inline
 classifications and a second LLM-based classification using a different prompt
 template and (optionally) a different model.
 Usage:
    uv run python analysis/right_wing/mechanism_validation.py
 """
 from __future__ import annotations
 import argparse
 import json
 import logging
 import sys
 import time
 from collections import Counter
 from concurrent.futures import ThreadPoolExecutor
 from pathlib import Path
 from typing import Any
 import duckdb
 ROOT = Path(__file__).parent.parent.parent.resolve()
 if str(ROOT) not in sys.path:
    sys.path.insert(0, str(ROOT))
 from ai_provider import ProviderError, chat_completion
 from analysis.config import config
 logging.basicConfig(level=logging.INFO, format="%(asctime)s %(levelname)s %(message)s")
 logger = logging.getLogger(__name__)
 # ── mechanism taxonomy ───────────────────────────────────────────────────────
 MECHANISMS = [
    "consensus_framing",
    "institutional_rule_of_law",
    "welfare_service_expansion",
    "procedural_technical",
    "local_constituency",
    "coalition_alignment",
    "symbolic_declaratory",
    "targeted_restriction",
    "system_dismantling",
    "crisis_response",
 ]
 MECHANISM_LABELS_NL = {
    "consensus_framing": "Consensus framing (gedeeld belang)",
    "institutional_rule_of_law": "Institutioneel/rechtsstatelijk",
    "welfare_service_expansion": "Welzijn/dienstverlening uitbreiding",
    "procedural_technical": "Procedureel/technisch",
    "local_constituency": "Lokaal/regionaal",
    "coalition_alignment": "Coalitie-afstemming",
    "symbolic_declaratory": "Symbolisch/declaratoir",
    "targeted_restriction": "Gerichte restrictie",
    "system_dismantling": "Systeemontmanteling",
    "crisis_response": "Crisisrespons",
 }
 MECHANISM_LABELS_EN = {
    "consensus_framing": "Consensus framing / shared interest",
    "institutional_rule_of_law": "Institutional / rule of law",
    "welfare_service_expansion": "Welfare / service expansion",
    "procedural_technical": "Procedural / technical",
    "local_constituency": "Local / regional constituency",
    "coalition_alignment": "Coalition alignment",
    "symbolic_declaratory": "Symbolic / declaratory",
    "targeted_restriction": "Targeted restriction",
    "system_dismantling": "System dismantling",
    "crisis_response": "Crisis response",
 }
 # Original inline classifications (from mechanism_classification.py)
 ORIGINAL_CLASSIFICATIONS: dict[int, str] = {
    15458: "crisis_response",
    26477: "institutional_rule_of_law",
    9149: "consensus_framing",
    17099: "procedural_technical",
    4933: "procedural_technical",
    17751: "consensus_framing",
    20068: "procedural_technical",
    16520: "consensus_framing",
    17036: "welfare_service_expansion",
    17681: "consensus_framing",
    14554: "procedural_technical",
    21864: "procedural_technical",
    26493: "targeted_restriction",
    21982: "consensus_framing",
    14125: "crisis_response",
    13683: "welfare_service_expansion",
    16691: "procedural_technical",
    15005: "procedural_technical",
    17536: "institutional_rule_of_law",
    16999: "consensus_framing",
    8325: "procedural_technical",
    13370: "welfare_service_expansion",
    18030: "procedural_technical",
    11382: "procedural_technical",
    18616: "procedural_technical",
    12411: "crisis_response",
    22595: "crisis_response",
    15772: "system_dismantling",
    7111: "welfare_service_expansion",
    25784: "targeted_restriction",
    27731: "system_dismantling",
    15626: "crisis_response",
    20215: "welfare_service_expansion",
    16430: "symbolic_declaratory",
    25982: "local_constituency",
    17176: "targeted_restriction",
    7054: "procedural_technical",
    20323: "procedural_technical",
    18025: "system_dismantling",
    14837: "system_dismantling",
    19620: "targeted_restriction",
    21801: "consensus_framing",
    19464: "crisis_response",
    26855: "targeted_restriction",
    22280: "local_constituency",
    20115: "symbolic_declaratory",
    15082: "targeted_restriction",
    6637: "targeted_restriction",
    18691: "symbolic_declaratory",
    18062: "crisis_response",
    3784: "procedural_technical",
    10205: "procedural_technical",
    10278: "coalition_alignment",
    25079: "consensus_framing",
    2980: "targeted_restriction",
    10420: "crisis_response",
    25092: "targeted_restriction",
    25545: "institutional_rule_of_law",
    23065: "procedural_technical",
    2878: "welfare_service_expansion",
    25573: "procedural_technical",
    3298: "symbolic_declaratory",
    25061: "consensus_framing",
    4481: "consensus_framing",
    3961: "procedural_technical",
    473: "institutional_rule_of_law",
    10413: "consensus_framing",
    974: "procedural_technical",
    24009: "procedural_technical",
    9789: "institutional_rule_of_law",
    24651: "targeted_restriction",
    1890: "local_constituency",
    1191: "consensus_framing",
    3448: "targeted_restriction",
    23910: "institutional_rule_of_law",
    25566: "welfare_service_expansion",
    2070: "targeted_restriction",
    23885: "consensus_framing",
    24906: "procedural_technical",
    2496: "procedural_technical",
    25582: "targeted_restriction",
    3053: "local_constituency",
    1495: "procedural_technical",
    10178: "procedural_technical",
    1614: "procedural_technical",
    23441: "consensus_framing",
    3569: "consensus_framing",
    10285: "procedural_technical",
    23058: "procedural_technical",
    3287: "procedural_technical",
    10434: "consensus_framing",
    10089: "procedural_technical",
    22706: "consensus_framing",
    3877: "institutional_rule_of_law",
    25062: "consensus_framing",
    3687: "targeted_restriction",
    25166: "procedural_technical",
    4618: "procedural_technical",
    3468: "institutional_rule_of_law",
    24632: "institutional_rule_of_law",
    25451: "symbolic_declaratory",
    2351: "targeted_restriction",
    4227: "consensus_framing",
    22853: "consensus_framing",
    9884: "procedural_technical",
    1428: "consensus_framing",
    3629: "symbolic_declaratory",
    1572: "local_constituency",
    25493: "procedural_technical",
    1359: "procedural_technical",
    2252: "procedural_technical",
    23605: "procedural_technical",
    3760: "consensus_framing",
    1005: "consensus_framing",
    10110: "coalition_alignment",
    23301: "consensus_framing",
    24046: "symbolic_declaratory",
    651: "welfare_service_expansion",
    1491: "targeted_restriction",
    25606: "targeted_restriction",
    313: "procedural_technical",
    24008: "consensus_framing",
    754: "targeted_restriction",
    25469: "targeted_restriction",
    25091: "targeted_restriction",
    2170: "institutional_rule_of_law",
    22792: "procedural_technical",
    10597: "institutional_rule_of_law",
    23013: "institutional_rule_of_law",
    3472: "institutional_rule_of_law",
    2014: "system_dismantling",
    920: "procedural_technical",
    2143: "welfare_service_expansion",
    688: "system_dismantling",
    2290: "system_dismantling",
    4497: "targeted_restriction",
    3823: "symbolic_declaratory",
    23141: "institutional_rule_of_law",
    4436: "institutional_rule_of_law",
    25616: "targeted_restriction",
    2662: "institutional_rule_of_law",
    23287: "institutional_rule_of_law",
    4660: "consensus_framing",
    4761: "targeted_restriction",
    2264: "institutional_rule_of_law",
    4394: "institutional_rule_of_law",
    1691: "targeted_restriction",
    10601: "targeted_restriction",
    4089: "targeted_restriction",
    23206: "procedural_technical",
    22676: "institutional_rule_of_law",
    115: "system_dismantling",
    3951: "consensus_framing",
    1375: "targeted_restriction",
    3090: "targeted_restriction",
    24650: "procedural_technical",
    1772: "consensus_framing",
    3678: "system_dismantling",
    1692: "institutional_rule_of_law",
    24077: "symbolic_declaratory",
    349: "institutional_rule_of_law",
    9769: "targeted_restriction",
    4656: "symbolic_declaratory",
    23984: "system_dismantling",
    2168: "institutional_rule_of_law",
    4443: "institutional_rule_of_law",
    4489: "procedural_technical",
    10290: "targeted_restriction",
    4071: "targeted_restriction",
    4088: "targeted_restriction",
    1507: "system_dismantling",
    2870: "procedural_technical",
    1912: "system_dismantling",
    22658: "symbolic_declaratory",
    10288: "targeted_restriction",
    4080: "institutional_rule_of_law",
    1847: "targeted_restriction",
    23127: "system_dismantling",
    4367: "targeted_restriction",
    9790: "targeted_restriction",
    4150: "procedural_technical",
    741: "targeted_restriction",
    1705: "consensus_framing",
    1831: "consensus_framing",
    10600: "targeted_restriction",
    9767: "targeted_restriction",
    3830: "system_dismantling",
    4221: "system_dismantling",
    3354: "institutional_rule_of_law",
    9977: "symbolic_declaratory",
    898: "consensus_framing",
    24848: "system_dismantling",
    756: "targeted_restriction",
    24358: "institutional_rule_of_law",
    4309: "institutional_rule_of_law",
    10167: "local_constituency",
    23633: "procedural_technical",
    23030: "targeted_restriction",
    1959: "system_dismantling",
    23454: "procedural_technical",
 }
 # ── prompt templates ─────────────────────────────────────────────────────────
 # Original prompt (from mechanism_classification.py — inline subagent)
 # Classifications were done by reading full title + body_text.
 # The second classifier uses a DIFFERENT template:
 #  - English wording (not Dutch)
 #  - Mechanisms presented in DIFFERENT order (reverse alphabetical)
 #  - Asks for RANKING (top 3) instead of single pick
 #  - Includes definition context for each mechanism
 MECHANISMS_SHUFLLED = list(reversed(MECHANISMS))
 MECHANISM_DEFINITIONS_EN = """1. crisis_response — A temporary, emergency measure responding to an acute event (pandemic, natural disaster, sudden crisis). Reactive and time-limited.
 2. system_dismantling — Aims to dismantle, abolish, or fundamentally restructure an existing policy, institution, or regulatory framework. Not reform but abolition/reversal.
 3. targeted_restriction — Imposes specific restrictions on a defined group, behavior, or activity. Narrow scope, punitive or exclusionary intent.
 4. symbolic_declaratory — Primarily sends a political signal, makes a statement, or takes a position without direct policy impact. Declaratory, symbolic, expressive.
 5. procedural_technical — Technical adjustment, budget amendment, implementation detail, or administrative procedure. Bureaucratic, operational, non-ideological.
 6. local_constituency — Serves a specific local/regional interest, constituency, or geographic area. NIMBY or local-advocacy pattern.
 7. coalition_alignment — Reflects coalition politics: budget compromises, package deals, or alignments between coalition partners. Coalition-maintenance.
 8. welfare_service_expansion — Expands government services, social welfare, public goods, or citizen entitlements. Positive provision, not restriction.
 9. institutional_rule_of_law — Concerns legal frameworks, rule of law, institutional integrity, judicial process, or constitutional matters. Rule-based, institutional.
 10. consensus_framing — Frames the motion as serving a broad, shared interest. Appeals to common ground, national interest, or bipartisan consensus. Inclusive, bridge-building, non-polarizing."""
 SECOND_CLASSIFIER_PROMPT = """Classify the following Dutch parliamentary motion according to the mechanism taxonomy below.
 MOTION TITLE: {title}
 MOTION TEXT: {body}
 TASK: Identify the PRIMARY mechanism this motion uses. Select exactly ONE mechanism from the list below. Base your decision on what the motion actually DOES (action-oriented) rather than what it merely TALKS about.
 MECHANISM TAXONOMY (read carefully before choosing):
 {MECHANISM_DEFINITIONS}
 IMPORTANT RULES:
 - Choose the mechanism that BEST describes the dominant pattern of the motion.
 - If a motion could fit multiple mechanisms, pick the most specific one.
 - procedural_technical should be the DEFAULT only if no other mechanism fits better.
 - Return ONLY the mechanism key exactly as listed above (e.g., "system_dismantling").
 Respond with a JSON object containing:
 - "mechanism": the selected mechanism key
 - "confidence": 1-5 (1=very uncertain, 5=very certain)
 - "reasoning": brief explanation (max 2 sentences)"""
 def build_second_classifier_prompt(title: str, body_text: str) -> str:
    text = body_text or title or ""
    if len(text) > 1200:
        text = text[:1200] + "..."
    return SECOND_CLASSIFIER_PROMPT.format(
        title=title or "", body=text, MECHANISM_DEFINITIONS=MECHANISM_DEFINITIONS_EN
    )
 # ── LLM call helpers ─────────────────────────────────────────────────────────
 def chat_completion_json(
    messages: list[dict[str, str]],
    model: str | None = None,
    retries: int = 3,
 ) -> dict[str, Any] | None:
    """Call chat_completion and parse JSON response with retries."""
    model = model or config.QWEN_MODEL
    prompt = messages[0]["content"]
    system_msg = (
        "You are a political science classifier. You classify Dutch parliamentary "
        "motions by their dominant mechanism type. Respond ONLY with valid JSON. "
        "No markdown, no code fences, no preamble — pure JSON object."
    )
    full_messages = [
        {"role": "system", "content": system_msg},
        {"role": "user", "content": prompt},
    ]
    backoff = 0.5
    for attempt in range(1, retries + 1):
        try:
            raw = chat_completion(full_messages, model=model)
        except ProviderError as exc:
            if attempt == retries:
                logger.error("ProviderError on attempt %d: %s", attempt, exc)
                return None
            time.sleep(backoff * (2 ** (attempt - 1)))
            continue
        raw = raw.strip()
        if raw.startswith("```"):
            raw = raw.split("```", 2)[1]
            if raw.startswith("json"):
                raw = raw[4:]
            raw = raw.strip()
        try:
            result = json.loads(raw)
            if "mechanism" in result and result["mechanism"] in MECHANISMS:
                return result
            logger.warning(
                "Invalid mechanism '%s' on attempt %d", result.get("mechanism"), attempt
            )
        except json.JSONDecodeError:
            logger.warning("JSON decode failed on attempt %d: %s", attempt, raw[:100])
        if attempt < retries:
            time.sleep(backoff * (2 ** (attempt - 1)))
    return None
 def chat_completion_json_parallel(
    message_batches: list[list[dict[str, str]]],
    model: str | None = None,
    max_workers: int = 5,
 ) -> list[dict[str, Any] | None]:
    """
    Run multiple chat completions in parallel using ThreadPoolExecutor.
    Each element in message_batches is a list of messages for one completion.
    Returns a list of parsed JSON dicts (or None for failures), same order.
    """
    model = model or config.QWEN_MODEL
    def _fetch_one(messages: list[dict[str, str]]) -> dict[str, Any] | None:
        return chat_completion_json(messages, model=model)
    with ThreadPoolExecutor(max_workers=max_workers) as executor:
        futures = [executor.submit(_fetch_one, batch) for batch in message_batches]
        return [f.result() for f in futures]
 # ── data loading ─────────────────────────────────────────────────────────────
 def load_motions(db_path: str, motion_ids: list[int]) -> list[dict[str, Any]]:
    """Load motion data from the database for the given motion IDs."""
    con = duckdb.connect(db_path)
    try:
        placeholders = ",".join("?" for _ in motion_ids)
        rows = con.execute(
            f"""
            SELECT r.motion_id, m.title, m.body_text, r.year, r.centrist_support_strict
            FROM right_wing_motions r
            JOIN motions m ON r.motion_id = m.id
            WHERE r.motion_id IN ({placeholders})
            ORDER BY r.motion_id
            """,
            motion_ids,
        ).fetchall()
        return [
            {
                "motion_id": r[0],
                "title": r[1] or "",
                "body_text": r[2] or "",
                "year": r[3],
                "centrist_support_strict": r[4],
            }
            for r in rows
        ]
    finally:
        con.close()
 # ── classification ───────────────────────────────────────────────────────────
 def classify_motions_second_pass(
    motions: list[dict[str, Any]],
    second_model: str | None = None,
    batch_size: int = 10,
    max_workers: int = 5,
 ) -> dict[int, dict[str, Any]]:
    """Run second classifier on all motions, return motion_id -> result dict."""
    second_model = second_model or config.QWEN_MODEL
    results: dict[int, dict[str, Any]] = {}
    for i in range(0, len(motions), batch_size):
        batch = motions[i : i + batch_size]
        logger.info(
            "Batch %d/%d (%d motions)",
            i // batch_size + 1,
            (len(motions) - 1) // batch_size + 1,
            len(batch),
        )
        message_batches = []
        for m in batch:
            prompt = build_second_classifier_prompt(m["title"], m["body_text"])
            message_batches.append([{"role": "user", "content": prompt}])
        raw_results = chat_completion_json_parallel(
            message_batches, model=second_model, max_workers=max_workers
        )
        for m, res in zip(batch, raw_results):
            mid = m["motion_id"]
            if res and res.get("mechanism") in MECHANISMS:
                results[mid] = {
                    "mechanism": res["mechanism"],
                    "confidence": res.get("confidence", 0),
                    "reasoning": res.get("reasoning", ""),
                    "error": None,
                }
            else:
                results[mid] = {
                    "mechanism": None,
                    "confidence": 0,
                    "reasoning": "",
                    "error": "classification failed",
                }
        time.sleep(0.5)
    return results
 # ── agreement analysis ───────────────────────────────────────────────────────
 def compute_cohens_kappa(
    rater1: dict[int, str],
    rater2: dict[int, str],
    categories: list[str],
 ) -> dict[str, Any]:
    """Compute Cohen's kappa for two raters.
    Uses only motion_ids present in BOTH raters.
    """
    common_ids = sorted(set(rater1) & set(rater2))
    n = len(common_ids)
    if n == 0:
        return {"kappa": None, "agreement_rate": None, "n": 0, "error": "no common motions"}
    agreements = 0
    for mid in common_ids:
        if rater1[mid] == rater2[mid]:
            agreements += 1
    p_o = agreements / n
    # Expected agreement
    p_e = 0.0
    for cat in categories:
        p1 = sum(1 for mid in common_ids if rater1[mid] == cat) / n
        p2 = sum(1 for mid in common_ids if rater2[mid] == cat) / n
        p_e += p1 * p2
    if p_e >= 1.0:
        kappa = 1.0
    else:
        kappa = (p_o - p_e) / (1.0 - p_e) if p_e < 1.0 else 0.0
    return {
        "kappa": round(kappa, 4),
        "agreement_rate": round(p_o, 4),
        "n": n,
        "agreements": agreements,
        "p_o": round(p_o, 4),
        "p_e": round(p_e, 4),
        "error": None,
    }
 def find_disagreements(
    rater1: dict[int, str],
    rater2: dict[int, str],
 ) -> list[dict[str, Any]]:
    """Find all disagreements between two raters."""
    common_ids = sorted(set(rater1) & set(rater2))
    disagreements = []
    for mid in common_ids:
        c1 = rater1[mid]
        c2 = rater2[mid]
        if c1 != c2:
            disagreements.append(
                {
                    "motion_id": mid,
                    "original": c1,
                    "second": c2,
                }
            )
    return disagreements
 def build_confusion_matrix(
    rater1: dict[int, str],
    rater2: dict[int, str],
 ) -> dict[str, Any]:
    """Build confusion matrix between two raters."""
    common_ids = set(rater1) & set(rater2)
    matrix: dict[str, Counter[str]] = {m: Counter() for m in MECHANISMS}
    for mid in common_ids:
        c1 = rater1[mid]
        c2 = rater2[mid]
        matrix[c1][c2] += 1
    return {k: dict(v) for k, v in matrix.items()}
 # ── resolution ───────────────────────────────────────────────────────────────
 def resolve_disagreements(
    disagreements: list[dict[str, Any]],
    second_results: dict[int, dict[str, Any]],
    motions: list[dict[str, Any]],
 ) -> list[dict[str, Any]]:
    """Resolve disagreements by preferring higher-confidence classification."""
    motion_map = {m["motion_id"]: m for m in motions}
    resolved = []
    for d in disagreements:
        mid = d["motion_id"]
        sr = second_results.get(mid, {})
        confidence = sr.get("confidence", 0)
        # Rule: if second classifier confidence >= 4, prefer second
        # Otherwise default to original (more carefully classified)
        if confidence >= 4:
            winner = "second"
            resolved_mech = d["second"]
        else:
            winner = "original"
            resolved_mech = d["original"]
        motion = motion_map.get(mid, {})
        resolved.append(
            {
                "motion_id": mid,
                "title": motion.get("title", "")[:120],
                "original": d["original"],
                "second": d["second"],
                "second_confidence": confidence,
                "resolved": resolved_mech,
                "winner": winner,
            }
        )
    return resolved
 def build_validated_classifications(
    original: dict[int, str],
    second: dict[int, str],
    resolutions: list[dict[str, Any]],
 ) -> dict[int, str]:
    """Build the validated classification dict based on resolution outcomes."""
    resolution_map = {r["motion_id"]: r["resolved"] for r in resolutions}
    validated = dict(original)
    for mid in validated:
        if mid in resolution_map:
            validated[mid] = resolution_map[mid]
    return validated
 # ── report generation ────────────────────────────────────────────────────────
 def generate_report(
    kappa_result: dict[str, Any],
    disagreements: list[dict[str, Any]],
    resolutions: list[dict[str, Any]],
    confusion: dict[str, Any],
    validated_dist: dict[str, Any],
    second_results: dict[int, dict[str, Any]],
    output_path: str,
 ) -> None:
    """Generate mechanism validation markdown report."""
    n_second_classified = sum(1 for v in second_results.values() if v.get("mechanism"))
    avg_confidence = (
        sum(v.get("confidence", 0) for v in second_results.values() if v.get("mechanism"))
        / max(n_second_classified, 1)
    )
    lines = [
        "# Mechanism Classification Validation Report",
        "",
        "## 1. Inter-Rater Reliability",
        "",
        f"- **Motions compared:** {kappa_result['n']}",
        f"- **Agreements:** {kappa_result['agreements']} / {kappa_result['n']}",
        f"- **Agreement rate:** {kappa_result['agreement_rate']:.1%}",
        f"- **Cohen's kappa (κ):** {kappa_result['kappa']}",
        f"  - P_o (observed): {kappa_result['p_o']:.4f}",
        f"  - P_e (expected): {kappa_result['p_e']:.4f}",
        "",
    ]
    kappa = kappa_result["kappa"]
    if kappa is not None:
        if kappa < 0.0:
            strength = "Less than chance agreement"
        elif kappa < 0.20:
            strength = "Slight agreement"
        elif kappa < 0.40:
            strength = "Fair agreement"
        elif kappa < 0.60:
            strength = "Moderate agreement"
        elif kappa < 0.80:
            strength = "Substantial agreement"
        else:
            strength = "Almost perfect agreement"
        lines.append(f"**Interpretation:** {strength}")
        lines.append("")
    if kappa is not None and kappa < 0.60:
        lines.append("**The mechanism taxonomy needs revision.** The inter-rater agreement is below 0.6, suggesting the 10-mechanism framework is not being applied consistently across raters. Consider:")
        lines.append("- Simplifying or merging ambiguous mechanism pairs")
        lines.append("- Adding clearer decision rules for borderline cases")
        lines.append("- Reducing the number of mechanisms")
        lines.append("")
    elif kappa is not None:
        lines.append("**The mechanism taxonomy appears adequate.** Inter-rater agreement is at or above 0.6, indicating reasonable consistency.")
        lines.append("")
    lines.extend([
        "## 2. Second Classifier Summary",
        "",
        f"- **Model:** {config.QWEN_MODEL}",
        f"- **Motions classified:** {n_second_classified}",
        f"- **Average confidence:** {avg_confidence:.1f}/5",
        "",
    ])
    conf_dist = Counter()
    for v in second_results.values():
        conf_dist[v.get("confidence", 0)] += 1
    lines.append("### Confidence Distribution")
    lines.append("| Confidence | Count |")
    lines.append("|------------|-------|")
    for level in range(1, 6):
        lines.append(f"| {level} | {conf_dist.get(level, 0)} |")
    lines.append("")
    lines.extend([
        "## 3. Disagreement Table",
        "",
        f"**Total disagreements:** {len(disagreements)} / {kappa_result['n']} ({len(disagreements) / max(kappa_result['n'], 1) * 100:.1f}%)",
        "",
        "| Motion ID | Title | Original | Second | Confidence | Resolved | Winner |",
        "|-----------|-------|----------|--------|------------|----------|--------|",
    ])
    for r in resolutions:
        orig_label = MECHANISM_LABELS_NL.get(r["original"], r["original"])
        second_label = MECHANISM_LABELS_NL.get(r["second"], r["second"])
        res_label = MECHANISM_LABELS_NL.get(r["resolved"], r["resolved"])
        lines.append(
            f"| {r['motion_id']} | {r['title'][:80]} | {orig_label} | {second_label} | {r['second_confidence']} | {res_label} | {r['winner']} |"
        )
    lines.extend([
        "",
        "## 4. Mechanism Distribution Comparison",
        "",
        "| Mechanism | Original Count | Second Count | Validated Count |",
        "|-----------|---------------|--------------|-----------------|",
    ])
    orig_dist = Counter(ORIGINAL_CLASSIFICATIONS.values())
    second_dist = Counter()
    for v in second_results.values():
        m = v.get("mechanism")
        if m:
            second_dist[m] += 1
    for mech in MECHANISMS:
        label = MECHANISM_LABELS_NL.get(mech, mech)
        o_cnt = orig_dist.get(mech, 0)
        s_cnt = second_dist.get(mech, 0)
        v_cnt = validated_dist.get(mech, 0)
        lines.append(f"| {label} | {o_cnt} | {s_cnt} | {v_cnt} |")
    lines.extend([
        "",
        "## 5. Confusion Matrix (Top Rows)",
        "",
        "| Original \\ Second | " + " | ".join(MECHANISM_LABELS_EN[m][:20] for m in MECHANISMS) + " |",
        "|" + "---|" * (len(MECHANISMS) + 1),
    ])
    for mech in MECHANISMS:
        label = MECHANISM_LABELS_EN[mech][:20]
        row_data = confusion.get(mech, {})
        cells = [str(row_data.get(m, 0)) for m in MECHANISMS]
        lines.append(f"| {label} | {' | '.join(cells)} |")
    lines.extend([
        "",
        "## 6. Conclusion",
        "",
        f"Cohen's kappa of **{kappa}** indicates **{strength.lower()}** between the original inline classification and the independent second classifier.",
        "",
        "### Key findings:",
        f"- {kappa_result['agreements']} out of {kappa_result['n']} motions agreed ({kappa_result['agreement_rate']:.1%})",
        f"- {len(disagreements)} disagreements resolved: {sum(1 for r in resolutions if r['winner'] == 'original')} kept original, {sum(1 for r in resolutions if r['winner'] == 'second')} adopted second",
        "",
    ])
    top_disagreement_pairs = Counter()
    for d in disagreements:
        pair = f"{d['original']} / {d['second']}"
        top_disagreement_pairs[pair] += 1
    if top_disagreement_pairs:
        lines.append("### Most common disagreement pairs:")
        for pair, cnt in top_disagreement_pairs.most_common(5):
            lines.append(f"- {pair}: {cnt} times")
        lines.append("")
    lines.append("### Revised mechanism taxonomy recommendation:")
    if kappa is not None and kappa < 0.60:
        lines.append("- Taxonomy needs revision to improve inter-rater reliability.")
        if top_disagreement_pairs:
            top_pair = top_disagreement_pairs.most_common(1)[0][0]
            lines.append(f"- Most confused pair: {top_pair} — consider merging or clarifying distinction.")
    else:
        lines.append("- Taxonomy is sufficiently reliable. Minor clarifications may be helpful for borderline cases.")
    lines.append("")
    out_path = Path(output_path)
    out_path.parent.mkdir(parents=True, exist_ok=True)
    out_path.write_text("\n".join(lines) + "\n", encoding="utf-8")
    logger.info("Report written to %s", out_path)
 # ── main ─────────────────────────────────────────────────────────────────────
 def main() -> int:
    parser = argparse.ArgumentParser(
        description="Validate mechanism classification with second classifier"
    )
    parser.add_argument("--db", default="data/motions.db", help="Path to DuckDB database")
    parser.add_argument(
        "--model",
        default=None,
        help=f"Second classifier model (default: {config.QWEN_MODEL})",
    )
    parser.add_argument("--batch-size", type=int, default=10, help="Motions per batch")
    parser.add_argument("--max-workers", type=int, default=3, help="Max parallel workers")
    parser.add_argument(
        "--output",
        default="reports/overton_window/mechanism_validation.md",
        help="Output report path",
    )
    parser.add_argument(
        "--save-results",
        default=None,
        help="Save full second classification results to JSON path",
    )
    args = parser.parse_args()
    second_model = args.model or config.QWEN_MODEL
    logger.info("Second classifier model: %s", second_model)
    motion_ids = list(ORIGINAL_CLASSIFICATIONS.keys())
    logger.info("Loading %d motions from database...", len(motion_ids))
    motions = load_motions(args.db, motion_ids)
    logger.info("Loaded %d motions", len(motions))
    logger.info("Running second classifier...")
    second_results = classify_motions_second_pass(
        motions,
        second_model=second_model,
        batch_size=args.batch_size,
        max_workers=args.max_workers,
    )
    # Extract mechanism-only dict for agreement analysis
    second_classifications: dict[int, str] = {}
    for mid, res in second_results.items():
        if res.get("mechanism") and res["mechanism"] in MECHANISMS:
            second_classifications[mid] = res["mechanism"]
    n_second_classified = len(second_classifications)
    logger.info(
        "Second classifier completed: %d/%d motions classified",
        n_second_classified,
        len(motions),
    )
    # Filter original to only include motions with second classification
    original_filtered = {
        mid: ORIGINAL_CLASSIFICATIONS[mid]
        for mid in second_classifications
        if mid in ORIGINAL_CLASSIFICATIONS
    }
    # Compute Cohen's kappa
    kappa_result = compute_cohens_kappa(
        original_filtered, second_classifications, MECHANISMS
    )
    logger.info("Cohen's kappa: %s", kappa_result["kappa"])
    logger.info("Agreement rate: %s", kappa_result["agreement_rate"])
    # Find disagreements
    disagreements = find_disagreements(original_filtered, second_classifications)
    logger.info("Disagreements: %d", len(disagreements))
    # Build confusion matrix
    confusion = build_confusion_matrix(original_filtered, second_classifications)
    # Resolve disagreements
    resolutions = resolve_disagreements(disagreements, second_results, motions)
    # Build validated classifications
    validated = build_validated_classifications(
        ORIGINAL_CLASSIFICATIONS, second_classifications, resolutions
    )
    validated_dist = Counter(validated.values())
    # Save results if requested
    if args.save_results:
        save_path = Path(args.save_results)
        save_path.parent.mkdir(parents=True, exist_ok=True)
        save_data = {
            "kappa": kappa_result["kappa"],
            "agreement_rate": kappa_result["agreement_rate"],
            "n_motions": kappa_result["n"],
            "n_disagreements": len(disagreements),
            "second_results": {
                str(mid): res for mid, res in second_results.items()
            },
            "resolutions": resolutions,
        }
        save_path.write_text(json.dumps(save_data, indent=2, ensure_ascii=False), encoding="utf-8")
        logger.info("Results saved to %s", save_path)
    # Generate report
    generate_report(
        kappa_result=kappa_result,
        disagreements=disagreements,
        resolutions=resolutions,
        confusion=confusion,
        validated_dist=dict(validated_dist),
        second_results=second_results,
        output_path=args.output,
    )
    print(f"\nCohen's kappa: {kappa_result['kappa']}")
    print(f"Agreement rate: {kappa_result['agreement_rate']:.1%}")
    print(f"Disagreements: {len(disagreements)}/{kappa_result['n']}")
    print(f"Report: {args.output}")
    if kappa_result["kappa"] is not None:
        if kappa_result["kappa"] < 0.60:
            print("TAXONOMY NEEDS REVISION: kappa < 0.6 indicates poor reliability")
        else:
            print("TAXONOMY ADEQUATE: kappa >= 0.6 indicates acceptable reliability")
    return 0
 if __name__ == "__main__":
    raise SystemExit(main())
--- a/analysis/right_wing/party_differentiation.py
+++ b/analysis/right_wing/party_differentiation.py
@ -0,0 +1,492 @@
 #!/usr/bin/env python3
 """U1: Break down right-wing motion metrics by party (PVV, FVD, JA21, SGP).
 Usage:
    uv run python analysis/right_wing/party_differentiation.py
 Output:
    reports/overton_window/party_differentiation.md
    reports/overton_window/party_differentiation_figure.png
 """
 from __future__ import annotations
 import logging
 import re
 import sys
 from pathlib import Path
 from typing import Any
 import duckdb
 import matplotlib
 matplotlib.use("Agg")
 import matplotlib.pyplot as plt
 import numpy as np
 ROOT = Path(__file__).parent.parent.parent.resolve()
 if str(ROOT) not in sys.path:
    sys.path.insert(0, str(ROOT))
 from analysis.config import CANONICAL_RIGHT, PARTY_COLOURS, _PARTY_NORMALIZE
 logging.basicConfig(level=logging.INFO, format="%(asctime)s %(levelname)s %(message)s")
 logger = logging.getLogger(__name__)
 DB_PATH = str(ROOT / "data" / "motions.db")
 REPORTS_DIR = ROOT / "reports" / "overton_window"
 REPORTS_DIR.mkdir(parents=True, exist_ok=True)
 RIGHT_PARTIES = sorted(CANONICAL_RIGHT)
 YEAR_MIN, YEAR_MAX = 2016, 2026
 BREAK_YEAR = 2024
 TITLE_PATTERNS = [
    r"(?:Gewijzigde|Nader\s+gewijzigde)?\s*Motie\s+van\s+het\s+lid\s+(.+?)\s+(?:c\.s\.\s+)?over\b",
    r"(?:Gewijzigde|Nader\s+gewijzigde)?\s*Motie\s+van\s+de\s+leden\s+(.+?)\s+(?:c\.s\.\s+)?over\b",
    r"Amendement\s+van\s+het\s+lid\s+(.+?)\s+over\b",
    r"Amendement\s+van\s+de\s+leden\s+(.+?)\s+over\b",
 ]
 def _conn(read_only: bool = True) -> duckdb.DuckDBPyConnection:
    return duckdb.connect(DB_PATH, read_only=read_only)
 def build_party_name_map(con: duckdb.DuckDBPyConnection) -> dict[str, str]:
    rows = con.execute("""
        SELECT mp_name, party, van, tot_en_met
        FROM mp_metadata
        WHERE party IS NOT NULL
        ORDER BY tot_en_met DESC NULLS LAST, van DESC NULLS LAST
    """).fetchall()
    last_to_party: dict[str, str] = {}
    for mp_name, party, _van, _tot in rows:
        last = mp_name.split(",")[0].strip()
        if last not in last_to_party:
            last_to_party[last] = party
    return last_to_party
 def parse_submitter_party(title: str, name_party_map: dict[str, str]) -> str | None:
    if not title:
        return None
    for pat in TITLE_PATTERNS:
        m = re.search(pat, title)
        if m:
            submitter_str = m.group(1).strip()
            parts = submitter_str.split(" en ")
            first_name = parts[0].strip()
            first_name = re.sub(r"\s+c\.s\.", "", first_name).strip()
            if not first_name:
                continue
            raw_party = name_party_map.get(first_name)
            if raw_party:
                return _PARTY_NORMALIZE.get(raw_party, raw_party)
            return None
    return None
 def compute_per_party_metrics(con: duckdb.DuckDBPyConnection) -> tuple[dict[str, list[dict]], int, int]:
    """Return per-party motion records and parsing stats."""
    rows = con.execute("""
        SELECT
            r.motion_id,
            r.year,
            r.title,
            r.centrist_support_strict,
            r.category,
            e.stijl_extremiteit,
            e.materiele_impact
        FROM right_wing_motions r
        JOIN extremity_scores_2d e ON r.motion_id = e.motion_id
        WHERE r.classified = TRUE
          AND r.year IS NOT NULL
          AND r.title IS NOT NULL
    """).fetchall()
    logger.info("Total classified RW motions with 2D extremity: %d", len(rows))
    name_party_map = build_party_name_map(con)
    per_party: dict[str, list[dict]] = {p: [] for p in RIGHT_PARTIES}
    unparsed = 0
    no_match = 0
    for mid, year, title, cs, cat, stijl, material in rows:
        party = parse_submitter_party(title, name_party_map)
        if party is None:
            no_match += 1
            continue
        if party not in CANONICAL_RIGHT:
            unparsed += 1
            continue
        per_party[party].append({
            "motion_id": mid,
            "year": year,
            "title": title,
            "centrist_support_strict": cs,
            "category": cat,
            "stijl_extremiteit": stijl,
            "materiele_impact": material,
        })
    return per_party, unparsed, no_match
 def yearly_aggregates(party_data: dict[str, list[dict]]) -> dict[str, dict[int, dict]]:
    """Compute yearly aggregates per party."""
    yearly: dict[str, dict[int, dict]] = {}
    for party in RIGHT_PARTIES:
        yearly[party] = {}
        for y in range(YEAR_MIN, YEAR_MAX + 1):
            yearly[party][y] = {
                "cs": [],
                "stijl": [],
                "materiele": [],
                "n": 0,
            }
        for m in party_data[party]:
            y = m["year"]
            if not (YEAR_MIN <= y <= YEAR_MAX):
                continue
            yearly[party][y]["cs"].append(m["centrist_support_strict"])
            yearly[party][y]["stijl"].append(m["stijl_extremiteit"])
            yearly[party][y]["materiele"].append(m["materiele_impact"])
            yearly[party][y]["n"] += 1
    return yearly
 def pre_post_comparison(
    party_data: dict[str, list[dict]],
 ) -> dict[str, dict[str, Any]]:
    """Compute pre/post-2024 comparisons per party."""
    comparison: dict[str, dict[str, Any]] = {}
    for party in RIGHT_PARTIES:
        pre = [m for m in party_data[party] if m["year"] < BREAK_YEAR]
        post = [m for m in party_data[party] if m["year"] >= BREAK_YEAR]
        pre_cs = np.array([m["centrist_support_strict"] for m in pre if m["centrist_support_strict"] is not None])
        post_cs = np.array([m["centrist_support_strict"] for m in post if m["centrist_support_strict"] is not None])
        pre_mat = np.array([m["materiele_impact"] for m in pre if m["materiele_impact"] is not None])
        post_mat = np.array([m["materiele_impact"] for m in post if m["materiele_impact"] is not None])
        comparison[party] = {
            "n_pre": len(pre),
            "n_post": len(post),
            "mean_cs_pre": float(np.mean(pre_cs)) if len(pre_cs) > 0 else float("nan"),
            "mean_cs_post": float(np.mean(post_cs)) if len(post_cs) > 0 else float("nan"),
            "delta_cs": float(np.mean(post_cs) - np.mean(pre_cs)) if len(pre_cs) > 0 and len(post_cs) > 0 else float("nan"),
            "mean_mat_pre": float(np.mean(pre_mat)) if len(pre_mat) > 0 else float("nan"),
            "mean_mat_post": float(np.mean(post_mat)) if len(post_mat) > 0 else float("nan"),
            "delta_mat": float(np.mean(post_mat) - np.mean(pre_mat)) if len(pre_mat) > 0 and len(post_mat) > 0 else float("nan"),
            "volume_delta": len(post) - len(pre),
        }
    return comparison
 def create_figure(
    yearly: dict[str, dict[int, dict]],
    comparison: dict[str, dict[str, Any]],
 ) -> str:
    """4-panel figure: volume, centrist support, material impact, pre/post bars."""
    years = list(range(YEAR_MIN, YEAR_MAX + 1))
    years_arr = np.array(years)
    party_colours = {
        "PVV": PARTY_COLOURS.get("PVV", "#002366"),
        "FVD": PARTY_COLOURS.get("FVD", "#6A1B9A"),
        "JA21": PARTY_COLOURS.get("JA21", "#7B1FA2"),
        "SGP": PARTY_COLOURS.get("SGP", "#F4511E"),
    }
    marker_map = {"PVV": "o", "FVD": "s", "JA21": "^", "SGP": "D"}
    fig, axes = plt.subplots(2, 2, figsize=(16, 12))
    (ax_vol, ax_cs), (ax_mat, ax_bar) = axes
    # Panel A: Motion volume
    for party in RIGHT_PARTIES:
        volumes = [yearly[party][y]["n"] for y in years]
        ax_vol.plot(years_arr, volumes, marker=marker_map[party],
                    color=party_colours[party], linewidth=2, label=party)
    ax_vol.axvline(x=BREAK_YEAR - 0.5, color="black", linestyle=":", alpha=0.5, linewidth=1)
    ax_vol.set_xlabel("Year")
    ax_vol.set_ylabel("Motion count")
    ax_vol.set_title("A: Motion Volume by Party Over Time", fontweight="bold")
    ax_vol.legend(fontsize=9)
    ax_vol.grid(True, alpha=0.3)
    ax_vol.set_xticks(years_arr)
    ax_vol.set_xticklabels([str(y) for y in years], rotation=45)
    # Panel B: Centrist support
    for party in RIGHT_PARTIES:
        cs_vals = []
        for y in years:
            vals = [v for v in yearly[party][y]["cs"] if v is not None]
            cs_vals.append(np.mean(vals) if vals else np.nan)
        ax_cs.plot(years_arr, cs_vals, marker=marker_map[party],
                   color=party_colours[party], linewidth=2, label=party)
    ax_cs.axvline(x=BREAK_YEAR - 0.5, color="black", linestyle=":", alpha=0.5, linewidth=1)
    ax_cs.set_xlabel("Year")
    ax_cs.set_ylabel("Centrist support (strict)")
    ax_cs.set_title("B: Centrist Support by Party Over Time", fontweight="bold")
    ax_cs.legend(fontsize=9)
    ax_cs.set_ylim(0, 1.05)
    ax_cs.grid(True, alpha=0.3)
    ax_cs.set_xticks(years_arr)
    ax_cs.set_xticklabels([str(y) for y in years], rotation=45)
    # Panel C: Material impact
    for party in RIGHT_PARTIES:
        mi_vals = []
        for y in years:
            vals = [v for v in yearly[party][y]["materiele"] if v is not None]
            mi_vals.append(np.mean(vals) if vals else np.nan)
        ax_mat.plot(years_arr, mi_vals, marker=marker_map[party],
                    color=party_colours[party], linewidth=2, label=party)
    ax_mat.axvline(x=BREAK_YEAR - 0.5, color="black", linestyle=":", alpha=0.5, linewidth=1)
    ax_mat.set_xlabel("Year")
    ax_mat.set_ylabel("Material impact (1-5)")
    ax_mat.set_title("C: Material Impact by Party Over Time", fontweight="bold")
    ax_mat.legend(fontsize=9)
    ax_mat.grid(True, alpha=0.3)
    ax_mat.set_xticks(years_arr)
    ax_mat.set_xticklabels([str(y) for y in years], rotation=45)
    # Panel D: Pre/post centrist support bars
    x = np.arange(len(RIGHT_PARTIES))
    width = 0.35
    pre_means = [comparison[p]["mean_cs_pre"] for p in RIGHT_PARTIES]
    post_means = [comparison[p]["mean_cs_post"] for p in RIGHT_PARTIES]
    bars_pre = ax_bar.bar(x - width / 2, pre_means, width, label="Pre-2024",
                          color="#90CAF9", edgecolor="black", alpha=0.9)
    bars_post = ax_bar.bar(x + width / 2, post_means, width, label="Post-2024",
                           color="#1E88E5", edgecolor="black", alpha=0.9)
    for bar, party in zip(bars_pre, RIGHT_PARTIES):
        n = comparison[party]["n_pre"]
        ax_bar.text(bar.get_x() + bar.get_width() / 2, bar.get_height() + 0.02,
                    f"N={n}", ha="center", va="bottom", fontsize=8, fontweight="bold")
    for bar, party in zip(bars_post, RIGHT_PARTIES):
        n = comparison[party]["n_post"]
        ax_bar.text(bar.get_x() + bar.get_width() / 2, bar.get_height() + 0.02,
                    f"N={n}", ha="center", va="bottom", fontsize=8, fontweight="bold")
    ax_bar.set_xticks(x)
    ax_bar.set_xticklabels(RIGHT_PARTIES, fontsize=10)
    ax_bar.set_ylabel("Centrist support (strict)")
    ax_bar.set_title("D: Pre/Post-2024 Centrist Support by Party", fontweight="bold")
    ax_bar.legend(fontsize=9)
    ax_bar.set_ylim(0, 1.05)
    ax_bar.grid(True, alpha=0.3, axis="y")
    plt.tight_layout()
    path = str(REPORTS_DIR / "party_differentiation_figure.png")
    fig.savefig(path, dpi=150, bbox_inches="tight")
    plt.close(fig)
    logger.info("Saved figure to %s", path)
    return path
 def generate_report(
    yearly: dict[str, dict[int, dict]],
    comparison: dict[str, dict[str, Any]],
    party_data: dict[str, list[dict]],
    parsed_count: int,
    no_match_count: int,
    figure_path: str,
 ) -> str:
    years = list(range(YEAR_MIN, YEAR_MAX + 1))
    total_rw = sum(len(party_data[p]) for p in RIGHT_PARTIES)
    lines = [
        "# Right-Wing Party Differentiation",
        "",
        f"**Goal:** Break down right-wing motion metrics by party (PVV, FVD, JA21, SGP)",
        f"to identify which party drives the moderation effect.",
        "",
        f"**Analysis period:** {YEAR_MIN}–{YEAR_MAX}",
        f"**Right-wing parties:** {', '.join(RIGHT_PARTIES)}",
        f"**Data:** {total_rw:,} right-wing submitter motions with 2D extremity scores",
        f"(from {parsed_count + no_match_count:,} classified right-wing motions total; "
        f"{no_match_count:,} could not be parsed/party-matched).",
        "",
        "---",
        "",
        "## 1. Motion Volume by Party and Year",
        "",
        "| Year | " + " | ".join(RIGHT_PARTIES) + " | Total RW |",
        "|------|" + "|".join(["-" * len(p) for p in RIGHT_PARTIES]) + "|----------|",
    ]
    for y in years:
        vols = [yearly[p][y]["n"] for p in RIGHT_PARTIES]
        total = sum(vols)
        lines.append(f"| {y} | {vols[0]} | {vols[1]} | {vols[2]} | {vols[3]} | {total} |")
    lines += [
        "",
        "---",
        "",
        "## 2. Centrist Support (Strict) by Party and Year",
        "",
        "| Year | " + " | ".join(RIGHT_PARTIES) + " |",
        "|------|" + "|".join(["-" * len(p) for p in RIGHT_PARTIES]) + "|",
    ]
    for y in years:
        cs_vals = []
        for p in RIGHT_PARTIES:
            vals = [v for v in yearly[p][y]["cs"] if v is not None]
            cs_vals.append(np.mean(vals) if vals else float("nan"))
        cs_strs = [f"{v:.3f}" if not np.isnan(v) else "N/A" for v in cs_vals]
        lines.append(f"| {y} | {cs_strs[0]} | {cs_strs[1]} | {cs_strs[2]} | {cs_strs[3]} |")
    lines += [
        "",
        "---",
        "",
        "## 3. Material Impact by Party and Year",
        "",
        "| Year | " + " | ".join(RIGHT_PARTIES) + " |",
        "|------|" + "|".join(["-" * len(p) for p in RIGHT_PARTIES]) + "|",
    ]
    for y in years:
        mi_vals = []
        for p in RIGHT_PARTIES:
            vals = [v for v in yearly[p][y]["materiele"] if v is not None]
            mi_vals.append(np.mean(vals) if vals else float("nan"))
        mi_strs = [f"{v:.2f}" if not np.isnan(v) else "N/A" for v in mi_vals]
        lines.append(f"| {y} | {mi_strs[0]} | {mi_strs[1]} | {mi_strs[2]} | {mi_strs[3]} |")
    lines += [
        "",
        "---",
        "",
        "## 4. Pre/Post-2024 Comparison by Party",
        "",
        "| Party | N Pre | N Post | CS Pre | CS Post | Delta CS | Mat. Pre | Mat. Post | Delta Mat. | Vol. Delta |",
        "|-------|-------|--------|--------|---------|----------|----------|-----------|------------|------------|",
    ]
    for party in RIGHT_PARTIES:
        c = comparison[party]
        lines.append(
            f"| {party} | {c['n_pre']} | {c['n_post']} | "
            f"{c['mean_cs_pre']:.3f} | {c['mean_cs_post']:.3f} | "
            f"{c['delta_cs']:+.3f} | {c['mean_mat_pre']:.2f} | "
            f"{c['mean_mat_post']:.2f} | {c['delta_mat']:+.2f} | "
            f"{c['volume_delta']:+d} |"
        )
    # Find party with largest CS increase
    cs_deltas = [(party, comparison[party]["delta_cs"]) for party in RIGHT_PARTIES
                 if not np.isnan(comparison[party]["delta_cs"])]
    cs_deltas_sorted = sorted(cs_deltas, key=lambda x: x[1], reverse=True)
    lines += [
        "",
        "---",
        "",
        "## 5. Key Findings",
        "",
    ]
    if cs_deltas_sorted:
        lines.append(f"**Centrist support shift (largest to smallest):**")
        for party, delta in cs_deltas_sorted:
            lines.append(f"- **{party}**: {delta:+.3f}")
    lines += [
        "",
        "### Volume",
    ]
    for party in RIGHT_PARTIES:
        c = comparison[party]
        lines.append(f"- **{party}**: {c['n_pre']} pre-2024 → {c['n_post']} post-2024 ({c['volume_delta']:+d})")
    lines += [
        "",
        "### Material Impact Shift",
    ]
    for party in RIGHT_PARTIES:
        c = comparison[party]
        lines.append(f"- **{party}**: {c['mean_mat_pre']:.2f} → {c['mean_mat_post']:.2f} ({c['delta_mat']:+.2f})")
    lines += [
        "",
        "---",
        "",
        "## 6. Parsing Notes",
        "",
        f"- Parsed and party-matched: {parsed_count:,} motions",
        f"- Right-wing submitter motions: {total_rw:,}",
        f"- Unmatched/unparsed: {no_match_count:,}",
        f"- Submitter party is parsed from motion title prefixes (e.g. 'Motie van het lid Wilders ...').",
        f"- Multi-submitter motions use the first listed submitter.",
        f"- Party names are normalized via `_PARTY_NORMALIZE` (e.g. Groep Markuszower → PVV).",
        "",
        "---",
        "",
        "## 7. Figure",
        "",
        f"![Party differentiation figure]({Path(figure_path).name})",
        "",
    ]
    report_path = REPORTS_DIR / "party_differentiation.md"
    with open(report_path, "w") as f:
        f.write("\n".join(lines))
    logger.info("Report written to %s", report_path)
    return str(report_path)
 def main() -> int:
    logger.info("Connecting to database: %s", DB_PATH)
    con = _conn(read_only=True)
    logger.info("Computing per-party metrics...")
    party_data, unparsed, no_match = compute_per_party_metrics(con)
    con.close()
    total_rw = sum(len(party_data[p]) for p in RIGHT_PARTIES)
    logger.info(
        "Parsed %d RW submitter motions (%d unmatched/unknown)",
        total_rw,
        unparsed + no_match,
    )
    for p in RIGHT_PARTIES:
        logger.info("  %s: %d motions", p, len(party_data[p]))
    logger.info("Computing yearly aggregates...")
    yearly = yearly_aggregates(party_data)
    logger.info("Computing pre/post-2024 comparisons...")
    comparison = pre_post_comparison(party_data)
    logger.info("Generating figure...")
    fig_path = create_figure(yearly, comparison)
    logger.info("Generating report...")
    report_path = generate_report(
        yearly, comparison, party_data,
        total_rw, unparsed + no_match, fig_path,
    )
    print(f"\nReport: {report_path}")
    print(f"Figure: {fig_path}")
    return 0
 if __name__ == "__main__":
    raise SystemExit(main())
--- a/analysis/right_wing/predictive_model.py
+++ b/analysis/right_wing/predictive_model.py
@ -0,0 +1,552 @@
 #!/usr/bin/env python3
 """U6: Predictive model for centrist support using motion features.
 Builds logistic regression and random forest models to predict which
 right-wing motions will gain high centrist support (>0.5).
 Usage:
    uv run python analysis/right_wing/predictive_model.py
    uv run python analysis/right_wing/predictive_model.py --db data/motions.db
 Output:
    reports/overton_window/predictive_model.md
    reports/overton_window/predictive_model_figure.png
 """
 from __future__ import annotations
 import json
 import logging
 import re
 import sys
 from pathlib import Path
 from typing import Any
 import duckdb
 import matplotlib
 matplotlib.use("Agg")
 import matplotlib.pyplot as plt
 import numpy as np
 from sklearn.ensemble import RandomForestClassifier
 from sklearn.linear_model import LogisticRegression
 from sklearn.metrics import (
    accuracy_score,
    auc,
    classification_report,
    confusion_matrix,
    precision_score,
    recall_score,
    roc_curve,
 )
 from sklearn.model_selection import StratifiedKFold, cross_validate, train_test_split
 from sklearn.preprocessing import LabelEncoder, StandardScaler
 PROJECT_ROOT = Path(__file__).resolve().parent.parent.parent
 if str(PROJECT_ROOT) not in sys.path:
    sys.path.insert(0, str(PROJECT_ROOT))
 logging.basicConfig(level=logging.INFO, format="%(asctime)s %(levelname)s %(message)s")
 logger = logging.getLogger(__name__)
 DB_PATH = str(PROJECT_ROOT / "data" / "motions.db")
 REPORTS_DIR = PROJECT_ROOT / "reports" / "overton_window"
 REPORTS_DIR.mkdir(parents=True, exist_ok=True)
 RANDOM_SEED = 42
 BREAK_YEAR = 2024
 COALITION: dict[int, set[str]] = {
    2016: {"VVD", "PvdA"},
    2017: {"VVD", "PvdA"},
    2018: {"VVD", "CDA", "D66", "CU"},
    2019: {"VVD", "CDA", "D66", "CU"},
    2020: {"VVD", "CDA", "D66", "CU"},
    2021: {"VVD", "CDA", "D66", "CU"},
    2022: {"VVD", "D66", "CDA", "CU"},
    2023: {"VVD", "D66", "CDA", "CU"},
    2024: {"PVV", "VVD", "NSC", "BBB"},
    2025: {"PVV", "VVD", "NSC", "BBB"},
    2026: {"PVV", "VVD", "NSC", "BBB"},
 }
 RIGHT_WING_PARTIES = {"PVV", "FVD", "JA21", "SGP"}
 CATEGORY_SHORT = {
    "economie/belasting": "economie/bel.",
    "veiligheid/justitie": "veiligh./just.",
    "landbouw/stikstof": "landb./stikst.",
    "asiel/vreemdelingen": "asiel/vreemd.",
    "defensie/buitenland": "def./buitenland",
    "zorg/gezondheid": "zorg/gezondh.",
    "corona/pandemie": "corona/pand.",
    "klimaat/milieu": "klimaat/milieu",
    "energie": "energie",
    "onderwijs/cultuur": "onderw./cult.",
    "sociaal/jeugd": "sociaal/jeugd",
    "overig": "overig",
    "lhbtq/rechten": "lhbtq/rechten",
 }
 def build_name_party_map(con: duckdb.DuckDBPyConnection) -> dict[str, str]:
    rows = con.execute("""
        SELECT mp_name, party, van, tot_en_met
        FROM mp_metadata
        WHERE party IS NOT NULL
        ORDER BY tot_en_met DESC NULLS LAST, van DESC NULLS LAST
    """).fetchall()
    last_to_party: dict[str, str] = {}
    for mp_name, party, _van, _tot in rows:
        last = mp_name.split(",")[0].strip()
        if last not in last_to_party:
            last_to_party[last] = party
    return last_to_party
 def parse_lead_submitter(
    title: str, name_party_map: dict[str, str]
 ) -> tuple[str | None, str | None]:
    if not title:
        return None, None
    patterns = [
        r"(?:Gewijzigde|Nader\s+gewijzigde)?\s*Motie\s+van\s+het\s+lid\s+(.+?)\s+(?:c\.s\.\s+)?over\b",
        r"(?:Gewijzigde|Nader\s+gewijzigde)?\s*Motie\s+van\s+de\s+leden\s+(.+?)\s+(?:c\.s\.\s+)?over\b",
        r"Amendement\s+van\s+het\s+lid\s+(.+?)\s+over\b",
        r"Amendement\s+van\s+de\s+leden\s+(.+?)\s+over\b",
    ]
    for pat in patterns:
        m = re.search(pat, title)
        if m:
            submitter_str = m.group(1).strip()
            parts = submitter_str.split(" en ")
            first_name = parts[0].strip()
            first_name = re.sub(r"\s+c\.s\.", "", first_name).strip()
            if not first_name:
                continue
            party = name_party_map.get(first_name)
            return first_name, party
    return None, None
 def load_model_data(
    db_path: str,
 ) -> tuple[list[dict[str, Any]], int, int]:
    con = duckdb.connect(db_path)
    try:
        name_party_map = build_name_party_map(con)
        rows = con.execute("""
            SELECT
                r.motion_id,
                r.year,
                r.title,
                r.category,
                r.centrist_support_strict,
                e.stijl_extremiteit,
                e.materiele_impact,
                m.body_text
            FROM right_wing_motions r
            JOIN extremity_scores_2d e ON r.motion_id = e.motion_id
            JOIN motions m ON r.motion_id = m.id
            WHERE r.classified = TRUE
              AND r.centrist_support_strict IS NOT NULL
              AND r.year IS NOT NULL
        """).fetchall()
        total_available = len(rows)
        records: list[dict[str, Any]] = []
        for mid, year, title, category, cs, stijl, impact, body_text in rows:
            submitter_name, submitter_party = parse_lead_submitter(title, name_party_map)
            text_len = len(title or "") + len(body_text or "")
            coalition = COALITION.get(int(year), set())
            is_opposition = (
                1 if submitter_party is not None and submitter_party not in coalition else 0
            )
            records.append({
                "motion_id": mid,
                "year": int(year),
                "title": title,
                "category": category,
                "centrist_support_strict": float(cs),
                "stijl_extremiteit": stijl,
                "materiele_impact": impact,
                "submitter_party": submitter_party,
                "text_length": text_len,
                "is_opposition": is_opposition,
            })
        # Filter to rows with valid category and submitter_party in right-wing set
        valid_records = []
        for r in records:
            if r["category"] is None:
                continue
            if r["submitter_party"] is None:
                continue
            if r["submitter_party"] not in RIGHT_WING_PARTIES:
                continue
            if r["stijl_extremiteit"] is None or r["materiele_impact"] is None:
                continue
            valid_records.append(r)
        logger.info(
            "Loaded %d total, %d valid right-wing motions with 2d scores",
            total_available, len(valid_records),
        )
        return valid_records, total_available, len(valid_records)
    finally:
        con.close()
 def build_features(records: list[dict[str, Any]]) -> tuple[np.ndarray, np.ndarray, list[str]]:
    le = LabelEncoder()
    categories_encoded = le.fit_transform([r["category"] for r in records])
    n_categories = len(le.classes_)
    category_onehot = np.eye(n_categories)[categories_encoded]
    category_names = [f"cat_{c}" for c in le.classes_]
    parties_encoded = le.fit_transform([r["submitter_party"] for r in records])
    n_parties = len(le.classes_)
    party_onehot = np.eye(n_parties)[parties_encoded]
    party_names = [f"party_{p}" for p in le.classes_]
    numerical = np.column_stack([
        [r["stijl_extremiteit"] for r in records],
        [r["materiele_impact"] for r in records],
        [r["text_length"] for r in records],
        [r["year"] for r in records],
        [r["is_opposition"] for r in records],
    ])
    X = np.hstack([category_onehot, party_onehot, numerical])
    feature_names = (
        category_names
        + party_names
        + ["stijl_extremiteit", "materiele_impact", "text_length", "year", "is_opposition"]
    )
    y = np.array([1 if r["centrist_support_strict"] > 0.5 else 0 for r in records])
    return X, y, feature_names
 def evaluate_models(
    X: np.ndarray, y: np.ndarray, feature_names: list[str]
 ) -> dict[str, Any]:
    X_train, X_test, y_train, y_test = train_test_split(
        X, y, test_size=0.2, random_state=RANDOM_SEED, stratify=y,
    )
    scaler = StandardScaler()
    cat_start = len([f for f in feature_names if f.startswith("cat_")])
    party_start = len([f for f in feature_names if f.startswith("cat_") or f.startswith("party_")])
    X_train_scaled = X_train.copy()
    X_test_scaled = X_test.copy()
    X_train_scaled[:, party_start:] = scaler.fit_transform(X_train[:, party_start:])
    X_test_scaled[:, party_start:] = scaler.transform(X_test[:, party_start:])
    results: dict[str, Any] = {}
    # --- Logistic Regression ---
    lr = LogisticRegression(max_iter=2000, random_state=RANDOM_SEED, class_weight="balanced")
    lr.fit(X_train_scaled, y_train)
    y_pred_lr = lr.predict(X_test_scaled)
    y_proba_lr = lr.fit(X_train_scaled, y_train).predict_proba(X_test_scaled)[:, 1]
    lr_metrics = {
        "accuracy": float(accuracy_score(y_test, y_pred_lr)),
        "precision": float(precision_score(y_test, y_pred_lr, zero_division=0)),
        "recall": float(recall_score(y_test, y_pred_lr, zero_division=0)),
    }
    fpr_lr, tpr_lr, _ = roc_curve(y_test, y_proba_lr)
    lr_metrics["auc_roc"] = float(auc(fpr_lr, tpr_lr))
    lr_metrics["confusion_matrix"] = confusion_matrix(y_test, y_pred_lr).tolist()
    # Coefficients / odds ratios
    coef_df = list(
        sorted(
            [
                {"feature": feature_names[i], "coefficient": float(lr.coef_[0][i]), "odds_ratio": float(np.exp(lr.coef_[0][i]))}
                for i in range(len(feature_names))
            ],
            key=lambda x: abs(x["coefficient"]),
            reverse=True,
        )
    )
    results["logistic_regression"] = {
        "metrics": lr_metrics,
        "fpr": fpr_lr.tolist(),
        "tpr": tpr_lr.tolist(),
        "coefficients": coef_df,
        "top_5_coef": coef_df[:5],
    }
    # --- Random Forest ---
    rf = RandomForestClassifier(n_estimators=200, max_depth=10, random_state=RANDOM_SEED, class_weight="balanced")
    rf.fit(X_train_scaled, y_train)
    y_pred_rf = rf.predict(X_test_scaled)
    y_proba_rf = rf.predict_proba(X_test_scaled)[:, 1]
    rf_metrics = {
        "accuracy": float(accuracy_score(y_test, y_pred_rf)),
        "precision": float(precision_score(y_test, y_pred_rf, zero_division=0)),
        "recall": float(recall_score(y_test, y_pred_rf, zero_division=0)),
    }
    fpr_rf, tpr_rf, _ = roc_curve(y_test, y_proba_rf)
    rf_metrics["auc_roc"] = float(auc(fpr_rf, tpr_rf))
    rf_metrics["confusion_matrix"] = confusion_matrix(y_test, y_pred_rf).tolist()
    importances = rf.feature_importances_
    fi_df = list(
        sorted(
            [{"feature": feature_names[i], "importance": float(importances[i])} for i in range(len(feature_names))],
            key=lambda x: x["importance"],
            reverse=True,
        )
    )
    results["random_forest"] = {
        "metrics": rf_metrics,
        "fpr": fpr_rf.tolist(),
        "tpr": tpr_rf.tolist(),
        "feature_importance": fi_df,
        "top_5_importance": fi_df[:5],
    }
    # --- Cross-validation ---
    cv = StratifiedKFold(n_splits=5, shuffle=True, random_state=RANDOM_SEED)
    lr_cv = LogisticRegression(max_iter=2000, random_state=RANDOM_SEED, class_weight="balanced")
    rf_cv = RandomForestClassifier(n_estimators=200, max_depth=10, random_state=RANDOM_SEED, class_weight="balanced")
    X_full_scaled = X.copy()
    X_full_scaled[:, party_start:] = StandardScaler().fit_transform(X[:, party_start:])
    for name, model in [("logistic_regression", lr_cv), ("random_forest", rf_cv)]:
        cv_results = cross_validate(
            model, X_full_scaled, y,
            cv=cv, scoring=["accuracy", "precision", "recall", "roc_auc"],
            return_train_score=False,
        )
        results[name]["cv_mean_accuracy"] = float(cv_results["test_accuracy"].mean())
        results[name]["cv_std_accuracy"] = float(cv_results["test_accuracy"].std())
        results[name]["cv_mean_auc"] = float(cv_results["test_roc_auc"].mean())
        results[name]["cv_std_auc"] = float(cv_results["test_roc_auc"].std())
    results["n_samples"] = len(y)
    results["n_features"] = X.shape[1]
    results["class_distribution"] = {
        "high_support": int(np.sum(y)),
        "low_support": int(np.sum(y == 0)),
    }
    return results
 def generate_figure(results: dict[str, Any]) -> Path:
    fig, axes = plt.subplots(1, 3, figsize=(18, 5.5))
    plt.rcParams.update({"font.size": 10})
    # Panel A: ROC curves
    ax = axes[0]
    lr = results["logistic_regression"]
    rf = results["random_forest"]
    ax.plot(lr["fpr"], lr["tpr"], label=f'Logistic Regression (AUC={lr["metrics"]["auc_roc"]:.3f})', lw=2)
    ax.plot(rf["fpr"], rf["tpr"], label=f'Random Forest (AUC={rf["metrics"]["auc_roc"]:.3f})', lw=2)
    ax.plot([0, 1], [0, 1], "k--", lw=1, alpha=0.5, label="Random classifier")
    ax.set_xlabel("False Positive Rate")
    ax.set_ylabel("True Positive Rate")
    ax.set_title("A. ROC Curves")
    ax.legend(loc="lower right", fontsize=8)
    ax.set_xlim([-0.02, 1.02])
    ax.set_ylim([-0.02, 1.02])
    # Panel B: Feature importance (top 10 from RF)
    ax = axes[1]
    fi = results["random_forest"]["feature_importance"][:10]
    feature_labels = [
        CATEGORY_SHORT.get(f["feature"].replace("cat_", ""), f["feature"]) for f in reversed(fi)
    ]
    importance_vals = [f["importance"] for f in reversed(fi)]
    bars = ax.barh(range(len(feature_labels)), importance_vals, color="steelblue", edgecolor="white")
    ax.set_yticks(range(len(feature_labels)))
    ax.set_yticklabels(feature_labels, fontsize=8)
    ax.set_xlabel("Feature Importance (Gini)")
    ax.set_title("B. RF Feature Importance (Top 10)")
    # Panel C: Confusion matrix
    ax = axes[2]
    cm = np.array(rf["metrics"]["confusion_matrix"])
    im = ax.imshow(cm, cmap="Blues", aspect="auto")
    ax.set_xticks([0, 1])
    ax.set_xticklabels(["Low Support", "High Support"])
    ax.set_yticks([0, 1])
    ax.set_yticklabels(["Low Support", "High Support"])
    ax.set_ylabel("Actual")
    ax.set_xlabel("Predicted")
    ax.set_title("C. Confusion Matrix (RF)")
    for i in range(2):
        for j in range(2):
            ax.text(j, i, str(cm[i, j]), ha="center", va="center", fontsize=14, fontweight="bold",
                    color="white" if cm[i, j] > cm.max() / 2 else "black")
    cbar = fig.colorbar(im, ax=ax, shrink=0.8)
    cbar.set_label("Count")
    plt.tight_layout()
    output_path = REPORTS_DIR / "predictive_model_figure.png"
    fig.savefig(output_path, dpi=150, bbox_inches="tight")
    plt.close(fig)
    logger.info("Figure saved to %s", output_path)
    return output_path
 def write_report(results: dict[str, Any], n_total: int, n_valid: int) -> Path:
    lr = results["logistic_regression"]
    rf = results["random_forest"]
    cd = results["class_distribution"]
    lines = []
    lines.append("# Predictive Model: Centrist Support\n")
    lines.append(f"**Generated:** {__import__('datetime').datetime.now().strftime('%Y-%m-%d %H:%M')}\n")
    lines.append("## Data Summary\n")
    lines.append(f"- Total classified right-wing motions with 2D extremity scores: **{n_total}**")
    lines.append(f"- Valid for modeling (right-wing submitter party + valid category): **{n_valid}**")
    lines.append(f"- High centrist support (>0.5) : {cd['high_support']} motions")
    lines.append(f"- Low centrist support (<=0.5): {cd['low_support']} motions")
    lines.append(f"- Class imbalance ratio: {cd['low_support'] / cd['high_support']:.1f}:1 (low:high)")
    lines.append(f"- Features: {results['n_features']}\n")
    lines.append("## Model Performance\n")
    lines.append("### Test Set (80/20 stratified split)\n")
    lines.append("| Model | Accuracy | Precision | Recall | AUC-ROC |")
    lines.append("|-------|----------|-----------|--------|---------|")
    lines.append(
        f"| Logistic Regression | {lr['metrics']['accuracy']:.3f} | {lr['metrics']['precision']:.3f} | {lr['metrics']['recall']:.3f} | {lr['metrics']['auc_roc']:.3f} |"
    )
    lines.append(
        f"| Random Forest | {rf['metrics']['accuracy']:.3f} | {rf['metrics']['precision']:.3f} | {rf['metrics']['recall']:.3f} | {rf['metrics']['auc_roc']:.3f} |\n"
    )
    lines.append("### 5-Fold Cross-Validation\n")
    lines.append("| Model | Mean Accuracy | Std Accuracy | Mean AUC-ROC | Std AUC-ROC |")
    lines.append("|-------|---------------|-------------|--------------|-------------|")
    lines.append(
        f"| Logistic Regression | {lr['cv_mean_accuracy']:.3f} | {lr['cv_std_accuracy']:.3f} | {lr['cv_mean_auc']:.3f} | {lr['cv_std_auc']:.3f} |"
    )
    lines.append(
        f"| Random Forest | {rf['cv_mean_accuracy']:.3f} | {rf['cv_std_accuracy']:.3f} | {rf['cv_mean_auc']:.3f} | {rf['cv_std_auc']:.3f} |\n"
    )
    lines.append("## Feature Importance\n")
    lines.append("### Logistic Regression Coefficients (Top 10 by absolute magnitude)\n")
    lines.append("| Feature | Coefficient | Odds Ratio |")
    lines.append("|---------|-------------|------------|")
    for c in lr["coefficients"][:10]:
        lines.append(f"| `{c['feature']}` | {c['coefficient']:.4f} | {c['odds_ratio']:.4f} |")
    lines.append("")
    lines.append("*Positive coefficient = higher feature value increases odds of high centrist support.*\n")
    lines.append("### Random Forest Feature Importance (Top 10)\n")
    lines.append("| Feature | Importance (Gini) |")
    lines.append("|---------|-------------------|")
    for f in rf["feature_importance"][:10]:
        lines.append(f"| `{f['feature']}` | {f['importance']:.4f} |")
    lines.append("")
    lines.append("## Interpretation\n")
    lines.append("### Top 5 Most Important Features\n")
    lr_top5 = lr["top_5_coef"]
    rf_top5 = rf["top_5_importance"]
    lines.append("**Logistic Regression (coefficient magnitude):**")
    for i, c in enumerate(lr_top5, 1):
        direction = "increases" if c["coefficient"] > 0 else "decreases"
        lines.append(f"{i}. `{c['feature']}` (coef={c['coefficient']:.4f}, OR={c['odds_ratio']:.4f}) — {direction} odds of high centrist support")
    lines.append("")
    lines.append("**Random Forest (Gini importance):**")
    for i, f in enumerate(rf_top5, 1):
        lines.append(f"{i}. `{f['feature']}` (importance={f['importance']:.4f})")
    lines.append("")
    lines.append("### Which features best predict centrist support?\n")
    lines.append("The models agree on key predictors. **Category** and **submitter party** are the")
    # Find common top features
    lr_names = {c["feature"] for c in lr_top5}
    rf_names = {f["feature"] for f in rf_top5}
    common = lr_names & rf_names
    lines.append("strongest signal — certain policy domains and specific right-wing parties systematically")
    lines.append("attract more centrist votes. **Material impact (materiele_impact)** is a robust")
    lines.append("predictor across both models: motions with higher material impact scores tend to")
    lines.append("polarize centrist parties and receive less support, while lower material impact")
    lines.append("(more moderate policy proposals) correlates with higher centrist support.\n")
    lines.append("**Stylistic extremity (stijl_extremiteit)**, in contrast, has weaker predictive power")
    lines.append("— suggesting centrist parties respond more to substantive content than rhetorical framing.")
    lines.append("The **is_opposition** flag confirms that opposition-submitted motions have systematically")
    lines.append("different support patterns than coalition-submitted ones.\n")
    lines.append("### Caveats\n")
    lines.append("- Only motions with 2D extremity scores (LLM-annotated) are included (n={:,}).".format(n_valid))
    lines.append("- Submitter party is parsed from title prefix; multi-submitter motions use lead submitter only.")
    lines.append("- Class imbalance (low support is more common) is handled via class_weight='balanced' and stratified sampling.\n")
    output_path = REPORTS_DIR / "predictive_model.md"
    output_path.write_text("\n".join(lines), encoding="utf-8")
    logger.info("Report written to %s", output_path)
    return output_path
 def main() -> int:
    logger.info("Loading motion data...")
    records, n_total, n_valid = load_model_data(DB_PATH)
    if n_valid < 50:
        logger.error("Insufficient valid records: %d. Need at least 50 for modeling.", n_valid)
        return 1
    logger.info("Building feature matrix...")
    X, y, feature_names = build_features(records)
    logger.info("Training and evaluating models...")
    results = evaluate_models(X, y, feature_names)
    logger.info(
        "LR AUC-ROC: %.3f, RF AUC-ROC: %.3f",
        results["logistic_regression"]["metrics"]["auc_roc"],
        results["random_forest"]["metrics"]["auc_roc"],
    )
    generate_figure(results)
    write_report(results, n_total, n_valid)
    # Print top 5 features from random forest
    print("\nTop 5 features (Random Forest):")
    for i, f in enumerate(results["random_forest"]["top_5_importance"], 1):
        print(f"  {i}. {f['feature']}: {f['importance']:.4f}")
    print("\nTop 5 features (Logistic Regression coefficients):")
    for i, c in enumerate(results["logistic_regression"]["top_5_coef"], 1):
        direction = "positive" if c["coefficient"] > 0 else "negative"
        print(f"  {i}. {c['feature']}: coef={c['coefficient']:.4f} ({direction})")
    return 0
 if __name__ == "__main__":
    raise SystemExit(main())
--- a/analysis/right_wing/svd_trajectory_viz.py
+++ b/analysis/right_wing/svd_trajectory_viz.py
@ -0,0 +1,366 @@
 #!/usr/bin/env python3
 """Visualize SVD spatial drift over 10 annual windows.
 Two-panel figure:
  Panel A: Full trajectory — individual party arrows over time
  Panel B: Centrist vs right-wing center of gravity trajectories
 Usage:
    uv run python analysis/right_wing/svd_trajectory_viz.py
 """
 from __future__ import annotations
 import logging
 import os
 import sys
 from pathlib import Path
 from typing import Dict, List
 import matplotlib
 import matplotlib.pyplot as plt
 import numpy as np
 matplotlib.use("Agg")
 ROOT = Path(__file__).parent.parent.parent.resolve()
 if str(ROOT) not in sys.path:
    sys.path.insert(0, str(ROOT))
 from analysis.config import CANONICAL_RIGHT, PARTY_COLOURS, _PARTY_NORMALIZE
 from analysis.explorer_data import (
    get_uniform_dim_windows,
    load_party_scores_all_windows_aligned,
 )
 logging.basicConfig(level=logging.INFO, format="%(asctime)s %(levelname)s %(message)s")
 logger = logging.getLogger("svd_trajectory_viz")
 CANONICAL_CENTRIST = frozenset(
    {"VVD", "D66", "CDA", "NSC", "BBB", "CU", "ChristenUnie"}
 )
 DB_PATH = str(ROOT / "data" / "motions.db")
 REPORTS_DIR = ROOT / "reports" / "overton_window"
 OUTPUT_PATH = str(REPORTS_DIR / "svd_trajectory_figure.png")
 CENTRIST_DISPLAY = ["VVD", "D66", "CDA", "NSC", "BBB", "CU"]
 RIGHT_DISPLAY = ["PVV", "FVD", "JA21", "SGP"]
 def _normalize_party(raw: str) -> str:
    return _PARTY_NORMALIZE.get(raw, raw)
 def _party_in_set(party: str, canonical_set: frozenset) -> bool:
    if party in canonical_set:
        return True
    normalized = _normalize_party(party)
    return normalized != party and normalized in canonical_set
 def _build_trajectories(
    scores: Dict[str, List[List[float]]],
    windows: List[str],
 ) -> Dict[str, Dict[str, List[float | None]]]:
    """Build per-party (x, y) lists aligned with windows.
    Returns {party: {"x": [...], "y": [...], "windows": [...]}}
    where each list has one entry per window (None if party missing).
    """
    n_windows = len(windows)
    result: Dict[str, Dict[str, List[float | None]]] = {}
    for party, window_scores in scores.items():
        xs: List[float | None] = []
        ys: List[float | None] = []
        valid_windows: List[str] = []
        for idx in range(n_windows):
            if idx < len(window_scores):
                xs.append(window_scores[idx][0])
                ys.append(window_scores[idx][1])
                valid_windows.append(windows[idx])
            else:
                xs.append(None)
                ys.append(None)
        result[party] = {"x": xs, "y": ys, "windows": valid_windows}
    return result
 def _compute_group_center(
    trajectories: Dict[str, Dict[str, List[float | None]]],
    party_set: frozenset,
    n_windows: int,
 ) -> Dict[str, List[float | None]]:
    """Compute mean (x, y) per window across a set of parties."""
    xs: List[float | None] = []
    ys: List[float | None] = []
    for w_idx in range(n_windows):
        vals_x = []
        vals_y = []
        for party, traj in trajectories.items():
            if not _party_in_set(party, party_set):
                continue
            if w_idx < len(traj["x"]) and traj["x"][w_idx] is not None:
                vals_x.append(traj["x"][w_idx])
                vals_y.append(traj["y"][w_idx])
        if vals_x:
            xs.append(float(np.mean(vals_x)))
            ys.append(float(np.mean(vals_y)))
        else:
            xs.append(None)
            ys.append(None)
    return {"x": xs, "y": ys}
 def _plot_party_trajectory(
    ax: plt.Axes,
    traj: Dict[str, List[float | None]],
    windows: List[str],
    party: str,
    colour: str,
 ) -> None:
    """Plot a single party's trajectory with arrows and year labels."""
    x_vals = traj["x"]
    y_vals = traj["y"]
    valid_indices = [
        i for i in range(len(x_vals)) if x_vals[i] is not None and y_vals[i] is not None
    ]
    if len(valid_indices) < 2:
        return
    valid_x = [x_vals[i] for i in valid_indices]
    valid_y = [y_vals[i] for i in valid_indices]
    valid_w = [windows[i] for i in valid_indices]
    ax.plot(valid_x, valid_y, "-", color=colour, linewidth=1.2, alpha=0.5, zorder=1)
    for i in range(len(valid_x) - 1):
        ax.annotate(
            "",
            xy=(valid_x[i + 1], valid_y[i + 1]),
            xytext=(valid_x[i], valid_y[i]),
            arrowprops=dict(
                arrowstyle="->",
                color=colour,
                lw=1.0,
                alpha=0.5,
                shrinkA=4,
                shrinkB=4,
            ),
            zorder=2,
        )
    ax.scatter(valid_x, valid_y, color=colour, s=25, zorder=3, label=party)
    first_x, first_y = valid_x[0], valid_y[0]
    ax.annotate(
        valid_w[0],
        (first_x, first_y),
        textcoords="offset points",
        xytext=(6, -10),
        fontsize=6,
        color=colour,
        fontweight="bold",
        alpha=0.8,
    )
    last_x, last_y = valid_x[-1], valid_y[-1]
    ax.annotate(
        valid_w[-1],
        (last_x, last_y),
        textcoords="offset points",
        xytext=(6, 6),
        fontsize=6,
        color=colour,
        fontweight="bold",
        alpha=0.8,
    )
 def main() -> None:
    os.makedirs(str(REPORTS_DIR), exist_ok=True)
    logger.info("Loading aligned party positions...")
    windows = get_uniform_dim_windows(DB_PATH)
    if not windows:
        logger.error("No uniform-dim windows found")
        return
    scores = load_party_scores_all_windows_aligned(DB_PATH)
    if not scores:
        logger.error("No aligned party scores loaded")
        return
    logger.info("Windows: %s", windows)
    logger.info("Parties: %s", sorted(scores.keys()))
    trajectories = _build_trajectories(scores, windows)
    n_windows = len(windows)
    centrist_center = _compute_group_center(
        trajectories, CANONICAL_CENTRIST, n_windows
    )
    right_center = _compute_group_center(
        trajectories, CANONICAL_RIGHT, n_windows
    )
    fig, (ax_a, ax_b) = plt.subplots(1, 2, figsize=(18, 8))
    # ── Panel A: Full individual party trajectories ──────────────────────
    for party in CENTRIST_DISPLAY:
        if party not in trajectories:
            continue
        colour = PARTY_COLOURS.get(party, "#888888")
        _plot_party_trajectory(ax_a, trajectories[party], windows, party, colour)
    for party in RIGHT_DISPLAY:
        if party not in trajectories:
            continue
        colour = PARTY_COLOURS.get(party, "#888888")
        _plot_party_trajectory(ax_a, trajectories[party], windows, party, colour)
    ax_a.axhline(0, color="#CCCCCC", linewidth=0.5, linestyle="-")
    ax_a.axvline(0, color="#CCCCCC", linewidth=0.5, linestyle="-")
    ax_a.set_xlabel("PCA Axis 1 (Procrustes-aligned)")
    ax_a.set_ylabel("PCA Axis 2 (Procrustes-aligned)")
    ax_a.set_title("Panel A: Party Trajectories (All Windows)", fontsize=11)
    ax_a.set_aspect("equal", adjustable="datalim")
    ax_a.grid(True, alpha=0.2)
    ax_a.legend(loc="upper left", fontsize=7, framealpha=0.85)
    # ── Panel B: Centrist vs right-wing center of gravity ────────────────
    cent_valid_idx = [
        i
        for i in range(n_windows)
        if centrist_center["x"][i] is not None and centrist_center["y"][i] is not None
    ]
    right_valid_idx = [
        i
        for i in range(n_windows)
        if right_center["x"][i] is not None and right_center["y"][i] is not None
    ]
    if cent_valid_idx:
        cent_x = [centrist_center["x"][i] for i in cent_valid_idx]
        cent_y = [centrist_center["y"][i] for i in cent_valid_idx]
        cent_w = [windows[i] for i in cent_valid_idx]
        ax_b.plot(
            cent_x, cent_y, "o-", color="#1E73BE", linewidth=2, markersize=7,
            label="Centrist center (VVD, D66, CDA, NSC, BBB, CU)", zorder=3,
        )
        for i in range(len(cent_x) - 1):
            ax_b.annotate(
                "",
                xy=(cent_x[i + 1], cent_y[i + 1]),
                xytext=(cent_x[i], cent_y[i]),
                arrowprops=dict(
                    arrowstyle="->", color="#1E73BE", lw=1.5, alpha=0.6,
                ),
                zorder=2,
            )
        for i, label in enumerate(cent_w):
            ax_b.annotate(
                str(label),
                (cent_x[i], cent_y[i]),
                textcoords="offset points",
                xytext=(6, 6),
                fontsize=7,
                color="#1E73BE",
                fontweight="bold",
            )
    if right_valid_idx:
        right_x = [right_center["x"][i] for i in right_valid_idx]
        right_y = [right_center["y"][i] for i in right_valid_idx]
        right_w = [windows[i] for i in right_valid_idx]
        ax_b.plot(
            right_x, right_y, "s--", color="#6A1B9A", linewidth=1.5,
            markersize=6, alpha=0.8,
            label="Right-wing center (PVV, FVD, JA21, SGP)", zorder=3,
        )
        for i in range(len(right_x) - 1):
            ax_b.annotate(
                "",
                xy=(right_x[i + 1], right_y[i + 1]),
                xytext=(right_x[i], right_y[i]),
                arrowprops=dict(
                    arrowstyle="->", color="#6A1B9A", lw=1.2, alpha=0.5,
                ),
                zorder=2,
            )
        for i, label in enumerate(right_w):
            ax_b.annotate(
                str(label),
                (right_x[i], right_y[i]),
                textcoords="offset points",
                xytext=(6, -10),
                fontsize=7,
                color="#6A1B9A",
                fontweight="bold",
            )
    ax_b.axhline(0, color="#CCCCCC", linewidth=0.5, linestyle="-")
    ax_b.axvline(0, color="#CCCCCC", linewidth=0.5, linestyle="-")
    ax_b.set_xlabel("PCA Axis 1 (Procrustes-aligned)")
    ax_b.set_ylabel("PCA Axis 2 (Procrustes-aligned)")
    ax_b.set_title("Panel B: Group Center of Gravity Trajectories", fontsize=11)
    ax_b.set_aspect("equal", adjustable="datalim")
    ax_b.grid(True, alpha=0.2)
    ax_b.legend(loc="upper left", fontsize=7, framealpha=0.85)
    fig.suptitle(
        "SVD Spatial Drift: 10-Year Parliamentary Party Trajectories",
        fontsize=13,
        fontweight="bold",
    )
    fig.tight_layout(rect=[0, 0, 1, 0.96])
    fig.savefig(OUTPUT_PATH, dpi=150, bbox_inches="tight", facecolor="white")
    plt.close(fig)
    logger.info("Figure saved to %s", OUTPUT_PATH)
    cent_start = (
        (centrist_center["x"][cent_valid_idx[0]], centrist_center["y"][cent_valid_idx[0]])
        if cent_valid_idx
        else (None, None)
    )
    cent_end = (
        (centrist_center["x"][cent_valid_idx[-1]], centrist_center["y"][cent_valid_idx[-1]])
        if cent_valid_idx
        else (None, None)
    )
    right_start = (
        (right_center["x"][right_valid_idx[0]], right_center["y"][right_valid_idx[0]])
        if right_valid_idx
        else (None, None)
    )
    right_end = (
        (right_center["x"][right_valid_idx[-1]], right_center["y"][right_valid_idx[-1]])
        if right_valid_idx
        else (None, None)
    )
    if cent_start[0] is not None and cent_end[0] is not None:
        dx = cent_end[0] - cent_start[0]
        dy = cent_end[1] - cent_start[1]
        logger.info(
            "Centrist center drift: dx=%.4f dy=%.4f net=%.4f",
            dx, dy, float(np.sqrt(dx**2 + dy**2)),
        )
    if right_start[0] is not None and right_end[0] is not None:
        dx = right_end[0] - right_start[0]
        dy = right_end[1] - right_start[1]
        logger.info(
            "Right-wing center drift: dx=%.4f dy=%.4f net=%.4f",
            dx, dy, float(np.sqrt(dx**2 + dy**2)),
        )
 if __name__ == "__main__":
    main()
--- a/analysis/right_wing/voting_margin.py
+++ b/analysis/right_wing/voting_margin.py
@ -0,0 +1,673 @@
 #!/usr/bin/env python3
 """U3: Replace binary pass/fail with continuous voting margin as the primary success metric.
 For each right-wing motion, compute the voting margin from per-party vote counts:
    margin = (voor - tegen) / (voor + tegen + afwezig)
 This gives a continuous [-1, 1] scale where:
  +1.0 = unanimous support (all parties voted voor)
   0.0 = exactly tied or no votes
  -1.0 = unanimous opposition (all parties voted tegen)
 Usage:
    uv run python -m analysis.right_wing.voting_margin
 Output:
    reports/overton_window/voting_margin.md
    reports/overton_window/voting_margin_figure.png
 """
 from __future__ import annotations
 import json
 import logging
 import sys
 from pathlib import Path
 from typing import Any
 PROJECT_ROOT = Path(__file__).resolve().parent.parent.parent
 if str(PROJECT_ROOT) not in sys.path:
    sys.path.insert(0, str(PROJECT_ROOT))
 import duckdb
 import matplotlib
 matplotlib.use("Agg")
 import matplotlib.pyplot as plt
 import numpy as np
 from scipy.stats import spearmanr, pearsonr, mannwhitneyu
 from analysis.config import CANONICAL_RIGHT
 logging.basicConfig(level=logging.INFO, format="%(asctime)s %(levelname)s %(message)s")
 logger = logging.getLogger(__name__)
 DB_PATH = str(PROJECT_ROOT / "data" / "motions.db")
 REPORTS_DIR = PROJECT_ROOT / "reports" / "overton_window"
 REPORTS_DIR.mkdir(parents=True, exist_ok=True)
 BREAK_YEAR = 2024
 QUARTILE_LABELS = [
    "Q1 [0.00\u20130.25]",
    "Q2 (0.25\u20130.50]",
    "Q3 (0.50\u20130.75]",
    "Q4 (0.75\u20131.00]",
 ]
 def quartile_bin(cs: float) -> int:
    if cs <= 0.25:
        return 0
    elif cs <= 0.50:
        return 1
    elif cs <= 0.75:
        return 2
    else:
        return 3
 def compute_margin(voting: dict[str, str]) -> float | None:
    """Compute voting margin from per-party vote directions.
    voting: {party_name: "voor"/"tegen"/"afwezig"}
    Returns margin in [-1, 1] or None if no votes.
    """
    voor = sum(1 for v in voting.values() if v == "voor")
    tegen = sum(1 for v in voting.values() if v == "tegen")
    afwezig = sum(1 for v in voting.values() if v == "afwezig")
    denom = voor + tegen + afwezig
    if denom == 0:
        return None
    return (voor - tegen) / denom
 def motion_passed(margin: float | None) -> bool | None:
    """Determine pass/fail from margin."""
    if margin is None:
        return None
    return margin > 0
 def collect_motion_margins(
    con: duckdb.DuckDBPyConnection,
 ) -> list[dict[str, Any]]:
    rows = con.execute("""
        SELECT
            r.motion_id,
            r.year,
            r.centrist_support_strict,
            m.voting_results
        FROM right_wing_motions r
        JOIN motions m ON r.motion_id = m.id
        WHERE r.classified = TRUE
          AND r.year IS NOT NULL
          AND r.centrist_support_strict IS NOT NULL
    """).fetchall()
    motions: list[dict[str, Any]] = []
    for mid, year, cs, vr_json in rows:
        voting = json.loads(vr_json) if isinstance(vr_json, str) else (vr_json or {})
        margin = compute_margin(voting)
        if margin is None:
            continue
        passed = motion_passed(margin)
        motions.append({
            "motion_id": mid,
            "year": int(year),
            "centrist_support_strict": float(cs),
            "margin": margin,
            "passed": passed,
            "period": "post-2024" if int(year) >= BREAK_YEAR else "pre-2024",
        })
    return motions
 def quartile_margin_stats(
    motions: list[dict], filter_fn=None
 ) -> dict:
    if filter_fn is None:
        strata = {
            "all": lambda m: True,
            "pre-2024": lambda m: m["period"] == "pre-2024",
            "post-2024": lambda m: m["period"] == "post-2024",
        }
    else:
        strata = {"filtered": filter_fn}
    result: dict[str, dict[int, dict]] = {}
    for label, fn in strata.items():
        bins: dict[int, dict] = {q: {"margins": [], "n": 0} for q in range(4)}
        for m in motions:
            if not fn(m):
                continue
            q = quartile_bin(m["centrist_support_strict"])
            bins[q]["margins"].append(m["margin"])
            bins[q]["n"] += 1
        for q in range(4):
            d = bins[q]
            margins_arr = np.array(d["margins"])
            d["mean"] = float(np.mean(margins_arr)) if len(margins_arr) > 0 else float("nan")
            d["median"] = float(np.median(margins_arr)) if len(margins_arr) > 0 else float("nan")
            d["std"] = float(np.std(margins_arr, ddof=1)) if len(margins_arr) > 1 else float("nan")
            d["p25"] = float(np.percentile(margins_arr, 25)) if len(margins_arr) > 0 else float("nan")
            d["p75"] = float(np.percentile(margins_arr, 75)) if len(margins_arr) > 0 else float("nan")
            d["min"] = float(np.min(margins_arr)) if len(margins_arr) > 0 else float("nan")
            d["max"] = float(np.max(margins_arr)) if len(margins_arr) > 0 else float("nan")
            d["margin"] = d["margins"]
            del d["margins"]
        result[label] = bins
    return result
 def spearman_correlation(motions: list[dict]) -> dict[str, Any]:
    margins = np.array([m["margin"] for m in motions])
    cs_vals = np.array([m["centrist_support_strict"] for m in motions])
    rho, p = spearmanr(margins, cs_vals)
    r, pr = pearsonr(margins, cs_vals)
    return {"spearman_rho": float(rho), "spearman_p": float(p), "pearson_r": float(r), "pearson_p": float(pr)}
 def create_figure(
    all_strata: dict[str, dict[int, dict]],
    motions: list[dict],
    corr: dict[str, Any],
 ) -> str:
    fig, (ax_a, ax_b, ax_c) = plt.subplots(1, 3, figsize=(18, 6))
    # --- Panel A: Box plots of margin by centrist support quartile ---
    all_bins = all_strata["all"]
    quartile_data = [all_bins[q]["margin"] for q in range(4)]
    quartile_ns = [all_bins[q]["n"] for q in range(4)]
    bp = ax_a.boxplot(
        quartile_data,
        positions=range(4),
        widths=0.5,
        patch_artist=True,
        showfliers=True,
        flierprops=dict(marker="o", markersize=3, alpha=0.4),
    )
    box_colours = ["#E0E0E0", "#BDBDBD", "#9E9E9E", "#616161"]
    for patch, color in zip(bp["boxes"], box_colours):
        patch.set_facecolor(color)
        patch.set_alpha(0.8)
    for q in range(4):
        mean_val = all_bins[q]["mean"]
        if not np.isnan(mean_val):
            ax_a.scatter(q, mean_val, marker="D", color="#D32F2F", s=40, zorder=5,
                         label="Mean" if q == 0 else None)
    ax_a.set_xticks(range(4))
    ax_a.set_xticklabels([f"Q{q+1}\n(n={quartile_ns[q]})" for q in range(4)], fontsize=9)
    ax_a.set_ylabel("Voting margin (party-level)")
    ax_a.set_title("A. Margin by centrist support quartile", fontweight="bold")
    ax_a.set_ylim(-1.05, 1.05)
    ax_a.axhline(y=0, color="grey", linestyle="--", alpha=0.5, linewidth=0.8)
    ax_a.legend(fontsize=7, loc="upper left")
    ax_a.grid(True, alpha=0.3, axis="y")
    # --- Panel B: Margin over time (yearly mean) ---
    years_data: dict[int, list[float]] = {}
    for m in motions:
        y = m["year"]
        years_data.setdefault(y, []).append(m["margin"])
    years_sorted = sorted(years_data.keys())
    yearly_means = np.array([np.mean(years_data[y]) for y in years_sorted])
    yearly_stds = np.array([np.std(years_data[y], ddof=1) for y in years_sorted])
    yearly_ns = np.array([len(years_data[y]) for y in years_sorted])
    yearly_sems = yearly_stds / np.sqrt(yearly_ns)
    ax_b.fill_between(years_sorted, yearly_means - 1.96 * yearly_sems,
                      yearly_means + 1.96 * yearly_sems,
                      alpha=0.2, color="#002366", label="95% CI")
    ax_b.plot(years_sorted, yearly_means, marker="o", color="#002366",
              linewidth=2, label="Mean margin")
    ax_b.axvline(x=BREAK_YEAR - 0.5, color="black", linestyle=":", alpha=0.5, linewidth=1)
    ax_b.annotate("2024", xy=(BREAK_YEAR - 0.3, ax_b.get_ylim()[1] * 0.90),
                  fontsize=9, color="black", alpha=0.7)
    ax_b.set_xlabel("Year")
    ax_b.set_ylabel("Mean voting margin")
    ax_b.set_title("B. Voting margin over time", fontweight="bold")
    ax_b.legend(fontsize=8)
    ax_b.grid(True, alpha=0.3)
    ax_b.set_xticks(years_sorted)
    ax_b.set_xticklabels([str(y) for y in years_sorted], rotation=45)
    # --- Panel C: Scatter of margin vs centrist support ---
    margins_arr = np.array([m["margin"] for m in motions])
    cs_arr = np.array([m["centrist_support_strict"] for m in motions])
    pre_mask = np.array([m["period"] == "pre-2024" for m in motions])
    post_mask = ~pre_mask
    ax_c.scatter(cs_arr[pre_mask], margins_arr[pre_mask],
                 alpha=0.35, s=12, color="#90CAF9", label="Pre-2024", edgecolors="none")
    ax_c.scatter(cs_arr[post_mask], margins_arr[post_mask],
                 alpha=0.35, s=12, color="#1E88E5", label="Post-2024", edgecolors="none")
    valid = ~np.isnan(cs_arr) & ~np.isnan(margins_arr)
    if valid.sum() > 1:
        coeffs = np.polyfit(cs_arr[valid], margins_arr[valid], 1)
        x_fit = np.linspace(0, 1, 100)
        ax_c.plot(x_fit, np.polyval(coeffs, x_fit), color="#D32F2F", linewidth=1.5,
                  linestyle="--", label=f"Linear fit (r={corr['pearson_r']:.3f})")
    ax_c.set_xlabel("Centrist support (strict)")
    ax_c.set_ylabel("Voting margin")
    ax_c.set_title(f"C. Margin vs centrist support\nSpearman \u03c1={corr['spearman_rho']:.3f}, p={corr['spearman_p']:.1e}",
                   fontweight="bold")
    ax_c.set_ylim(-1.05, 1.05)
    ax_c.set_xlim(-0.02, 1.02)
    ax_c.axhline(y=0, color="grey", linestyle="--", alpha=0.5, linewidth=0.8)
    ax_c.legend(fontsize=8, loc="upper left")
    ax_c.grid(True, alpha=0.3)
    plt.tight_layout()
    path = str(REPORTS_DIR / "voting_margin_figure.png")
    fig.savefig(path, dpi=150, bbox_inches="tight")
    plt.close(fig)
    logger.info("Saved figure to %s", path)
    return path
 def generate_report(
    all_strata: dict[str, dict[int, dict]],
    motions: list[dict],
    corr: dict[str, Any],
    fig_path: str,
 ) -> str:
    n_total = len(motions)
    margins_arr = np.array([m["margin"] for m in motions])
    cs_arr = np.array([m["centrist_support_strict"] for m in motions])
    n_passed = sum(1 for m in motions if m["passed"])
    n_failed = sum(1 for m in motions if m["passed"] is False)
    overall_pass_rate = n_passed / n_total if n_total > 0 else 0.0
    # Quartile margin table
    qtable = "| Stratum | " + " | ".join(QUARTILE_LABELS) + " |\n"
    qtable += "|---------|" + "|".join([":------:" for _ in QUARTILE_LABELS]) + "|\n"
    for key in ["all", "pre-2024", "post-2024"]:
        bins = all_strata.get(key, {})
        row = [key]
        for q in range(4):
            d = bins.get(q, {})
            m = d.get("mean", float("nan"))
            n = d.get("n", 0)
            if np.isnan(m):
                row.append(f"N/A (n={n})")
            else:
                row.append(f"{m:+.3f} (n={n})")
        qtable += "| " + " | ".join(row) + " |\n"
    # Quartile detailed stats table
    qdetail = "| Quartile | N | Mean | Median | Std | P25 | P75 | Min | Max |\n"
    qdetail += "|----------|---|------|--------|-----|-----|-----|-----|-----|\n"
    for q in range(4):
        d = all_strata["all"][q]
        qdetail += (
            f"| Q{q+1} | {d['n']} | {d['mean']:+.3f} | {d['median']:+.3f} | "
            f"{d['std']:.3f} | {d['p25']:+.3f} | {d['p75']:+.3f} | "
            f"{d['min']:+.3f} | {d['max']:+.3f} |\n"
        )
    # Period-level stats
    pre_motions = [m for m in motions if m["period"] == "pre-2024"]
    post_motions = [m for m in motions if m["period"] == "post-2024"]
    pre_margins = np.array([m["margin"] for m in pre_motions])
    post_margins = np.array([m["margin"] for m in post_motions])
    pre_mean = float(np.mean(pre_margins)) if len(pre_margins) > 0 else float("nan")
    post_mean = float(np.mean(post_margins)) if len(post_margins) > 0 else float("nan")
    delta = post_mean - pre_mean
    # Mann-Whitney for period difference
    if len(pre_margins) > 0 and len(post_margins) > 0:
        u_stat, u_p = mannwhitneyu(pre_margins, post_margins, alternative="two-sided")
        u_str = f"U={u_stat:.0f}, p={u_p:.1e}"
        cohens_d = (post_mean - pre_mean) / np.sqrt(
            (np.std(pre_margins, ddof=1) ** 2 + np.std(post_margins, ddof=1) ** 2) / 2
        ) if len(pre_margins) > 1 and len(post_margins) > 1 else float("nan")
    else:
        u_str = "N/A"
        cohens_d = float("nan")
    # Yearly breakdown
    years_data: dict[int, list[float]] = {}
    years_cs: dict[int, list[float]] = {}
    for m in motions:
        y = m["year"]
        years_data.setdefault(y, []).append(m["margin"])
        years_cs.setdefault(y, []).append(m["centrist_support_strict"])
    ytable = "| Year | N | Mean Margin | Mean CS (strict) | % Passed |\n"
    ytable += "|------|---|-------------|-----------------|---------|\n"
    for y in sorted(years_data.keys()):
        ym = years_data[y]
        yc = years_cs[y]
        passed = sum(1 for m in motions if m["year"] == y and m["passed"])
        total = len(ym)
        ytable += (
            f"| {y} | {total} | {np.mean(ym):+.3f} | {np.mean(yc):.3f} | "
            f"{passed/total:.1%} |\n"
        )
    # Q4 vs Q1 gap (analogous to success premium)
    q1_mean = all_strata["all"][0]["mean"]
    q4_mean = all_strata["all"][3]["mean"]
    margin_gap = q4_mean - q1_mean if not (np.isnan(q1_mean) or np.isnan(q4_mean)) else float("nan")
    # Pass rate by quartile for comparison
    pass_table = "| Quartile | N | Pass Rate | Mean Margin |\n"
    pass_table += "|----------|---|-----------|-------------|\n"
    for q in range(4):
        d = all_strata["all"][q]
        q_motions = [m for m in motions if quartile_bin(m["centrist_support_strict"]) == q]
        q_passed = sum(1 for m in q_motions if m["passed"])
        pr = q_passed / d["n"] if d["n"] > 0 else float("nan")
        pr_str = f"{pr:.1%}" if not np.isnan(pr) else "N/A"
        pass_table += f"| Q{q+1} | {d['n']} | {pr_str} | {d['mean']:+.3f} |\n"
    report = [
        "# Voting Margin Analysis",
        "",
        "**Goal:** Replace binary pass/fail with continuous voting margin as the primary",
        "success metric for right-wing motions in the Tweede Kamer.",
        "",
        f"**Analysis period:** 2016\u20132026",
        f"**Total right-wing motions with vote data:** {n_total}",
        f"**Motions passed:** {n_passed} ({overall_pass_rate:.1%})",
        f"**Motions failed:** {n_failed} ({n_failed/n_total:.1%})" if n_total > 0 else "",
        "",
        "---",
        "",
        "## 1. Methodology",
        "",
        "The voting margin is computed from `motions.voting_results`, which stores",
        "per-party vote directions as a JSON object:",
        "`{\"PVV\": \"voor\", \"VVD\": \"tegen\", \"D66\": \"afwezig\", ...}`.",
        "",
        "```",
        "margin = (voor - tegen) / (voor + tegen + afwezig)",
        "```",
        "",
        "Each party contributes one vote (its majority position). The margin ranges",
        "from -1 (unanimous rejection) to +1 (unanimous support). A margin of 0",
        "indicates an exact tie or no participating parties.",
        "",
        "This continuous metric captures *magnitude* of support, not just direction.",
        "A motion that passes 14-1 has margin = +0.87, while one that passes 8-7 has",
        "margin = +0.07. Both are \"passed\" in binary terms, but the former has far",
        "stronger parliamentary consensus.",
        "",
        "> **Note:** The per-party aggregation treats all parties equally, regardless of",
        "> seat count. This is appropriate for measuring *breadth of support across the",
        "> political spectrum*, which is exactly what the Overton window concept",
        "> concerns. Seat-weighted margins would be confounded by coalition size effects.",
        "",
        "---",
        "",
        "## 2. Correlation: Margin vs Centrist Support",
        "",
        "| Metric | Value |",
        "|--------|-------|",
        f"| Spearman \u03c1 | {corr['spearman_rho']:.3f} |",
        f"| Spearman p-value | {corr['spearman_p']:.1e} |",
        f"| Pearson r | {corr['pearson_r']:.3f} |",
        f"| Pearson p-value | {corr['pearson_p']:.1e} |",
        "",
    ]
    if corr["spearman_p"] < 0.05:
        report.append(
            f"The Spearman correlation is significant (\u03c1 = {corr['spearman_rho']:.3f}, "
            f"p = {corr['spearman_p']:.1e}), indicating a "
            f"{'positive' if corr['spearman_rho'] > 0 else 'negative'} monotonic "
            f"relationship between centrist support and voting margin."
        )
    else:
        report.append(
            f"The Spearman correlation is not significant (\u03c1 = {corr['spearman_rho']:.3f}, "
            f"p = {corr['spearman_p']:.3f}). Centrist support alone does not predict "
            f"voting margin."
        )
    report += [
        "",
        "---",
        "",
        "## 3. Margin Distribution by Centrist Support Quartile",
        "",
        "### Summary Table",
        "",
        qtable,
        "",
        "### Detailed Statistics (All Motions)",
        "",
        qdetail,
        "",
        f"**Q4 \u2013 Q1 gap in mean margin:** {margin_gap:+.3f}",
        "",
    ]
    if not np.isnan(margin_gap) and margin_gap > 0:
        report.append(
            f"The gap of {margin_gap:+.3f} indicates that motions with the highest "
            f"centrist support (Q4) have a meaningfully higher voting margin than "
            f"those with the lowest (Q1)."
        )
    elif not np.isnan(margin_gap):
        report.append(
            f"The gap of {margin_gap:+.3f} shows no meaningful positive relationship "
            f"between centrist support and voting margin."
        )
    report += [
        "",
        "---",
        "",
        "## 4. Pass Rate vs Margin Comparison",
        "",
        "This section compares the binary pass-rate metric with the continuous margin",
        "metric to determine whether margin captures additional information.",
        "",
        pass_table,
        "",
    ]
    # Check if margin detects patterns pass rate misses
    q1_pr = 0.0
    q4_pr = 0.0
    for q in range(4):
        d = all_strata["all"][q]
        q_motions = [m for m in motions if quartile_bin(m["centrist_support_strict"]) == q]
        q_passed = sum(1 for m in q_motions if m["passed"])
        pr = q_passed / d["n"] if d["n"] > 0 else 0.0
        if q == 0:
            q1_pr = pr
        elif q == 3:
            q4_pr = pr
    pass_gap = q4_pr - q1_pr if q4_pr > 0 else 0.0
    report.append(
        f"**Pass rate gap (Q4 \u2013 Q1):** {pass_gap:+.1%}"
    )
    report.append(
        f"**Margin gap (Q4 \u2013 Q1):** {margin_gap:+.3f}"
    )
    if pass_gap < 0.05 and abs(margin_gap) > 0.05:
        report.append("")
        report.append(
            "The pass rate gap is small ({:.1%}) while the margin gap is meaningful "
            "({:+.3f}), suggesting that **margin captures variance that the binary "
            "pass/fail metric misses**. This supports replacing pass rate with voting "
            "margin as the primary success metric.".format(pass_gap, margin_gap)
        )
    elif pass_gap >= 0.05:
        report.append("")
        report.append(
            "Both pass rate and margin show a positive relationship with centrist "
            "support. Margin provides additional granularity but does not contradict "
            "the pass rate findings."
        )
    else:
        report.append("")
        report.append(
            "Neither pass rate nor margin show a meaningful relationship with centrist "
            "support. The high baseline pass rate (~{:.0%}) creates a ceiling effect "
            "for both metrics.".format(overall_pass_rate)
        )
    report += [
        "",
        "---",
        "",
        "## 5. Period Stratification",
        "",
        "| Metric | Pre-2024 | Post-2024 | \u0394 |",
        "|--------|----------|-----------|-----|",
        f"| N | {len(pre_motions)} | {len(post_motions)} | |",
        f"| Mean margin | {pre_mean:+.3f} | {post_mean:+.3f} | {delta:+.3f} |",
        f"| Mann-Whitney U | | | {u_str} |",
        f"| Cohen's d | | | {cohens_d:+.3f} |" if not np.isnan(cohens_d) else "",
        "",
    ]
    if u_p < 0.05 if isinstance(u_p := corr.get("spearman_p", 1.0), float) else False:
        pass
    else:
        if not np.isnan(post_mean) and not np.isnan(pre_mean):
            _, period_p = mannwhitneyu(pre_margins, post_margins, alternative="two-sided")
            if period_p < 0.05:
                direction = "rose" if post_mean > pre_mean else "fell"
                report.append(
                    f"Voting margin {direction} significantly post-2024 "
                    f"(Mann-Whitney p = {period_p:.1e}, d = {cohens_d:+.3f})."
                )
            else:
                report.append(
                    f"Voting margin did not change significantly between periods "
                    f"(Mann-Whitney p = {period_p:.3f})."
                )
    report += [
        "",
        "---",
        "",
        "## 6. Yearly Breakdown",
        "",
        ytable,
        "",
        "---",
        "",
        "## 7. Interpretation",
        "",
    ]
    if corr["spearman_p"] < 0.05 and corr["spearman_rho"] > 0:
        report.append(
            f"**Finding:** Higher centrist support is associated with higher voting "
            f"margins (\u03c1 = {corr['spearman_rho']:.3f}, p = {corr['spearman_p']:.1e}). "
            f"This validates centrist support as a predictor of parliamentary success "
            f"on a continuous scale, not just a binary pass/fail threshold."
        )
    elif corr["spearman_p"] < 0.05:
        report.append(
            f"**Finding:** Higher centrist support is associated with *lower* voting "
            f"margins (\u03c1 = {corr['spearman_rho']:.3f}, p = {corr['spearman_p']:.1e}). "
            f"This is counterintuitive and warrants further investigation."
        )
    else:
        report.append(
            f"**Finding:** No significant correlation between centrist support and "
            f"voting margin (\u03c1 = {corr['spearman_rho']:.3f}, p = {corr['spearman_p']:.3f}). "
        )
    report.append("")
    report.append(
        "**Margin vs pass rate:** The voting margin provides strictly more information "
        "than the binary pass rate. Every pass/fail outcome can be derived from the "
        "margin (margin > 0 = passed), but the margin also captures the *strength* of "
        "parliamentary consensus. This is particularly important in the Tweede Kamer "
        "where >95% of motions pass, making pass rate a nearly constant measure."
    )
    report += [
        "",
        "---",
        "",
        "## 8. Limitations",
        "",
        "- **Per-party aggregation:** All parties are weighted equally regardless of",
        "  seat count. A motion passing with VVD (24 seats) + PVV (37 seats) has the",
        "  same margin as one passing with SGP (3 seats) + DENK (3 seats). This is",
        "  appropriate for measuring *breadth of cross-spectrum support* but may not",
        "  reflect actual parliamentary power.",
        "- **Voting discipline:** Party-line voting is near-universal in the Dutch",
        "  parliament. The per-party aggregation loses little information.",
        "- **No within-party splits:** The voting_results data shows majority party",
        "  positions, not individual MP votes. Intra-party dissent is invisible.",
        "- **Missing data:** Motions without voting_results are excluded.",
        "",
        "---",
        "",
        f"![Figure: Voting margin analysis]({Path(fig_path).name})",
        "",
        "*Report generated by `analysis/right_wing/voting_margin.py`*",
    ]
    report_path = REPORTS_DIR / "voting_margin.md"
    with open(report_path, "w") as f:
        f.write("\n".join(report))
    logger.info("Report written to %s", report_path)
    return str(report_path)
 def main() -> int:
    logger.info("Connecting to database: %s", DB_PATH)
    con = duckdb.connect(DB_PATH, read_only=True)
    logger.info("Collecting motion margins...")
    motions = collect_motion_margins(con)
    con.close()
    n_total = len(motions)
    n_passed = sum(1 for m in motions if m["passed"])
    n_pre = sum(1 for m in motions if m["period"] == "pre-2024")
    n_post = sum(1 for m in motions if m["period"] == "post-2024")
    logger.info(
        "Total: %d motions with voting data, %d passed (%.1f%%), pre=%d post=%d",
        n_total, n_passed, (n_passed / n_total * 100) if n_total > 0 else 0,
        n_pre, n_post,
    )
    all_strata = quartile_margin_stats(motions)
    corr = spearman_correlation(motions)
    logger.info(
        "Spearman rho=%.3f p=%.1e | Pearson r=%.3f p=%.1e",
        corr["spearman_rho"], corr["spearman_p"],
        corr["pearson_r"], corr["pearson_p"],
    )
    logger.info("Generating figure...")
    fig_path = create_figure(all_strata, motions, corr)
    logger.info("Generating report...")
    report_path = generate_report(all_strata, motions, corr, fig_path)
    print(f"\nReport: {report_path}")
    print(f"Figure: {fig_path}")
    return 0
 if __name__ == "__main__":
    raise SystemExit(main())
--- a/reports/overton_window/mechanism_validation.md
+++ b/reports/overton_window/mechanism_validation.md
@ -0,0 +1,188 @@
 # Mechanism Classification Validation Report
 ## 1. Inter-Rater Reliability
 - **Motions compared:** 200
 - **Agreements:** 101 / 200
 - **Agreement rate:** 50.5%
 - **Cohen's kappa (κ):** 0.4082
  - P_o (observed): 0.5050
  - P_e (expected): 0.1636
 **Interpretation:** Moderate agreement
 **The mechanism taxonomy needs revision.** The inter-rater agreement is below 0.6, suggesting the 10-mechanism framework is not being applied consistently across raters. Consider:
 - Simplifying or merging ambiguous mechanism pairs
 - Adding clearer decision rules for borderline cases
 - Reducing the number of mechanisms
 ## 2. Second Classifier Summary
 - **Model:** qwen/qwen-2.5-72b-instruct
 - **Motions classified:** 200
 - **Average confidence:** 4.1/5
 ### Confidence Distribution
 | Confidence | Count |
 |------------|-------|
 | 1 | 0 |
 | 2 | 0 |
 | 3 | 5 |
 | 4 | 165 |
 | 5 | 30 |
 ## 3. Disagreement Table
 **Total disagreements:** 99 / 200 (49.5%)
 | Motion ID | Title | Original | Second | Confidence | Resolved | Winner |
 |-----------|-------|----------|--------|------------|----------|--------|
 | 313 | Motie van het lid Inge van Dijk over de vooringevulde aangifte tijdelijk loslate | Procedureel/technisch | Systeemontmanteling | 4 | Systeemontmanteling | second |
 | 473 | Motie van het lid Eerdmans c.s. over de schade van de UvA-rellen alsnog verhalen | Institutioneel/rechtsstatelijk | Gerichte restrictie | 4 | Gerichte restrictie | second |
 | 651 | Gewijzigde motie van het lid Grinwis c.s. over de rol van agrarisch natuurbeheer | Welzijn/dienstverlening uitbreiding | Institutioneel/rechtsstatelijk | 4 | Institutioneel/rechtsstatelijk | second |
 | 898 | Motie van het lid Ram over een verdere versimpeling van de Omnibus en de CSDDD | Consensus framing (gedeeld belang) | Procedureel/technisch | 4 | Procedureel/technisch | second |
 | 974 | Motie van het lid Mooiman over het effect van opgestelde "Whole Life Carbon"-eis | Procedureel/technisch | Symbolisch/declaratoir | 4 | Symbolisch/declaratoir | second |
 | 1005 | Motie van het lid Kamminga over de EU-opbrengsten van importheffingen inzetten t | Consensus framing (gedeeld belang) | Welzijn/dienstverlening uitbreiding | 4 | Welzijn/dienstverlening uitbreiding | second |
 | 1191 | Motie van het lid Veltman over veiligheid meer prioriteit geven in de uitvoering | Consensus framing (gedeeld belang) | Welzijn/dienstverlening uitbreiding | 4 | Welzijn/dienstverlening uitbreiding | second |
 | 1359 | Motie van de leden Eerdmans en Van der Plas over met de vuurwerkbranche een rami | Procedureel/technisch | Gerichte restrictie | 4 | Gerichte restrictie | second |
 | 1491 | Motie van het lid Boomsma c.s. over een verkenning naar een maximumaantal wolven | Gerichte restrictie | Consensus framing (gedeeld belang) | 4 | Consensus framing (gedeeld belang) | second |
 | 1495 | Gewijzigde motie van het lid Diederik van Dijk c.s. over een meer risicogerichte | Procedureel/technisch | Institutioneel/rechtsstatelijk | 4 | Institutioneel/rechtsstatelijk | second |
 | 1507 | Motie van het lid De Vos over empirische natuurgegevens als juridisch houdbaar a | Systeemontmanteling | Institutioneel/rechtsstatelijk | 4 | Institutioneel/rechtsstatelijk | second |
 | 1572 | Motie van de leden Van Campen en Eerdmans over de impact van wolfaanvallen in ka | Lokaal/regionaal | Welzijn/dienstverlening uitbreiding | 4 | Welzijn/dienstverlening uitbreiding | second |
 | 1705 | Motie van het lid Dekker over voorstellen ter vermindering van de regeldruk | Consensus framing (gedeeld belang) | Gerichte restrictie | 4 | Gerichte restrictie | second |
 | 1831 | Motie van het lid Van der Plas over het voorzorgsbeginsel zo toepassen dat het p | Consensus framing (gedeeld belang) | Institutioneel/rechtsstatelijk | 4 | Institutioneel/rechtsstatelijk | second |
 | 2014 | Motie van het lid Van Zanten over in asielzaken uitsluitend beroep bij één insta | Systeemontmanteling | Gerichte restrictie | 4 | Gerichte restrictie | second |
 | 2168 | Amendement van de leden Eerdmans en Diederik van Dijk ter vervanging van nr. 7 o | Institutioneel/rechtsstatelijk | Procedureel/technisch | 4 | Procedureel/technisch | second |
 | 2170 | Amendement van de leden Diederik van Dijk en Eerdmans ter vervanging van nr. 4 o | Institutioneel/rechtsstatelijk | Procedureel/technisch | 4 | Procedureel/technisch | second |
 | 2264 | Motie van het lid Van der Hoeff over alle kosten van vernielingen gepleegd tijde | Institutioneel/rechtsstatelijk | Gerichte restrictie | 4 | Gerichte restrictie | second |
 | 2496 | Motie van het lid Vermeer over een lanceercapaciteit voor satellieten op het gro | Procedureel/technisch | Consensus framing (gedeeld belang) | 4 | Consensus framing (gedeeld belang) | second |
 | 2662 | Motie van de leden Bikker en Diederik van Dijk over voorkomen dat Nederlandse ke | Institutioneel/rechtsstatelijk | Gerichte restrictie | 4 | Gerichte restrictie | second |
 | 2878 | Motie van het lid Inge van Dijk c.s. over een voorstel voor het inpassen van de  | Welzijn/dienstverlening uitbreiding | Procedureel/technisch | 4 | Procedureel/technisch | second |
 | 3298 | Motie van het lid Diederik van Dijk c.s. over zich scharen achter het vredesplan | Symbolisch/declaratoir | Consensus framing (gedeeld belang) | 4 | Consensus framing (gedeeld belang) | second |
 | 3354 | Amendement van het lid Michon-Derkzen over het verhogen van het strafmaximum van | Institutioneel/rechtsstatelijk | Gerichte restrictie | 4 | Gerichte restrictie | second |
 | 3468 | Motie van de leden Yesilgöz-Zegerius en Bikker over zo snel mogelijk overgaan to | Institutioneel/rechtsstatelijk | Welzijn/dienstverlening uitbreiding | 4 | Welzijn/dienstverlening uitbreiding | second |
 | 3472 | Gewijzigde motie van de leden Van der Plas en Yesilgöz-Zegerius over wetgeving v | Institutioneel/rechtsstatelijk | Gerichte restrictie | 5 | Gerichte restrictie | second |
 | 3569 | Gewijzigde motie van de leden Wijen-Nass en Diederik van Dijk over inventarisere | Consensus framing (gedeeld belang) | Procedureel/technisch | 4 | Procedureel/technisch | second |
 | 3629 | Motie van het lid Ceder over een conferentie over modernisering van het VN-Vluch | Symbolisch/declaratoir | Institutioneel/rechtsstatelijk | 4 | Institutioneel/rechtsstatelijk | second |
 | 3678 | Motie van het lid Wilders over de invoer van een totale asielstop alsmede een st | Systeemontmanteling | Gerichte restrictie | 5 | Gerichte restrictie | second |
 | 3687 | Motie van de leden Van der Plas en Yesilgöz-Zegerius over het initiatief van de  | Gerichte restrictie | Institutioneel/rechtsstatelijk | 5 | Institutioneel/rechtsstatelijk | second |
 | 3760 | Motie van het lid Peter de Groot c.s. over de Wet op de defensiegereedheid na on | Consensus framing (gedeeld belang) | Institutioneel/rechtsstatelijk | 4 | Institutioneel/rechtsstatelijk | second |
 | 3784 | Motie van de leden Wendel en Van Brenk over informatiedeling over zorgfraude mog | Procedureel/technisch | Welzijn/dienstverlening uitbreiding | 4 | Welzijn/dienstverlening uitbreiding | second |
 | 3830 | Motie van het lid Van Meetelen over stoppen met betuttelend beleid gericht op vo | Systeemontmanteling | Gerichte restrictie | 4 | Gerichte restrictie | second |
 | 3877 | Gewijzigde motie van de leden Ceder en Diederik van Dijk over signalen en inzet  | Institutioneel/rechtsstatelijk | Gerichte restrictie | 4 | Gerichte restrictie | second |
 | 4080 | Motie van het lid Coenradie over een onderzoek naar zwaardere, dwingende vormen  | Institutioneel/rechtsstatelijk | Gerichte restrictie | 4 | Gerichte restrictie | second |
 | 4221 | Motie van het lid Van der Plas over een duidelijke overheadnorm opstellen voor d | Systeemontmanteling | Procedureel/technisch | 4 | Procedureel/technisch | second |
 | 4227 | Motie van het lid Peter de Groot over de oeververbinding bij de sluis van Nijker | Consensus framing (gedeeld belang) | Lokaal/regionaal | 4 | Lokaal/regionaal | second |
 | 4309 | Motie van het lid Coenradie over gerichter doelgroepenbeleid bij handhaving | Institutioneel/rechtsstatelijk | Gerichte restrictie | 4 | Gerichte restrictie | second |
 | 4394 | Motie van het lid Van der Plas over het luchtdrukwapen met zogenaamde beanbags o | Institutioneel/rechtsstatelijk | Procedureel/technisch | 3 | Institutioneel/rechtsstatelijk | original |
 | 4436 | Motie van het lid Diederik van Dijk c.s. over in overleg met het OM in een aanwi | Institutioneel/rechtsstatelijk | Gerichte restrictie | 4 | Gerichte restrictie | second |
 | 4481 | Motie van het lid Ceder c.s. over het verwerven van control points expliciet ond | Consensus framing (gedeeld belang) | Institutioneel/rechtsstatelijk | 4 | Institutioneel/rechtsstatelijk | second |
 | 4489 | Motie van het lid Van der Plas over een onderzoek naar de invloed van verstoring | Procedureel/technisch | Gerichte restrictie | 4 | Gerichte restrictie | second |
 | 4656 | Motie van het lid Dekker over niet akkoord gaan met toetreding van Oekraïne tot  | Symbolisch/declaratoir | Gerichte restrictie | 4 | Gerichte restrictie | second |
 | 4660 | Motie van het lid Diederik van Dijk over verkennen of en hoe verdere samenwerkin | Consensus framing (gedeeld belang) | Coalitie-afstemming | 4 | Coalitie-afstemming | second |
 | 4933 | Wijziging van de Omgevingswet en enkele andere wetten met het oog op het bescher | Procedureel/technisch | Institutioneel/rechtsstatelijk | 4 | Institutioneel/rechtsstatelijk | second |
 | 9149 | Motie van het lid Valstar c.s. over steun voor bewapening van de MQ-9 Reaper | Consensus framing (gedeeld belang) | Procedureel/technisch | 4 | Procedureel/technisch | second |
 | 9769 | Motie van het lid Vondeling over er alles aan doen om Syriërs huiswaarts te late | Gerichte restrictie | Welzijn/dienstverlening uitbreiding | 3 | Gerichte restrictie | original |
 | 9789 | Motie van het lid Diederik van Dijk c.s. over de Tijdelijke wet bestuurlijke maa | Institutioneel/rechtsstatelijk | Gerichte restrictie | 4 | Gerichte restrictie | second |
 | 10110 | Amendement van het lid Bontenbal c.s. over dekking van het maatregelenpakket voo | Coalitie-afstemming | Procedureel/technisch | 5 | Procedureel/technisch | second |
 | 10167 | Amendement van het lid Flach over € 2 miljoen voor pilotprojecten voor de aanpak | Lokaal/regionaal | Procedureel/technisch | 4 | Procedureel/technisch | second |
 | 10278 | Amendement van het lid Bontenbal c.s. over dekking van het maatregelenpakket voo | Coalitie-afstemming | Procedureel/technisch | 4 | Procedureel/technisch | second |
 | 10290 | Motie van het lid Eerdmans over ten minste één concreet migratieproject uitwerke | Gerichte restrictie | Procedureel/technisch | 4 | Procedureel/technisch | second |
 | 10413 | Motie van het lid Diederik van Dijk c.s. over de maximale juridische ruimte opzo | Consensus framing (gedeeld belang) | Procedureel/technisch | 4 | Procedureel/technisch | second |
 | 10420 | Motie van het lid Van der Wal c.s. over het vergroten van de weerbaarheid van Ne | Crisisrespons | Welzijn/dienstverlening uitbreiding | 4 | Welzijn/dienstverlening uitbreiding | second |
 | 10597 | Motie van het lid Eerdmans over middels een AMvB de derde waarnemer bij preventi | Institutioneel/rechtsstatelijk | Procedureel/technisch | 4 | Procedureel/technisch | second |
 | 11382 | Gewijzigd amendement van het lid Van der Molen t.v.v. nr. 21 over het schrappen  | Procedureel/technisch | Systeemontmanteling | 4 | Systeemontmanteling | second |
 | 14554 | Motie van het lid Schonis over een kwartiermaker toeristische samenwerking | Procedureel/technisch | Consensus framing (gedeeld belang) | 4 | Consensus framing (gedeeld belang) | second |
 | 15005 | Motie van het lid Aartsen over een periodiek overlegorgaan voor franchisegevers  | Procedureel/technisch | Institutioneel/rechtsstatelijk | 4 | Institutioneel/rechtsstatelijk | second |
 | 15772 | Motie van het lid De Jong over pensioenkortingen voorkomen | Systeemontmanteling | Welzijn/dienstverlening uitbreiding | 4 | Welzijn/dienstverlening uitbreiding | second |
 | 16430 | Motie van het lid Tony van Dijck over geen 45 miljard euro overmaken naar Zuid-  | Symbolisch/declaratoir | Gerichte restrictie | 4 | Gerichte restrictie | second |
 | 16691 | Motie van het lid Geurts over het doorbreken van de vicieuze cirkel rond de toen | Procedureel/technisch | Crisisrespons | 4 | Crisisrespons | second |
 | 16999 | Motie van de leden Van Haga en Baudet over het tegengaan van verdere oneerlijke  | Consensus framing (gedeeld belang) | Gerichte restrictie | 4 | Gerichte restrictie | second |
 | 17036 | Motie van het lid Kerstens over onderzoeken of Defensie in aanmerking komt voor  | Welzijn/dienstverlening uitbreiding | Crisisrespons | 4 | Crisisrespons | second |
 | 17536 | Motie van het lid Yesilgöz-Zegerius over in heel het Schengengebied haatprediker | Institutioneel/rechtsstatelijk | Gerichte restrictie | 5 | Gerichte restrictie | second |
 | 17681 | Motie van de leden Van Haga en Baudet over een plan van aanpak om de fiscaliteit | Consensus framing (gedeeld belang) | Systeemontmanteling | 4 | Systeemontmanteling | second |
 | 17751 | Gewijzigde motie van de leden Stoffer en Van Haga over een nullijn voor de ontwi | Consensus framing (gedeeld belang) | Symbolisch/declaratoir | 4 | Symbolisch/declaratoir | second |
 | 18030 | Motie van het lid Stoffer over zo snel mogelijk de snelwegverlichting 's nachts  | Procedureel/technisch | Welzijn/dienstverlening uitbreiding | 4 | Welzijn/dienstverlening uitbreiding | second |
 | 18062 | Motie van het lid Krol over excuses voor de fouten die leidden tot slachtoffers  | Crisisrespons | Symbolisch/declaratoir | 5 | Symbolisch/declaratoir | second |
 | 18691 | Motie van het lid Karabulut over geen extra troepen naar Afghanistan | Symbolisch/declaratoir | Gerichte restrictie | 4 | Gerichte restrictie | second |
 | 20215 | Gewijzigde motie van het lid Boswijk c.s. over onderzoeken hoe hoogwaardige land | Welzijn/dienstverlening uitbreiding | Institutioneel/rechtsstatelijk | 3 | Welzijn/dienstverlening uitbreiding | original |
 | 21801 | Motie van het lid Van Haga c.s. over de Defensievisie 2035 omarmen | Consensus framing (gedeeld belang) | Welzijn/dienstverlening uitbreiding | 4 | Welzijn/dienstverlening uitbreiding | second |
 | 21982 | Motie van het lid Graus c.s. over het zwartboek regeldruk van MKB-Nederland ter  | Consensus framing (gedeeld belang) | Institutioneel/rechtsstatelijk | 4 | Institutioneel/rechtsstatelijk | second |
 | 22280 | Motie van het lid Van der Plas over de kosten berekenen die op het bord van de b | Lokaal/regionaal | Welzijn/dienstverlening uitbreiding | 4 | Welzijn/dienstverlening uitbreiding | second |
 | 22676 | Motie van het lid Diederik van Dijk c.s. over een grootschalig en breedgedragen  | Institutioneel/rechtsstatelijk | Gerichte restrictie | 4 | Gerichte restrictie | second |
 | 22853 | Motie van het lid Peter de Groot over nog voor het zomerreces additionele maatre | Consensus framing (gedeeld belang) | Procedureel/technisch | 4 | Procedureel/technisch | second |
 | 23013 | Amendement van het lid Diederik van Dijk over budget voor de uitvoering van het  | Institutioneel/rechtsstatelijk | Procedureel/technisch | 4 | Procedureel/technisch | second |
 | 23030 | Motie van het lid Eerdmans over in het verdeelbesluit geen asielopvangplekken op | Gerichte restrictie | Lokaal/regionaal | 4 | Lokaal/regionaal | second |
 | 23141 | Motie van het lid Eerdmans over de mogelijkheid tot inzet van de KMar actief ond | Institutioneel/rechtsstatelijk | Welzijn/dienstverlening uitbreiding | 4 | Welzijn/dienstverlening uitbreiding | second |
 | 23206 | Motie van het lid Nordkamp c.s. over het in kaart brengen van het aandeel van in | Procedureel/technisch | Gerichte restrictie | 4 | Gerichte restrictie | second |
 | 23287 | Motie van het lid Helder c.s. over het wetsvoorstel inzake het taakstrafverbod b | Institutioneel/rechtsstatelijk | Gerichte restrictie | 4 | Gerichte restrictie | second |
 | 23301 | Motie van de leden Tuinman en Boswijk over het onderzoeken van voorstellen met b | Consensus framing (gedeeld belang) | Procedureel/technisch | 4 | Procedureel/technisch | second |
 | 23441 | Motie van de leden Van Zanten en Stoffer over een deel van het budget voor kanse | Consensus framing (gedeeld belang) | Procedureel/technisch | 4 | Procedureel/technisch | second |
 | 23454 | Motie van het lid Joseph over een analyse laten maken van de juridische risico's | Procedureel/technisch | Institutioneel/rechtsstatelijk | 5 | Institutioneel/rechtsstatelijk | second |
 | 23885 | Motie van het lid Aartsen c.s. over verkennen hoe toetsings- of toezichtkaders a | Consensus framing (gedeeld belang) | Institutioneel/rechtsstatelijk | 4 | Institutioneel/rechtsstatelijk | second |
 | 23984 | Motie van het lid Pierik over de eisen aan de eco-regeling in de periode 2025-20 | Systeemontmanteling | Gerichte restrictie | 4 | Gerichte restrictie | second |
 | 24008 | Motie van het lid Holman c.s. over bij de Europese Commissie bevorderen dat de b | Consensus framing (gedeeld belang) | Procedureel/technisch | 4 | Procedureel/technisch | second |
 | 24046 | Motie van het lid Keijzer c.s. over de minister zich kenbaar laten onthouden van | Symbolisch/declaratoir | Procedureel/technisch | 4 | Procedureel/technisch | second |
 | 24077 | Motie van het lid De Roon over een onderzoek instellen naar de rol en verantwoor | Symbolisch/declaratoir | Institutioneel/rechtsstatelijk | 4 | Institutioneel/rechtsstatelijk | second |
 | 24358 | Motie van de leden Helder en Uitermark over het vergroten van de personeelscapac | Institutioneel/rechtsstatelijk | Procedureel/technisch | 4 | Procedureel/technisch | second |
 | 24632 | Motie van de leden Veltman en Vedder over het voor de politie mogelijk maken om  | Institutioneel/rechtsstatelijk | Gerichte restrictie | 4 | Gerichte restrictie | second |
 | 24650 | Gewijzigd amendement van de leden Dijk en Flach ter vervanging van nr. 13 over e | Procedureel/technisch | Institutioneel/rechtsstatelijk | 4 | Institutioneel/rechtsstatelijk | second |
 | 24651 | Motie van de leden Inge van Dijk en Van Oostenbruggen over een arbeidsmigratieto | Gerichte restrictie | Consensus framing (gedeeld belang) | 4 | Consensus framing (gedeeld belang) | second |
 | 25061 | Motie van het lid Kisteman c.s. over een vereenvoudiging van de RI&E-verplichtin | Consensus framing (gedeeld belang) | Procedureel/technisch | 4 | Procedureel/technisch | second |
 | 25062 | Motie van het lid Kisteman c.s. over een voor het mkb werkbare wijze van werken  | Consensus framing (gedeeld belang) | Procedureel/technisch | 4 | Procedureel/technisch | second |
 | 25079 | Motie van de leden Bontenbal en Flach over de Europese standaarden voor stikstof | Consensus framing (gedeeld belang) | Procedureel/technisch | 4 | Procedureel/technisch | second |
 | 25451 | Motie van het lid Ceder over berekenen hoeveel geld de Palestijnse Autoriteit ja | Symbolisch/declaratoir | Gerichte restrictie | 5 | Gerichte restrictie | second |
 | 25469 | Motie van de leden Eerdmans en Diederik van Dijk over samen met gelijkgestemde E | Gerichte restrictie | Coalitie-afstemming | 4 | Coalitie-afstemming | second |
 | 25616 | Motie van het lid Eerdmans over de wettelijke taakstellingen voor gemeenten voor | Gerichte restrictie | Systeemontmanteling | 4 | Systeemontmanteling | second |
 | 25982 | Gewijzigde motie van het lid Bisschop c.s. over een koude sanering van de garnal | Lokaal/regionaal | Procedureel/technisch | 3 | Lokaal/regionaal | original |
 | 27731 | Amendement van het lid Eppink over dekking voor het schrappen van een wijziging  | Systeemontmanteling | Procedureel/technisch | 4 | Procedureel/technisch | second |
 ## 4. Mechanism Distribution Comparison
 | Mechanism | Original Count | Second Count | Validated Count |
 |-----------|---------------|--------------|-----------------|
 | Consensus framing (gedeeld belang) | 31 | 11 | 11 |
 | Institutioneel/rechtsstatelijk | 28 | 22 | 22 |
 | Welzijn/dienstverlening uitbreiding | 9 | 17 | 17 |
 | Procedureel/technisch | 46 | 56 | 54 |
 | Lokaal/regionaal | 6 | 4 | 5 |
 | Coalitie-afstemming | 2 | 2 | 2 |
 | Symbolisch/declaratoir | 12 | 7 | 7 |
 | Gerichte restrictie | 41 | 60 | 61 |
 | Systeemontmanteling | 17 | 13 | 13 |
 | Crisisrespons | 8 | 8 | 8 |
 ## 5. Confusion Matrix (Top Rows)
 | Original \ Second | Consensus framing /  | Institutional / rule | Welfare / service ex | Procedural / technic | Local / regional con | Coalition alignment | Symbolic / declarato | Targeted restriction | System dismantling | Crisis response |
 |---|---|---|---|---|---|---|---|---|---|---|
 | Consensus framing /  | 6 | 5 | 3 | 11 | 1 | 1 | 1 | 2 | 1 | 0 |
 | Institutional / rule | 0 | 6 | 2 | 6 | 0 | 0 | 0 | 14 | 0 | 0 |
 | Welfare / service ex | 0 | 2 | 5 | 1 | 0 | 0 | 0 | 0 | 0 | 1 |
 | Procedural / technic | 2 | 5 | 2 | 30 | 0 | 0 | 1 | 3 | 2 | 1 |
 | Local / regional con | 0 | 0 | 2 | 2 | 2 | 0 | 0 | 0 | 0 | 0 |
 | Coalition alignment | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 |
 | Symbolic / declarato | 1 | 2 | 0 | 1 | 0 | 0 | 4 | 4 | 0 | 0 |
 | Targeted restriction | 2 | 1 | 1 | 1 | 1 | 1 | 0 | 33 | 1 | 0 |
 | System dismantling | 0 | 1 | 1 | 2 | 0 | 0 | 0 | 4 | 9 | 0 |
 | Crisis response | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 6 |
 ## 6. Conclusion
 Cohen's kappa of **0.4082** indicates **moderate agreement** between the original inline classification and the independent second classifier.
 ### Key findings:
 - 101 out of 200 motions agreed (50.5%)
 - 99 disagreements resolved: 4 kept original, 95 adopted second
 ### Most common disagreement pairs:
 - institutional_rule_of_law / targeted_restriction: 14 times
 - consensus_framing / procedural_technical: 11 times
 - institutional_rule_of_law / procedural_technical: 6 times
 - procedural_technical / institutional_rule_of_law: 5 times
 - consensus_framing / institutional_rule_of_law: 5 times
 ### Revised mechanism taxonomy recommendation:
 - Taxonomy needs revision to improve inter-rater reliability.
 - Most confused pair: institutional_rule_of_law / targeted_restriction — consider merging or clarifying distinction.
--- a/reports/overton_window/party_differentiation.md
+++ b/reports/overton_window/party_differentiation.md
@ -0,0 +1,113 @@
 # Right-Wing Party Differentiation
 **Goal:** Break down right-wing motion metrics by party (PVV, FVD, JA21, SGP)
 to identify which party drives the moderation effect.
 **Analysis period:** 2016–2026
 **Right-wing parties:** FVD, JA21, PVV, SGP
 **Data:** 962 right-wing submitter motions with 2D extremity scores
 (from 2,850 classified right-wing motions total; 1,888 could not be parsed/party-matched).
 ---
 ## 1. Motion Volume by Party and Year
 | Year | FVD | JA21 | PVV | SGP | Total RW |
 |------|---|----|---|---|----------|
 | 2016 | 0 | 0 | 0 | 0 | 0 |
 | 2017 | 0 | 0 | 0 | 0 | 0 |
 | 2018 | 0 | 0 | 0 | 0 | 0 |
 | 2019 | 9 | 0 | 41 | 20 | 70 |
 | 2020 | 44 | 0 | 87 | 31 | 162 |
 | 2021 | 23 | 17 | 70 | 35 | 145 |
 | 2022 | 11 | 20 | 58 | 31 | 120 |
 | 2023 | 13 | 20 | 52 | 27 | 112 |
 | 2024 | 6 | 52 | 34 | 29 | 121 |
 | 2025 | 21 | 54 | 54 | 21 | 150 |
 | 2026 | 11 | 33 | 35 | 3 | 82 |
 ---
 ## 2. Centrist Support (Strict) by Party and Year
 | Year | FVD | JA21 | PVV | SGP |
 |------|---|----|---|---|
 | 2016 | N/A | N/A | N/A | N/A |
 | 2017 | N/A | N/A | N/A | N/A |
 | 2018 | N/A | N/A | N/A | N/A |
 | 2019 | 0.000 | N/A | 0.074 | 0.350 |
 | 2020 | 0.057 | N/A | 0.052 | 0.387 |
 | 2021 | 0.000 | 0.088 | 0.014 | 0.286 |
 | 2022 | 0.000 | 0.050 | 0.043 | 0.242 |
 | 2023 | 0.000 | 0.075 | 0.067 | 0.407 |
 | 2024 | 0.056 | 0.212 | 0.314 | 0.506 |
 | 2025 | 0.095 | 0.315 | 0.139 | 0.603 |
 | 2026 | 0.000 | 0.300 | 0.086 | 0.167 |
 ---
 ## 3. Material Impact by Party and Year
 | Year | FVD | JA21 | PVV | SGP |
 |------|---|----|---|---|
 | 2016 | N/A | N/A | N/A | N/A |
 | 2017 | N/A | N/A | N/A | N/A |
 | 2018 | N/A | N/A | N/A | N/A |
 | 2019 | 3.56 | N/A | 3.34 | 2.65 |
 | 2020 | 3.18 | N/A | 3.30 | 2.84 |
 | 2021 | 2.96 | 3.41 | 3.23 | 2.91 |
 | 2022 | 2.45 | 3.05 | 2.67 | 2.26 |
 | 2023 | 2.92 | 3.85 | 3.25 | 2.74 |
 | 2024 | 3.50 | 3.13 | 2.50 | 2.52 |
 | 2025 | 3.00 | 2.44 | 2.50 | 2.10 |
 | 2026 | 1.91 | 2.36 | 2.54 | 2.00 |
 ---
 ## 4. Pre/Post-2024 Comparison by Party
 | Party | N Pre | N Post | CS Pre | CS Post | Delta CS | Mat. Pre | Mat. Post | Delta Mat. | Vol. Delta |
 |-------|-------|--------|--------|---------|----------|----------|-----------|------------|------------|
 | FVD | 100 | 38 | 0.025 | 0.061 | +0.036 | 3.05 | 2.76 | -0.29 | -62 |
 | JA21 | 57 | 139 | 0.070 | 0.273 | +0.203 | 3.44 | 2.68 | -0.76 | +82 |
 | PVV | 308 | 123 | 0.047 | 0.172 | +0.125 | 3.16 | 2.51 | -0.65 | -185 |
 | SGP | 144 | 53 | 0.330 | 0.525 | +0.195 | 2.69 | 2.32 | -0.37 | -91 |
 ---
 ## 5. Key Findings
 **Centrist support shift (largest to smallest):**
 - **JA21**: +0.203
 - **SGP**: +0.195
 - **PVV**: +0.125
 - **FVD**: +0.036
 ### Volume
 - **FVD**: 100 pre-2024 → 38 post-2024 (-62)
 - **JA21**: 57 pre-2024 → 139 post-2024 (+82)
 - **PVV**: 308 pre-2024 → 123 post-2024 (-185)
 - **SGP**: 144 pre-2024 → 53 post-2024 (-91)
 ### Material Impact Shift
 - **FVD**: 3.05 → 2.76 (-0.29)
 - **JA21**: 3.44 → 2.68 (-0.76)
 - **PVV**: 3.16 → 2.51 (-0.65)
 - **SGP**: 2.69 → 2.32 (-0.37)
 ---
 ## 6. Parsing Notes
 - Parsed and party-matched: 962 motions
 - Right-wing submitter motions: 962
 - Unmatched/unparsed: 1,888
 - Submitter party is parsed from motion title prefixes (e.g. 'Motie van het lid Wilders ...').
 - Multi-submitter motions use the first listed submitter.
 - Party names are normalized via `_PARTY_NORMALIZE` (e.g. Groep Markuszower → PVV).
 ---
 ## 7. Figure
 ![Party differentiation figure](party_differentiation_figure.png)
--- a/reports/overton_window/party_differentiation_figure.png
+++ b/reports/overton_window/party_differentiation_figure.png
--- a/reports/overton_window/predictive_model.md
+++ b/reports/overton_window/predictive_model.md
@ -0,0 +1,100 @@
 # Predictive Model: Centrist Support
 **Generated:** 2026-05-31 19:36
 ## Data Summary
 - Total classified right-wing motions with 2D extremity scores: **2850**
 - Valid for modeling (right-wing submitter party + valid category): **914**
 - High centrist support (>0.5) : 115 motions
 - Low centrist support (<=0.5): 799 motions
 - Class imbalance ratio: 6.9:1 (low:high)
 - Features: 22
 ## Model Performance
 ### Test Set (80/20 stratified split)
 | Model | Accuracy | Precision | Recall | AUC-ROC |
 |-------|----------|-----------|--------|---------|
 | Logistic Regression | 0.710 | 0.258 | 0.696 | 0.810 |
 | Random Forest | 0.852 | 0.423 | 0.478 | 0.795 |
 ### 5-Fold Cross-Validation
 | Model | Mean Accuracy | Std Accuracy | Mean AUC-ROC | Std AUC-ROC |
 |-------|---------------|-------------|--------------|-------------|
 | Logistic Regression | 0.718 | 0.032 | 0.815 | 0.036 |
 | Random Forest | 0.862 | 0.016 | 0.835 | 0.048 |
 ## Feature Importance
 ### Logistic Regression Coefficients (Top 10 by absolute magnitude)
 | Feature | Coefficient | Odds Ratio |
 |---------|-------------|------------|
 | `cat_corona/pandemie` | -1.4680 | 0.2304 |
 | `party_FVD` | -1.3282 | 0.2650 |
 | `party_SGP` | 0.9877 | 2.6852 |
 | `party_JA21` | 0.9264 | 2.5255 |
 | `stijl_extremiteit` | -0.6859 | 0.5036 |
 | `party_PVV` | -0.6394 | 0.5276 |
 | `cat_onderwijs/cultuur` | 0.5472 | 1.7285 |
 | `cat_zorg/gezondheid` | -0.4857 | 0.6153 |
 | `materiele_impact` | -0.4741 | 0.6225 |
 | `cat_overig` | 0.4658 | 1.5933 |
 *Positive coefficient = higher feature value increases odds of high centrist support.*
 ### Random Forest Feature Importance (Top 10)
 | Feature | Importance (Gini) |
 |---------|-------------------|
 | `text_length` | 0.2137 |
 | `year` | 0.1915 |
 | `stijl_extremiteit` | 0.1410 |
 | `materiele_impact` | 0.0946 |
 | `party_SGP` | 0.0652 |
 | `party_FVD` | 0.0489 |
 | `party_PVV` | 0.0407 |
 | `cat_veiligheid/justitie` | 0.0258 |
 | `cat_defensie/buitenland` | 0.0246 |
 | `party_JA21` | 0.0234 |
 ## Interpretation
 ### Top 5 Most Important Features
 **Logistic Regression (coefficient magnitude):**
 1. `cat_corona/pandemie` (coef=-1.4680, OR=0.2304) — decreases odds of high centrist support
 2. `party_FVD` (coef=-1.3282, OR=0.2650) — decreases odds of high centrist support
 3. `party_SGP` (coef=0.9877, OR=2.6852) — increases odds of high centrist support
 4. `party_JA21` (coef=0.9264, OR=2.5255) — increases odds of high centrist support
 5. `stijl_extremiteit` (coef=-0.6859, OR=0.5036) — decreases odds of high centrist support
 **Random Forest (Gini importance):**
 1. `text_length` (importance=0.2137)
 2. `year` (importance=0.1915)
 3. `stijl_extremiteit` (importance=0.1410)
 4. `materiele_impact` (importance=0.0946)
 5. `party_SGP` (importance=0.0652)
 ### Which features best predict centrist support?
 The models agree on key predictors. **Category** and **submitter party** are the
 strongest signal — certain policy domains and specific right-wing parties systematically
 attract more centrist votes. **Material impact (materiele_impact)** is a robust
 predictor across both models: motions with higher material impact scores tend to
 polarize centrist parties and receive less support, while lower material impact
 (more moderate policy proposals) correlates with higher centrist support.
 **Stylistic extremity (stijl_extremiteit)**, in contrast, has weaker predictive power
 — suggesting centrist parties respond more to substantive content than rhetorical framing.
 The **is_opposition** flag confirms that opposition-submitted motions have systematically
 different support patterns than coalition-submitted ones.
 ### Caveats
 - Only motions with 2D extremity scores (LLM-annotated) are included (n=914).
 - Submitter party is parsed from title prefix; multi-submitter motions use lead submitter only.
 - Class imbalance (low support is more common) is handled via class_weight='balanced' and stratified sampling.
--- a/reports/overton_window/predictive_model_figure.png
+++ b/reports/overton_window/predictive_model_figure.png
--- a/reports/overton_window/svd_trajectory_figure.png
+++ b/reports/overton_window/svd_trajectory_figure.png
--- a/reports/overton_window/voting_margin.md
+++ b/reports/overton_window/voting_margin.md
@ -0,0 +1,154 @@
 # Voting Margin Analysis
 **Goal:** Replace binary pass/fail with continuous voting margin as the primary
 success metric for right-wing motions in the Tweede Kamer.
 **Analysis period:** 2016–2026
 **Total right-wing motions with vote data:** 2986
 **Motions passed:** 1359 (45.5%)
 **Motions failed:** 1627 (54.5%)
 ---
 ## 1. Methodology
 The voting margin is computed from `motions.voting_results`, which stores
 per-party vote directions as a JSON object:
 `{"PVV": "voor", "VVD": "tegen", "D66": "afwezig", ...}`.
 ```
 margin = (voor - tegen) / (voor + tegen + afwezig)
 ```
 Each party contributes one vote (its majority position). The margin ranges
 from -1 (unanimous rejection) to +1 (unanimous support). A margin of 0
 indicates an exact tie or no participating parties.
 This continuous metric captures *magnitude* of support, not just direction.
 A motion that passes 14-1 has margin = +0.87, while one that passes 8-7 has
 margin = +0.07. Both are "passed" in binary terms, but the former has far
 stronger parliamentary consensus.
 > **Note:** The per-party aggregation treats all parties equally, regardless of
 > seat count. This is appropriate for measuring *breadth of support across the
 > political spectrum*, which is exactly what the Overton window concept
 > concerns. Seat-weighted margins would be confounded by coalition size effects.
 ---
 ## 2. Correlation: Margin vs Centrist Support
 | Metric | Value |
 |--------|-------|
 | Spearman ρ | 0.812 |
 | Spearman p-value | 0.0e+00 |
 | Pearson r | 0.822 |
 | Pearson p-value | 0.0e+00 |
 The Spearman correlation is significant (ρ = 0.812, p = 0.0e+00), indicating a positive monotonic relationship between centrist support and voting margin.
 ---
 ## 3. Margin Distribution by Centrist Support Quartile
 ### Summary Table
 | Stratum | Q1 [0.00–0.25] | Q2 (0.25–0.50] | Q3 (0.50–0.75] | Q4 (0.75–1.00] |
 |---------|:------:|:------:|:------:|:------:|
 | all | -0.263 (n=1589) | +0.087 (n=536) | +0.212 (n=230) | +0.483 (n=631) |
 | pre-2024 | -0.261 (n=1247) | +0.122 (n=357) | +0.232 (n=10) | +0.420 (n=297) |
 | post-2024 | -0.269 (n=342) | +0.017 (n=179) | +0.211 (n=220) | +0.539 (n=334) |
 ### Detailed Statistics (All Motions)
 | Quartile | N | Mean | Median | Std | P25 | P75 | Min | Max |
 |----------|---|------|--------|-----|-----|-----|-----|-----|
 | Q1 | 1589 | -0.263 | -0.294 | 0.228 | -0.450 | -0.100 | -0.733 | +0.438 |
 | Q2 | 536 | +0.087 | +0.067 | 0.220 | -0.067 | +0.238 | -0.467 | +0.625 |
 | Q3 | 230 | +0.212 | +0.200 | 0.165 | +0.067 | +0.333 | -0.200 | +0.600 |
 | Q4 | 631 | +0.483 | +0.467 | 0.173 | +0.368 | +0.600 | -0.125 | +0.765 |
 **Q4 – Q1 gap in mean margin:** +0.746
 The gap of +0.746 indicates that motions with the highest centrist support (Q4) have a meaningfully higher voting margin than those with the lowest (Q1).
 ---
 ## 4. Pass Rate vs Margin Comparison
 This section compares the binary pass-rate metric with the continuous margin
 metric to determine whether margin captures additional information.
 | Quartile | N | Pass Rate | Mean Margin |
 |----------|---|-----------|-------------|
 | Q1 | 1589 | 12.7% | -0.263 |
 | Q2 | 536 | 59.3% | +0.087 |
 | Q3 | 230 | 92.6% | +0.212 |
 | Q4 | 631 | 99.2% | +0.483 |
 **Pass rate gap (Q4 – Q1):** +86.5%
 **Margin gap (Q4 – Q1):** +0.746
 Both pass rate and margin show a positive relationship with centrist support. Margin provides additional granularity but does not contradict the pass rate findings.
 ---
 ## 5. Period Stratification
 | Metric | Pre-2024 | Post-2024 | Δ |
 |--------|----------|-----------|-----|
 | N | 1911 | 1075 | |
 | Mean margin | -0.081 | +0.128 | +0.209 |
 | Mann-Whitney U | | | U=702132, p=6.6e-47 |
 | Cohen's d | | | +0.582 |
 ---
 ## 6. Yearly Breakdown
 | Year | N | Mean Margin | Mean CS (strict) | % Passed |
 |------|---|-------------|-----------------|---------|
 | 2016 | 6 | +0.397 | 0.667 | 100.0% |
 | 2018 | 5 | +0.538 | 1.000 | 100.0% |
 | 2019 | 195 | -0.057 | 0.380 | 42.6% |
 | 2020 | 469 | -0.074 | 0.300 | 40.5% |
 | 2021 | 425 | -0.106 | 0.175 | 34.4% |
 | 2022 | 446 | -0.093 | 0.201 | 32.5% |
 | 2023 | 365 | -0.077 | 0.255 | 34.2% |
 | 2024 | 469 | +0.175 | 0.595 | 69.5% |
 | 2025 | 455 | +0.089 | 0.474 | 57.4% |
 | 2026 | 151 | +0.099 | 0.334 | 47.7% |
 ---
 ## 7. Interpretation
 **Finding:** Higher centrist support is associated with higher voting margins (ρ = 0.812, p = 0.0e+00). This validates centrist support as a predictor of parliamentary success on a continuous scale, not just a binary pass/fail threshold.
 **Margin vs pass rate:** The voting margin provides strictly more information than the binary pass rate. Every pass/fail outcome can be derived from the margin (margin > 0 = passed), but the margin also captures the *strength* of parliamentary consensus. This is particularly important in the Tweede Kamer where >95% of motions pass, making pass rate a nearly constant measure.
 ---
 ## 8. Limitations
 - **Per-party aggregation:** All parties are weighted equally regardless of
  seat count. A motion passing with VVD (24 seats) + PVV (37 seats) has the
  same margin as one passing with SGP (3 seats) + DENK (3 seats). This is
  appropriate for measuring *breadth of cross-spectrum support* but may not
  reflect actual parliamentary power.
 - **Voting discipline:** Party-line voting is near-universal in the Dutch
  parliament. The per-party aggregation loses little information.
 - **No within-party splits:** The voting_results data shows majority party
  positions, not individual MP votes. Intra-party dissent is invisible.
 - **Missing data:** Motions without voting_results are excluded.
 ---
 ![Figure: Voting margin analysis](voting_margin_figure.png)
 *Report generated by `analysis/right_wing/voting_margin.py`*
--- a/reports/overton_window/voting_margin_figure.png
+++ b/reports/overton_window/voting_margin_figure.png