You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
Sven Geboers 2c60f41f29 cleanup: archive stale scripts and delete orphaned generate_extra_charts 4 weeks ago
.compound-engineering chore: add compound-engineering config example 4 weeks ago
.github/workflows cleanup: remove stale .mindmodel, old venvs, orphaned code, and transient artifacts 4 weeks ago
analysis cleanup: remove stale .mindmodel, old venvs, orphaned code, and transient artifacts 4 weeks ago
ansible fix: add health check wait to ansible deploy 2 months ago
data Add debug st.info before st.plotly_chart to diagnose invisible chart 2 months ago
docs cleanup: archive stale scripts and delete orphaned generate_extra_charts 4 weeks ago
health feat: add pipeline health checks module and CLI runner 4 weeks ago
migrations feat(similarity): add precomputed similarity cache, fix fusion N+1, add 429 retry 2 months ago
packages/@ansible/example feat(ansible-example): add @ansible/example package, tests, CI, publish & deploy workflows, docs and changelog 2 months ago
pages UI improvements + add axis orientation test 2 months ago
pipeline feat: persist and load explained variance for scree plots 4 weeks ago
reports/drift cleanup: merge session ledgers into docs/solutions and delete artifacts 4 weeks ago
scripts cleanup: archive stale scripts and delete orphaned generate_extra_charts 4 weeks ago
similarity Refactor tests: replace sys.modules hacks with real DI + in-memory DB 2 months ago
src/validators cleanup: remove stale .mindmodel, old venvs, orphaned code, and transient artifacts 4 weeks ago
tests cleanup: archive stale scripts and delete orphaned generate_extra_charts 4 weeks ago
thoughts cleanup: remove stale .mindmodel, old venvs, orphaned code, and transient artifacts 4 weeks ago
tools Add debug st.info before st.plotly_chart to diagnose invisible chart 2 months ago
.gitignore cleanup: remove stale .mindmodel, old venvs, orphaned code, and transient artifacts 4 weeks ago
.pre-commit-config.yaml infra: fix CI, config, docker-compose, README, and pre-commit 4 weeks ago
.python-version feat(pipeline): implement parliamentary embedding pipeline MVP 2 months ago
AGENTS.md infra: fix CI, config, docker-compose, README, and pre-commit 4 weeks ago
ARCHITECTURE.md chore: commit remaining modified files from refactoring 2 months ago
CODE_STYLE.md feat(ansible-example): add @ansible/example package, tests, CI, publish & deploy workflows, docs and changelog 2 months ago
Dockerfile feat: add StemAtlas Streamlit app, explorer, Docker deployment, blog charts 2 months ago
Home.py UI improvements + add axis orientation test 2 months ago
README.md infra: fix CI, config, docker-compose, README, and pre-commit 4 weeks ago
ai_provider.py feat: complete parliamentary embedding pipeline with full historical coverage 2 months ago
api_client.py refactor: migrate api_client.py prints to structured logging 4 weeks ago
app.py feat(pipeline): implement parliamentary embedding pipeline MVP 2 months ago
config.py infra: fix CI, config, docker-compose, README, and pre-commit 4 weeks ago
database.py refactor: tighten exception handling in database.py and add BLE lint rule 4 weeks ago
docker-compose.yml infra: fix CI, config, docker-compose, README, and pre-commit 4 weeks ago
explorer.py refactor: decompose explorer.py into analysis/tabs/ and add scheduler 4 weeks ago
explorer_helpers.py refactor(trajectory): fix code quality issues in centroid diagnostics 2 months ago
logging_config.py feat: add structured logging configuration module 4 weeks ago
pyproject.toml fix: remove duplicate import and add ruff to dev deps 4 weeks ago
requirements-dev.txt chore(deps): move pytest to dev-dependencies 2 months ago
scheduler.py fix: remove duplicate import and add ruff to dev deps 4 weeks ago
streamlit_index.html Add debug st.info before st.plotly_chart to diagnose invisible chart 2 months ago
summarizer.py feat(pipeline): implement parliamentary embedding pipeline MVP 2 months ago
uv.lock fix: remove duplicate import and add ruff to dev deps 4 weeks ago

README.md

Stemwijzer

A Dutch parliamentary voting compass that lets you vote on real Tweede Kamer motions and see which parties match your positions.

Stemwijzer Explorer

What is Stemwijzer?

Stemwijzer ingests motions and voting records from the Dutch House of Representatives (Tweede Kamer), stores them in DuckDB, generates AI-powered explanations with an LLM, and presents a Streamlit UI where users can vote on real motions and explore party positions through SVD visualizations, trajectory analysis, and embedding-based similarity search.

Features

  • Voting Compass — Vote on real parliamentary motions and see which parties align with your choices
  • Explorer — Interactive SVD visualizations, party trajectories over time, motion browser, and semantic search
  • Analytics — SVD decomposition of voting patterns, UMAP projections, clustering, and drift analysis
  • LLM Enrichment — Automatic generation of layman-friendly motion explanations using QWEN via OpenRouter

Prerequisites

  • Python >= 3.13
  • uv for dependency management
  • (Optional) OPENROUTER_API_KEY for LLM enrichment

Quickstart

# Clone and enter the repository
git clone <your-gitea-url>/sgeboers/stemwijzer.git
cd stemwijzer

# Install dependencies
uv sync

# Run the Streamlit app
uv run streamlit run Home.py

# Run the data pipeline (fetch motions, compute embeddings, etc.)
uv run python pipeline/run_pipeline.py

# Run tests
uv run pytest tests/ -q

The app will be available at http://localhost:8501.

Project Structure

├── app.py              # Streamlit UI entrypoint
├── database.py         # DuckDB schema and queries
├── api_client.py       # Tweede Kamer OData API client
├── explorer.py         # Explorer page with SVD visualizations
├── pipeline/           # Data ingestion and analysis pipelines
├── analysis/           # SVD, clustering, trajectory modules
├── tests/              # pytest test suite
├── docs/               # Documentation, research, and plans
└── data/motions.db     # DuckDB database (~18 GB)

Documentation

  • ARCHITECTURE.md — Comprehensive architecture overview, tech stack, and contributor guidance
  • CODE_STYLE.md — Coding conventions, naming, typing, and testing standards
  • docs/solutions/ — Documented solutions to past bugs and best practices

Tech Stack

  • Language: Python 3.13+
  • Data: DuckDB via ibis-framework
  • UI: Streamlit + Plotly
  • ML/Analysis: scipy, scikit-learn, umap-learn
  • LLM: QWEN via OpenRouter (OpenAI-compatible)
  • Package Manager: uv

Deployment

See docs/deployment/ansible-package-deploy.md for server deployment instructions using the Ansible package.

License

[Your license here]