You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
Sven Geboers e352d7c7bc feat: add pipeline health checks module and CLI runner 4 weeks ago
.github/workflows infra: fix CI, config, docker-compose, README, and pre-commit 4 weeks ago
.mindmodel chore: convert mindmodel from YAML to markdown and clean up 2 months ago
analysis infra: fix CI, config, docker-compose, README, and pre-commit 4 weeks ago
ansible fix: add health check wait to ansible deploy 2 months ago
data Add debug st.info before st.plotly_chart to diagnose invisible chart 2 months ago
docs docs: add improvement roadmap, research notes, and solution docs 4 weeks ago
health feat: add pipeline health checks module and CLI runner 4 weeks ago
migrations feat(similarity): add precomputed similarity cache, fix fusion N+1, add 429 retry 2 months ago
packages/@ansible/example feat(ansible-example): add @ansible/example package, tests, CI, publish & deploy workflows, docs and changelog 2 months ago
pages UI improvements + add axis orientation test 2 months ago
pipeline chore: commit remaining modified files from refactoring 2 months ago
reports/drift cleanup: merge session ledgers into docs/solutions and delete artifacts 4 weeks ago
scripts feat: add pipeline health checks module and CLI runner 4 weeks ago
similarity Refactor tests: replace sys.modules hacks with real DI + in-memory DB 2 months ago
src Add debug st.info before st.plotly_chart to diagnose invisible chart 2 months ago
tests feat: add pipeline health checks module and CLI runner 4 weeks ago
thoughts cleanup: merge session ledgers into docs/solutions and delete artifacts 4 weeks ago
tools Add debug st.info before st.plotly_chart to diagnose invisible chart 2 months ago
.gitignore cleanup: merge session ledgers into docs/solutions and delete artifacts 4 weeks ago
.pre-commit-config.yaml infra: fix CI, config, docker-compose, README, and pre-commit 4 weeks ago
.python-version feat(pipeline): implement parliamentary embedding pipeline MVP 2 months ago
AGENTS.md infra: fix CI, config, docker-compose, README, and pre-commit 4 weeks ago
ARCHITECTURE.md chore: commit remaining modified files from refactoring 2 months ago
CODE_STYLE.md feat(ansible-example): add @ansible/example package, tests, CI, publish & deploy workflows, docs and changelog 2 months ago
Dockerfile feat: add StemAtlas Streamlit app, explorer, Docker deployment, blog charts 2 months ago
Home.py UI improvements + add axis orientation test 2 months ago
README.md infra: fix CI, config, docker-compose, README, and pre-commit 4 weeks ago
ai_provider.py feat: complete parliamentary embedding pipeline with full historical coverage 2 months ago
api_client.py refactor: migrate api_client.py prints to structured logging 4 weeks ago
app.py feat(pipeline): implement parliamentary embedding pipeline MVP 2 months ago
config.py infra: fix CI, config, docker-compose, README, and pre-commit 4 weeks ago
database.py refactor: tighten exception handling in database.py and add BLE lint rule 4 weeks ago
docker-compose.yml infra: fix CI, config, docker-compose, README, and pre-commit 4 weeks ago
explorer.py sync to server 1 month ago
explorer_helpers.py refactor(trajectory): fix code quality issues in centroid diagnostics 2 months ago
logging_config.py feat: add structured logging configuration module 4 weeks ago
pyproject.toml refactor: tighten exception handling in database.py and add BLE lint rule 4 weeks ago
requirements-dev.txt chore(deps): move pytest to dev-dependencies 2 months ago
streamlit_index.html Add debug st.info before st.plotly_chart to diagnose invisible chart 2 months ago
summarizer.py feat(pipeline): implement parliamentary embedding pipeline MVP 2 months ago
uv.lock cleanup: merge session ledgers into docs/solutions and delete artifacts 4 weeks ago

README.md

Stemwijzer

A Dutch parliamentary voting compass that lets you vote on real Tweede Kamer motions and see which parties match your positions.

Stemwijzer Explorer

What is Stemwijzer?

Stemwijzer ingests motions and voting records from the Dutch House of Representatives (Tweede Kamer), stores them in DuckDB, generates AI-powered explanations with an LLM, and presents a Streamlit UI where users can vote on real motions and explore party positions through SVD visualizations, trajectory analysis, and embedding-based similarity search.

Features

  • Voting Compass — Vote on real parliamentary motions and see which parties align with your choices
  • Explorer — Interactive SVD visualizations, party trajectories over time, motion browser, and semantic search
  • Analytics — SVD decomposition of voting patterns, UMAP projections, clustering, and drift analysis
  • LLM Enrichment — Automatic generation of layman-friendly motion explanations using QWEN via OpenRouter

Prerequisites

  • Python >= 3.13
  • uv for dependency management
  • (Optional) OPENROUTER_API_KEY for LLM enrichment

Quickstart

# Clone and enter the repository
git clone <your-gitea-url>/sgeboers/stemwijzer.git
cd stemwijzer

# Install dependencies
uv sync

# Run the Streamlit app
uv run streamlit run Home.py

# Run the data pipeline (fetch motions, compute embeddings, etc.)
uv run python pipeline/run_pipeline.py

# Run tests
uv run pytest tests/ -q

The app will be available at http://localhost:8501.

Project Structure

├── app.py              # Streamlit UI entrypoint
├── database.py         # DuckDB schema and queries
├── api_client.py       # Tweede Kamer OData API client
├── explorer.py         # Explorer page with SVD visualizations
├── pipeline/           # Data ingestion and analysis pipelines
├── analysis/           # SVD, clustering, trajectory modules
├── tests/              # pytest test suite
├── docs/               # Documentation, research, and plans
└── data/motions.db     # DuckDB database (~18 GB)

Documentation

  • ARCHITECTURE.md — Comprehensive architecture overview, tech stack, and contributor guidance
  • CODE_STYLE.md — Coding conventions, naming, typing, and testing standards
  • docs/solutions/ — Documented solutions to past bugs and best practices

Tech Stack

  • Language: Python 3.13+
  • Data: DuckDB via ibis-framework
  • UI: Streamlit + Plotly
  • ML/Analysis: scipy, scikit-learn, umap-learn
  • LLM: QWEN via OpenRouter (OpenAI-compatible)
  • Package Manager: uv

Deployment

See docs/deployment/ansible-package-deploy.md for server deployment instructions using the Ansible package.

License

[Your license here]