You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Sven Geboers
daa22c5e2b
feat: complete parliamentary embedding pipeline with full historical coverage
- Add fused (SVD + text) embedding pipeline for annual windows 2016-2026
- Fix store_fused_embedding duplicate bug: DELETE before INSERT (idempotent)
- Add --text-batch-size CLI flag to run_pipeline.py (default 200)
- Add explicit --start-date/--end-date to download_past_year.py
- Backfill mp_votes for all motions (party-level votes, 111k new rows)
- Add similarity cache recompute: 212k rows across 9 annual windows
- Improve ai_provider retry logic, text_pipeline batching
- Improve analysis/political_axis PCA handling and visualizations
- Add diagnostic/utility scripts: compare_svd, generate_compass, inspect_axis, etc.
- Untrack data/motions.db (3.6GB binary), add to .gitignore with outputs/
- Update continuity ledger with full session state
|
1 month ago |
| .. |
|
fixtures
|
feat(pipeline): implement parliamentary embedding pipeline MVP
|
1 month ago |
|
integration
|
feat: complete parliamentary embedding pipeline with full historical coverage
|
1 month ago |
|
migrations
|
feat(pipeline): implement parliamentary embedding pipeline MVP
|
1 month ago |
|
types
|
feat(pipeline): implement parliamentary embedding pipeline MVP
|
1 month ago |
|
utils
|
feat(pipeline): implement parliamentary embedding pipeline MVP
|
1 month ago |
|
__init__.py
|
feat(pipeline): implement parliamentary embedding pipeline MVP
|
1 month ago |
|
conftest.py
|
feat(pipeline): implement parliamentary embedding pipeline MVP
|
1 month ago |
|
test_ai_provider.py
|
feat(pipeline): implement parliamentary embedding pipeline MVP
|
1 month ago |
|
test_ai_provider_retry.py
|
feat(similarity): add precomputed similarity cache, fix fusion N+1, add 429 retry
|
1 month ago |
|
test_analysis.py
|
feat(pipeline): add orchestrator CLI, analysis modules, and ActorFractie ingestion
|
1 month ago |
|
test_extract_mp_votes.py
|
fix(tests): update test_extract_mp_votes for party-level actor inclusion
|
1 month ago |
|
test_fetch_mp_metadata.py
|
feat(analysis): fetch real MP metadata, fix anchor axis for party-level actors
|
1 month ago |
|
test_fusion.py
|
feat(similarity): add precomputed similarity cache, fix fusion N+1, add 429 retry
|
1 month ago |
|
test_migration_embeddings.py
|
feat(pipeline): implement parliamentary embedding pipeline MVP
|
1 month ago |
|
test_migration_pipeline_tables.py
|
feat(pipeline): implement parliamentary embedding pipeline MVP
|
1 month ago |
|
test_political_compass.py
|
feat(analysis): add 2D political compass and 2D trajectories
|
1 month ago |
|
test_pyproject_deps.py
|
feat(pipeline): implement parliamentary embedding pipeline MVP
|
1 month ago |
|
test_run_pipeline.py
|
feat(pipeline): add orchestrator CLI, analysis modules, and ActorFractie ingestion
|
1 month ago |
|
test_similarity_compute.py
|
feat(similarity): add precomputed similarity cache, fix fusion N+1, add 429 retry
|
1 month ago |
|
test_similarity_db_helpers.py
|
feat(similarity): add precomputed similarity cache, fix fusion N+1, add 429 retry
|
1 month ago |
|
test_svd_pipeline.py
|
feat(pipeline): implement parliamentary embedding pipeline MVP
|
1 month ago |
|
test_text_pipeline.py
|
feat: complete parliamentary embedding pipeline with full historical coverage
|
1 month ago |