You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Sven Geboers
a78bee9b0a
feat(similarity): add precomputed similarity cache, fix fusion N+1, add 429 retry
- Add similarity/ package (compute.py, lookup.py) with numpy-based
pairwise cosine similarity and cached lookup
- database.py: create embeddings + similarity_cache tables in _init_database(),
add store_similarity_batch/get_cached_similarities/clear_similarity_cache helpers
- pipeline/fusion.py: replace N+1 per-motion embedding SELECT with single
bulk JOIN using DuckDB QUALIFY window function
- ai_provider.py: retry HTTP 429 with Retry-After header support
- migrations/2026-03-22-add-similarity-cache.sql: make executable
- Add tests for similarity compute, db helpers, and 429 retry (34 pass, 2 skip)
|
1 month ago |
| .. |
|
fixtures
|
feat(pipeline): implement parliamentary embedding pipeline MVP
|
1 month ago |
|
integration
|
feat(pipeline): implement parliamentary embedding pipeline MVP
|
1 month ago |
|
migrations
|
feat(pipeline): implement parliamentary embedding pipeline MVP
|
1 month ago |
|
types
|
feat(pipeline): implement parliamentary embedding pipeline MVP
|
1 month ago |
|
utils
|
feat(pipeline): implement parliamentary embedding pipeline MVP
|
1 month ago |
|
__init__.py
|
feat(pipeline): implement parliamentary embedding pipeline MVP
|
1 month ago |
|
conftest.py
|
feat(pipeline): implement parliamentary embedding pipeline MVP
|
1 month ago |
|
test_ai_provider.py
|
feat(pipeline): implement parliamentary embedding pipeline MVP
|
1 month ago |
|
test_ai_provider_retry.py
|
feat(similarity): add precomputed similarity cache, fix fusion N+1, add 429 retry
|
1 month ago |
|
test_analysis.py
|
feat(pipeline): add orchestrator CLI, analysis modules, and ActorFractie ingestion
|
1 month ago |
|
test_extract_mp_votes.py
|
fix(tests): update test_extract_mp_votes for party-level actor inclusion
|
1 month ago |
|
test_fetch_mp_metadata.py
|
feat(analysis): fetch real MP metadata, fix anchor axis for party-level actors
|
1 month ago |
|
test_fusion.py
|
feat(similarity): add precomputed similarity cache, fix fusion N+1, add 429 retry
|
1 month ago |
|
test_migration_embeddings.py
|
feat(pipeline): implement parliamentary embedding pipeline MVP
|
1 month ago |
|
test_migration_pipeline_tables.py
|
feat(pipeline): implement parliamentary embedding pipeline MVP
|
1 month ago |
|
test_political_compass.py
|
feat(analysis): add 2D political compass and 2D trajectories
|
1 month ago |
|
test_pyproject_deps.py
|
feat(pipeline): implement parliamentary embedding pipeline MVP
|
1 month ago |
|
test_run_pipeline.py
|
feat(pipeline): add orchestrator CLI, analysis modules, and ActorFractie ingestion
|
1 month ago |
|
test_similarity_compute.py
|
feat(similarity): add precomputed similarity cache, fix fusion N+1, add 429 retry
|
1 month ago |
|
test_similarity_db_helpers.py
|
feat(similarity): add precomputed similarity cache, fix fusion N+1, add 429 retry
|
1 month ago |
|
test_svd_pipeline.py
|
feat(pipeline): implement parliamentary embedding pipeline MVP
|
1 month ago |
|
test_text_pipeline.py
|
feat(pipeline): implement parliamentary embedding pipeline MVP
|
1 month ago |