infoxtractor

Author	SHA1	Message	Date
goldstein	9c73895318	fix(genai): ollama loose JSON (#39 ) Some checks failed tests / test (push) Has been cancelled Details	2026-04-18 11:59:18 +00:00
Dirk Riemann	2efc4d1088	fix(genai): send format="json" (loose mode) to Ollama All checks were successful tests / test (push) Successful in 1m13s Details tests / test (pull_request) Successful in 1m23s Details Ollama 0.11.8 segfaults on any Pydantic-shaped structured-output schema with $ref, anyOf, or pattern — confirmed on the deploy host with the simplest MVP case (BankStatementHeader alone). The earlier null-stripping sanitiser wasn't enough. Switch to format="json", which is "emit valid JSON" mode. We're already describing the exact JSON shape in the system prompt (via GenAIStep + the use case's citation instruction appendix) and validating the response body through Pydantic on parse — which raises IX_002_001 on schema mismatch, exactly as before. Stronger guarantees can come back later via a newer Ollama, an API fix, or a different GenAIClient impl. None of that is needed for the MVP to work end to end. Unit tests: the sanitiser left in place (harmless, still tested). The "happy path" test now asserts format == "json". Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 13:59:04 +02:00
goldstein	f6ce97d7fd	fix(compose): persist model caches (#38 ) All checks were successful tests / test (push) Successful in 1m8s Details	2026-04-18 11:49:24 +00:00
Dirk Riemann	9e33923f71	fix(compose): persist Surya + HF caches so rebuilds don't redownload models All checks were successful tests / test (push) Successful in 2m1s Details tests / test (pull_request) Successful in 1m18s Details First /healthz call on a fresh container triggers Surya to fetch the text-recognition (1.34 GB) and detection (73 MB) models from HuggingFace. Without a volume they land in the container fs and vanish on every rebuild, which is every deploy. Mount named volumes for /root/.cache/datalab (Surya) and /root/.cache/huggingface. Rebuild now keeps /healthz warm. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 13:49:09 +02:00
goldstein	65670af78f	fix(genai): sanitise Optional for Ollama (#37 ) Some checks are pending tests / test (push) Waiting to run Details	2026-04-18 11:48:43 +00:00
Dirk Riemann	9cb62d69af	fix(genai): strip null branches from anyOf before sending to Ollama All checks were successful tests / test (push) Successful in 1m33s Details tests / test (pull_request) Successful in 4m29s Details Ollama 0.11.8's llama.cpp structured-output implementation segfaults on Pydantic v2's standard Optional pattern: {"anyOf": [{"type": "string"}, {"type": "null"}]} Confirmed on the deploy host: /api/chat request with the MVP's ProvenanceWrappedResponse schema crashed Ollama with SIGSEGV; the client saw httpx RemoteProtocolError → IX_002_000. New _sanitise_schema_for_ollama walks the schema recursively and drops "type: null" branches from every anyOf. Single-branch unions are inlined so sibling keys (default, title) survive. This only narrows what the LLM is told it may emit; Pydantic still validates the real response body against the original schema and accepts None for Optional fields if they were absent or explicitly null. Existing unit tests updated: the "happy path" test no longer pins the format to `_Schema.model_json_schema()` verbatim — instead it asserts the sanitisation effect on a known-Optional field. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 13:48:26 +02:00
goldstein	4c0746950e	fix(deps): surya ^0.17 (#36 ) All checks were successful tests / test (push) Successful in 2m58s Details	2026-04-18 11:21:54 +00:00
Dirk Riemann	a418969251	fix(deps): pin surya-ocr ^0.17 and drop cu124 index All checks were successful tests / test (push) Successful in 1m23s Details tests / test (pull_request) Successful in 2m23s Details Our client code imports surya.foundation (added in 0.17). The earlier cu124 torch pin forced uv to downgrade surya to 0.14.1, which doesn't have that module and depends on a transformers version that lacks QuantizedCacheConfig. Net: ocr: fail at /healthz. Drop the cu124 index pin. surya 0.17.1 needs torch >= 2.7, which the default pypi torch (2.11) satisfies. The deploy host's CUDA 12.4 driver doesn't match torch 2.11's cu13 wheels, so CUDA init warns and the GPU isn't available — torch + Surya transparently fall back to CPU. Slower than GPU but correct for MVP. A host driver upgrade later will unlock GPU with no code changes. Unit suite stays green. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 13:21:40 +02:00
goldstein	fae8c3267f	fix(deps): torch cu124 (#35 ) All checks were successful tests / test (push) Successful in 3m48s Details	2026-04-18 11:02:38 +00:00
Dirk Riemann	d90117807b	fix(deps): pin torch to the CUDA 12.4 wheel channel All checks were successful tests / test (push) Successful in 3m21s Details tests / test (pull_request) Successful in 3m40s Details The default pypi torch (2.11 as of lockfile) ships cu13 wheels, which refuse to initialise against the deploy host's NVIDIA 12.4 driver (UserWarning: "driver on your system is too old (found version 12040)"). /healthz reported ocr: fail because Surya couldn't pick up the GPU. Use `tool.uv.sources` to route torch through PyTorch's cu124 index. That pulls torch 2.6.0+cu124 (still satisfies surya-ocr >= 0.9). Lock updated. transformers downgraded to 4.57.6, triton to 3.2.0 — all compatible with surya and each other. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 13:02:26 +02:00
goldstein	44c3428993	fix(deploy): network_mode: host (#34 ) Some checks failed tests / test (push) Has been cancelled Details	2026-04-18 11:00:14 +00:00
Dirk Riemann	c7dc40c51e	fix(deploy): switch to network_mode: host — reach postgis + ollama on loopback All checks were successful tests / test (push) Successful in 1m12s Details tests / test (pull_request) Successful in 1m10s Details The shared postgis container is bound to 127.0.0.1 on the host (security hardening, infrastructure §T12). Ollama is similarly LAN-hardened. The previous `host.docker.internal + extra_hosts: host-gateway` approach points at the bridge gateway IP, not loopback, so the container couldn't reach either service. Switch to `network_mode: host` (same pattern goldstein uses) and update the default IX_POSTGRES_URL / IX_OLLAMA_URL to 127.0.0.1. Keep the GPU reservation block; drop the now-meaningless ports: declaration (host mode publishes directly). AppConfig defaults + .env.example + test_config assertions + inline docstring examples all follow. Caught on fourth deploy attempt. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 13:00:02 +02:00
goldstein	39a6c10634	fix(compose): drop runtime: nvidia (#33 ) All checks were successful tests / test (push) Successful in 1m19s Details	2026-04-18 10:56:15 +00:00
Dirk Riemann	9f793da778	fix(compose): drop runtime: nvidia — use deploy.resources.devices only All checks were successful tests / test (push) Successful in 1m10s Details tests / test (pull_request) Successful in 1m10s Details Docker on the deploy host doesn't register 'nvidia' as a named runtime (modern nvidia-container-toolkit hooks via --gpus all / resources.devices instead). Immich-ml on the same host uses only deploy.resources.devices with driver: nvidia, which is enough. Drop the legacy runtime line. Caught on third deploy attempt. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 12:56:03 +02:00
goldstein	4802e086a0	fix(docker): README.md for hatchling (#32 ) All checks were successful tests / test (push) Successful in 2m27s Details	2026-04-18 10:42:41 +00:00
Dirk Riemann	f54f0d317d	fix(docker): include README.md in the uv sync COPY so hatchling finds it All checks were successful tests / test (push) Successful in 1m22s Details tests / test (pull_request) Successful in 1m36s Details pyproject.toml names README.md as the readme; hatchling validates that file exists when uv sync resolves the editable install of the project itself. Caught on second deploy build. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 12:42:29 +02:00
goldstein	e6fcd5fc54	fix(docker): uv via standalone installer (#31 ) All checks were successful tests / test (push) Successful in 1m14s Details	2026-04-18 10:33:11 +00:00
Dirk Riemann	1c31444611	fix(docker): install uv via standalone installer (no system pip) All checks were successful tests / test (pull_request) Successful in 1m28s Details tests / test (push) Successful in 1m19s Details Python 3.12 from deadsnakes on Ubuntu 22.04 drops `distutils` from the stdlib, and Ubuntu's system pip still imports from it — so `pip install` fails immediately with ModuleNotFoundError: distutils. Switch to the uv standalone installer, which doesn't need pip at all. Caught during the first deploy build. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 12:32:55 +02:00
goldstein	a9e510362d	chore(model): qwen3:14b default (#30 ) All checks were successful tests / test (push) Successful in 2m55s Details unblock first deploy	2026-04-18 10:20:38 +00:00
Dirk Riemann	5ee74f367c	chore(model): switch default IX_DEFAULT_MODEL to qwen3:14b (already on host) All checks were successful tests / test (push) Successful in 1m52s Details tests / test (pull_request) Successful in 1m45s Details The home server's Ollama doesn't have gpt-oss:20b pulled; qwen3:14b is already there and is what mammon's chat agent uses. Switching the default now so the first deploy passes the /healthz ollama probe without an extra `ollama pull` step. The spec lists gpt-oss:20b as a concrete example; qwen3:14b is equally on-prem and Ollama-structured-output-compatible. Touched: AppConfig default, BankStatementHeader Request.default_model, .env.example, setup_server.sh ollama-list check, AGENTS.md, deployment.md, live tests. Unit tests that hard-coded the old model string but don't assert the default were left alone. Also: ASCII en-dash in e2e_smoke.py Paperless-style text (ruff RUF001). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 12:20:23 +02:00
goldstein	f6cc99f062	feat(e2e): e2e_smoke.py deploy gate (#29 ) Some checks are pending tests / test (push) Waiting to run Details Lands Task 5.4.	2026-04-18 10:18:23 +00:00
Dirk Riemann	d0648fe01d	feat(e2e): scripts/e2e_smoke.py — live deploy gate All checks were successful tests / test (push) Successful in 1m11s Details tests / test (pull_request) Successful in 2m14s Details Runs from the Mac after every `git push server main`. Flow: starts a tiny HTTP server on the Mac's LAN IP serving tests/fixtures/synthetic_giro.pdf → POST /jobs with bank_statement_header + Paperless-style texts so text_agreement has something to check against → poll GET /jobs/{id} until terminal → assert status=done, bank_name non-empty, closing_balance.provenance_verified=True, text_agreement=True, elapsed < 60 s. Non-zero exit blocks the deploy. Uses only stdlib (http.server, urllib) — no extra deps on the Mac-side, no test framework overhead. Task 5.4 of MVP plan. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 12:18:07 +02:00
goldstein	5841bc09c0	feat(deploy): setup_server.sh + deployment runbook (#28 ) Some checks are pending tests / test (push) Waiting to run Details Lands Task 5.2.	2026-04-18 10:17:14 +00:00
Dirk Riemann	6d1bc720b4	feat(deploy): setup_server.sh + deployment runbook All checks were successful tests / test (push) Successful in 1m9s Details tests / test (pull_request) Successful in 1m10s Details - scripts/setup_server.sh: idempotent one-shot. Creates bare repo, post-receive hook (which rebuilds docker compose + gates on /healthz), infoxtractor Postgres role + DB on the shared postgis container, .env (0600) from .env.example with the password substituted in, verifies gpt-oss:20b is pulled. - docs/deployment.md: topology, one-time setup command, normal deploy workflow, rollback-via-revert pattern (never force-push main), operational checklists for the common /healthz degraded states. - First deploy section reserved; filled in after Task 5.3 runs. Task 5.2 of MVP plan. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 12:16:58 +02:00
goldstein	3c7d607776	feat(docker): Dockerfile + compose (#27 ) Some checks are pending tests / test (push) Waiting to run Details Lands Task 5.1.	2026-04-18 10:15:45 +00:00
Dirk Riemann	4646180942	feat(docker): Dockerfile (CUDA+python3.12) + compose with GPU reservation All checks were successful tests / test (push) Successful in 1m13s Details tests / test (pull_request) Successful in 1m10s Details - nvidia/cuda:12.4 runtime base matches the deploy host's driver stack (immich-ml / monitoring use the same pattern). - python3.12 via deadsnakes (Ubuntu 22.04 ships 3.10 only). - System deps: libmagic1 (python-magic), libgl1/libglib2 (PIL + PyMuPDF headless), curl (post-receive /healthz probe), ca-certs (httpx TLS). - uv sync --frozen --no-dev --extra ocr installs prod + Surya/torch; dev tooling stays out of the image. - CMD runs `alembic upgrade head && uvicorn ix.app:create_app` — idempotent. - Compose: single service, port 8994, GPU reservation mirroring immich-ml, labels for monitoring dashboard auto-discovery + backup opt-in. - host.docker.internal:host-gateway lets ix reach the host's Ollama and postgis containers (same pattern mammon uses). Task 5.1 of MVP plan. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 12:15:26 +02:00
goldstein	c234b67bbf	Merge pull request 'feat(app): production wiring factories + /healthz real probes (Task 4.3)' (#26 ) from feat/production-wiring into main All checks were successful tests / test (push) Successful in 1m10s Details	2026-04-18 10:09:34 +00:00
Dirk Riemann	ebefee4184	feat(app): production wiring — factories, pipeline, /healthz real probes All checks were successful tests / test (push) Successful in 1m9s Details tests / test (pull_request) Successful in 1m13s Details Task 4.3 closes the loop on Chunk 4: the FastAPI lifespan now selects fake vs real clients via IX_TEST_MODE (new AppConfig field), wires /healthz probes to the live selfcheck() on OllamaClient / SuryaOCRClient, and spawns the worker with a production Pipeline factory that builds SetupStep -> OCRStep -> GenAIStep -> ReliabilityStep -> ResponseHandler over the injected clients. Factories: - make_genai_client(cfg) -> FakeGenAIClient \| OllamaClient - make_ocr_client(cfg) -> FakeOCRClient \| SuryaOCRClient (spec §6.2) Probes run the async selfcheck on a fresh event loop in a short-lived thread so they're safe to call from either sync callers or a live FastAPI handler without stalling the request loop. Drops the worker-loop spawn_worker_task stub — the app module owns the production spawn directly. Tests: +11 unit tests (5 factories + 6 app-wiring / probe adapter / pipeline build). Full suite: 236 passed. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 12:09:11 +02:00
goldstein	b737ed7b21	Merge pull request 'feat(ocr): SuryaOCRClient real OCR backend (spec 6.2)' (#25 ) from feat/surya-client into main All checks were successful tests / test (push) Successful in 1m10s Details	2026-04-18 10:04:41 +00:00
Dirk Riemann	322f6b2b1b	feat(ocr): SuryaOCRClient — real OCR backend (spec §6.2) All checks were successful tests / test (push) Successful in 1m14s Details tests / test (pull_request) Successful in 1m14s Details Runs Surya's detection + recognition over PIL images rendered from each Page's source file (PDFs via PyMuPDF, images via Pillow). Lazy warm_up so FastAPI lifespan start stays predictable. Deferred Surya/torch imports keep the base install slim — the heavy deps stay under [ocr]. Extends OCRClient Protocol with optional files + page_metadata kwargs so the engine can resolve each page back to its on-disk source; Fake accepts-and-ignores to keep hermetic tests unchanged. selfcheck() runs the predictors on a 1x1 PIL image — wired into /healthz by Task 4.3. Tests: 6 hermetic unit tests (Surya predictors mocked, no model download); 2 live tests gated on IX_TEST_OLLAMA=1 (never run in CI). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 12:04:19 +02:00
goldstein	0f045f814a	Merge pull request 'feat(genai): OllamaClient structured-output /api/chat backend (spec 6)' (#24 ) from feat/ollama-client into main All checks were successful tests / test (push) Successful in 1m15s Details	2026-04-18 09:58:38 +00:00
Dirk Riemann	90e46b707d	feat(genai): OllamaClient — structured-output /api/chat backend (spec §6) All checks were successful tests / test (push) Successful in 1m10s Details tests / test (pull_request) Successful in 1m5s Details Real GenAIClient for the production pipeline. Sends `format=<pydantic JSON schema>`, `stream=false`, and mapped options (`temperature`; drops `reasoning_effort`). Content-parts lists joined to a single string since MVP models don't speak native content-parts. Error mapping per spec: connection/timeout/5xx → IX_002_000, schema violations → IX_002_001. `selfcheck()` probes /api/tags with a fixed 5 s timeout for /healthz. Tests: 10 hermetic pytest-httpx unit tests; 2 live tests gated on IX_TEST_OLLAMA=1 (never run in CI). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 11:58:15 +02:00
goldstein	6183b9c886	Merge pull request 'feat(pg-queue): LISTEN ix_jobs_new + 10s fallback poll' (#23 ) from feat/pg-queue-adapter into main All checks were successful tests / test (push) Successful in 1m11s Details	2026-04-18 09:52:45 +00:00
Dirk Riemann	050f80dcd7	feat(pg-queue): LISTEN ix_jobs_new + 10s fallback poll (spec §4) All checks were successful tests / test (push) Successful in 1m8s Details tests / test (pull_request) Successful in 1m9s Details PgQueueListener: - Dedicated asyncpg connection outside the SQLAlchemy pool (LISTEN needs a persistent connection; pooled connections check in/out). - Exposes wait_for_work(timeout) — resolves on NOTIFY or timeout, whichever fires first. The worker treats both wakes identically. - asyncpg_dsn_from_sqlalchemy_url strips the +asyncpg driver segment and percent-decodes the password so the same URL in IX_POSTGRES_URL works for both SQLAlchemy and raw asyncpg. app.py lifespan now also spawns the listener alongside the worker; both are gated on spawn_worker=True so REST-only tests stay fast. 2 new integration tests: NOTIFY path (wake within 2 s despite 60 s poll) + missed-NOTIFY path (fallback poll recovers within 5 s). 33 integration tests total, 209 unit. Forgejo Actions trigger is flaky; local verification is the gate. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 11:52:26 +02:00
goldstein	415e03fba1	Merge pull request 'feat(worker): async worker loop + one-shot callback delivery' (#22 ) from feat/worker-loop into main Some checks are pending tests / test (push) Waiting to run Details	2026-04-18 09:50:11 +00:00
Dirk Riemann	406a7ea2fd	feat(worker): async worker loop + one-shot callback delivery (spec §5) All checks were successful tests / test (push) Successful in 1m15s Details tests / test (pull_request) Successful in 1m8s Details Worker: - Startup: sweep_orphans(now, max_running_seconds) rescues rows stuck in 'running' from a crashed prior process. - Loop: claim_next_pending → build pipeline via injected factory → run → mark_done/mark_error → deliver callback if set → record outcome. - Non-IX exceptions from the pipeline collapse to IX_002_000 so callers see a stable error code. - Sleep loop uses a cancellable wait so the stop event reacts immediately; the wait_for_work hook is ready for Task 3.6 to plug in the LISTEN-driven event without the worker knowing about NOTIFY. Callback: - One-shot POST, 2xx → delivered, anything else (incl. connect/timeout exceptions) → failed. No retries. - Callback record never reverts the job's terminal state — GET /jobs/{id} stays the authoritative fallback. 7 integration tests: happy path, pipeline-raise → error, callback 2xx, callback 5xx, orphan sweep on startup, no-callback rows stay callback_status=None (x2 via parametrize). Unit suite still 209. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 11:49:54 +02:00
goldstein	ee023d6e34	Merge pull request 'feat(rest): FastAPI adapter + /jobs /healthz /metrics (spec 5)' (#21 ) from feat/rest-adapter into main Some checks failed tests / test (push) Has been cancelled Details	2026-04-18 09:47:35 +00:00
Dirk Riemann	e46c44f1e0	feat(rest): FastAPI adapter + /jobs, /healthz, /metrics routes (spec §5) All checks were successful tests / test (push) Successful in 1m7s Details tests / test (pull_request) Successful in 1m5s Details Routes: - POST /jobs: 201 on first insert, 200 on idempotent re-submit. - GET /jobs/{id}: full Job envelope or 404. - GET /jobs?client_id=&request_id=: correlation lookup or 404. - GET /healthz: {postgres, ollama, ocr}; 200 iff all ok (degraded counts as non-200 per spec). Postgres probe guarded by a 2 s wait_for. - GET /metrics: pending/running counts + 24h done/error counters + per-use-case avg seconds. Plain JSON, no Prometheus. create_app(spawn_worker=bool) parameterises worker spawning so tests that only need REST pass False. Worker spawn is tolerant of the loop module not being importable yet (Task 3.5 fills it in). Probes are a DI bundle — production wiring swaps them in at startup (Chunk 4); tests inject canned ok/fail callables. Session factory is also DI'd so tests can point at a per-loop engine and sidestep the async-pg cross-loop future issue that bit the jobs_repo fixture. 9 new integration tests; unit suite unchanged. Forgejo Actions trigger is flaky; local verification is the gate (unit + integration green locally). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 11:47:04 +02:00
goldstein	04a415a191	Merge pull request 'feat(store): JobsRepo CRUD over ix_jobs + integration fixtures' (#20 ) from feat/jobs-repo into main All checks were successful tests / test (push) Successful in 1m14s Details	2026-04-18 09:43:28 +00:00
Dirk Riemann	141153ffa7	feat(store): JobsRepo CRUD over ix_jobs + integration fixtures (spec §4) All checks were successful tests / test (push) Successful in 1m10s Details tests / test (pull_request) Successful in 1m10s Details JobsRepo covers the full job-lifecycle surface: - insert_pending: idempotent on (client_id, request_id) via ON CONFLICT DO NOTHING + re-select; assigns a 16-hex ix_id. - claim_next_pending: FOR UPDATE SKIP LOCKED so concurrent workers never double-dispatch a row. - get / get_by_correlation: hydrates JSONB back through Pydantic. - mark_done: done iff response.error is None, else error. - mark_error: explicit convenience wrapper. - update_callback_status: delivered \| failed (no status transition). - sweep_orphans: time-based rescue of stuck running rows; attempts++. Integration fixtures (tests/integration/conftest.py): - Skip cleanly when neither IX_TEST_DATABASE_URL nor IX_POSTGRES_URL is set (unit suite stays runnable on a bare laptop). - Alembic upgrade/downgrade runs in a subprocess so its internal asyncio.run() doesn't collide with pytest-asyncio's loop. - Per-test engine + truncate so loops never cross and tests start clean. 15 integration tests against a live postgres:16, including SKIP LOCKED concurrency + orphan sweep. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 11:43:11 +02:00
goldstein	8bb220ae43	Merge pull request 'feat(config): AppConfig + cached get_config()' (#19 ) from feat/config into main All checks were successful tests / test (push) Successful in 59s Details	2026-04-18 09:39:00 +00:00
Dirk Riemann	95728accbf	feat(config): AppConfig + cached get_config() (spec §9) All checks were successful tests / test (push) Successful in 1m1s Details tests / test (pull_request) Successful in 58s Details Typed pydantic-settings view over every IX_* env var, defaults matching spec §9 exactly. @lru_cache-wrapped accessor so parsing/validation happens once per process; tests clear the cache via get_config.cache_clear(). extra="ignore" keeps the container robust against typo'd env vars in production .env files. engine.py's URL resolver now goes through get_config() when ix.config is importable (bootstrap fallback remains so hypothetical early-import callers don't crash). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 11:38:44 +02:00
goldstein	dc6d28bda1	Merge pull request 'feat(store): Alembic scaffolding + initial ix_jobs migration' (#18 ) from feat/alembic-init into main Some checks are pending tests / test (push) Waiting to run Details	2026-04-18 09:37:37 +00:00
Dirk Riemann	1c60c30084	feat(store): Alembic scaffolding + initial ix_jobs migration (spec §4) All checks were successful tests / test (push) Successful in 1m15s Details tests / test (pull_request) Successful in 1m2s Details Lands the async-friendly Alembic env (NullPool, reads IX_POSTGRES_URL), the hand-written 001 migration matching the spec's table layout exactly (CHECK on status, partial index on pending rows, UNIQUE on (client_id, request_id)), the SQLAlchemy 2.0 ORM mapping, and a lazy engine/session factory. The factory reads the URL through ix.config when available; Task 3.2 makes that the only path. Smoke-tested: alembic upgrade head + downgrade base against a live postgres:16 produce the expected table shape and tear down cleanly. Unit tests assert the migration source contains every required column/index so the migration can't drift from spec at import time. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 11:37:21 +02:00
goldstein	a54a968313	Merge pull request 'test(pipeline): end-to-end hermetic test with fakes + synthetic fixture' (#17 ) from feat/pipeline-e2e-fakes into main Some checks failed tests / test (push) Has been cancelled Details	2026-04-18 09:24:51 +00:00
Dirk Riemann	b109bba873	test(pipeline): end-to-end hermetic test with fakes + synthetic fixture All checks were successful tests / test (push) Successful in 59s Details tests / test (pull_request) Successful in 57s Details Wires the five pipeline steps together with FakeOCRClient + FakeGenAIClient, feeds the committed synthetic_giro.pdf fixture via file:// URL, and asserts the full response shape. - scripts/create_fixture_pdf.py: PyMuPDF-based builder. One-page A4 PDF with six known header strings (bank name, IBAN, period, balances, statement date). Re-runnable on demand; the committed PDF is what CI consumes. - tests/fixtures/synthetic_giro.pdf: committed output. - tests/unit/test_pipeline_end_to_end.py: 5 tests covering * ix_result.result fields populated from the fake LLM * provenance.fields["result.closing_balance"].provenance_verified True * text_agreement True when Paperless-style texts match the value * metadata.timings has one entry per step in the right order * response.error is None and context is not serialised 197 tests total; ruff clean. No integration tests, no real clients, no network. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 11:24:29 +02:00
goldstein	118d77c428	Merge pull request 'feat(pipeline): ResponseHandlerStep (spec §8)' (#16 ) from feat/step-response-handler into main Some checks are pending tests / test (push) Waiting to run Details	2026-04-18 09:21:50 +00:00
Dirk Riemann	565d8d0676	feat(pipeline): ResponseHandlerStep — shape-up final payload (spec §8) All checks were successful tests / test (push) Successful in 1m0s Details tests / test (pull_request) Successful in 1m2s Details Final pipeline step. Three mechanical transforms: 1. include_ocr_text -> concatenate non-tag line texts, pages joined with \n\n, write to ocr_result.result.text. 2. include_geometries=False (default) -> strip ocr_result.result.pages + ocr_result.meta_data. Geometries are heavy; callers opt in. 3. Delete response.context so the internal accumulator never leaks to the caller (belt-and-braces; Field(exclude=True) already does this). validate() always returns True per spec. 7 unit tests in tests/unit/test_response_handler_step.py cover all three branches + context-not-in-model_dump check. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 11:21:36 +02:00
goldstein	83c1996702	Merge pull request 'feat(pipeline): ReliabilityStep (spec §6)' (#15 ) from feat/step-reliability into main Some checks are pending tests / test (push) Waiting to run Details	2026-04-18 09:20:38 +00:00
Dirk Riemann	132f110463	feat(pipeline): ReliabilityStep — writes reliability flags (spec §6) All checks were successful tests / test (push) Successful in 1m3s Details tests / test (pull_request) Successful in 1m1s Details Thin wrapper around ix.provenance.apply_reliability_flags. Validate skips entirely when include_provenance is off OR when no provenance data was built (text-only request, etc.). Process reads context.texts + context.use_case_response and lets the verifier mutate the FieldProvenance entries + fill quality_metrics counters in place. 11 unit tests in tests/unit/test_reliability_step.py cover: validate skips on flag off / missing provenance, runs otherwise; per-type flag behaviour (string verified + text_agreement, Literal -> None, None value -> None, short numeric -> text_agreement None, date with both sides parsed, IBAN whitespace-insensitive, disagreement -> False); quality_metrics verified_fields / text_agreement_fields counters. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 11:20:18 +02:00

1 2

85 commits