infoxtractor

Author	SHA1	Message	Date
Dirk Riemann	c7dc40c51e	fix(deploy): switch to network_mode: host — reach postgis + ollama on loopback All checks were successful tests / test (push) Successful in 1m12s Details tests / test (pull_request) Successful in 1m10s Details The shared postgis container is bound to 127.0.0.1 on the host (security hardening, infrastructure §T12). Ollama is similarly LAN-hardened. The previous `host.docker.internal + extra_hosts: host-gateway` approach points at the bridge gateway IP, not loopback, so the container couldn't reach either service. Switch to `network_mode: host` (same pattern goldstein uses) and update the default IX_POSTGRES_URL / IX_OLLAMA_URL to 127.0.0.1. Keep the GPU reservation block; drop the now-meaningless ports: declaration (host mode publishes directly). AppConfig defaults + .env.example + test_config assertions + inline docstring examples all follow. Caught on fourth deploy attempt. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 13:00:02 +02:00
goldstein	39a6c10634	fix(compose): drop runtime: nvidia (#33 ) All checks were successful tests / test (push) Successful in 1m19s Details	2026-04-18 10:56:15 +00:00
Dirk Riemann	9f793da778	fix(compose): drop runtime: nvidia — use deploy.resources.devices only All checks were successful tests / test (push) Successful in 1m10s Details tests / test (pull_request) Successful in 1m10s Details Docker on the deploy host doesn't register 'nvidia' as a named runtime (modern nvidia-container-toolkit hooks via --gpus all / resources.devices instead). Immich-ml on the same host uses only deploy.resources.devices with driver: nvidia, which is enough. Drop the legacy runtime line. Caught on third deploy attempt. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 12:56:03 +02:00
goldstein	4802e086a0	fix(docker): README.md for hatchling (#32 ) All checks were successful tests / test (push) Successful in 2m27s Details	2026-04-18 10:42:41 +00:00
Dirk Riemann	f54f0d317d	fix(docker): include README.md in the uv sync COPY so hatchling finds it All checks were successful tests / test (push) Successful in 1m22s Details tests / test (pull_request) Successful in 1m36s Details pyproject.toml names README.md as the readme; hatchling validates that file exists when uv sync resolves the editable install of the project itself. Caught on second deploy build. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 12:42:29 +02:00
goldstein	e6fcd5fc54	fix(docker): uv via standalone installer (#31 ) All checks were successful tests / test (push) Successful in 1m14s Details	2026-04-18 10:33:11 +00:00
Dirk Riemann	1c31444611	fix(docker): install uv via standalone installer (no system pip) All checks were successful tests / test (pull_request) Successful in 1m28s Details tests / test (push) Successful in 1m19s Details Python 3.12 from deadsnakes on Ubuntu 22.04 drops `distutils` from the stdlib, and Ubuntu's system pip still imports from it — so `pip install` fails immediately with ModuleNotFoundError: distutils. Switch to the uv standalone installer, which doesn't need pip at all. Caught during the first deploy build. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 12:32:55 +02:00
goldstein	a9e510362d	chore(model): qwen3:14b default (#30 ) All checks were successful tests / test (push) Successful in 2m55s Details unblock first deploy	2026-04-18 10:20:38 +00:00
Dirk Riemann	5ee74f367c	chore(model): switch default IX_DEFAULT_MODEL to qwen3:14b (already on host) All checks were successful tests / test (push) Successful in 1m52s Details tests / test (pull_request) Successful in 1m45s Details The home server's Ollama doesn't have gpt-oss:20b pulled; qwen3:14b is already there and is what mammon's chat agent uses. Switching the default now so the first deploy passes the /healthz ollama probe without an extra `ollama pull` step. The spec lists gpt-oss:20b as a concrete example; qwen3:14b is equally on-prem and Ollama-structured-output-compatible. Touched: AppConfig default, BankStatementHeader Request.default_model, .env.example, setup_server.sh ollama-list check, AGENTS.md, deployment.md, live tests. Unit tests that hard-coded the old model string but don't assert the default were left alone. Also: ASCII en-dash in e2e_smoke.py Paperless-style text (ruff RUF001). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 12:20:23 +02:00
goldstein	f6cc99f062	feat(e2e): e2e_smoke.py deploy gate (#29 ) Some checks are pending tests / test (push) Waiting to run Details Lands Task 5.4.	2026-04-18 10:18:23 +00:00
Dirk Riemann	d0648fe01d	feat(e2e): scripts/e2e_smoke.py — live deploy gate All checks were successful tests / test (push) Successful in 1m11s Details tests / test (pull_request) Successful in 2m14s Details Runs from the Mac after every `git push server main`. Flow: starts a tiny HTTP server on the Mac's LAN IP serving tests/fixtures/synthetic_giro.pdf → POST /jobs with bank_statement_header + Paperless-style texts so text_agreement has something to check against → poll GET /jobs/{id} until terminal → assert status=done, bank_name non-empty, closing_balance.provenance_verified=True, text_agreement=True, elapsed < 60 s. Non-zero exit blocks the deploy. Uses only stdlib (http.server, urllib) — no extra deps on the Mac-side, no test framework overhead. Task 5.4 of MVP plan. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 12:18:07 +02:00
goldstein	5841bc09c0	feat(deploy): setup_server.sh + deployment runbook (#28 ) Some checks are pending tests / test (push) Waiting to run Details Lands Task 5.2.	2026-04-18 10:17:14 +00:00
Dirk Riemann	6d1bc720b4	feat(deploy): setup_server.sh + deployment runbook All checks were successful tests / test (push) Successful in 1m9s Details tests / test (pull_request) Successful in 1m10s Details - scripts/setup_server.sh: idempotent one-shot. Creates bare repo, post-receive hook (which rebuilds docker compose + gates on /healthz), infoxtractor Postgres role + DB on the shared postgis container, .env (0600) from .env.example with the password substituted in, verifies gpt-oss:20b is pulled. - docs/deployment.md: topology, one-time setup command, normal deploy workflow, rollback-via-revert pattern (never force-push main), operational checklists for the common /healthz degraded states. - First deploy section reserved; filled in after Task 5.3 runs. Task 5.2 of MVP plan. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 12:16:58 +02:00
goldstein	3c7d607776	feat(docker): Dockerfile + compose (#27 ) Some checks are pending tests / test (push) Waiting to run Details Lands Task 5.1.	2026-04-18 10:15:45 +00:00
Dirk Riemann	4646180942	feat(docker): Dockerfile (CUDA+python3.12) + compose with GPU reservation All checks were successful tests / test (push) Successful in 1m13s Details tests / test (pull_request) Successful in 1m10s Details - nvidia/cuda:12.4 runtime base matches the deploy host's driver stack (immich-ml / monitoring use the same pattern). - python3.12 via deadsnakes (Ubuntu 22.04 ships 3.10 only). - System deps: libmagic1 (python-magic), libgl1/libglib2 (PIL + PyMuPDF headless), curl (post-receive /healthz probe), ca-certs (httpx TLS). - uv sync --frozen --no-dev --extra ocr installs prod + Surya/torch; dev tooling stays out of the image. - CMD runs `alembic upgrade head && uvicorn ix.app:create_app` — idempotent. - Compose: single service, port 8994, GPU reservation mirroring immich-ml, labels for monitoring dashboard auto-discovery + backup opt-in. - host.docker.internal:host-gateway lets ix reach the host's Ollama and postgis containers (same pattern mammon uses). Task 5.1 of MVP plan. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 12:15:26 +02:00
goldstein	c234b67bbf	Merge pull request 'feat(app): production wiring factories + /healthz real probes (Task 4.3)' (#26 ) from feat/production-wiring into main All checks were successful tests / test (push) Successful in 1m10s Details	2026-04-18 10:09:34 +00:00
Dirk Riemann	ebefee4184	feat(app): production wiring — factories, pipeline, /healthz real probes All checks were successful tests / test (push) Successful in 1m9s Details tests / test (pull_request) Successful in 1m13s Details Task 4.3 closes the loop on Chunk 4: the FastAPI lifespan now selects fake vs real clients via IX_TEST_MODE (new AppConfig field), wires /healthz probes to the live selfcheck() on OllamaClient / SuryaOCRClient, and spawns the worker with a production Pipeline factory that builds SetupStep -> OCRStep -> GenAIStep -> ReliabilityStep -> ResponseHandler over the injected clients. Factories: - make_genai_client(cfg) -> FakeGenAIClient \| OllamaClient - make_ocr_client(cfg) -> FakeOCRClient \| SuryaOCRClient (spec §6.2) Probes run the async selfcheck on a fresh event loop in a short-lived thread so they're safe to call from either sync callers or a live FastAPI handler without stalling the request loop. Drops the worker-loop spawn_worker_task stub — the app module owns the production spawn directly. Tests: +11 unit tests (5 factories + 6 app-wiring / probe adapter / pipeline build). Full suite: 236 passed. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 12:09:11 +02:00
goldstein	b737ed7b21	Merge pull request 'feat(ocr): SuryaOCRClient real OCR backend (spec 6.2)' (#25 ) from feat/surya-client into main All checks were successful tests / test (push) Successful in 1m10s Details	2026-04-18 10:04:41 +00:00
Dirk Riemann	322f6b2b1b	feat(ocr): SuryaOCRClient — real OCR backend (spec §6.2) All checks were successful tests / test (push) Successful in 1m14s Details tests / test (pull_request) Successful in 1m14s Details Runs Surya's detection + recognition over PIL images rendered from each Page's source file (PDFs via PyMuPDF, images via Pillow). Lazy warm_up so FastAPI lifespan start stays predictable. Deferred Surya/torch imports keep the base install slim — the heavy deps stay under [ocr]. Extends OCRClient Protocol with optional files + page_metadata kwargs so the engine can resolve each page back to its on-disk source; Fake accepts-and-ignores to keep hermetic tests unchanged. selfcheck() runs the predictors on a 1x1 PIL image — wired into /healthz by Task 4.3. Tests: 6 hermetic unit tests (Surya predictors mocked, no model download); 2 live tests gated on IX_TEST_OLLAMA=1 (never run in CI). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 12:04:19 +02:00
goldstein	0f045f814a	Merge pull request 'feat(genai): OllamaClient structured-output /api/chat backend (spec 6)' (#24 ) from feat/ollama-client into main All checks were successful tests / test (push) Successful in 1m15s Details	2026-04-18 09:58:38 +00:00
Dirk Riemann	90e46b707d	feat(genai): OllamaClient — structured-output /api/chat backend (spec §6) All checks were successful tests / test (push) Successful in 1m10s Details tests / test (pull_request) Successful in 1m5s Details Real GenAIClient for the production pipeline. Sends `format=<pydantic JSON schema>`, `stream=false`, and mapped options (`temperature`; drops `reasoning_effort`). Content-parts lists joined to a single string since MVP models don't speak native content-parts. Error mapping per spec: connection/timeout/5xx → IX_002_000, schema violations → IX_002_001. `selfcheck()` probes /api/tags with a fixed 5 s timeout for /healthz. Tests: 10 hermetic pytest-httpx unit tests; 2 live tests gated on IX_TEST_OLLAMA=1 (never run in CI). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 11:58:15 +02:00
goldstein	6183b9c886	Merge pull request 'feat(pg-queue): LISTEN ix_jobs_new + 10s fallback poll' (#23 ) from feat/pg-queue-adapter into main All checks were successful tests / test (push) Successful in 1m11s Details	2026-04-18 09:52:45 +00:00
Dirk Riemann	050f80dcd7	feat(pg-queue): LISTEN ix_jobs_new + 10s fallback poll (spec §4) All checks were successful tests / test (push) Successful in 1m8s Details tests / test (pull_request) Successful in 1m9s Details PgQueueListener: - Dedicated asyncpg connection outside the SQLAlchemy pool (LISTEN needs a persistent connection; pooled connections check in/out). - Exposes wait_for_work(timeout) — resolves on NOTIFY or timeout, whichever fires first. The worker treats both wakes identically. - asyncpg_dsn_from_sqlalchemy_url strips the +asyncpg driver segment and percent-decodes the password so the same URL in IX_POSTGRES_URL works for both SQLAlchemy and raw asyncpg. app.py lifespan now also spawns the listener alongside the worker; both are gated on spawn_worker=True so REST-only tests stay fast. 2 new integration tests: NOTIFY path (wake within 2 s despite 60 s poll) + missed-NOTIFY path (fallback poll recovers within 5 s). 33 integration tests total, 209 unit. Forgejo Actions trigger is flaky; local verification is the gate. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 11:52:26 +02:00
goldstein	415e03fba1	Merge pull request 'feat(worker): async worker loop + one-shot callback delivery' (#22 ) from feat/worker-loop into main Some checks are pending tests / test (push) Waiting to run Details	2026-04-18 09:50:11 +00:00
Dirk Riemann	406a7ea2fd	feat(worker): async worker loop + one-shot callback delivery (spec §5) All checks were successful tests / test (push) Successful in 1m15s Details tests / test (pull_request) Successful in 1m8s Details Worker: - Startup: sweep_orphans(now, max_running_seconds) rescues rows stuck in 'running' from a crashed prior process. - Loop: claim_next_pending → build pipeline via injected factory → run → mark_done/mark_error → deliver callback if set → record outcome. - Non-IX exceptions from the pipeline collapse to IX_002_000 so callers see a stable error code. - Sleep loop uses a cancellable wait so the stop event reacts immediately; the wait_for_work hook is ready for Task 3.6 to plug in the LISTEN-driven event without the worker knowing about NOTIFY. Callback: - One-shot POST, 2xx → delivered, anything else (incl. connect/timeout exceptions) → failed. No retries. - Callback record never reverts the job's terminal state — GET /jobs/{id} stays the authoritative fallback. 7 integration tests: happy path, pipeline-raise → error, callback 2xx, callback 5xx, orphan sweep on startup, no-callback rows stay callback_status=None (x2 via parametrize). Unit suite still 209. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 11:49:54 +02:00
goldstein	ee023d6e34	Merge pull request 'feat(rest): FastAPI adapter + /jobs /healthz /metrics (spec 5)' (#21 ) from feat/rest-adapter into main Some checks failed tests / test (push) Has been cancelled Details	2026-04-18 09:47:35 +00:00
Dirk Riemann	e46c44f1e0	feat(rest): FastAPI adapter + /jobs, /healthz, /metrics routes (spec §5) All checks were successful tests / test (push) Successful in 1m7s Details tests / test (pull_request) Successful in 1m5s Details Routes: - POST /jobs: 201 on first insert, 200 on idempotent re-submit. - GET /jobs/{id}: full Job envelope or 404. - GET /jobs?client_id=&request_id=: correlation lookup or 404. - GET /healthz: {postgres, ollama, ocr}; 200 iff all ok (degraded counts as non-200 per spec). Postgres probe guarded by a 2 s wait_for. - GET /metrics: pending/running counts + 24h done/error counters + per-use-case avg seconds. Plain JSON, no Prometheus. create_app(spawn_worker=bool) parameterises worker spawning so tests that only need REST pass False. Worker spawn is tolerant of the loop module not being importable yet (Task 3.5 fills it in). Probes are a DI bundle — production wiring swaps them in at startup (Chunk 4); tests inject canned ok/fail callables. Session factory is also DI'd so tests can point at a per-loop engine and sidestep the async-pg cross-loop future issue that bit the jobs_repo fixture. 9 new integration tests; unit suite unchanged. Forgejo Actions trigger is flaky; local verification is the gate (unit + integration green locally). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 11:47:04 +02:00
goldstein	04a415a191	Merge pull request 'feat(store): JobsRepo CRUD over ix_jobs + integration fixtures' (#20 ) from feat/jobs-repo into main All checks were successful tests / test (push) Successful in 1m14s Details	2026-04-18 09:43:28 +00:00
Dirk Riemann	141153ffa7	feat(store): JobsRepo CRUD over ix_jobs + integration fixtures (spec §4) All checks were successful tests / test (push) Successful in 1m10s Details tests / test (pull_request) Successful in 1m10s Details JobsRepo covers the full job-lifecycle surface: - insert_pending: idempotent on (client_id, request_id) via ON CONFLICT DO NOTHING + re-select; assigns a 16-hex ix_id. - claim_next_pending: FOR UPDATE SKIP LOCKED so concurrent workers never double-dispatch a row. - get / get_by_correlation: hydrates JSONB back through Pydantic. - mark_done: done iff response.error is None, else error. - mark_error: explicit convenience wrapper. - update_callback_status: delivered \| failed (no status transition). - sweep_orphans: time-based rescue of stuck running rows; attempts++. Integration fixtures (tests/integration/conftest.py): - Skip cleanly when neither IX_TEST_DATABASE_URL nor IX_POSTGRES_URL is set (unit suite stays runnable on a bare laptop). - Alembic upgrade/downgrade runs in a subprocess so its internal asyncio.run() doesn't collide with pytest-asyncio's loop. - Per-test engine + truncate so loops never cross and tests start clean. 15 integration tests against a live postgres:16, including SKIP LOCKED concurrency + orphan sweep. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 11:43:11 +02:00
goldstein	8bb220ae43	Merge pull request 'feat(config): AppConfig + cached get_config()' (#19 ) from feat/config into main All checks were successful tests / test (push) Successful in 59s Details	2026-04-18 09:39:00 +00:00
Dirk Riemann	95728accbf	feat(config): AppConfig + cached get_config() (spec §9) All checks were successful tests / test (push) Successful in 1m1s Details tests / test (pull_request) Successful in 58s Details Typed pydantic-settings view over every IX_* env var, defaults matching spec §9 exactly. @lru_cache-wrapped accessor so parsing/validation happens once per process; tests clear the cache via get_config.cache_clear(). extra="ignore" keeps the container robust against typo'd env vars in production .env files. engine.py's URL resolver now goes through get_config() when ix.config is importable (bootstrap fallback remains so hypothetical early-import callers don't crash). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 11:38:44 +02:00
goldstein	dc6d28bda1	Merge pull request 'feat(store): Alembic scaffolding + initial ix_jobs migration' (#18 ) from feat/alembic-init into main Some checks are pending tests / test (push) Waiting to run Details	2026-04-18 09:37:37 +00:00
Dirk Riemann	1c60c30084	feat(store): Alembic scaffolding + initial ix_jobs migration (spec §4) All checks were successful tests / test (push) Successful in 1m15s Details tests / test (pull_request) Successful in 1m2s Details Lands the async-friendly Alembic env (NullPool, reads IX_POSTGRES_URL), the hand-written 001 migration matching the spec's table layout exactly (CHECK on status, partial index on pending rows, UNIQUE on (client_id, request_id)), the SQLAlchemy 2.0 ORM mapping, and a lazy engine/session factory. The factory reads the URL through ix.config when available; Task 3.2 makes that the only path. Smoke-tested: alembic upgrade head + downgrade base against a live postgres:16 produce the expected table shape and tear down cleanly. Unit tests assert the migration source contains every required column/index so the migration can't drift from spec at import time. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 11:37:21 +02:00
goldstein	a54a968313	Merge pull request 'test(pipeline): end-to-end hermetic test with fakes + synthetic fixture' (#17 ) from feat/pipeline-e2e-fakes into main Some checks failed tests / test (push) Has been cancelled Details	2026-04-18 09:24:51 +00:00
Dirk Riemann	b109bba873	test(pipeline): end-to-end hermetic test with fakes + synthetic fixture All checks were successful tests / test (push) Successful in 59s Details tests / test (pull_request) Successful in 57s Details Wires the five pipeline steps together with FakeOCRClient + FakeGenAIClient, feeds the committed synthetic_giro.pdf fixture via file:// URL, and asserts the full response shape. - scripts/create_fixture_pdf.py: PyMuPDF-based builder. One-page A4 PDF with six known header strings (bank name, IBAN, period, balances, statement date). Re-runnable on demand; the committed PDF is what CI consumes. - tests/fixtures/synthetic_giro.pdf: committed output. - tests/unit/test_pipeline_end_to_end.py: 5 tests covering * ix_result.result fields populated from the fake LLM * provenance.fields["result.closing_balance"].provenance_verified True * text_agreement True when Paperless-style texts match the value * metadata.timings has one entry per step in the right order * response.error is None and context is not serialised 197 tests total; ruff clean. No integration tests, no real clients, no network. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 11:24:29 +02:00
goldstein	118d77c428	Merge pull request 'feat(pipeline): ResponseHandlerStep (spec §8)' (#16 ) from feat/step-response-handler into main Some checks are pending tests / test (push) Waiting to run Details	2026-04-18 09:21:50 +00:00
Dirk Riemann	565d8d0676	feat(pipeline): ResponseHandlerStep — shape-up final payload (spec §8) All checks were successful tests / test (push) Successful in 1m0s Details tests / test (pull_request) Successful in 1m2s Details Final pipeline step. Three mechanical transforms: 1. include_ocr_text -> concatenate non-tag line texts, pages joined with \n\n, write to ocr_result.result.text. 2. include_geometries=False (default) -> strip ocr_result.result.pages + ocr_result.meta_data. Geometries are heavy; callers opt in. 3. Delete response.context so the internal accumulator never leaks to the caller (belt-and-braces; Field(exclude=True) already does this). validate() always returns True per spec. 7 unit tests in tests/unit/test_response_handler_step.py cover all three branches + context-not-in-model_dump check. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 11:21:36 +02:00
goldstein	83c1996702	Merge pull request 'feat(pipeline): ReliabilityStep (spec §6)' (#15 ) from feat/step-reliability into main Some checks are pending tests / test (push) Waiting to run Details	2026-04-18 09:20:38 +00:00
Dirk Riemann	132f110463	feat(pipeline): ReliabilityStep — writes reliability flags (spec §6) All checks were successful tests / test (push) Successful in 1m3s Details tests / test (pull_request) Successful in 1m1s Details Thin wrapper around ix.provenance.apply_reliability_flags. Validate skips entirely when include_provenance is off OR when no provenance data was built (text-only request, etc.). Process reads context.texts + context.use_case_response and lets the verifier mutate the FieldProvenance entries + fill quality_metrics counters in place. 11 unit tests in tests/unit/test_reliability_step.py cover: validate skips on flag off / missing provenance, runs otherwise; per-type flag behaviour (string verified + text_agreement, Literal -> None, None value -> None, short numeric -> text_agreement None, date with both sides parsed, IBAN whitespace-insensitive, disagreement -> False); quality_metrics verified_fields / text_agreement_fields counters. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 11:20:18 +02:00
goldstein	6d9c239e82	Merge pull request 'feat(pipeline): GenAIStep (spec §6.3, §7, §9.2)' (#14 ) from feat/step-genai into main Some checks are pending tests / test (push) Waiting to run Details	2026-04-18 09:18:59 +00:00
Dirk Riemann	abee9cea7b	feat(pipeline): GenAIStep — LLM call + provenance mapping (spec §6.3, §7, §9.2) All checks were successful tests / test (push) Successful in 1m14s Details tests / test (pull_request) Successful in 1m10s Details Assembles the prompt, picks the structured-output schema, calls the injected GenAIClient, and maps any emitted segment_citations into response.provenance. Reliability flags stay None here; ReliabilityStep fills them in Task 2.7. - System prompt = use_case.system_prompt + (provenance-on) the verbatim citation instruction from spec §9.2. - User text = SegmentIndex.to_prompt_text([p1_l0] style) when provenance is on, else plain OCR flat text + texts joined. - Response schema = UseCaseResponse directly, or a runtime create_model("ProvenanceWrappedResponse", result=(UCR, ...), segment_citations=(list[SegmentCitation], Field(default_factory=list))) when provenance is on. - Model = request override -> use-case default. - Failure modes: httpx / connection / timeout errors -> IX_002_000; pydantic.ValidationError -> IX_002_001. - Writes ix_result.result + ix_result.meta_data (model_name + token_usage); builds response.provenance via map_segment_refs_to_provenance when provenance is on. 17 unit tests in tests/unit/test_genai_step.py cover validate (ocr_only skip, empty -> IX_001_000, text-only, ocr-text path), process happy path, system-prompt shape with/without citation instruction, user text tagged vs. plain, response schema plain vs. wrapped, provenance mapping, error mapping (IX_002_000 + IX_002_001), and model selection (request override + use-case default). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 11:18:44 +02:00
goldstein	acb2d55ce3	Merge pull request 'feat(pipeline): OCRStep (spec §6.2)' (#13 ) from feat/step-ocr into main Some checks are pending tests / test (push) Waiting to run Details	2026-04-18 09:16:04 +00:00
Dirk Riemann	81054baa06	feat(pipeline): OCRStep — run OCR + page tags + SegmentIndex (spec §6.2) All checks were successful tests / test (push) Successful in 1m11s Details tests / test (pull_request) Successful in 1m13s Details Runs after SetupStep. Dispatches the flat page list to the injected OCRClient, writes the raw OCRResult onto response.ocr_result, injects <page file="..." number="..."> open/close tag lines around each page's content, and builds a SegmentIndex over the non-tag lines when provenance is on. Validate follows the spec triad rule: - include_geometries/include_ocr_text/ocr_only + no files -> IX_000_004 - no files -> False (skip) - files + (use_ocr or triad) -> True 9 unit tests in tests/unit/test_ocr_step.py cover all three validate branches, OCRResult written, page tags injected (format + file_index), SegmentIndex built iff provenance on. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 11:15:46 +02:00
goldstein	632acdcd26	Merge pull request 'feat(pipeline): SetupStep (spec §6.1)' (#12 ) from feat/step-setup into main Some checks are pending tests / test (push) Waiting to run Details	2026-04-18 09:14:19 +00:00
Dirk Riemann	97aa24f478	feat(pipeline): SetupStep — validate + fetch + MIME + pages (spec §6.1) All checks were successful tests / test (push) Successful in 1m13s Details tests / test (pull_request) Successful in 1m19s Details First pipeline step. Validates the request (IX_000_002 on empty context), normalises every Context.files entry to a FileRef, downloads them in parallel via asyncio.gather, byte-sniffs MIMEs (IX_000_005 for unsupported), loads the use-case pair from REGISTRY (IX_001_001 on miss), and builds the flat pages + page_metadata list on response_ix.context. Fetcher / ingestor / MIME detector / tmp_dir / fetch_config all inject via the constructor so unit tests stay hermetic — production wires the real ix.ingestion defaults via the app factory. 7 unit tests in tests/unit/test_setup_step.py cover validate errors, happy path (fetcher + ingestor invoked correctly, context populated, use_case_name echoed), FileRef headers pass through, unsupported MIME -> IX_000_005, unknown use case -> IX_001_001, text-only request, and the _InternalContext type assertion. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 11:14:04 +02:00
goldstein	d801038c74	Merge pull request 'feat(ingestion): fetch_file + MIME sniff + DocumentIngestor' (#11 ) from feat/ingestion into main Some checks are pending tests / test (push) Waiting to run Details	2026-04-18 09:12:19 +00:00
Dirk Riemann	290e51416f	feat(ingestion): fetch_file + MIME sniff + DocumentIngestor (spec §6.1) All checks were successful tests / test (push) Successful in 57s Details tests / test (pull_request) Successful in 1m12s Details Three layered modules the SetupStep will wire together in Task 2.4. - fetch.py: async httpx fetch with configurable timeouts + incremental size cap (stream=True, accumulate bytes, raise IX_000_007 when exceeded). file:// URLs read locally. Auth headers pass through. The caller injects a FetchConfig — env reads happen in ix.config (Chunk 3). - mime.py: python-magic byte-sniff + SUPPORTED_MIMES frozenset + require_supported(mime) helper that raises IX_000_005. - pages.py: DocumentIngestor.build_pages(files, texts) -> (list[Page], list[PageMetadata]). PDFs via PyMuPDF (hard 100 pg/PDF cap -> IX_000_006), images via Pillow (multi-frame TIFFs yield multiple Pages), texts as zero-dim Pages so GenAIStep can still cite them. 21 new unit tests (141 total) cover: fetch success with headers, 4xx/5xx mapping, timeout -> IX_000_007, size cap enforced globally + per-file, file:// happy path + missing file, MIME detection for PDF/PNG/JPEG/TIFF, require_supported gate, PDF/TIFF/text page counts, 101-page PDF -> IX_000_006, multi-file file_index assignment. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 11:12:00 +02:00
goldstein	2709fb8d6b	Merge pull request 'feat(clients): OCRClient + GenAIClient protocols + fakes' (#10 ) from feat/client-protocols into main Some checks are pending tests / test (push) Waiting to run Details	2026-04-18 09:08:38 +00:00
Dirk Riemann	118a9abd09	feat(clients): OCRClient + GenAIClient protocols + fakes (spec §6.2, §6.3) All checks were successful tests / test (push) Successful in 1m0s Details tests / test (pull_request) Successful in 1m1s Details Adds the two Protocol-based client contracts the pipeline steps depend on, plus test-oriented fakes. Real engines (Surya, Ollama) land in Chunk 4. - ix.ocr.client.OCRClient — runtime_checkable Protocol with async ocr(). - ix.genai.client.GenAIClient — runtime_checkable Protocol with async invoke(); GenAIInvocationResult + GenAIUsage dataclasses carry the parsed model, token usage, and model name. - FakeOCRClient / FakeGenAIClient: return canned results; both expose a raise_on_call hook for error-path tests. 8 unit tests across tests/unit/test_ocr_fake.py + test_genai_fake.py confirm protocol conformance, canned-return behaviour, usage/model-name defaults, and raise_on_call propagation. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 11:08:24 +02:00
goldstein	1344b9ddb4	Merge pull request 'feat(pipeline): Step ABC + Pipeline runner + Timer' (#9 ) from feat/pipeline-core into main Some checks are pending tests / test (push) Waiting to run Details	2026-04-18 09:07:09 +00:00

1 2

74 commits