fix(genai): drop format flag; extract trailing JSON #41

Merged

goldstein merged 1 commit from fix/ollama-extract-json into main

2026-04-18 12:05:47 +00:00

Author	SHA1	Message	Date
Dirk Riemann	81e3b9a7d0	fix(genai): drop Ollama format flag; extract trailing JSON from response All checks were successful tests / test (push) Successful in 1m30s Details tests / test (pull_request) Successful in 1m21s Details qwen3:14b (and deepseek-r1, other reasoning models) wrap their output in <think>…</think> chains-of-thought before emitting real output. With format=json the constrained sampler terminated immediately at `{}` because the thinking block wasn't valid JSON; without format the model thinks normally and appends the actual JSON at the end. OllamaClient now omits the format flag and extracts the outermost balanced `{…}` block from the response (brace depth counter, string- literal aware). Works for reasoning models, ```json``` code-fenced outputs, and plain JSON alike. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-18 14:05:28 +02:00

Author

SHA1

Message

Date

Dirk Riemann

81e3b9a7d0

fix(genai): drop Ollama format flag; extract trailing JSON from response

tests / test (push) Successful in 1m30s

Details

tests / test (pull_request) Successful in 1m21s

Details

qwen3:14b (and deepseek-r1, other reasoning models) wrap their output
in <think>…</think> chains-of-thought before emitting real output.
With format=json the constrained sampler terminated immediately at
`{}` because the thinking block wasn't valid JSON; without format the
model thinks normally and appends the actual JSON at the end.

OllamaClient now omits the format flag and extracts the outermost
balanced `{…}` block from the response (brace depth counter, string-
literal aware). Works for reasoning models, ```json``` code-fenced
outputs, and plain JSON alike.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-04-18 14:05:28 +02:00