Bug Description
When using Ollama models, streaming output stalls mid-generation.
The agent loop pauses and waits for user input before continuing.
Typing anything (Enter or a short message) causes generation to resume.
Expected behaviour: streaming continues uninterrupted without user intervention.
Steps to Reproduce
1. Run kit with any Ollama model (tested: ollama/gemma4:latest, ollama/qwen3:14b)
2. Ask a question that produces a long or multi-step response
3. Observe output stops mid-generation
4. Type anything and press Enter — generation resumes
Relevant Code / Configuration
temperature: 0.6
top-k: 20
top-p: 0.95
max-tokens: 8192
thinking-level: "off"
max-steps: 50
Affected Component
Streaming / Ollama provider
Kit Version
dev (go install github.com/mark3labs/kit@latest, built 2026-05-25)
Additional Context
Ollama version: 0.30.6
Hardware: Apple M4 Mac mini
No keepalive or poll-interval flags available in kit --help.
Issue occurs consistently across multiple sessions and models.
Checklist
Bug Description
When using Ollama models, streaming output stalls mid-generation.
The agent loop pauses and waits for user input before continuing.
Typing anything (Enter or a short message) causes generation to resume.
Expected behaviour: streaming continues uninterrupted without user intervention.
Steps to Reproduce
Relevant Code / Configuration
Affected Component
Streaming / Ollama provider
Kit Version
dev (go install github.com/mark3labs/kit@latest, built 2026-05-25)
Additional Context
Ollama version: 0.30.6
Hardware: Apple M4 Mac mini
No keepalive or poll-interval flags available in kit --help.
Issue occurs consistently across multiple sessions and models.
Checklist