common/chat : fix LFM2/LFM2.5 reasoning round-trip and <think> leak by tdakhran · Pull Request #24234 · ggml-org/llama.cpp

tdakhran · 2026-06-06T15:05:37Z

Overview

Follow-up on review comment #24178 (review) made by @aldehir.

For LFM2/LFM2.5 models, copy reasoning_content into thinking.

LFM2.5-8B-A1B is always a reasoning model. The chat template doesn't have a switch to disable it. This leads to a leak of thinking into content with reasoning disabled (-rea off). (reported here #24178 (comment)).
The workaround is to drop thinking content (as is done for Deepseek).

Additional information

Add more tests, and refactor (unify) test cases for different LFM2/LFM2.5 chat templates.

Requirements

I have read and agree with the contributing guidelines
AI usage disclosure: YES, test case refactoring and review.

aldehir · 2026-06-06T17:13:40Z

+        } else if (extract_reasoning) {
+            // Thinking off, but model may still emit <think>, drop it
+            reasoning = p.optional(p.literal(THINK_START) + p.until(THINK_END) + p.literal(THINK_END));
        }


Let's always parse reasoning but gate it by reasoning format and model (i.e. if the think token exists in the template). Simulating this doesn't gain anything and might seem as if the model is doing nothing.

I like it, pushed one more commit, now reasoning parsing is always enabled if there is <think> in the chat template.

aldehir · 2026-06-06T18:31:50Z

Looks good, I'll wait for CI to pass before giving approval. Thank you for your contributions!

common/chat : fix LFM2 reasoning round-trip and stray <think> leak

477a217

tdakhran requested review from a team and pwilkin as code owners June 6, 2026 15:05

github-actions Bot added the testing Everything test related label Jun 6, 2026

tdakhran mentioned this pull request Jun 6, 2026

common/chat : unify and fix LFM2/LFM2.5 tool parser #24178

Merged

aldehir reviewed Jun 6, 2026

View reviewed changes

Gate by reasoning format and whether the template supports <think>

e9c51d9

tdakhran requested a review from aldehir June 6, 2026 18:24

aldehir approved these changes Jun 6, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

common/chat : fix LFM2/LFM2.5 reasoning round-trip and <think> leak#24234

common/chat : fix LFM2/LFM2.5 reasoning round-trip and <think> leak#24234
tdakhran wants to merge 2 commits into
ggml-org:masterfrom
tdakhran:tarek/feat/lfm2-thinking-fix

tdakhran commented Jun 6, 2026

Uh oh!

aldehir Jun 6, 2026 •

edited

Loading

Uh oh!

tdakhran Jun 6, 2026

Uh oh!

aldehir commented Jun 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

tdakhran commented Jun 6, 2026

Overview

Additional information

Requirements

Uh oh!

aldehir Jun 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tdakhran Jun 6, 2026

Choose a reason for hiding this comment

Uh oh!

aldehir commented Jun 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

aldehir Jun 6, 2026 •

edited

Loading