server: add "schema" and validation by ngxson · Pull Request #24150 · ggml-org/llama.cpp

ngxson · 2026-06-04T23:34:37Z

Overview

Add the notion of server_schema that will define the input schema in a more systematic way.

This is actually something I wanted to do since a long time ago. Motivations for this proposal:

To address bugs like GHSA-8947-pfff-2f3c due to lack of input validation
Generalize the validation logic
Inline documentation, can be exported to markdown in the future

TODO in follow-up PRs:

Migrate other schema (for example, chat completion) to using this system
Export schema to markdown
Allow linking arg.cpp <--> server-schema.cpp via an enum? (not sure how it will be useful)

Example of the code:

// old way
params.sampling.top_k              = json_value(data, "top_k",               defaults.sampling.top_k);

// new way
add((new field_num("top_k", params.sampling.top_k))
    ->set_limits(0, INT32_MAX)
    ->set_desc("Limit the next token selection to the K most probable tokens (0 = disabled)"));

Requirements

I have read and agree with the contributing guidelines
AI usage disclosure: to spot and fix mistakes during the migration

ServeurpersoCom · 2026-06-05T10:21:03Z

Testing this on my pod. It seems to me that the OAI spec can have sampling values less than 0 even if it is rarely useful in practice, such as encouraging repetitions with frequency_penalty. ( https://developers.openai.com/api/reference/python/resources/completions/methods/create search for "-2")

ServeurpersoCom · 2026-06-05T10:33:25Z

+    auto schema = make_llama_cmpl_schema(params_base, params);
+
+    // eval all fields in the schema
+    for (const auto & f : schema) {


Type errors return the raw nlohmann message without the field name, maybe wrap the eval loop in a try/catch to prepend it so the client knows which param failed?

yes I added it in the last commit(s), along with some corrections for the numerical limits. PTAL

example error message:

"message": "Field 'min_keep': Value must be between 0 <= value <= 2147483647, but got -100" "message": "Field 'min_keep': [json.exception.type_error.302] type must be number, but is string",

ServeurpersoCom · 2026-06-05T11:03:21Z

+using field_handler = std::function<void(field_eval_context &, const json &)>;
+
+struct field {
+    std::set<const char *> name;


std::set<const char *> compares pointer addresses, not strings, so the alias order is not the insertion order. I got max_tokens winning over n_predict on my pod just by changing TU link order. A std::vector fixes it and it's a 2 line change, tested here.

Suggested change

std::set<const char *> name;

std::vector<const char *> name;

And :

- name.insert(n); + name.push_back(n);

yup, nice catch

ngxson added 2 commits June 5, 2026 01:07

wip

9180a69

working

360e66d

ngxson requested a review from a team as a code owner June 4, 2026 23:34

github-actions Bot added examples server labels Jun 4, 2026

ServeurpersoCom reviewed Jun 5, 2026

View reviewed changes

ngxson added 2 commits June 5, 2026 17:37

correct some limits

fcfab9b

add field name to error message

a4d300d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

server: add "schema" and validation#24150

server: add "schema" and validation#24150
ngxson wants to merge 4 commits into
masterfrom
xsn/server_schema

ngxson commented Jun 4, 2026 •

edited

Loading

Uh oh!

ServeurpersoCom commented Jun 5, 2026 •

edited

Loading

Uh oh!

ServeurpersoCom Jun 5, 2026

Uh oh!

ngxson Jun 5, 2026

Uh oh!

ServeurpersoCom Jun 5, 2026

Uh oh!

ngxson Jun 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ngxson commented Jun 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Requirements

Uh oh!

ServeurpersoCom commented Jun 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ServeurpersoCom Jun 5, 2026

Choose a reason for hiding this comment

Uh oh!

ngxson Jun 5, 2026

Choose a reason for hiding this comment

Uh oh!

ServeurpersoCom Jun 5, 2026

Choose a reason for hiding this comment

Uh oh!

ngxson Jun 5, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ngxson commented Jun 4, 2026 •

edited

Loading

ServeurpersoCom commented Jun 5, 2026 •

edited

Loading