For the complete documentation index, see llms.txt. This page is also available as Markdown.

Overview

Overview of the parsing pipeline and the two error classes.

Turning Internet Object text into validated data happens in stages:

  1. Tokenize — split the text into tokens (values, separators, structural characters).

  2. Parse — assemble tokens into a document tree (header, sections, records, values).

  3. Validate — check the data against the schema.

  4. Load — produce the final in-memory values.

Errors fall into two classes, matching the stages that produce them:

Class
Stage
Example

Syntax error

tokenize / parse

unbalanced {, missing comma, unterminated string

Validation error

validate

wrong type, out of range, missing required field

The distinction matters because the two classes recover differently:

  • Syntax errors are bounded by structure — the parser skips to the next boundary (~ or ---) and continues.

  • Validation errors are bounded by the object — each record is validated on its own and may report zero, one, or many errors, without affecting other records.

In this section

See Also

Last updated

Was this helpful?