ZON vs TOON

ZON (Zero Overhead Notation) and TOON (Table Object Object Notation) are both modern data formats designed to be more efficient than JSON for Large Language Models (LLMs). While TOON pioneered the idea of "human-readable compression," ZON optimizes further for token efficiency and explicit schema validation.

At a Glance

Feature	ZON	TOON	Advantage
Token Efficiency	High (28% savings vs JSON)	Moderate (25% savings vs JSON)	ZON saves ~5-15% more tokens
Retrieval Accuracy	100%	100%	Tie
Syntax Style	Tabular/Minimalist	Structured/Verbose	ZON is cleaner
Parsing Speed	Fast (Stream-ready)	Moderate	ZON

Token Efficiency Comparison

Benchmarks across multiple datasets and tokenizers (GPT-4o, Claude 3.5, Llama 3) consistently show ZON outperforming TOON, especially for tabular data.

Unified Benchmark Results

Aggregate results across 8 diverse datasets (E-commerce, Logs, Time-series, etc.):

Metric	ZON	TOON	JSON
Total Tokens	19,995	20,988	28,042
vs JSON	-28.7%	-25.1%	-
vs TOON	-4.7%	-	-

NOTE: ZON achieves these savings without sacrificing accuracy. By using explicit headers (@(N)) and strict boolean tokens (T/F), ZON ensures LLMs understand the structure perfectly while using fewer tokens than TOON's indented format.

Retrieval Accuracy

Both formats are designed to be unambiguous for LLMs. In our benchmarks using gpt-5-nano on Azure OpenAI, both achieved perfect scores.

ZON: 100% Accuracy (309/309 questions)
TOON: 100% Accuracy (309/309 questions)

The Difference: ZON achieves this accuracy with fewer tokens.

Why ZON Wins

1. Tabular Encoding for Arrays

TOON uses a verbose indented syntax for arrays. ZON uses a highly efficient CSV-like tabular format with explicit headers, eliminating repetitive keys and indentation tokens.

context{location:Boulder,season:spring_2025,task:Our favorite hikes together}
friends[ana,luis,sam]
hikes:@(3):companion,distanceKm,elevationGain,id,name,wasSunny
ana,7.5,320,1,Blue Lake Trail,T
luis,9.2,540,2,Ridge Overlook,F
sam,5.1,180,3,Wildflower Loop,T

2. Minimal Syntax Overhead

ZON removes unnecessary punctuation like braces {}, brackets [], spaces and commas , where they aren't strictly needed for parsing. It relies on a strict schema-first approach (@(N):cols) rather than indentation, which is more robust against tokenization artifacts.

3. Stream-First Design

ZON is designed to be parsed byte-by-byte, making it ideal for streaming responses from LLMs. Its structure allows parsers to build objects incrementally without waiting for closing delimiters or tracking indentation levels.

Conclusion

While TOON is a solid improvement over JSON, ZON represents the next generation of efficiency. It provides the same 100% retrieval accuracy but with better compression, faster parsing, and a more robust schema definition for LLMs.

Edit on GitHub

PreviousBenchmarks Next CLI Guide

Essentials

Toolkit

Technical Reference