Benchmarks

Comprehensive performance evaluations of The Token Company compression API. Each benchmark provides detailed methodology, statistical analysis, and reproducible results.

We are working on updating the benchmarks to include more models and domains using our next generation of compression models.

Why benchmark?

Token compression must balance efficiency with quality. Removing too many tokens risks degrading model performance, while removing too few limits cost savings.

Our benchmarks use rigorous statistical methodology — hundreds of measurements per configuration, bootstrap analysis, and transparent reporting of both methodology and limitations.

Every result is reproducible. We publish the exact configurations, datasets, and evaluation criteria so you can verify our claims.