There is a newer version of the record available.

Published March 26, 2026 | Version 2026.3.4
Software Open

TensorZero

Authors/Creators

Description

[!WARNING] Planned Deprecations

  • The configuration for inference evaluations should be nested under the relevant functions moving forward [docs]. You can run evaluations by providing a function name and a list of evaluators. The legacy format will be removed in a future release.
    [functions.write_haiku.evaluators.exact_match]
    type = "exact_match"
    
  • The legacy implementation of GEPA (launch_optimization with GEPAConfig) will be removed in a future release. Please use t0.optimization.gepa.launch instead. [docs]

Bug Fixes

  • Fixed a UI bug where a custom gateway base_path was not handled correctly in certain routes. (thanks @wangfenjin!)

New Features

  • Started including embeddings requests in the Prometheus metrics tensorzero_requests_total and tensorzero_inferences_total.
  • Added the configuration field observability.batch_writes.write_queue_capacity to enable backpressure for observability data in the gateway.

& multiple under-the-hood and UI improvements (thanks @majiayu000)!

[!IMPORTANT]

🆕 TensorZero Autopilot

TensorZero Autopilot is an automated AI engineer powered by TensorZero that analyzes LLM observability data, sets up evals, optimizes prompts and models, and runs A/B tests.

It dramatically improves the performance of LLM agents across diverse tasks:

<img width="600" alt="Bar chart showing baseline vs. optimized scores across diverse LLM tasks" src="https://github.com/user-attachments/assets/aa474fe3-b55a-48aa-9f0d-e7c2f8e32ccd" /> <br>

Learn more →&emsp;&emsp;Schedule a demo →

Notes

If you use this software, please cite it as below.

Files

tensorzero/tensorzero-2026.3.4.zip

Files (28.9 MB)

Name Size Download all
md5:5b5341c50907451a1f35aab42cdea638
28.9 MB Preview Download

Additional details

Related works