Logo16x Eval

Download 16x Eval

Iterate on prompts. Test different models. Find the best combo for your tasks.

Latest version: v0.0.49
Released: (July 16, 2025)

New updates from the latest releases

0.0.49

July 16, 2025

  • Added links to provider api key page in settings page
  • UI/UX improvements to experiment page to allow more customization
Release 0.0.49 - Links to provider api key page in settings page

0.0.48

July 15, 2025

  • Added support for xAI as first-party model provider
  • Added first-party support for Grok 4 model
  • Changed the default model to Claude Sonnet 4
  • Fixed thoughts / reasoning token counting logic for OpenRouter provider
  • Various UI/UX improvements
Release 0.0.48 - xAI and Grok 4 support

0.0.47

June 20, 2025

  • Added pricing metrics for models in the evals page
  • Added cost metrics for individual evaluations in the evals page
Release 0.0.47 - Pricing and cost metrics

Download 16x Eval

Join AI builders and power users in running your own evaluations