0.0.46
June 17, 2025
- Added evaluation comparison page to compare the results of two evaluations side by side
0.0.45
June 8, 2025
- Added support for Gemini 2.5 Pro Preview (06-05)
- Optimized the UX for adding OpenRouter models for new users
0.0.44
June 1, 2025
- Moved benchmark to a separate dedicated page with option to select models
- Added sorting options for experiments page
- Added temperature as an advanced setting
- Merged rating and notes modal for better UX
- Various UI/UX improvements