VA

Validate evaluator

free

by Hamelsmu

Calibrate LLM judges against human labels