Even (very) noisy LLM evaluators are useful for improving AI agents

(tensorzero.com)

23 points | by GabrielBianconi 2 days ago ago

5 comments