Confidence estimation is a better metric than agreement for LLM judges

(arxiv.org)

3 points | by rapiddev  7 hours ago

No comments yet.