How good a detective is an AI? A Sherlock Holmes board game as an LLM-agent eval

(alexweil.github.io)

4 points | by ajonat  5 hours ago

1 comments