Show HN: A new benchmark for testing LLMs for deterministic outputs

(interfaze.ai)

16 points | by khurdula  2 hours ago

2 comments