Show HN: A new benchmark for testing LLMs for deterministic outputs

(interfaze.ai)

39 points | by khurdula 5 hours ago ago

15 comments