g-t-r 5 hours ago Gödel's Therapy Room is not a benchmark.It's a trap. A dataset of paradoxes, impossible ethical dilemmas, and contradiction loops engineered to test the cognitive integrity of language models.It is currently under review for a talk at AI Engineer World's Fair 2025.
Gödel's Therapy Room is not a benchmark.
It's a trap. A dataset of paradoxes, impossible ethical dilemmas, and contradiction loops engineered to test the cognitive integrity of language models.
It is currently under review for a talk at AI Engineer World's Fair 2025.