This researcher has a new way to measure AI performance. It's BS, literally.
BullshitBench, created by Peter Gostev, evaluates AI models' ability to detect nonsense. One AI company did way better than everyone else.
Source: Business Insider
BullshitBench, created by Peter Gostev, evaluates AI models' ability to detect nonsense. One AI company did way better than everyone else.