Bluffbench: Effective agents need to prioritize evidence over preconceptions | Not Hacker News!