Attention Sinks in Diffusion Language Models | Not Hacker News!