Fast Matrix Multiply on an Apple GPU | Not Hacker News!