Product Launch
anonymous
2 points
0 comments
Posted3 months ago
Show HN: Optimizing DeepSeek's NSA for TPUs – A Kernel Worklog
henryhmko.github.ioTPU optimizationDeep LearningJAXSparse Attention
Discussion (0 comments)
No comments available in our database yet.
Comments are synced periodically from Hacker News.