Not
Hacker
News
!
Home
Hiring
Products
Discussion
Q&A
Users
Post-transformer inference: 224× compression of Llama-70B with improved accuracy | Not Hacker News!