Last activity 7h agoPosted Nov 25, 2025 at 9:35 AM EST
Sparse Matrix-Vector Multiplication That Works at 30–90% Sparsity
Mood
informative
Sentiment
positive
Category
startup_launch
Key topics
Sparse Matrix-Vector Multiplication
GPU Optimization
Llms
Sparsity
To get benefits from sparsity, you usually need to have very sparse matrices, impose some structure on the sparsity pattern or have specialized hardware. None of it is the case if you want to rune pruned LLMs on consumer devices.
I wanted to see how far can you push it on a GPU and ended up with this.
Blog: https://www.grizzlytech.dev/blog/macko-spmv
Paper: https://arxiv.org/abs/2511.13061
Code (example with torch): https://github.com/vlejd/macko_spmv
Discussion Activity
Light discussionFirst comment
23m
Peak period
4
Day 1
Avg / period
2.5
Key moments
- 01Story posted
Nov 25, 2025 at 9:35 AM EST
1d ago
Step 01 - 02First comment
Nov 25, 2025 at 9:58 AM EST
23m after posting
Step 02 - 03Peak activity
4 comments in Day 1
Hottest window of the conversation
Step 03 - 04Latest activity
Nov 26, 2025 at 1:30 PM EST
7h ago
Step 04
Generating AI Summary...
Analyzing up to 500 comments to identify key contributors and discussion patterns
Discussion (0 comments)
Discussion hasn't started yet.
ID: 46046106Type: storyLast synced: 11/25/2025, 2:36:09 PM
Want the full context?
Jump to the original sources
Read the primary article or dive into the live Hacker News thread when you're ready.