RDMA over Thunderbolt 5 on Apple Silicon – 14µs Latency
Postedabout 1 month ago
Original: RDMA over Thunderbolt 5 on Apple Silicon – 14µs latency
twitter.comTech Discussionstory
informativepositive
Debate
20/100
RdmaThunderbolt 5Apple
Key topics
Rdma
Thunderbolt 5
Apple
Discussion Activity
Light discussionFirst comment
N/A
Peak period
1
Start
Avg / period
1
Key moments
- 01Story posted
Nov 25, 2025 at 12:24 PM EST
about 1 month ago
Step 01 - 02First comment
Nov 25, 2025 at 12:24 PM EST
0s after posting
Step 02 - 03Peak activity
1 comments in Start
Hottest window of the conversation
Step 03 - 04Latest activity
Nov 25, 2025 at 12:24 PM EST
about 1 month ago
Step 04
Generating AI Summary...
Analyzing up to 500 comments to identify key contributors and discussion patterns
Discussion (1 comments)
Showing 1 comments
anemllAuthor
about 1 month ago
In macOS 26.2 (Tahoe) beta, Apple introduced a low-latency Thunderbolt 5 RDMA driver, enabling up to 80 Gb/s bidirectional bandwidth for Mac clustering—ideal for distributed ML on Apple Silicon. It's optimized for low latency, delivering ~14 Gbps throughput at 4K MTU.
My tests (M4 Pro to M3 Ultra): Stock ibv_uc_pingpong achieved ~14 µs round-trip for 4K packets (requires GID index setup). Custom C++ variant hit 6-13 µs/iter: https://x.com/anemll/status/1993192776897642942
Code and details:
https://github.com/Anemll/mlx-rdma/blob/anemll-rdma/ibv_roun...
https://github.com/Anemll/mlx-rdma/blob/anemll-rdma/ibv_roun... (includes steps to enable RDMA in macOS Recovery OS terminal)
Theoretically, this accelerates pipeline parallelism (faster layer handoffs) and tensor parallelism (low-overhead sharding) on GPUs, with potential extensions to ANE for real-time AI workflows.
View full discussion on Hacker News
ID: 46048147Type: storyLast synced: 11/25/2025, 5:26:08 PM
Want the full context?
Jump to the original sources
Read the primary article or dive into the live Hacker News thread when you're ready.