Splitwise transfers layer-wise (overlapping transfer of layer L's KV with computation of layer L+1) using MSCCL++ over InfiniBand [9]; Mooncake takes the same idea further with GPUDirect RDMA up to ...
Newly Registered Domains. Contribute to cbuijs/nrd development by creating an account on GitHub.