
@Mascobot @a16z Nice, @mascobot - at OXMIQ, we did a similar project. We took it one step further - we overcame the 2 PCIe GPU limitation enforced by NVIDIA NCCL so that you can use all 384 GB to load a single model - oxmiq.ai/blog/4blackwell
English




