CloudRift

83 posts

CloudRift banner
CloudRift

CloudRift

@CloudRiftAI

The Operating System for Sovereign AI Deployments

Mountain View, CA Katılım Mart 2024
39 Takip Edilen76 Takipçiler
CloudRift
CloudRift@CloudRiftAI·
Most GPU VMs come configured for general workloads. Our team benchmarked what host-level tuning actually changes: memory bandwidth up to 7x on #H200, #NCCL up to +144% on PRO 6000. On the wrong config, #NUMA exposure cuts NCCL by 57%. cloudrift.ai/blog/benchmark…
English
0
0
0
12
CloudRift retweetledi
ElevenLabs
ElevenLabs@ElevenLabs·
Introducing Dubbing v2, our revolutionary new dubbing model. For the first time, the emotion and performance of the original content is carried over into every language.
English
71
188
1.8K
492.6K
CloudRift
CloudRift@CloudRiftAI·
61% of Western European CIOs now prioritize local cloud providers over US hyperscalers. With the EU AI Act fully applicable on August 2, regional GPU capacity is shifting from a preference to a procurement requirement. euronews.com/next/2026/03/0… #SovereignAI #EUAIAct
English
1
0
1
17
CloudRift retweetledi
dstack
dstack@dstackai·
Training models or serving inference on AMD GPUs? We’ve refreshed the AMD accelerator example in the dstack docs, covering on-prem fleets, cloud GPU provisioning, dev environments, training jobs, and production-grade inference. dstack.ai/docs/examples/…
English
2
6
6
1.4K
CloudRift
CloudRift@CloudRiftAI·
If you've ever wished you could read PyTorch's compiler end to end, here's the closest thing: Dmitry built a working ML compiler in about 8,000 lines of Python that's faster than PyTorch eager on average and up to 4.7x faster on small kernels like reductions and k/v projections. cloudrift.ai/blog/building-… @PyTorch #MLcompilers #PyTorch
English
0
0
0
30
CloudRift
CloudRift@CloudRiftAI·
Modern ML compilers all share the same shape: Torch IR → Tensor IR → Loop IR → Tile IR → Kernel IR → CUDA Each lowering moves closer to the hardware: decomposition → fusion → tiling → scheduling → codegen. @ditrifonov rebuilt the whole pipeline in 5K lines of Python to show why. cloudrift.ai/blog/building-… @modular @PyTorch
English
0
0
0
65
CloudRift
CloudRift@CloudRiftAI·
@CatoDigitalInc redeploys GPU servers retired from Meta and NVIDIA fleets, rather than commissioning new ones. Their capacity is now on CloudRift as V100 32GB VMs at $0.29 per GPU/hour. Good for fine-tuning, batch inference, rendering, and HPC. → cloudrift.ai/gpu-rentals
English
0
0
0
47
CloudRift
CloudRift@CloudRiftAI·
V100 32GB VMs are now on CloudRift at $0.29 per GPU/hour, supplied by @CatoDigitalInc. Fits a LoRA fine-tune of Llama 3 8B, Whisper Large inference, or a batch embeddings job on a single GPU. → cloudrift.ai/gpu-rentals @nvidia
English
1
1
5
151.9K
CloudRift
CloudRift@CloudRiftAI·
$0.29 per GPU/hour for a V100 32GB VM on CloudRift. The same hardware on AWS and Azure runs above $3 per GPU/hour, and the 32GB variant is usually only sold in 8-GPU bundles. We offer it as a single-GPU VM. extremely useful if your job runs fine on Volta and does not need Hopper. Supplied by @CatoDigitalInc. → cloudrift.ai/gpu-rentals @huggingface @nvidia
English
0
1
3
80