Sabitlenmiş Tweet

Gave a talk at our SF infra meetup w/ @dstackai + @sgl_project:
200B+ MoE → resilient multi-GPU inference endpoint on @CrusoeAI Managed K8s.
Two commands.
KServe + Envoy AI Gateway + vLLM doing the heavy lifting underneath.




English












