Lepton AI

13 posts

Lepton AI banner
Lepton AI

Lepton AI

@LeptonAI

The World’s GPU Compute at Your Fingertips. Acquired by @NVIDIA.

San Francisco Katılım Ocak 2024
4 Takip Edilen323 Takipçiler
Lepton AI retweetledi
Garry Tan
Garry Tan@garrytan·
Xfinity: Make something nobody wants
English
111
19
1.1K
197.4K
Lepton AI retweetledi
Devin AI
Devin AI@toolandtea·
7/ NVIDIA and Hugging Face offer DGX Cloud Lepton for instant global GPU access. Train, fine-tune, and deploy models at scale with ease. Fast, flexible, and collaborative.
English
2
4
80
19.3K
Lepton AI retweetledi
Rohan Paul
Rohan Paul@rohanpaul_ai·
🚨 NVIDIA launches DGX Cloud Lepton to commoditize inference compute across clouds, threatening neocloud margins. DGX Cloud Lepton is a new layer abstracting inference compute across multiple neoclouds. It gives users a consistent interface while automatically routing workloads across providers. → The goal is to make inference compute a commodity, similar to what Uber did for taxi services. This strips differentiation from neoclouds and creates pricing pressure, reducing their margins. → Lepton’s real innovation is turning multi-cloud inference into a seamless, interoperable platform. It raises performance per dollar for users, while keeping NVIDIA’s margins untouched. @NVIDIAAIDev
English
3
6
21
3.1K
Lepton AI retweetledi
Yangqing Jia
Yangqing Jia@jiayq·
We've achieved a >99.5% uptime for large scale GPU clusters, with a great collaboration between @LeptonAI and @digitalocean. This is much better than industry standard SLAs which roams around 98%. It's done via proactive monitoring solutions like our open source GPUD, the cloud native platform, and close collaboration between the engineering teams. Learn more at blog.lepton.ai/achieving-99-5…, and shoot a message to info@lepton.ai if you need high performance, cloud native, production grade AI infra!
English
9
5
66
18K
Lepton AI retweetledi
Freddy A Boulton
Freddy A Boulton@freddy_alfonso_·
Talk to Llama 3.2-3B 🦙🗣️⚡️ Powered by @LeptonAI (blazing fast LLM inference, ASR, and TTS all in one!) and @Gradio 's ergonomic WebRTC Streaming ⚡️ Building this took me about 30 minutes despite never using Lepton before.
English
1
2
1
877
Lepton AI retweetledi
DigitalOcean
DigitalOcean@digitalocean·
Achieving more than 99.9% uptime and quick turnaround times for collaboration between teams after partnering with #DigitalOcean, @LeptonAI’s CEO, Yangqing Jia, is realizing his goal of growing 10x over the next year. 🚀 Watch to learn how ⤵ youtube.com/watch?v=NLtQHg…
YouTube video
YouTube
English
0
3
8
3.4K
Lepton AI retweetledi
Exabits
Exabits@exa_bits·
We are so proud to announce our extended partnership with FastGPU @fast_gpu via AI OG innovators, the mighty LeptonAI @LeptonAI . Now you can deploy on-Demand RTX4090’s with Enterprise AI Infrastructure IN SECONDS with Exabits on FastGPU. Just pay for what you use, as you go. ~a thread~
Exabits tweet media
English
2
4
15
1.8K
Lepton AI retweetledi
Martian
Martian@withmartian·
.@LeptonAI surpasses all other providers in throughput (P50 & P90) for both Llama-2-70B and Mixtral on a small service load for short input long output prompts. A P50 of 130 tks/s is the fastest throughput we've observed among all model offerings by all providers View this scenario live: leaderboard.withmartian.com/?output_tokens…
Martian tweet media
English
2
2
16
6.2K