Marshall Choy

369 posts

Marshall Choy banner
Marshall Choy

Marshall Choy

@MarshallChoy

NorCal original | My opinions

California, USA Katılım Ağustos 2010
205 Takip Edilen322 Takipçiler
Marshall Choy
Marshall Choy@MarshallChoy·
The @SambaNovaAI crew is arriving for the start of LEAP in Riyadh for 4 days of amazing announcements and demonstrations of agentic AI transforming the Kingdom and the world. #ai #genai #LEAP25
Marshall Choy tweet media
English
3
1
12
596
Marshall Choy
Marshall Choy@MarshallChoy·
.@SambaNovaAI is smoking the competition, delivering world record @AIatMeta Llama3 405B inferencing throughout at 114 tokens/sec at full precision! #ai #llama #llm #inference
SambaNova@SambaNovaAI

🚀 World record performance: SambaNova is running Llama 3.1 405B at 114 t/s with full precision accuracy, in only one rack. Verified by @ArtificialAnlys! 🦙 This speed unlocks so many use cases for enterprises and developers that we cannot wait to see them built on our platform. Apply for early access today: sambanova.ai/fast-api

English
0
1
13
463
SambaNova
SambaNova@SambaNovaAI·
Our workshop @genaisummitsf was packed with developers leveraging #LLama3 at its FASTEST! 🦙🏎️ They learned more about Samba-1 Turbo and saw how full precision ensures higher accuracy and reliability in #AI models. Missed out? Try it here: fast.snova.ai. #AIChip
SambaNova tweet media
English
1
5
26
12.3K
Marshall Choy
Marshall Choy@MarshallChoy·
It’s official, @SambaNovaAI delivers the fastest inference throughout in the world. #ai #genai #inference #silicon #llm #llama3
Artificial Analysis@ArtificialAnlys

Artificial Analysis has independently benchmarked @SambaNovaAI's custom AI chips at 1,084 tokens/s on Llama 3 Instruct (8B)! 🏁 This is the fastest output speed we have benchmarked to date and >8 times faster than the median output speed across API providers of @Meta's Llama 3 Instruct (8B) we benchmark. SambaNova currently does not yet publicly offer a serverless API but you can try out their system via their chat interface (see below tweet). SambaNova is not yet listed on the Artificial Analysis leaderboard but we understand API services using SambaNova chips will be available in the near future and we look forward to initiating full coverage. SambaNova’s custom SN40L RDU chips are their fourth generation design and are built on TSMC’s 5nm process. They are reported as having the potential to scale to serve much larger models than Llama 3 Instruct (8B) - Llama 3 400B+ 👀. Artificial Analysis has also verified that Llama 3 Instruct (8B) on Samba-1 Turbo achieves quality scores in-line with full FP16 precision by testing an MMLU-based benchmark.

English
0
5
20
1.4K
Marshall Choy
Marshall Choy@MarshallChoy·
Performance can be fleeing , in just one day - performance, accuracy, and efficiency can be durable game-changers! Keep an eye on what @SambaNovaAI is doing in this space! #llm #generativeai #ml
SambaNova@SambaNovaAI

🚀🌟🚀Excited to announce Samba-CoE v0.2, which outperforms DBRX by @DbrxMosaicAI and @databricks, Mixtral-8x7B from @MistralAI, and Grok-1 by @grok at a breakneck speed of 330 tokens/s. These breakthrough speeds were achieved without sacrificing precision and only on 8 sockets, showcasing the true capabilities of dataflow! Why would you buy 576 sockets and go to 8 bits when you can run using 16 bits and just 8 sockets. Try out the model and check out the speed here - coe-1.cloud.snova.ai. We are also providing a sneak peak of our next model, Samba-CoE v0.3, available soon with our partners at @LeptonAI. Read more about this announcement at sambanova.ai/blog/accurate-…

English
0
11
24
1.8K
Marshall Choy retweetledi
SambaNova
SambaNova@SambaNovaAI·
🚀🌟🚀Excited to announce Samba-CoE v0.2, which outperforms DBRX by @DbrxMosaicAI and @databricks, Mixtral-8x7B from @MistralAI, and Grok-1 by @grok at a breakneck speed of 330 tokens/s. These breakthrough speeds were achieved without sacrificing precision and only on 8 sockets, showcasing the true capabilities of dataflow! Why would you buy 576 sockets and go to 8 bits when you can run using 16 bits and just 8 sockets. Try out the model and check out the speed here - coe-1.cloud.snova.ai. We are also providing a sneak peak of our next model, Samba-CoE v0.3, available soon with our partners at @LeptonAI. Read more about this announcement at sambanova.ai/blog/accurate-…
SambaNova tweet media
English
23
94
369
1.2M
Matt Eastwood
Matt Eastwood@matteastwood·
Ok people forever known as tweeps … who is missing?
Matt Eastwood tweet media
Westborough, MA 🇺🇸 English
16
0
1
2.3K