Joyce

235 posts

Joyce

@joyceerhl

product @cerebras prev eng @code ✨ opinions mine

San Francisco Katılım Haziran 2019

274 Takip Edilen522 Takipçiler

Joyce@joyceerhl·2 Nis

@Tyriar @cursor_ai congratulations! 🎉

English

178

Daniel Imms@Tyriar·2 Nis

First week at @cursor_ai 👀

English

212

10.2K

Joyce retweetledi

Cerebras@cerebras·13 Mar

x.com/i/article/2032…

ZXX

143

1.2K

265.2K

Joyce retweetledi

Michael Magán@mrmagan_·3 Mar

generative user interfaces at the speed of thought. you can now build "tab autocomplete" for every app. ultra-fast inference @cerebras & your components render by the @tambo_ai agent.

English

219

24K

Joyce retweetledi

Tibo@thsottiaux·20 Şub

We’ve made GPT-5.3-Codex-Spark about 30% faster. It is now serving at over 1200 tokens per second. More to come on speed across the board.

English

212

117

2.6K

348.7K

Joyce retweetledi

DHH@dhh·17 Şub

@kevinwestmx @Zai_org GLM4.7 on @cerebras is insane. I was impressed with the model performance on my sample test, but I was BLOWN AWAY by the speed. Real window into the future of AI.

English

223

63.5K

Joyce retweetledi

Pierce Boggan@pierceboggan·17 Şub

Join us next week for Cafe Compute Seattle with the @cerebras, @code, and @github teams! RSVP: luma.com/cafecompute-se…

English

950

Joyce retweetledi

OpenAI@OpenAI·12 Şub

GPT-5.3-Codex-Spark is now in research preview. You can just build things—faster.

English

597

642

5.8K

1.5M

Joyce@joyceerhl·6 Şub

@burkeholland my kingdom for an alt modifier to engage shell mode

English

Burke Holland@burkeholland·5 Şub

Now that we're all coding in terminals this is my life

English

2.3K

Burke Holland@burkeholland·5 Şub

Son of a

English

12.4K

Joyce retweetledi

Sam Altman@sama·16 Oca

Very fast Codex coming!

Cerebras@cerebras

OpenAI🤝Cerebras openai.com/index/cerebras…

English

669

531

8.5K

1.6M

Joyce retweetledi

Andrew Feldman@andrewdfeldman·14 Oca

@OpenAI and @Cerebras have signed a multi-year agreement to deploy 750 megawatts of Cerebras wafer-scale systems to serve OpenAI customers. This has been a decade in the making. Deployment begins in early 2026, and when fully rolled out, it will be the largest high-speed AI inference deployment in the world. OpenAI and Cerebras were both founded in 2015 with radically ambitious goals. OpenAI set out to build the software that would push AI toward general intelligence. Cerebras set out to rethink computing hardware from first principles. Our teams met as far back as 2017. We shared ideas, early work, and a common belief: there would come a point when model scale and hardware architecture would have to converge. That point has arrived. ChatGPT set the direction for the entire industry. It showed the world what AI could be. Now we’re in the next phase - not proving capability, but delivering it at global scale. The history of technology is clear on one thing: speed drives adoption. The PC industry didn’t operate at kilohertz. The internet didn’t change the world on dial-up. AI is no different. As models grow more capable, speed becomes the bottleneck. Slow systems limit what users can do, how often they engage, and whether AI becomes infrastructure or remains a novelty. Cerebras was built for this moment. By keeping computation and memory on a single wafer-scale processor, we eliminate the data-movement penalties that dominate GPU systems. The result is up to 15× faster inference, without sacrificing model size or accuracy. That speed changes product design, user behavior, and ultimately productivity. For consumers, it means AI that feels instantaneous. For the economy, it means agents that can finally drive serious productivity growth. For Cerebras, 2026 will be a defining year. With this collaboration with OpenAI, Cerebras’ wafer-scale technology will reach hundreds of millions - and eventually billions - of users. We’re proud to work alongside OpenAI to bring fast, frontier AI to people around the world. This is what a decade of long-term thinking looks like.

English

498

157.3K

Joyce retweetledi

Cerebras@cerebras·14 Oca

OpenAI🤝Cerebras openai.com/index/cerebras…

Latviešu

180

346

2.9K

1.6M

Joyce retweetledi

Nathan Lambert@natolambert·11 Oca

The combo of improvements in reasoning efficiency (fewer tokens per answer, still very new research area) and faster chips is going to make coding agents so so much faster in 6-12 months. The products in 2+ years will feel approx instantaneous relative to today.

English

165

17.5K

Joyce retweetledi

Vercel Developers@vercel_dev·10 Oca

You can now use GLM-4.7 through Cerebras on AI Gateway.

Cerebras@cerebras

GLM-4.7 from @Zai_org is live on Cerebras! - Frontier intelligence for coding, tool-driven agents, and multi-turn reasoning - Record coding speed: ~1,000 tokens per second (up to 1,700 TPS for other uses) - Strong price-performance: ~10x higher than Sonnet 4.5

English

9.4K

Joyce@joyceerhl·9 Oca

@spsbuilds @cerebras @Zai_org 👋 the API supports tool calling and structured outputs when used separately, but doesn't currently support having them both in the same request. Learn more: inference-docs.cerebras.ai/capabilities/s…

English

SPS@spsbuilds·8 Oca

@cerebras @Zai_org Does the API support tool calling with structured output?

English

891

Joyce retweetledi

Cerebras@cerebras·8 Oca

English

123

1.4K

134.4K

Joyce@joyceerhl·9 Oca

@AI_GPT42 @cerebras @Zai_org 👋 By default, GLM 4.7 on Cerebras will use reasoning. You can opt to disable reasoning by setting `disable_reasoning: true` in your request.

English

AI Turbo dev ☮️🇿🇦 🇵🇱 🖖@AI_GPT42·8 Oca

@cerebras @Zai_org looks like thinking parameter is not supported via completions API

English

258

Joyce@joyceerhl·9 Oca

@AonSayyed @cerebras @Zai_org 👋 we support 131K context window for GLM 4.7, and these are the full model weights (non-REAPed).

English

Aon@AonSayyed·9 Oca

@cerebras @Zai_org With context size halved? Did you reap it too?

English

498

Joyce@joyceerhl·9 Oca

@gustojs @cerebras @Zai_org 👋 Caching and interleaved thinking are both supported. For optimal caching perf, we recommend setting `clear_thinking=false` in your requests. Learn more: #param-clear-thinking" target="_blank" rel="nofollow noopener">inference-docs.cerebras.ai/api-reference/…

English

Darek Gusto@darekgusto·8 Oca

@cerebras @Zai_org What about caching and interweaved thinking?

English

670

Joyce retweetledi

Pierce Boggan@pierceboggan·26 Kas

@joyceerhl @monstercameron @_MR_WALI_ @cerebras @code VS Code 🤝 Cerebras

Español

247

Joyce@joyceerhl·26 Kas

@pierceboggan @monstercameron @_MR_WALI_ @cerebras @code 🫰and Cerebras now hosts GLM 4.6!

English

120

Pierce Boggan@pierceboggan·24 Kas

@monstercameron @_MR_WALI_ the best experience I've had with BYOK is @Cerebras and Qwen models in @code :)

English

219

Wali Mohammad Kadri@_wmk0_·22 Kas

Hello @pierceboggan 👋🏻 When are we getting some better "0x" models in copilot chat? There are better, cheaper & open-source alternatives, why not provide them?

English

895

Keşfet

@Tyriar @cursor_ai @cerebras @tambo_ai @kevinwestmx @Zai_org @code @github