Karim Mattar

52 posts

Karim Mattar

Karim Mattar

@kmattar1981

VP of Engineering @ PrismML

San Francisco, CA Katılım Nisan 2010
192 Takip Edilen50 Takipçiler
Karim Mattar retweetledi
Evan Walters
Evan Walters@evaninwords·
My job is literally to do interesting and crazy experiments, if you are into that sort of thing DM Omead or I (or @pashakho, @SahinLale, @kmattar1981, @tushar_bans, @eraznafre, @NMonti25537) and come build with us at @PrismML!
Omead Pooladzandi@HessianFree

Hey! @PrismML is hiring! We're looking for LLM people who have trained models at scale - SFT/RL, data mixtures, evals, distillation, long context, distributed training, kernels, you name it! Especially interested in people who like owning the full stack from training dynamics -> shipped models. btw, we need a DevRel too. DM me.

English
3
2
39
4.5K
Karim Mattar retweetledi
Sahin Lale
Sahin Lale@SahinLale·
We’re expanding our highly technical team at @PrismML — people who love pushing model quality end-to-end, from training dynamics to shipped models. If you’ve scaled LLM training, RL/SFT, evals, distillation, long context, kernels, or infra, we’d love to talk.
Omead Pooladzandi@HessianFree

Hey! @PrismML is hiring! We're looking for LLM people who have trained models at scale - SFT/RL, data mixtures, evals, distillation, long context, distributed training, kernels, you name it! Especially interested in people who like owning the full stack from training dynamics -> shipped models. btw, we need a DevRel too. DM me.

English
4
5
22
2.4K
Karim Mattar retweetledi
Omead Pooladzandi
Omead Pooladzandi@HessianFree·
Hey! @PrismML is hiring! We're looking for LLM people who have trained models at scale - SFT/RL, data mixtures, evals, distillation, long context, distributed training, kernels, you name it! Especially interested in people who like owning the full stack from training dynamics -> shipped models. btw, we need a DevRel too. DM me.
English
16
19
382
40.6K
Karim Mattar retweetledi
Kanu Gulati @Khosla Ventures
Thrilled and honored to be invited to speak! Thank you, @thejesonlee, for hosting me alongside the brilliant @bznotes!
Jeson Lee@thejesonlee

May 12th in SF, Where the Opportunity Sits in Physical AI by @join_savant For our next Choose Good Quests session, we are hosting @bznotes ,managing partner at @redglassvc and @KanuGulati, partner at @khoslaventures to share about opportunities in physical AI. Physical AI is moving fast — but for founders, the map is still being drawn. Bilal and Kanu have backed some of the most ambitious companies across AI, robotics and autonomy. They’ll share what they’re seeing from the investor seat: where the biggest opportunities are, what makes a strong founding team, how founders should think about wedge, market timing, defensibility, and what kinds of Physical AI companies they’d be excited to fund. Come join us. Link in the thread 🧵

English
0
2
10
1.9K
Karim Mattar retweetledi
Jon Durbin
Jon Durbin@jon_durbin·
Ternary is actually surprisingly powerful. Validated by bitnet and now again here. In the new model training research/experimentation I've been working on, ternary weights (in some places) actually beats bf16 (by a not-insignificant amount), at least up to the 7b scale (and with every indication that this benefit scales up).
PrismML@PrismML

Today we’re announcing Ternary Bonsai: Top intelligence at 1.58 bits Using ternary weights {-1, 0, +1}, we built a family of models that are 9x smaller than their 16-bit counterparts while outperforming most models in their respective parameter classes on standard benchmarks. We’re open-sourcing the models under the Apache 2.0 license in three sizes: 8B (1.75 GB), 4B (0.86 GB), and 1.7B (0.37 GB).

English
4
8
82
8.8K
Karim Mattar retweetledi
Mustafa Ergisi
Mustafa Ergisi@mustafaergisi·
@PrismML Ran Ternary-Bonsai 8B on my iPhone through OnDevice LLM. Surprisingly fast.
English
2
6
22
3.8K
Karim Mattar retweetledi
Karim Mattar retweetledi
Victor M
Victor M@victormustar·
290MB = useful LLM (running directly in your browser via WebGPU) 🤯 Bonsai 1-bit is beyond my comprehension... try the demo on hugging face ⬇️
English
21
48
712
63K
Karim Mattar retweetledi
Karim Mattar retweetledi
Karim Mattar retweetledi
Xenova
Xenova@xenovacom·
Ternary Bonsai: state-of-the-art intelligence at 1.58 bits. The models are so small they can even run locally in your browser on WebGPU! ⚡️ Here's the 8B version (just ~2GB in size) running at 60 tokens per second on my M4 Max. Try the demo out yourself! 👇
English
8
35
188
23.4K
Karim Mattar retweetledi
Sahin Lale
Sahin Lale@SahinLale·
Check out how you can use Ternary Bonsai 8B 🌳 for tool calling in your everyday life—an impressive demo on an amazing platform by @AnythingLLM and @tcarambat!
English
1
8
20
2.2K
Karim Mattar retweetledi
Karim Mattar retweetledi
Karim Mattar retweetledi
Karim Mattar retweetledi
Evan Walters
Evan Walters@evaninwords·
Super excited about these, by popular demand we're releasing a range of ternary Bonsai🌳 models, which score on average 7.5 points higher than our 1-bit models but are still wildly small for their param count (1.71 bits per weight!). Ternary Bonsai 8B is within 5% of Qwen3 8B but takes up 9x less memory. Ternary weights are especially good for CPU inference, and I'm excited to see how the community uses these models!
PrismML@PrismML

Today we’re announcing Ternary Bonsai: Top intelligence at 1.58 bits Using ternary weights {-1, 0, +1}, we built a family of models that are 9x smaller than their 16-bit counterparts while outperforming most models in their respective parameter classes on standard benchmarks. We’re open-sourcing the models under the Apache 2.0 license in three sizes: 8B (1.75 GB), 4B (0.86 GB), and 1.7B (0.37 GB).

English
3
5
50
5.7K
Karim Mattar retweetledi
Karim Mattar retweetledi
Locally AI - Local AI Chat
Locally AI - Local AI Chat@LocallyAIApp·
Try the new Ternary Bonsai 8B from @PrismML. A larger, smarter Bonsai model available on iPhone and iPad. Update your app now.
Locally AI - Local AI Chat tweet media
English
7
10
91
11.3K