PrismML

2

9

270

PrismML retweetet

Sinatras@myainotez·7h

Fresh out of stealth from @HessianFree 1 bit is barely enough representation for anything but they made 1 bit models running somehow, gotta be dark magic

Today, we are emerging from stealth and launching PrismML, an AI lab with Caltech origins that is centered on building the most concentrated form of intelligence. At PrismML, we believe that the next major leaps in AI will be driven by order-of-magnitude improvements in intelligence density, not just sheer parameter count. Our first proof point is the 1-bit Bonsai 8B, a 1-bit weight model that fits into 1.15 GBs of memory and delivers over 10x the intelligence density of its full-precision counterparts. It is 14x smaller, 8x faster, and 5x more energy efficient on edge hardware while remaining competitive with other models in its parameter-class. We are open-sourcing the model under Apache 2.0 license, along with Bonsai 4B and 1.7B models. When advanced models become small, fast, and efficient enough to run locally, the design space for AI changes immediately. We believe in a future of on-device agents, real-time robotics, offline intelligence and entirely new products that were previously impossible. We are excited to share our vision with you and keep working in the future to push the frontier of intelligence to the edge.

English

2

13

2.4K

PrismML retweetet

Kanu Gulati @Khosla Ventures@KanuGulati·21h

@cryptopunk7213 Now see @PrismML And just like that, the bar moved 🤯

English

3

6

1.5K

PrismML retweetet

うみゆき@AI研究@umiyuki_ai·9h

おおっ！！なんかすごそうだぞ！PrismMLが1-bitBonsaiってモデルをオープンでリリース！ライセンスはApache。1.7Bと4Bと8Bの３種類がmlxとggufで公開！何がヤバいかってパラメータ当たりのサイズが1bitだという！つまり1bitネイティブ量子化モデル！8Bパラなのにモデルサイズ1.16GBしかなくてマジで1bit！1bitってゼロとイチしかないって事でしょ？そんなんでええんか。それでいてベンチスコアはLlama3.1-8B超えでQwen3-8Bよりは下と言うレベル。とはいえQwen3-8Bを1bit量子化なんて不可能なわけだから。速度も爆速でRTX4090で368tps！初日からLlama.cppでサポート済み！

日本語

2

30

196

20.8K

PrismML retweetet

Vinod Khosla@vkhosla·20h

1-bit Bonsai 8B, a 1-bit weight model that fits into 1.15 GBs of memory and delivers over 10x the intelligence density of its full-precision counterparts. It is 14x smaller, 8x faster, and 5x more energy efficient on edge hardware while remaining competitive with other models in its parameter-class

Vinod Khosla@vkhosla

Better compression , faster, smaller, more energy efficient than what Google announced yesterday.

English

20

28

279

42.3K

PrismML retweetet

Rafael Spring@Rafael_L_Spring·22h

hell yea! I'm so down for this

Today, we are emerging from stealth and launching PrismML, an AI lab with Caltech origins that is centered on building the most concentrated form of intelligence. At PrismML, we believe that the next major leaps in AI will be driven by order-of-magnitude improvements in intelligence density, not just sheer parameter count. Our first proof point is the 1-bit Bonsai 8B, a 1-bit weight model that fits into 1.15 GBs of memory and delivers over 10x the intelligence density of its full-precision counterparts. It is 14x smaller, 8x faster, and 5x more energy efficient on edge hardware while remaining competitive with other models in its parameter-class. We are open-sourcing the model under Apache 2.0 license, along with Bonsai 4B and 1.7B models. When advanced models become small, fast, and efficient enough to run locally, the design space for AI changes immediately. We believe in a future of on-device agents, real-time robotics, offline intelligence and entirely new products that were previously impossible. We are excited to share our vision with you and keep working in the future to push the frontier of intelligence to the edge.

English

3

16

3.3K

PrismML retweetet

Johnson Thomas, MD, FACE@JohnsonThomasMD·22h

Great for on device inference. Hope they can compress a good speech to text model also, then we could fit our on device medical models under 1 GB

Today, we are emerging from stealth and launching PrismML, an AI lab with Caltech origins that is centered on building the most concentrated form of intelligence. At PrismML, we believe that the next major leaps in AI will be driven by order-of-magnitude improvements in intelligence density, not just sheer parameter count. Our first proof point is the 1-bit Bonsai 8B, a 1-bit weight model that fits into 1.15 GBs of memory and delivers over 10x the intelligence density of its full-precision counterparts. It is 14x smaller, 8x faster, and 5x more energy efficient on edge hardware while remaining competitive with other models in its parameter-class. We are open-sourcing the model under Apache 2.0 license, along with Bonsai 4B and 1.7B models. When advanced models become small, fast, and efficient enough to run locally, the design space for AI changes immediately. We believe in a future of on-device agents, real-time robotics, offline intelligence and entirely new products that were previously impossible. We are excited to share our vision with you and keep working in the future to push the frontier of intelligence to the edge.

English

3

19

2.5K

PrismML retweetet

Tushar Bansal@tushar_bans·1d

The journey to the next phase of AI will not only be about maximizing intelligence, but dramatically concentrating it from its current state. In 1976, Seymour Cray introduced the world to the Cray-1. It was a 5.5-ton, C-shaped monolith that redefined the limits of computation. Problems that once required the resources of entire institutions suddenly became tractable. Scientific discovery accelerated. Weather modeling improved. Physics, defense, and engineering entered a new era because unprecedented compute had been made real. Today, a processor thousands of times more powerful sits in your pocket. AI will follow the same arc. Right now, we are still in the "supercomputer era" of intelligence: extraordinary capability, but concentrated in a few hands and mediated by enormous infrastructure. That is not the endpoint. The true measure of progress will be defined by intelligence density: how much intelligence the world can hold, carry, and wield.

Today, we are emerging from stealth and launching PrismML, an AI lab with Caltech origins that is centered on building the most concentrated form of intelligence. At PrismML, we believe that the next major leaps in AI will be driven by order-of-magnitude improvements in intelligence density, not just sheer parameter count. Our first proof point is the 1-bit Bonsai 8B, a 1-bit weight model that fits into 1.15 GBs of memory and delivers over 10x the intelligence density of its full-precision counterparts. It is 14x smaller, 8x faster, and 5x more energy efficient on edge hardware while remaining competitive with other models in its parameter-class. We are open-sourcing the model under Apache 2.0 license, along with Bonsai 4B and 1.7B models. When advanced models become small, fast, and efficient enough to run locally, the design space for AI changes immediately. We believe in a future of on-device agents, real-time robotics, offline intelligence and entirely new products that were previously impossible. We are excited to share our vision with you and keep working in the future to push the frontier of intelligence to the edge.

English

6

30

2.9K

PrismML retweetet

Vinod Khosla@vkhosla·22h

Better compression , faster, smaller, more energy efficient than what Google announced yesterday.

Today, we are emerging from stealth and launching PrismML, an AI lab with Caltech origins that is centered on building the most concentrated form of intelligence. At PrismML, we believe that the next major leaps in AI will be driven by order-of-magnitude improvements in intelligence density, not just sheer parameter count. Our first proof point is the 1-bit Bonsai 8B, a 1-bit weight model that fits into 1.15 GBs of memory and delivers over 10x the intelligence density of its full-precision counterparts. It is 14x smaller, 8x faster, and 5x more energy efficient on edge hardware while remaining competitive with other models in its parameter-class. We are open-sourcing the model under Apache 2.0 license, along with Bonsai 4B and 1.7B models. When advanced models become small, fast, and efficient enough to run locally, the design space for AI changes immediately. We believe in a future of on-device agents, real-time robotics, offline intelligence and entirely new products that were previously impossible. We are excited to share our vision with you and keep working in the future to push the frontier of intelligence to the edge.

English

25

56

670

178.1K

PrismML retweetet

nisten🇨🇦e/acc@nisten·23h

it actually got half of my standard very hard question right even when running it in 8 bit kv cache activations: command used: llama.cpp/build/bin/llama-cli -m ~/1bit/Bonsai-8B.gguf -c 12000 -ngl 99 -t 4 --mlock --chat-template chatml -cnv -p "You are a helpful assistant.that thinks in first principles" --temp 0.5 -ctk q8_0 -ctv q8_0 Prompt: calculate how long a mass driver rail would need to be to accelerate people comfortably at max 2Gs on mars travelling along the slope of and launching from the top of mount olympus mons and what speed would it need to achieve at the top in order to get to escape velocity from Mars' gravity well or to at least get to the minimum martian orbital speed. Do thorough calculations with actual numbers and facts. Use emojis and pointform to communicate it all /think

English

3

32

2.3K

PrismML retweetet

Danil Akhtiamov@AkhtiamovDanil·1d

One bit is all you need!

Today, we are emerging from stealth and launching PrismML, an AI lab with Caltech origins that is centered on building the most concentrated form of intelligence. At PrismML, we believe that the next major leaps in AI will be driven by order-of-magnitude improvements in intelligence density, not just sheer parameter count. Our first proof point is the 1-bit Bonsai 8B, a 1-bit weight model that fits into 1.15 GBs of memory and delivers over 10x the intelligence density of its full-precision counterparts. It is 14x smaller, 8x faster, and 5x more energy efficient on edge hardware while remaining competitive with other models in its parameter-class. We are open-sourcing the model under Apache 2.0 license, along with Bonsai 4B and 1.7B models. When advanced models become small, fast, and efficient enough to run locally, the design space for AI changes immediately. We believe in a future of on-device agents, real-time robotics, offline intelligence and entirely new products that were previously impossible. We are excited to share our vision with you and keep working in the future to push the frontier of intelligence to the edge.

English

4

19

2.5K

PrismML retweetet

Kanu Gulati @Khosla Ventures@KanuGulati·1d

AI isn’t just a scale race anymore. It’s an efficiency war. The winners will be the ones squeezing the most intelligence out of every watt and every dollar i.e. Intelligence Density. That’s @PrismML bet. Our first proof point is the 1-bit Bonsai 8B, a 1-bit weight model that fits into 1.15 GB of memory and delivers over 10x the intelligence density of its full-precision counterpart. This rewrites the economics of AI. @khoslaventures @vkhosla @SStrohband and I are thrilled to back this team: @BabakHassibi, @SahinLale, @HessianFree, @rsadri_ml

Today, we are emerging from stealth and launching PrismML, an AI lab with Caltech origins that is centered on building the most concentrated form of intelligence. At PrismML, we believe that the next major leaps in AI will be driven by order-of-magnitude improvements in intelligence density, not just sheer parameter count. Our first proof point is the 1-bit Bonsai 8B, a 1-bit weight model that fits into 1.15 GBs of memory and delivers over 10x the intelligence density of its full-precision counterparts. It is 14x smaller, 8x faster, and 5x more energy efficient on edge hardware while remaining competitive with other models in its parameter-class. We are open-sourcing the model under Apache 2.0 license, along with Bonsai 4B and 1.7B models. When advanced models become small, fast, and efficient enough to run locally, the design space for AI changes immediately. We believe in a future of on-device agents, real-time robotics, offline intelligence and entirely new products that were previously impossible. We are excited to share our vision with you and keep working in the future to push the frontier of intelligence to the edge.

English

5

48

7.9K

PrismML@PrismML·1d

Founding Team: @BabakHassibi @SahinLale @HessianFree @rsadri_ml

English

2

87

15.6K

PrismML retweetet

Adrien Grondin@adrgrondin·1d

@Prince_Canuma @pashakho @PrismML @PrismML made PRs for mlx-swift and mlx-c, not yet merged. Hope it's merged soon so more people can easily try the model! @angeloskath github.com/PrismML-Eng/ml… github.com/PrismML-Eng/ml…

English

3

4

16

1.6K

PrismML@PrismML·1d

Today, we are emerging from stealth and launching PrismML, an AI lab with Caltech origins that is centered on building the most concentrated form of intelligence. At PrismML, we believe that the next major leaps in AI will be driven by order-of-magnitude improvements in intelligence density, not just sheer parameter count. Our first proof point is the 1-bit Bonsai 8B, a 1-bit weight model that fits into 1.15 GBs of memory and delivers over 10x the intelligence density of its full-precision counterparts. It is 14x smaller, 8x faster, and 5x more energy efficient on edge hardware while remaining competitive with other models in its parameter-class. We are open-sourcing the model under Apache 2.0 license, along with Bonsai 4B and 1.7B models. When advanced models become small, fast, and efficient enough to run locally, the design space for AI changes immediately. We believe in a future of on-device agents, real-time robotics, offline intelligence and entirely new products that were previously impossible. We are excited to share our vision with you and keep working in the future to push the frontier of intelligence to the edge.

English

155

500

3.7K

1.1M

PrismML retweetet

Christopher@communicating·1d

Big congrats to the @PrismML team on today’s Bonsai 1-Bit model launch. Pushing boundaries on model size vs accuracy for local, physical & (especially in PrismML’s case) edge deployments is one of the things required to propel the next great product development cycle. imo anyway.

Today, we are emerging from stealth and launching PrismML, an AI lab with Caltech origins that is centered on building the most concentrated form of intelligence. At PrismML, we believe that the next major leaps in AI will be driven by order-of-magnitude improvements in intelligence density, not just sheer parameter count. Our first proof point is the 1-bit Bonsai 8B, a 1-bit weight model that fits into 1.15 GBs of memory and delivers over 10x the intelligence density of its full-precision counterparts. It is 14x smaller, 8x faster, and 5x more energy efficient on edge hardware while remaining competitive with other models in its parameter-class. We are open-sourcing the model under Apache 2.0 license, along with Bonsai 4B and 1.7B models. When advanced models become small, fast, and efficient enough to run locally, the design space for AI changes immediately. We believe in a future of on-device agents, real-time robotics, offline intelligence and entirely new products that were previously impossible. We are excited to share our vision with you and keep working in the future to push the frontier of intelligence to the edge.

English

Omead Pooladzandi@HessianFree

2

7

1.1K

PrismML retweetet

Peyman@peyman_razaghi·1d

Follow this. 1-bit will be the new craze. Curious how they are training.

your spotify cache is bigger than our largest AI model. Bonsai: 1-bit weights. 1.7B to 8B params. 14x compression vs bf16. 8x faster on edge. 256 MB to 1.2GB. Based on Qwen 3. we just came out of stealth. intelligence belongs at the edge and we're going to put it there. Apache 2.0. we compressed intelligence. more coming. @PrismML

English

16

2.3K

PrismML retweetet

Peyman@peyman_razaghi·1d

This is huge. Interesting thing to see is their training workflow.

Today, we are emerging from stealth and launching PrismML, an AI lab with Caltech origins that is centered on building the most concentrated form of intelligence. At PrismML, we believe that the next major leaps in AI will be driven by order-of-magnitude improvements in intelligence density, not just sheer parameter count. Our first proof point is the 1-bit Bonsai 8B, a 1-bit weight model that fits into 1.15 GBs of memory and delivers over 10x the intelligence density of its full-precision counterparts. It is 14x smaller, 8x faster, and 5x more energy efficient on edge hardware while remaining competitive with other models in its parameter-class. We are open-sourcing the model under Apache 2.0 license, along with Bonsai 4B and 1.7B models. When advanced models become small, fast, and efficient enough to run locally, the design space for AI changes immediately. We believe in a future of on-device agents, real-time robotics, offline intelligence and entirely new products that were previously impossible. We are excited to share our vision with you and keep working in the future to push the frontier of intelligence to the edge.

English