Cactus

12 posts

Cactus

@cactuscompute

The fastest way to deploy mobile AI

San Francisco, CA เข้าร่วม Ekim 2025

7 กำลังติดตาม133 ผู้ติดตาม

Cactus รีทวีตแล้ว

PowerSync@powersync_·6 Mar

The PowerSync AI Hackathon starts today. Bring your favorite AI ideas to life and compete for over $8k+ in prizes, including bonus prizes from our partners @supabase @neondatabase @mastra @tan_stack @cactuscompute. Let the hacking being!

English

546

Cactus รีทวีตแล้ว

PowerSync@powersync_·3 Mar

We now also have @cactuscompute on board as a partner in the hackathon! Cactus' on-device AI with cloud fallback pairs well with PowerSync. They are sponsoring a prize of a month of Cactus Hybrid inference for the best submission using Cactus 🌵🎉 x.com/powersync_/sta…

PowerSync@powersync_

𝗔𝗻𝗻𝗼𝘂𝗻𝗰𝗶𝗻𝗴 𝘁𝗵𝗲 𝗣𝗼𝘄𝗲𝗿𝗦𝘆𝗻𝗰 𝗔𝗜 𝗛𝗮𝗰𝗸𝗮𝘁𝗵𝗼𝗻 🤖🧑‍💻 We are hosting a virtual hackathon where the challenge is to build innovative AI-powered software using PowerSync as a sync engine. Join us to win up to $3,500 in cash prizes (and bonus partner prizes), explore cutting-edge AI use cases, connect with a vibrant community of builders, and discover new patterns for creating AI applications. More details coming soon, stay tuned! In the meantime, save the date and sign up for updates ⬇️

English

2.2K

Cactus@cactuscompute·15 Ara

@sobedominik @sunglassesface Cactus Chat is a great choice :)

English

Dominik Sobe ツ@sobedominik·15 Ara

@sunglassesface @cactuscompute oh that’s sick! i’m actually looking for a consumer version that I can use myself

English

Dominik Sobe ツ@sobedominik·14 Ara

Anyone successfully using a local LLM on their iPhone? I have tested a few a year ago on my iP14 Pro but they all made my battery extremely hot and the UI sucked. Now with the iP17 Pro I’d love to give it another try. What app/model should I use?

English

2.1K

Cactus รีทวีตแล้ว

Google Open Source@GoogleOSS·11 Ara

From generalist to expert! See how @cactuscompute used #Tunix for Supervised Fine-Tuning on the Gemma 3 1B model, boosting its tool-calling capabilities from 28% to 35%. All on the free tier of Google Colab. #AI #SFT #Gemma goo.gle/gemma3-tunix-t…

English

6.3K

Cactus รีทวีตแล้ว

Samuel Donkor@SAMADON_·4 Ara

@cactuscompute @nothing @huggingface Excited to share that our team placed 2nd at the Cactus (YC S25) x Nothing x Hugging Face Mobile AI Hackathon. We were up against teams from MIT, Stanford, and builders from around the world. Grateful to have had the chance to build and compete alongside so many talented people.

English

362

Cactus รีทวีตแล้ว

Samir@SamLasseur·4 Ara

Spent 24 hours at Nothing’s new London HQ as part of the @cactuscompute × @huggingface × @nothing hackathon. My team and I built Pulse; a first-aid assistant to guide bystanders through basic life support whilst reporting to first responders providing situational context. Against ~700 international participants, we were awarded the honorary prize!!!

English

557

Cactus รีทวีตแล้ว

Henry Ndubuaku@Henry_Ndubuaku·27 Kas

1.6B INT8 VLM by @liquidai on Cactus (YC S25) never exceeds 231MB of peak memory usage at any context size. 1. Cactus is aggressively optimised to run on budget devices with minimal resources, enabling efficiency, negligible pressure on your phone and passes your OS safety mechanisms. 2. Notice how 1.6B INT8 CPU reaches 95 toks/sec on Apple M4 Pro, faster than your eyes could process. Our INT4 will almost 2x the speed when merged. Expect up to 180 toks/sec decode speed. 3. The prefill speed reaches 513 toks/sec. Our NPU kernels will 5-11x that once merged. Expect up to 2500 - 5500 toks/sec. The time to first token of your large context prompt will take less than 1sec. 4. LFM2-1.2B-INT8 in the Cactus compressed format takes only 722mb. This means that with INT4 will shrink to 350mb. Almost half as much as GGUF, ONNX, Executorch, LiteRT etc. 5. Once done, we will start recommending 1B models to our users, cause your Grandma’s phones will run them. Stay tuned! github.com/cactus-compute…

English

154

37.3K

Cactus รีทวีตแล้ว

Jakub Mroz@jakmroo·26 Kas

We just shipped the Cactus React Native SDK🌵- the fastest and most efficient on-device AI inference engine for React Native.⚡️Lightweight, insanely fast, and built for mobile devices from the ground up.🚀

English

532

Cactus@cactuscompute·25 Kas

@SelimBenayat @tigran3rd @nothing @huggingface @tigran3rd not in SF. Our community at Stanford is away for Thanksgiving. SF will be online-only

English

Sélim@SelimBenayat·25 Kas

@tigran3rd @nothing @cactuscompute @huggingface @cactuscompute did we lock in on a location on campus?

English

191

Cactus รีทวีตแล้ว

Sélim@SelimBenayat·24 Kas

Hackathon alert! London, SF, Boston. This Friday! 👀 @nothing is teaming up with @cactuscompute and @huggingface to hack on redefining on-device AI experiences! Come build something memorable, meet the teams, and ship in 24 hours! Signups are wild so far 🔥

English

197

48K

Cactus@cactuscompute·22 Kas

@_iamEtornam thanks for building with us, Etornam! 🫶🏼🌵

English

Etornam ✨@_iamEtornam·22 Kas

Let’s talk about on-device model… #DevfestPretoria

English

990

Cactus@cactuscompute·21 Kas

Cactus React Native v1 is live! Deploy AI on-device with text inference, tool calling, embeddings and more – powered by the fastest edge inference engine 🌵 Our React Native bindings run on @margelo_com's Nitro Modules, yielding the fastest mobile inference we've seen so far.

English

344

ค้นพบ

@supabase @neondatabase @mastra @tan_stack @sobedominik @sunglassesface @nothing @huggingface