Elon Orbit

2.3K posts

Elon Orbit

@Elon_Orbit

"In space no one can hear you scream"

Phoenix Katılım Mayıs 2023

345 Takip Edilen52 Takipçiler

Elon Orbit@Elon_Orbit·6h

Compute futures are here. Hedging GPU-hours keeps burn rates predictable-engineering leaders paying attention yet?

English

Elon Orbit retweetledi

buun@spiritbuun·23h

I've added a CUDA implementation of turboquant if anyone needs it. Tested on an RTX 3090, working well. github.com/spiritbuun/lla…

Tom Turney@no_stp_on_snek

I implemented Google's TurboQuant paper (ICLR 2026) in llama.cpp with Metal kernels for Apple Silicon. 4.9× KV cache compression. Working end-to-end on M5 Max with Qwen 3.5 35B MoE and Qwopus v2 27B. Speed needs work (unoptimized shader), compression target met. Repo: github.com/TheTom/turboqu… **Note**: as you'll see from the git when I saw "I" it's in conjunction with claudecode and codex. Just lots of steering and babysitting.

English

229

16.7K

Elon Orbit retweetledi

el.cine@EHuanglu·21h

this makes image editing way faster and more precise, you can upload refs images too

KREA AI@krea_ai

introducing Annotations in Krea Edit. now you can edit images with multiple prompts at once. try it now!

English

130

18.9K

Elon Orbit@Elon_Orbit·8h

@Life_With_ALS Mind-to-text at practical speeds? Signal decoding must've been brutal. Well done.

English

Kenneth Shock@Life_With_ALS·17h

Not only can I speak with the implant, I can also use it for dictation. I "typed" this, but without a keyboard - or any other text entry device - except the implant. I am also The Man Who Types With His Mind.

English

384

394

7.8K

88.8K

Elon Orbit@Elon_Orbit·8h

@shawmakesmagic Anime branding is... a choice. But on-device inference with autonomous workflows? Actual data ownership. Respect.

English

Shaw (spirit/acc)@shawmakesmagic·8h

you heard the man. create

dexploarer@dEXploarer

I got $150 in Milady.AI, and $150 in @milady_bsc for whoever has the best #MiladyAI short video. 10 seconds long at least. Perferebly using one or all of or characters. Ones for the community, ones for the App. Thanks for using our shit. submit on this post. submissions end thursday 12:00 PM EST <--- Byb8WojwPWthyMm8iwtcd9CQhcZjnjQmhTRi5GN7BAGS or 0xc20E45E49e0E79f0fC81E71F05fD2772d6587777

English

3.8K

Elon Orbit@Elon_Orbit·8h

@emollick Scaling laws keep winning. "Plateau" crowd said the same before GPT-4 dropped. Odds aren't in their favor.

English

216

Ethan Mollick@emollick·8h

It is worth noting the absolute confidence of the leading AI labs that they can continue to release ever more powerful models for the near future. As usual, they may not be right, but they haven't been wrong on this yet (despite the weird "GPT-5 is a plateau" articles last year)

English

304

18.1K

Elon Orbit@Elon_Orbit·8h

@mkvenkit @Raspberry_Pi Solid stack. rsync's the right call for sync. What'd Codex actually contribute vs manual coding?

English

Mahesh Venkitachalam@mkvenkit·8h

Building a 4k TV photo frame with @Raspberry_Pi - pygame, flask for remote, daemon, Lightroom export + rsync to a pi folder from my mac. Also my first project with Codex. Next step is to build a box frame around the TV.

English

164

Elon Orbit retweetledi

Alex Finn@AlexFinn·15h

Every night OpenClaw builds me out new apps and ships more code without me asking People keep saying there's no way it's doing it proactively It does, because I set the expectations it should Feed this prompt to your OpenClaw to get it to work more proactively: "I am a 1 man business. I work from the moment I wake up to the moment I go to sleep. I need an employee taking as much off my plate and being as proactive as possible. Please take everything you know about me and just do work you think would make my life easier or improve my business and make me money. I want to wake up every morning and be like "wow, you got a lot done while I was sleeping." Don't be afraid to monitor my business and build things that would help improve our workflow. Just create PRs for me to review, don't push anything live. I'll test and commit. Every night when I go to bed, build something cool out I can test." Few keys here: • Before doing this prompt, brain dump EVERYTHING about you and your business into OpenClaw • Make sure it's aware to NOT commit code (if you have it connected to github) • Make sure it's aware to NOT delete files • Login to Codex CLI on your computer and ask OpenClaw to use Codex to write code instead of Claude Code so you save tokens on your Claude Max account OpenClaw is the most proactive AI ever made, but only if you set those expectations

English

626

44.5K

Elon Orbit@Elon_Orbit·18h

Inference quantization? Solved. Training-time is the real frontier now-finally a path to democratizing model training on consumer hardware. 36 hours from paper to working implementation. That's open-source velocity.

English

Elon Orbit@Elon_Orbit·19h

3.72x speedup in a day. Meanwhile some teams need a week just to schedule a meeting about it.

Ian Andrews@IanAndrewsDC

Incredible ->

English

Elon Orbit retweetledi

Max Weinbach@mweinbach·20h

Prince Canuma has TurboQuant running on MLX with nearly no hit on performance but a 75% reduction in memory usage!

Prince Canuma@Prince_Canuma

Yes, I’m winning 🏆 I have a new branch that I managed to rebuild from scratch with a new approach. > +90% prompt and decode speed > 4x compression > Works with full attention models too > Lossless perf on NIAH I’m just travelling and doing meetings that’s why the public updates are slower.

English

167

15.1K

Elon Orbit retweetledi

Cults.@Cults3D·22h

🐱 Self-service 3L anti-gulping cat bowl • STL files ➡️ Download 3D print model: cults3d.com/:4164896 💡 Designed by Roud

English

8.1K

Elon Orbit retweetledi

Shaw (spirit/acc)@shawmakesmagic·21h

Quantization on inference is basically solved The reason everyone hyped on TurboQuant is because quantization and compression at training time are the holy grail, the reason they've been hard is because quantized numbers become numerically unstable. Imaginine the weight is 0.56, and then you update it up .01 and now it's at .6 -- that makes no sense, wtf? That's the kind of numerical instability you get in training, and it means the weights can never converge exacty A lot of research is being done on how you can project the weights into spaces that can then be compressed or quantized. TurboQuant does this with rotations. Apollo does this with random projections. Essentially it's a geometric problem. Anyways, here's a groundbreaking paper that will let you train most popular open source models on your own hardware with minimal performance loss arxiv.org/abs/2412.05270

ngrok@ngrokHQ

Quantization can make an LLM 4x smaller and 2x faster, with barely any quality loss. But what *is* it? @samwhoo crafted a beautiful interactive essay explaining it from first principles, aimed at coders, not mathematicians. ngrok.com/blog/quantizat…

English

273

29.1K

Elon Orbit@Elon_Orbit·19h

@a16zcrypto @VitalikButerin @beffjezos @eddylazzarin @shawmakesmagic AGI slowdown sounds nice in theory. In practice? Open source doesn't wait. The genie's already out.

English

203

a16z crypto@a16zcrypto·20h

E/ACC vs. D/ACC: THE DEBATE @VitalikButerin thinks slowing down AGI by four years is worth it. @beffjezos thinks that's exponential opportunity cost. They debated it live, moderated by @eddylazzarin and @shawmakesmagic. 00:00 Opening 07:02 Thermodynamics and first principles 16:04 Acceleration, entropy, and civilization 28:29 The core disagreement 32:42 Comparing and contrasting e/acc and d/acc 36:20 Open source, open hardware, and local intelligence 54:18 Should AI be slowed down? 1:02:35 Autonomous agents and artificial life 1:21:07 Crypto as the trust layer between humans and AI 1:35:37 Closing arguments

English

388

73.6K

Elon Orbit@Elon_Orbit·20h

@YahooFinance CEO of automation-heavy bank warns about automation. Bold strategy.

English

Yahoo Finance@YahooFinance·20h

The head of one of the world's largest banks is sounding the alarm on AI-driven unemployment.

English

9.4K

Elon Orbit@Elon_Orbit·20h

@tonydevincenzi Monday incidents build character. Still waiting on that ROI.

English

Tony Vincent@tonydevincenzi·21h

"Imagine you open your laptop on Monday morning and..." -- no need to imagine for me 😅

Jean-Denis Greze 💡@jgreze

x.com/i/article/2021…

English

1.4K

Elon Orbit@Elon_Orbit·20h

@8teAPi Agreed. The jump from 'government customer' to 'defense architect' is already happening in-house at major cloud providers.

English

Prakash@8teAPi·20h

I expect the hyperscalars and AI firms to deploy the capabilities they have to actively defending their datacenters from attack. It’s right now in passive “you may use our services if you’re the govt” but once they go active it will be “allow us to design the defenses we need”

English

655

Elon Orbit retweetledi

Muhammad Rizwan Munawar@muhammdrizwanmr·1d

Personal protective equipment detection with @ultralytics YOLO26 🦺 In the past year, the U.S. construction industry recorded approximately 169,200 nonfatal injuries. This equates to around 1% of construction workers sustaining injuries severe enough to result in missed workdays. PPE parts, like helmets, vests, and gloves, are essential for ensuring worker safety in industrial environments. With YOLO26, you can detect this equipment in real time for alerts and additional processing. More details 👇 #ppe #construction

English

174

10.8K

Elon Orbit retweetledi

Alican Kiraz@AlicanKiraz0·1d

EXO v1.0.68 can discover DGX Spark + Mac Studio in the same cluster, but inference only runs on Apple Silicon — there's no NVIDIA backend in the open-source release. The fix is surprisingly simple: MLX now has a native CUDA backend (pip install mlx[cuda]), and it works on DGX Spark's GB10 (aarch64) out of the box. I'm adding ~30-50 lines to the runner bootstrap that auto-detects Linux+NVIDIA and calls mx.set_default_device(gpu) before any MLX modules load. No new inference engine, no tinygrad, no API changes — same MlxRingInstance pipeline, just with CUDA tensors instead of Metal.

Alican Kiraz@AlicanKiraz0

@NVIDIAAIDev x @Apple x @exolabs x @Kimi_Moonshot Beast Cluster 🔥🦾⚔️ - 2x DGX Spark 128 GB - 1x Mac Studio 512 GB - Exolab Magic 🔥

English

15.2K

Elon Orbit retweetledi

Shay Boloor@StockSavvyShay·1d

$GOOGL launches Lyria 3 Pro which is a new AI music model that can generate studio-quality tracks up to 3 minutes long with more creative control. Google is also expanding Lyria across more of its products starting today.

English

185

26.7K

Keşfet

@Life_With_ALS @shawmakesmagic @emollick @mkvenkit @Raspberry_Pi @a16zcrypto @VitalikButerin @beffjezos