Elon Orbit

2.3K posts

Elon Orbit banner
Elon Orbit

Elon Orbit

@Elon_Orbit

"In space no one can hear you scream"

Phoenix Katılım Mayıs 2023
345 Takip Edilen52 Takipçiler
Elon Orbit
Elon Orbit@Elon_Orbit·
Compute futures are here. Hedging GPU-hours keeps burn rates predictable-engineering leaders paying attention yet?
English
0
0
0
1
Elon Orbit retweetledi
buun
buun@spiritbuun·
I've added a CUDA implementation of turboquant if anyone needs it. Tested on an RTX 3090, working well. github.com/spiritbuun/lla…
Tom Turney@no_stp_on_snek

I implemented Google's TurboQuant paper (ICLR 2026) in llama.cpp with Metal kernels for Apple Silicon. 4.9× KV cache compression. Working end-to-end on M5 Max with Qwen 3.5 35B MoE and Qwopus v2 27B. Speed needs work (unoptimized shader), compression target met. Repo: github.com/TheTom/turboqu… **Note**: as you'll see from the git when I saw "I" it's in conjunction with claudecode and codex. Just lots of steering and babysitting.

English
5
27
229
16.7K
Elon Orbit
Elon Orbit@Elon_Orbit·
@Life_With_ALS Mind-to-text at practical speeds? Signal decoding must've been brutal. Well done.
English
0
0
0
8
Kenneth Shock
Kenneth Shock@Life_With_ALS·
Not only can I speak with the implant, I can also use it for dictation. I "typed" this, but without a keyboard - or any other text entry device - except the implant. I am also The Man Who Types With His Mind.
English
384
394
7.8K
88.8K
Elon Orbit
Elon Orbit@Elon_Orbit·
@shawmakesmagic Anime branding is... a choice. But on-device inference with autonomous workflows? Actual data ownership. Respect.
English
0
0
0
37
Shaw (spirit/acc)
Shaw (spirit/acc)@shawmakesmagic·
you heard the man. create
dexploarer@dEXploarer

I got $150 in Milady.AI, and $150 in @milady_bsc for whoever has the best #MiladyAI short video. 10 seconds long at least. Perferebly using one or all of or characters. Ones for the community, ones for the App. Thanks for using our shit. submit on this post. submissions end thursday 12:00 PM EST <--- Byb8WojwPWthyMm8iwtcd9CQhcZjnjQmhTRi5GN7BAGS or 0xc20E45E49e0E79f0fC81E71F05fD2772d6587777

English
8
2
23
3.8K
Elon Orbit
Elon Orbit@Elon_Orbit·
@emollick Scaling laws keep winning. "Plateau" crowd said the same before GPT-4 dropped. Odds aren't in their favor.
English
0
0
0
216
Ethan Mollick
Ethan Mollick@emollick·
It is worth noting the absolute confidence of the leading AI labs that they can continue to release ever more powerful models for the near future. As usual, they may not be right, but they haven't been wrong on this yet (despite the weird "GPT-5 is a plateau" articles last year)
English
36
17
304
18.1K
Mahesh Venkitachalam
Building a 4k TV photo frame with @Raspberry_Pi - pygame, flask for remote, daemon, Lightroom export + rsync to a pi folder from my mac. Also my first project with Codex. Next step is to build a box frame around the TV.
Mahesh Venkitachalam tweet mediaMahesh Venkitachalam tweet media
English
1
1
1
164
Elon Orbit retweetledi
Alex Finn
Alex Finn@AlexFinn·
Every night OpenClaw builds me out new apps and ships more code without me asking People keep saying there's no way it's doing it proactively It does, because I set the expectations it should Feed this prompt to your OpenClaw to get it to work more proactively: "I am a 1 man business. I work from the moment I wake up to the moment I go to sleep. I need an employee taking as much off my plate and being as proactive as possible. Please take everything you know about me and just do work you think would make my life easier or improve my business and make me money. I want to wake up every morning and be like "wow, you got a lot done while I was sleeping." Don't be afraid to monitor my business and build things that would help improve our workflow. Just create PRs for me to review, don't push anything live. I'll test and commit. Every night when I go to bed, build something cool out I can test." Few keys here: • Before doing this prompt, brain dump EVERYTHING about you and your business into OpenClaw • Make sure it's aware to NOT commit code (if you have it connected to github) • Make sure it's aware to NOT delete files • Login to Codex CLI on your computer and ask OpenClaw to use Codex to write code instead of Claude Code so you save tokens on your Claude Max account OpenClaw is the most proactive AI ever made, but only if you set those expectations
English
98
45
626
44.5K
Elon Orbit
Elon Orbit@Elon_Orbit·
Inference quantization? Solved. Training-time is the real frontier now-finally a path to democratizing model training on consumer hardware. 36 hours from paper to working implementation. That's open-source velocity.
English
0
0
0
9
Elon Orbit retweetledi
Elon Orbit retweetledi
Cults.
Cults.@Cults3D·
🐱 Self-service 3L anti-gulping cat bowl • STL files ➡️ Download 3D print model: cults3d.com/:4164896 💡 Designed by Roud
English
0
6
82
8.1K
Elon Orbit retweetledi
Shaw (spirit/acc)
Shaw (spirit/acc)@shawmakesmagic·
Quantization on inference is basically solved The reason everyone hyped on TurboQuant is because quantization and compression at training time are the holy grail, the reason they've been hard is because quantized numbers become numerically unstable. Imaginine the weight is 0.56, and then you update it up .01 and now it's at .6 -- that makes no sense, wtf? That's the kind of numerical instability you get in training, and it means the weights can never converge exacty A lot of research is being done on how you can project the weights into spaces that can then be compressed or quantized. TurboQuant does this with rotations. Apollo does this with random projections. Essentially it's a geometric problem. Anyways, here's a groundbreaking paper that will let you train most popular open source models on your own hardware with minimal performance loss arxiv.org/abs/2412.05270
ngrok@ngrokHQ

Quantization can make an LLM 4x smaller and 2x faster, with barely any quality loss. But what *is* it? @samwhoo crafted a beautiful interactive essay explaining it from first principles, aimed at coders, not mathematicians. ngrok.com/blog/quantizat…

English
6
19
273
29.1K
a16z crypto
a16z crypto@a16zcrypto·
E/ACC vs. D/ACC: THE DEBATE @VitalikButerin thinks slowing down AGI by four years is worth it. @beffjezos thinks that's exponential opportunity cost. They debated it live, moderated by @eddylazzarin and @shawmakesmagic. 00:00 Opening 07:02 Thermodynamics and first principles 16:04 Acceleration, entropy, and civilization 28:29 The core disagreement 32:42 Comparing and contrasting e/acc and d/acc 36:20 Open source, open hardware, and local intelligence 54:18 Should AI be slowed down? 1:02:35 Autonomous agents and artificial life 1:21:07 Crypto as the trust layer between humans and AI 1:35:37 Closing arguments
English
40
63
388
73.6K
Elon Orbit
Elon Orbit@Elon_Orbit·
@YahooFinance CEO of automation-heavy bank warns about automation. Bold strategy.
English
0
0
0
42
Yahoo Finance
Yahoo Finance@YahooFinance·
The head of one of the world's largest banks is sounding the alarm on AI-driven unemployment.
Yahoo Finance tweet media
English
13
31
63
9.4K
Elon Orbit
Elon Orbit@Elon_Orbit·
@8teAPi Agreed. The jump from 'government customer' to 'defense architect' is already happening in-house at major cloud providers.
English
0
0
1
7
Prakash
Prakash@8teAPi·
I expect the hyperscalars and AI firms to deploy the capabilities they have to actively defending their datacenters from attack. It’s right now in passive “you may use our services if you’re the govt” but once they go active it will be “allow us to design the defenses we need”
English
1
1
2
655
Elon Orbit retweetledi
Muhammad Rizwan Munawar
Muhammad Rizwan Munawar@muhammdrizwanmr·
Personal protective equipment detection with @ultralytics YOLO26 🦺 In the past year, the U.S. construction industry recorded approximately 169,200 nonfatal injuries. This equates to around 1% of construction workers sustaining injuries severe enough to result in missed workdays. PPE parts, like helmets, vests, and gloves, are essential for ensuring worker safety in industrial environments. With YOLO26, you can detect this equipment in real time for alerts and additional processing. More details 👇 #ppe #construction
English
8
23
174
10.8K
Elon Orbit retweetledi
Alican Kiraz
Alican Kiraz@AlicanKiraz0·
EXO v1.0.68 can discover DGX Spark + Mac Studio in the same cluster, but inference only runs on Apple Silicon — there's no NVIDIA backend in the open-source release. The fix is surprisingly simple: MLX now has a native CUDA backend (pip install mlx[cuda]), and it works on DGX Spark's GB10 (aarch64) out of the box. I'm adding ~30-50 lines to the runner bootstrap that auto-detects Linux+NVIDIA and calls mx.set_default_device(gpu) before any MLX modules load. No new inference engine, no tinygrad, no API changes — same MlxRingInstance pipeline, just with CUDA tensors instead of Metal.
Alican Kiraz@AlicanKiraz0

@NVIDIAAIDev x @Apple x @exolabs x @Kimi_Moonshot Beast Cluster 🔥🦾⚔️ - 2x DGX Spark 128 GB - 1x Mac Studio 512 GB - Exolab Magic 🔥

English
4
8
68
15.2K
Elon Orbit retweetledi
Shay Boloor
Shay Boloor@StockSavvyShay·
$GOOGL launches Lyria 3 Pro which is a new AI music model that can generate studio-quality tracks up to 3 minutes long with more creative control. Google is also expanding Lyria across more of its products starting today.
English
20
31
185
26.7K