Rami Khalil

1.1K posts

Rami Khalil banner
Rami Khalil

Rami Khalil

@ramikhalil

Principal Research Engineer @boundless_xyz | Computing PhD @imperialcollege

Katılım Temmuz 2010
1.6K Takip Edilen1K Takipçiler
Rami Khalil retweetledi
Kun Chen
Kun Chen@kunchenguid·
@thdxr these LLM companies seem confused about what they are they are the power plants we want them to make power cleaner, cheaper, and more reliable instead, they keep building dishwashers and refrigerators
English
17
15
350
16.6K
Sudo su
Sudo su@sudoingX·
after 21 days through customs, one missed call, one phone number update, and the dgx spark is finally on my desk. state of the art supercomputer, 128gb unified memory, sitting in my lab in bangkok. new monitor and desk arriving in a few days, then this beast goes live and a lot of real work is coming through it. thank you @nvidia and @Coolmark482 for trusting builders and recognizing real ones. i wish you could do this for 10 million more builders around the world, there are so many of us out there building with whatever we can scrape together, hardware like this in the right hands changes what's possible. now i set up.
Sudo su tweet media
Sudo su@sudoingX

dgx spark arriving this week. shipped directly from nvidia. upgraded my lab to gigabit wifi. the benchmarks i'm about to publish will make some people very uncomfortable.

English
82
10
677
45.2K
Rami Khalil
Rami Khalil@ramikhalil·
Long weekends are just different now
Rami Khalil tweet media
English
2
0
2
209
Rami Khalil retweetledi
Rami Khalil retweetledi
superwhisper
superwhisper@superwhisper·
Superwhisper's next update might be too powerful to release publicly. The new voice model is so fast at transcription it started finishing sentences users hadn't thought of yet... We even put it in a sandbox and it dictated its way out. It also identified a flaw in the English language that had gone unnoticed for 600 years. Linguists have been informed. Out of an abundance of caution, we are withholding the update until further notice. Sincerely, The Superwhisper Team
English
247
430
6K
265.7K
Rami Khalil retweetledi
George Fahmy
George Fahmy@kajogo777·
Resume local Claude Code sessions on remote machines, with your session history, and all your workspace files github.com/kajogo777/bento "Portable Agent Workspaces" for other agents soon #claude #agents #oci
GIF
English
2
3
13
599
Sudo su
Sudo su@sudoingX·
i am not being able to recover from this one. 27B dense on a $900 RTX 3090 outperforming 120B MoE on a $70K production node with 2x H200 NVL at full precision. this is not easy to process. it changes the way we pick models for any task. if you're an AI startup running 120B MoE inference for agent workflows and a 27B dense with all parameters active on a single consumer GPU does it better, your compute bill might be solving the wrong problem. i am writing the full deep dive article to document everything here and share with you all so we can reproduce and verify. the reproduction test is coming first. same 3090, same 27B dense Q4, same prompt, same harness. if it holds twice it's not a fluke. it's architecture. and based on what the VRAM poll is showing me right now, most of you are sitting on the exact hardware that already won this fight. article drops this week.
Sudo su@sudoingX

i am still in shock that Qwen 3.5 27B dense on a single RTX 3090, a $900 GPU, one shotted a game challenge that 120B MoE at full precision on $70K+ production hardware could not. this is leading me to doubt if it was a fluke. so i am going to reproduce it. i will test 27B dense Q4 on my single 3090 again paired with Hermes Agent and have it reproduce the results. after that i will test the same dense 27B but unquantized because if Q4 can one shot something that 120B full precision cannot then i wonder what dense 27B unquantized would do. dense models with all parameters active on every token might matter more than total parameter count for agent coding. if this reproduces it changes how i think about what hardware you actually need. this is not letting me sleep well since yesterday. i will report back.

English
92
119
1.7K
216K
Rami Khalil retweetledi
Ryan Leachman
Ryan Leachman@RG_Leachman·
“Kids I have good news. Daddy is out of Claude tokens until 3PM. He has time to play with you now.”
Ryan Leachman tweet media
English
207
1.6K
25.2K
871.2K
dom (bitvm/acc) 🇪🇺
dom (bitvm/acc) 🇪🇺@dom60808·
What are people using for memory management for their AIs? I've build a little memory agent (called Paul) that I can use as a skill in claude. Under the hood, it uses a vector DB and a SQLite instance to have multiple levels of memory: - Permanent memory -> always true, highest prio - Quarterly goals -> changes over time - Weekly priorities -> what's important right now, very temporary, Paul knows not to give context from weeks that are in the past Then a few other specific DBs that handle other cases. I basically use a cheap model call to decide from which memories to load and then a mid-tier model to summarize and forward to the right query. So I'd do something like "Use /brainstorming with /paul as context to ..." To me, this works really nice compared to the memory.md approach since it works across projects and instances. But I'm curious, what are others doing?
English
4
0
10
239
Rami Khalil retweetledi
Shiv Shankar
Shiv Shankar@sshankar·
Optimistic about the future of blockchains, thanks to ZK proofs on Boundless.. We have been working closely with the team at Optimism to develop Kailua, an extension of the @boundless_xyz Protocol, built with rollups, for rollups. Recently we made integration even simpler to lower switching costs to practically zero. Along with that, the cost of proving has been dropping to below $0.0001 per billion cycles (it was 4x two weeks ago). There are no more excuses. Every blockchain will be a ZK chain. @Optimism is early.
Optimism@Optimism

Markets work better when withdrawals are fast, efficient, and secure. That’s why we’re accelerating ZK proving on the OP Stack with @SuccinctLabs as our first preferred ZK proving partner.

English
3
2
20
2.8K
Rami Khalil
Rami Khalil@ramikhalil·
web of trust will likely get a lot of attention very soon
Dustin Burnham@ModernDad

My wife calls me, panicked. The call is from her number, and her voice is unmistakable- that’s my wife. ‘Babe, our son is hurt. He got in a bike wreck. I’m at the emergency room but they won’t take our insurance and I need cash to get him help. Please send me 3000 dollars as soon as you can, he’s really not doing well.’ Me- ‘Wow, that’s scary. Tell me our passphrase and then I’ll send the money.’ Her (it) - ‘What? What passphrase? This is your wife, our son is hurt. Send the money now!!’ Me- ‘I’ll call you back. I don’t believe that this is my wife. If it is, I’m sorry, but we discussed this.’ The number? Spoofed. Easy to do and there’s no way to tell if a phone number is being spoofed aside from hanging up and calling back to confirm. The voice? AI generated. Easily done. A few seconds of audio is all it takes to create a realistic audio deepfake. What can you do? 1) Create a family safe word or passphrase. Ours is definitely not ‘Keep Going’ although we considered it. Discuss the passphrase far away from phones or any recording device. This is as analog as possible. Don’t forget that the trigger for the passphrase is just as important as the phrase itself. So instead of asking ‘what’s the safe word?’ have a separate triggering question. For example, you could say ‘I’m eating banana cream pie’ and this would trigger your spouse to respond ‘purple velvet pillows’ if that’s the safe word. Make it fun, silly, and easy to remember. And DON’T WRITE IT DOWN. 2) Cognitive security is an essential skill in 2026. Assume every image and video you see online is fake until proven otherwise. Expect scams and spammers, and be pleasantly surprised when it’s not. 3) Figure out a backup communication option with people who you absolutely need to be able to reach. Don’t just rely on a phone number for communication. Have redundant, ideally encrypted methods of communication with family. What did I miss? I think (hope) Nikita is wrong on the timeframe- agentic bots like Claude bot are impressive but not quite ready to flood the phone lines in just 90 days. But I think it’s going to be a huge problem by the end of the year. I already get dozens of increasingly realistic spam calls and texts daily- it’s only going to get more annoying. Have a plan to keep your family and your finances safe!

English
0
0
1
83
Rami Khalil retweetledi
Jonas Theis
Jonas Theis@jonastheis_·
I'm joining @boundless_xyz as a protocol engineer. After building at @Scroll_ZKP and @iota, I have a good idea how ZK will reshape what's possible onchain and what matters. What drew me to Boundless wasn't just the tech, but the thesis behind it. Most teams in ZK are making bets on one chain, one proving system and hope to be right. Boundless is building something different: infrastructure that works across chains and across zkVMs. A genuinely universal protocol that doesn't care which L1 wins or which proving system becomes dominant. It just works. It's already the largest proof marketplace and the only one that is truly open and permissionless. The ZK future isn't about picking winners. It's about building the connective layer that makes all of web3 useful. Excited to build with this team.
English
23
2
114
12.3K
Rami Khalil retweetledi
Jacob
Jacob@jacobeverly·
Reminder, @boundless_xyz has generated a proof of consensus for 6 months now and anyone can use them by contacting the team
English
0
1
5
140