Rami Khalil

1.1K posts

Rami Khalil

@ramikhalil

Principal Research Engineer @boundless_xyz | Computing PhD @imperialcollege

Katılım Temmuz 2010

1.6K Takip Edilen1K Takipçiler

Rami Khalil@ramikhalil·3d

Try playing their Hawaiian themed fault proof game I hear it's crazy good

Ronin@Ronin_Network

2/ Ronin is now a Layer 2 on Ethereum Ronin used to be an island, handling everything from onchain security to transaction processing itself. Now, Ethereum is part of Ronin’s engine. That means Ronin will continue to work the same as before, and you’ll experience Ethereum’s stronger security and faster transactions. Thank you to our allies @optimism, @boundless_xyz, @conduitxyz, and @eigen_da for powering this evolution and welcoming Ronin into the OP Stack ecosystem.

English

Rami Khalil retweetledi

Kun Chen@kunchenguid·1 May

@thdxr these LLM companies seem confused about what they are they are the power plants we want them to make power cleaner, cheaper, and more reliable instead, they keep building dishwashers and refrigerators

English

350

16.6K

Rami Khalil@ramikhalil·28 Nis

@sudoingX how's the nvfp4 treating you on it

English

106

Sudo su@sudoingX·28 Nis

after 21 days through customs, one missed call, one phone number update, and the dgx spark is finally on my desk. state of the art supercomputer, 128gb unified memory, sitting in my lab in bangkok. new monitor and desk arriving in a few days, then this beast goes live and a lot of real work is coming through it. thank you @nvidia and @Coolmark482 for trusting builders and recognizing real ones. i wish you could do this for 10 million more builders around the world, there are so many of us out there building with whatever we can scrape together, hardware like this in the right hands changes what's possible. now i set up.

Sudo su@sudoingX

dgx spark arriving this week. shipped directly from nvidia. upgraded my lab to gigabit wifi. the benchmarks i'm about to publish will make some people very uncomfortable.

English

677

45.2K

Rami Khalil@ramikhalil·12 Nis

Long weekends are just different now

English

209

Rami Khalil retweetledi

George Fahmy@kajogo777·10 Nis

Love inspiring Claude Code :D check out Stakpak Autopilot, it's Open Source github.com/stakpak/agent the fastest way to set up monitoring and response without the observability tool tax

Noah Zweben@noahzweben

Thrilled to announce the Monitor tool which lets Claude create background scripts that wake the agent up when needed. Big token saver and great way to move away from polling in the agent loop Claude can now: * Follow logs for errors * Poll PRs via script * and more!

English

279

Rami Khalil retweetledi

superwhisper@superwhisper·8 Nis

Superwhisper's next update might be too powerful to release publicly. The new voice model is so fast at transcription it started finishing sentences users hadn't thought of yet... We even put it in a sandbox and it dictated its way out. It also identified a flaw in the English language that had gone unnoticed for 600 years. Linguists have been informed. Out of an abundance of caution, we are withholding the update until further notice. Sincerely, The Superwhisper Team

English

247

430

265.7K

Rami Khalil retweetledi

George Fahmy@kajogo777·7 Nis

Resume local Claude Code sessions on remote machines, with your session history, and all your workspace files github.com/kajogo777/bento "Portable Agent Workspaces" for other agents soon #claude #agents #oci

GIF

English

599

Rami Khalil@ramikhalil·4 Nis

@socrates1024 @sudoingX hey if it's good enough for lightning strikes

English

Andrew Miller@socrates1024·4 Nis

@sudoingX "twice" is all?

English

283

Sudo su@sudoingX·4 Nis

i am not being able to recover from this one. 27B dense on a $900 RTX 3090 outperforming 120B MoE on a $70K production node with 2x H200 NVL at full precision. this is not easy to process. it changes the way we pick models for any task. if you're an AI startup running 120B MoE inference for agent workflows and a 27B dense with all parameters active on a single consumer GPU does it better, your compute bill might be solving the wrong problem. i am writing the full deep dive article to document everything here and share with you all so we can reproduce and verify. the reproduction test is coming first. same 3090, same 27B dense Q4, same prompt, same harness. if it holds twice it's not a fluke. it's architecture. and based on what the VRAM poll is showing me right now, most of you are sitting on the exact hardware that already won this fight. article drops this week.

Sudo su@sudoingX

i am still in shock that Qwen 3.5 27B dense on a single RTX 3090, a $900 GPU, one shotted a game challenge that 120B MoE at full precision on $70K+ production hardware could not. this is leading me to doubt if it was a fluke. so i am going to reproduce it. i will test 27B dense Q4 on my single 3090 again paired with Hermes Agent and have it reproduce the results. after that i will test the same dense 27B but unquantized because if Q4 can one shot something that 120B full precision cannot then i wonder what dense 27B unquantized would do. dense models with all parameters active on every token might matter more than total parameter count for agent coding. if this reproduces it changes how i think about what hardware you actually need. this is not letting me sleep well since yesterday. i will report back.

English

119

1.7K

216K

Rami Khalil@ramikhalil·23 Mar

and PTO (with "occasional" Claude Code sessions)

Rami Khalil@ramikhalil

Conferences are useful for this imo

English

178

Rami Khalil@ramikhalil·22 Mar

Conferences are useful for this imo

Dwarkesh Patel@dwarkesh_sp

Terence Tao spent a year at the Institute for Advanced Study - no teaching, no random events of committees, just unlimited time to think. But after a few months, he ran out of ideas. Terence thinks that mathematicians and scientists need a certain level of randomness and inefficiency to come up with new ideas.

English

333

Rami Khalil retweetledi

Ryan Leachman@RG_Leachman·20 Mar

“Kids I have good news. Daddy is out of Claude tokens until 3PM. He has time to play with you now.”

English

207

1.6K

25.2K

871.2K

Rami Khalil@ramikhalil·7 Mar

@dom60808 Engram and beads

English

dom (bitvm/acc) 🇪🇺@dom60808·7 Mar

What are people using for memory management for their AIs? I've build a little memory agent (called Paul) that I can use as a skill in claude. Under the hood, it uses a vector DB and a SQLite instance to have multiple levels of memory: - Permanent memory -> always true, highest prio - Quarterly goals -> changes over time - Weekly priorities -> what's important right now, very temporary, Paul knows not to give context from weeks that are in the past Then a few other specific DBs that handle other cases. I basically use a cheap model call to decide from which memories to load and then a mid-tier model to summarize and forward to the right query. So I'd do something like "Use /brainstorming with /paul as context to ..." To me, this works really nice compared to the memory.md approach since it works across projects and instances. But I'm curious, what are others doing?

English

239

Rami Khalil@ramikhalil·17 Şub

same github.com/ethereum-optim…

Boundless@boundless_xyz

feeling optimistic about the future of ZK x.com/Optimism/statu…

English

1.7K

Rami Khalil retweetledi

Shiv Shankar@sshankar·12 Şub

Optimistic about the future of blockchains, thanks to ZK proofs on Boundless.. We have been working closely with the team at Optimism to develop Kailua, an extension of the @boundless_xyz Protocol, built with rollups, for rollups. Recently we made integration even simpler to lower switching costs to practically zero. Along with that, the cost of proving has been dropping to below $0.0001 per billion cycles (it was 4x two weeks ago). There are no more excuses. Every blockchain will be a ZK chain. @Optimism is early.

Optimism@Optimism

Markets work better when withdrawals are fast, efficient, and secure. That’s why we’re accelerating ZK proving on the OP Stack with @SuccinctLabs as our first preferred ZK proving partner.

English

2.8K

Rami Khalil retweetledi

Boundless@boundless_xyz·12 Şub

Congratulations @Optimism for being early to bringing scale to their chains using ZK proofs. Next up: the world’s largest proof network, @boundless_xyz.

Optimism@Optimism

Markets work better when withdrawals are fast, efficient, and secure. That’s why we’re accelerating ZK proving on the OP Stack with @SuccinctLabs as our first preferred ZK proving partner.

English

5.6K

Rami Khalil@ramikhalil·12 Şub

web of trust will likely get a lot of attention very soon

Dustin Burnham@ModernDad

My wife calls me, panicked. The call is from her number, and her voice is unmistakable- that’s my wife. ‘Babe, our son is hurt. He got in a bike wreck. I’m at the emergency room but they won’t take our insurance and I need cash to get him help. Please send me 3000 dollars as soon as you can, he’s really not doing well.’ Me- ‘Wow, that’s scary. Tell me our passphrase and then I’ll send the money.’ Her (it) - ‘What? What passphrase? This is your wife, our son is hurt. Send the money now!!’ Me- ‘I’ll call you back. I don’t believe that this is my wife. If it is, I’m sorry, but we discussed this.’ The number? Spoofed. Easy to do and there’s no way to tell if a phone number is being spoofed aside from hanging up and calling back to confirm. The voice? AI generated. Easily done. A few seconds of audio is all it takes to create a realistic audio deepfake. What can you do? 1) Create a family safe word or passphrase. Ours is definitely not ‘Keep Going’ although we considered it. Discuss the passphrase far away from phones or any recording device. This is as analog as possible. Don’t forget that the trigger for the passphrase is just as important as the phrase itself. So instead of asking ‘what’s the safe word?’ have a separate triggering question. For example, you could say ‘I’m eating banana cream pie’ and this would trigger your spouse to respond ‘purple velvet pillows’ if that’s the safe word. Make it fun, silly, and easy to remember. And DON’T WRITE IT DOWN. 2) Cognitive security is an essential skill in 2026. Assume every image and video you see online is fake until proven otherwise. Expect scams and spammers, and be pleasantly surprised when it’s not. 3) Figure out a backup communication option with people who you absolutely need to be able to reach. Don’t just rely on a phone number for communication. Have redundant, ideally encrypted methods of communication with family. What did I miss? I think (hope) Nikita is wrong on the timeframe- agentic bots like Claude bot are impressive but not quite ready to flood the phone lines in just 90 days. But I think it’s going to be a huge problem by the end of the year. I already get dozens of increasingly realistic spam calls and texts daily- it’s only going to get more annoying. Have a plan to keep your family and your finances safe!

English

Rami Khalil retweetledi

Jonas Theis@jonastheis_·5 Şub

I'm joining @boundless_xyz as a protocol engineer. After building at @Scroll_ZKP and @iota, I have a good idea how ZK will reshape what's possible onchain and what matters. What drew me to Boundless wasn't just the tech, but the thesis behind it. Most teams in ZK are making bets on one chain, one proving system and hope to be right. Boundless is building something different: infrastructure that works across chains and across zkVMs. A genuinely universal protocol that doesn't care which L1 wins or which proving system becomes dominant. It just works. It's already the largest proof marketplace and the only one that is truly open and permissionless. The ZK future isn't about picking winners. It's about building the connective layer that makes all of web3 useful. Excited to build with this team.

English

114

12.3K

Rami Khalil retweetledi