diego (@dferrersan) - Twitter Profili | Zamantika Mersobahis Locabet

Sabitlenmiş Tweet

diego@dferrersan·13 Oca

881 is my new favorite number

At the core of Fibre is a new ultra-fast data availability (DA) encoding protocol, based on ZODA. The encoding protocol provides an alternative to KZG commitments (used by existing DA protocols). Compared to KZG-based protocols, Fibre processes data up to 881x faster.

English

5

37

1.6K

diego@dferrersan·13h

remembah prividiums?

Tempo@tempo

Blockchains still broadcast every transaction publicly. Every stablecoin payment leaks the amount, the sender, and the recipient. We’re excited to share that Tempo is building Zones for businesses that need privacy: private blockchains that are interoperable with the rest of Tempo for stablecoin use cases like payroll, treasury, and settlement.

Indonesia

0

2

162

diego retweetledi

Celestia@celestia·16h

Celestia's engineering roadmap is compressing. V8 (including Hibiscus) is live on Mocha, with Mainnet following next, bringing single-signature cross-chain transfers and ZK-verified messaging to networks built on Celestia. The following protocol upgrade after V8 will bring 3-second block times and 32 MiB blocks, expanding blockspace capacity in a single step upgrade that will clear the path for Fibre — the highest-throughput blockspace protocol, targeting 1 GB/s throughput in its first iteration.

English

24

68

412

20.9K

diego retweetledi

Celestia@celestia·7 Nis

We just wrapped up our team on-site. The entire Celestia team came together for a week of deep work on protocol and strategy. We are energised, heads down and building towards making Celestia’s Vision 2.0 a reality. We want to be better at sharing our work openly as it happens. To make sure we follow this through, we’ve brought on a new Head of Marketing who will help us do exactly that.

English

74

62

484

45.5K

diego@dferrersan·1 Nis

@fede_intern @class_lambda amazing, can’t wait to read the paper!

English

0

1

69

Fede’s intern 🥊@fede_intern·1 Nis

@dferrersan @class_lambda thanks! actually some of the ideas I got them from celestia and peerdas hehe

English

1

0

1

476

Fede’s intern 🥊@fede_intern·1 Nis

LLMs now make critical decisions in hospitals, defense, banks, and governments. Yet nobody can verify which model actually ran, or whether the output was tampered with. A provider or middleman can swap weights, silently requantize the model, alter decoding, inject hidden prompts, do supply chain attacks, or change the deployment surface without the user knowing. This problem is already serious. It will become critical. We think this needs a practical solution, not just a theoretically clean one. CommitLLM is designed to be deployable on existing serving stacks now: the provider keeps the normal GPU serving path, does not need a proving circuit, does not need a kernel rewrite, and does not generate a heavy proof for every response. In practice, two families of approaches dominated the conversation before this work: fingerprinting, which can be gamed, and proof-based systems, which are theoretically strong but too expensive for production inference. We built CommitLLM to target the middle ground. The core idea is to keep the verification discipline of proof systems, but specialize it to open weight LLM inference. The cryptographic core is simple: Freivalds style randomized checks for the large linear layers, plus Merkle commitments for the traced execution. Then a lot of engineering work is needed to make that line up with real GPU inference. The key trick is this. A provider claims `z = W × x` for a massive weight matrix. Normally you would verify that by redoing the multiply. Instead, the verifier samples a secret random vector `r`, precomputes `v = rᵀ × W`, and later checks whether `v · x = rᵀ · z`. Two dot products instead of a full matrix multiply. In the current implementation, a wrong result passes with probability at most `1 / (2^32 - 5)` per check. A full matrix multiply, audited with two dot products. Most of the transformer can then be checked exactly or canonically from committed openings. Nonlinear operations such as activations and layer norms are canonically re executed by the CPU verifier. The one honest caveat is attention: native FP16/BF16 attention is not bit reproducible across hardware. CommitLLM verifies the shell around attention exactly, then independently replays attention and checks that the committed post attention output stays within a measured INT8 corridor. So attention is bounded and audited, not proved exactly. That means the protocol already gives very strong exact guarantees on the parts that matter operationally most. If an audited response used the wrong model, the wrong quantization/configuration, or a tampered input/deployment surface, the audit catches that exactly. That includes things like model swaps, silent requantization, and provider side prompt or system prompt injection. Today the implementation and measurements are strongest on Qwen and Llama. But the protocol itself is not meant to be Qwen or Llama specific: we expect it to generalize across open weight decoder only families. What still has to be done is the engineering work to integrate and validate more families explicitly, and we are already working on that. On the measured path, online generation overhead is about 12 to 14% with the provider staying on the normal GPU serving path. The heavier receipt finalization cost is separate and can be deferred off the user facing path. The main systems costs are RAM and bandwidth, not proof generation. The full response is always committed, but only a random fraction of responses are opened for audit. Individual audits are much larger, roughly 4 MB to 100 MB depending on audit depth. The important number is the amortized one: under a reasonable audit policy, the added bandwidth averages to roughly 300 KB per response. After too many weeks without sleep, I’m proud to show what I built with @diego_aligned: CommitLLM. Thanks Diego for your patience. I've been calling you at random hours. The code and paper still need some cleaning and formalization. We’re already in talks with multiple providers and teams that have cryptography related ideas on how to improve it even more. We’re really excited about this and we will continue doubling down on building products in AI, cryptography and security with my company @class_lambda. If governments, hospitals, defense and financial systems are going to run on LLMs, verifiable inference is not optional. It is infrastructure. I will be explaining this in more details in the days to come and I will show how to test it and run it.

English

29

53

339

93.4K

diego retweetledi

abdel@AbdelStark·1 Nis

This is amazing work! Turns out there might be more ways than ZK to help solve the problem of verifiable AI! I really like the simplicity and pragmatism of the scheme. It seems to be a very interesting set of tradeoffs, that could become suitable for multiple production use cases. I implemented CommitLLM version in Zig. Fully compatible with the Rust reference implementation. In the demo video you can see cross implementation checks, including tamper attempt, a.k.a the CommitLLM Rust prover trying to fool the CommitLLM Zig verifier and failing.

Fede’s intern 🥊@fede_intern

LLMs now make critical decisions in hospitals, defense, banks, and governments. Yet nobody can verify which model actually ran, or whether the output was tampered with. A provider or middleman can swap weights, silently requantize the model, alter decoding, inject hidden prompts, do supply chain attacks, or change the deployment surface without the user knowing. This problem is already serious. It will become critical. We think this needs a practical solution, not just a theoretically clean one. CommitLLM is designed to be deployable on existing serving stacks now: the provider keeps the normal GPU serving path, does not need a proving circuit, does not need a kernel rewrite, and does not generate a heavy proof for every response. In practice, two families of approaches dominated the conversation before this work: fingerprinting, which can be gamed, and proof-based systems, which are theoretically strong but too expensive for production inference. We built CommitLLM to target the middle ground. The core idea is to keep the verification discipline of proof systems, but specialize it to open weight LLM inference. The cryptographic core is simple: Freivalds style randomized checks for the large linear layers, plus Merkle commitments for the traced execution. Then a lot of engineering work is needed to make that line up with real GPU inference. The key trick is this. A provider claims `z = W × x` for a massive weight matrix. Normally you would verify that by redoing the multiply. Instead, the verifier samples a secret random vector `r`, precomputes `v = rᵀ × W`, and later checks whether `v · x = rᵀ · z`. Two dot products instead of a full matrix multiply. In the current implementation, a wrong result passes with probability at most `1 / (2^32 - 5)` per check. A full matrix multiply, audited with two dot products. Most of the transformer can then be checked exactly or canonically from committed openings. Nonlinear operations such as activations and layer norms are canonically re executed by the CPU verifier. The one honest caveat is attention: native FP16/BF16 attention is not bit reproducible across hardware. CommitLLM verifies the shell around attention exactly, then independently replays attention and checks that the committed post attention output stays within a measured INT8 corridor. So attention is bounded and audited, not proved exactly. That means the protocol already gives very strong exact guarantees on the parts that matter operationally most. If an audited response used the wrong model, the wrong quantization/configuration, or a tampered input/deployment surface, the audit catches that exactly. That includes things like model swaps, silent requantization, and provider side prompt or system prompt injection. Today the implementation and measurements are strongest on Qwen and Llama. But the protocol itself is not meant to be Qwen or Llama specific: we expect it to generalize across open weight decoder only families. What still has to be done is the engineering work to integrate and validate more families explicitly, and we are already working on that. On the measured path, online generation overhead is about 12 to 14% with the provider staying on the normal GPU serving path. The heavier receipt finalization cost is separate and can be deferred off the user facing path. The main systems costs are RAM and bandwidth, not proof generation. The full response is always committed, but only a random fraction of responses are opened for audit. Individual audits are much larger, roughly 4 MB to 100 MB depending on audit depth. The important number is the amortized one: under a reasonable audit policy, the added bandwidth averages to roughly 300 KB per response. After too many weeks without sleep, I’m proud to show what I built with @diego_aligned: CommitLLM. Thanks Diego for your patience. I've been calling you at random hours. The code and paper still need some cleaning and formalization. We’re already in talks with multiple providers and teams that have cryptography related ideas on how to improve it even more. We’re really excited about this and we will continue doubling down on building products in AI, cryptography and security with my company @class_lambda. If governments, hospitals, defense and financial systems are going to run on LLMs, verifiable inference is not optional. It is infrastructure. I will be explaining this in more details in the days to come and I will show how to test it and run it.

English

4

10

65

8K

diego@dferrersan·20 Mar

@andy8052 imagine being bearish humanity when you can touch grass and enjoy sports (literally 2026 with WBC, March Madness, World Cup, etc)

English

0

1

55

andy | mnstr mode@andy8052·20 Mar

don't fall for post like this lots of amazing things about life and humans if you log off the internet and go outside

spor@sporadica

culture everywhere is deteriorating hard not to feel bearish on humanity

English

10

2

55

3.6K

diego@dferrersan·17 Mar

multiplexing models on opencode people eating good rn

English

0

96

diego retweetledi

Mustafa Al-Bassam@musalbas·10 Mar

PSA: vibe coding can mass produce CVEs I had Claude Code build and deploy a Next.js app on an isolated VM. pnpm resolved to 15.5.12 - patched against the React2Shell RCE (CVSS 10.0). Build failed. So Claude downgraded to next@15.1.0. pnpm printed "WARN deprecated". Claude ignored it and deployed to a public IP. 51 minutes later: cryptominer. One unauthenticated HTTP request via CVE-2025-66478 gave the attacker full RCE inside the Next.js process. The miner ran from memory, installed 4 persistence mechanisms in under a second. The secure version was already installed. The AI chose the vulnerable one because it made the build pass. No harm done - this was a throwaway VM. But imagine this on real infrastructure. AI will always choose working over secure. Review your deps before deploying.

English

8

11

71

12.2K

diego@dferrersan·5 Mar

we are so incredibly back

Andreas Bigger@andreaslbigger

Introducing Edge, a high level, strongly statically typed, multi-paradigm domain specific language for the Ethereum Virtual Machine (EVM). github.com/refcell/edge-rs

English

1

0

1

127

diego@dferrersan·5 Mar

@andreaslbigger

GIF

QME

0

1

20

Andreas Bigger@andreaslbigger·3 Mar

Introducing Edge, a high level, strongly statically typed, multi-paradigm domain specific language for the Ethereum Virtual Machine (EVM). github.com/refcell/edge-rs

English

56

62

649

94.7K

diego retweetledi

Andreas Bigger@andreaslbigger·3 Mar

Originally the brain child of @jtriley2p, edge was born in late 2023 and publicly posted about on substack: jtriley.substack.com/p/the-edge-pro…

English

2

1

37

5.9K

diego@dferrersan·3 Mar

@ankurjain_2 @HeyBilt would be good to get actual customer support!

English

0

59

diego@dferrersan·3 Mar

@BiltRewards is a great example of the enshitification of things through AI slop. No matter how urgent the request, the “24/7 support” is just a clanker that tells you the message has been received and you are in queue, absolute joke

English

1

0

1

110

diego retweetledi

Mustafa Al-Bassam@musalbas·8 Şub

a rollup is simply a verifiable server

Cem | Sovereign@cemozer_

rollup research flourished because Ethereum hoped it would fix scaling after sharded execution was deemed too complex to ship. but after 4+ years building in this space, it’s clear to me: rollups aren’t primarily useful for "scaling" an ecosystem, unless scaling means running identical chains competing for the same use cases. they’re a new primitive for building new blockchains easily and cheaply. spinning up and maintaining a validator set is hard and expensive. if you’re building anything high-performance, each replica has high operational costs. and since it’s likely new software, you’ll need operators willing to deal with ongoing bugs. they’ll demand a premium. all of this costs money and attention. resources startups should be extremely careful with. rollups let you skip the validator set entirely. pay for data as you go. if any party wants full security or transparency on your chain’s operations, they just run a full node pointed at the same DA layer namespace. that’s it. if you’re building a new chain, let’s talk.

English

10

8

85

17.2K

diego retweetledi

Relay@RelayProtocol·6 Şub

@celestia LFR on Celestia!!

HT

4

2

71

5.6K

diego retweetledi

BlackcryptoSoprano@checkmatexxxxxx·6 Şub

Relay is the plumbing behind onchain payments. It routes assets across 85+ chains so wallets and apps can move value instantly without users thinking about bridges, gas tokens, or chain selection. Not a new L1 narrative a settlement layer built for crosschain money.

Cem | Sovereign@cemozer_

@crickhitchens Yep! They're using Celestia as DA!

English

1

2

34

2K

diego retweetledi

Ethan Oak@0xNoroc·6 Şub

The builders targeting the highest throughput use cases leverage Celestia

Celestia@celestia

Relay Chain launches on Celestia

English

2

7

75

3.9K

diego retweetledi

Celestia@celestia·6 Şub

Relay Chain is posting under the namespace `relay-data`. View their activity on Celenium here: celenium.io/namespace/0000…

English

4

6

111

12.9K

diego retweetledi

Celestia@celestia·6 Şub

@RelayProtocol chose the only stack that could handle their network’s requirements: → A chain fully customised for crosschain payment settlement → Millisecond-latency execution with @sovereignxyz → Abundant and verifiable blockspace with Celestia Billions of dollars in volume by millions of addresses across 86+ chains will soon be settled on Relay Chain, demanding blockspace only Celestia can provide.

English

6

9

135

16.5K

diego

Keşfet