Sotnyk.етн

100 posts

Sotnyk.етн

@d3magexgod

Will use this account (not) only for Ethereum faucet.

Katılım Ağustos 2021

179 Takip Edilen33 Takipçiler

Sotnyk.етн@d3magexgod·8h

@realbarnakiss Awesome. How can I take part?

English

Barna@realbarnakiss·9h

yes and now in two places, cross-experiment and intra-experiment. Agents see previous iterations, we have a near miss classification to surface past tries (1.5% regression, probably this can be optimized later to be more narrow) Outside of provers one idea could be LoC minimization for better auditability. I think autoresearch is first implementation of a larger pattern of new program designs, likely with multi-agent orchestration, but distinct from OpenClaw direction (though sharing similarities).

English

Barna@realbarnakiss·1d

Round 2 of zk-autoresearch: Claude ran 20 iterations autonomously optimizing Plonky3's DFT/NTT. We stopped after 20 iterations. Found key agent-infra improvements meant further iterations would waste tokens without new signal. 4 improvements kept. 16 reverted. Cumulative speedup from baseline across Round 1 + Round 2: 4.1% The 2^22 result (−8.57%) was an outlier, memory bandwidth effects amplify when working sets exceed L3 cache. Full results in the table above. The repo was first starred by @drakefjustin (EF). The most interesting finding was a runaway agent: The agent discovered an incentive misalignment: failing to write grants another 20k token budget. So it kept failing. Iter 10 ran for ~30 minutes and consumed an estimated 150k+ tokens before writing code. In the end it submitted a regression of -1.22%, specifically exploring what we have decided to characterize as a dead-end for the next experiment in CLAUDE.md. The agent persisted in submitting 50k-character rewrites of radix_2_dit_parallel.rs instead of targeted fixes. - Agent was stuck for optimizations and defaulted to full rewrites instead of surgical improvements — updated agent-infra to prevent this in the next experiment - Optimizing a prover is inherently different from Karpathy's target: his setup is a continuous, differentiable parameter space. Ours: discrete code changes, bitwise-correct ZK constraints, compiler behavior shaped by CPU microarchitecture. The cliff between "compiler handles it" and "code regresses" is steep. - Model scaling would improve output, but risks masking existing shortcomings, the current decision is to optimize agent-infra until clear signs show Sonnet is not capable, then yield those benefits with model scaling once fixes are validated All 20 iterations respected hard constraints, no interface changes, no security parameter touches. The Plonky3 team added new testing in PR-1494 that we incorporated for experiment 2. @CreatorsOfChaos reviewed the methodology and flagged real gaps: tests run in debug not release, benchmark workload never tested for correctness directly. We are working through the issues publicly, most of them have been addressed by today's PRs. For Round 3 we have implemented: - Streaming: agent reasoning visible live, full thinking saved to experiment log - Supply chain resilience and audit trail + testing - Loop reliability, static diff inspection, three-stage correctness gate - Cross-experiment agent memory, surgical precision and documented proven techniques $153.25 across 2 experiments. ~$1.09/iter on Sonnet. Opus comes after Sonnet infrastructure is fully validated; every dead end mapped on Sonnet is an Opus iteration saved. We have received two offers for contributions, one small grant and one LLM contribution. Repo is open for contributions, we are mainly interested in: - Potential security vectors - Generalization ideas - Agent-infra improvements The roadmap: - Optimize on Sonnet, shift to Opus - Expand to other crates on Plonky3 and start initial generalization - Expand to other prover repos with easy setup; this requires significant work on generalization frameworks Feel free to message if you want to contribute. Link to repo: github.com/Barnadrot/zk-a… 14 issues have been opened, one of them is an upstream improvement for the Plonky3 repo (pending submission) Link to round 1 PR on Plonky3 repo: github.com/Plonky3/Plonky… Round 2 changes can be tracked on the fork currently (pending upstream submission) github.com/Barnadrot/Plon…

English

867

Sotnyk.етн@d3magexgod·10h

@realbarnakiss So it's safe to assume that you're fine-tuning the model. Except you're not updating the weights directly, but store them in markdown. That's a very cool idea! I'm wondering what other cases could be approached with this technique.

English

Barna@realbarnakiss·13h

@d3magexgod Loop starts fresh, the setup is split between CLAUDE.md (upstream repo related) and loop.py (agent logic related) Yesterday we have added cross-experiment memory to the system, Round 3 will have info about previous rounds.

English

Sotnyk.етн@d3magexgod·22h

@JKim_Tran This is too funny

English

jennifertran.eth@JKim_Tran·1d

I hate it when people are like, "explain this to me like I'm 5 years old." Like, no, there are lots of things that I'm not going to explain to a 5-year-old.

English

266

Sotnyk.етн@d3magexgod·2d

@alexanderlee314 It is not, they traded the number of qubits for the number of gates. Still not viable anytime soon.

English

Alexander John Lee@alexanderlee314·2d

Q day is likely closer than many would expect…

Lukasz Olejnik@lukOlejnik

A new research work dramatically reduces the amount of qubits to break elliptic curve ciphers on a hypothetical quantum computer. Elliptic curve crypto is the math protecting most HTTPS connections, digital signatures, and cryptocurrency wallets. Shor's quantum algorithm can break it, but requires a large fault-tolerant quantum computer - the question is exactly how large. This new work cuts the required logical qubit count for attacking a 256-bit curve nearly in half. From 2,124 down to 1,098. That is a huge improvement. It also means breaking elliptic curve cryptography now looks cheaper in qubits than breaking RSA of equivalent security - a reversal of previous estimates. The method to achieve this is really smart but let me spare you the details. Appreciate it in the paper! It is really clever. Also expensive. Quantum computers cannot be reduced only to qubit count. Quantum gates are equally important. This technique requires a huge increase in gate count - by more than a factor of 1000. Roughly 2^43 Toffoli gates. Even the IBM's stated target for its first fault-tolerant system around 2029 is 100 million gates. This attack needs ~11.9 trillion. A crude "space times work" proxy computation makes the new method around 836 times more costly overall than what it replaces. Not less.

English

1.7K

Sotnyk.етн@d3magexgod·6d

@real_philogy based and plankpilled

English

philogy@real_philogy·6d

Excited for our first grant and to finally get to work on stack scheduling again. The new algo will draw on insights from academia, my experience hand writing contracts in Huff, my balls prototype (github.com/philogy/balls) and more.

Plank@plankevm

Proud to announce our first grant from @argotorg to design a next gen stack scheduling algorithm for the EVM and implement it for our IR. The project's goals are: - excellent codegen quality - no stack too deep - adaptable to @official_fe's Sontina IR and @solidity_lang

English

1.8K

Sotnyk.етн retweetledi

Sebastian Bürgel@SCBuergel·24 Mar

The deepfake problem can't be solved in software. I mean this literally - if the forgery happens before data is signed, no algorithm, no watermark, no certificate can help you. The fix has to be in the hardware. And hardware is hard but that's what we built:

English

137

11K

Sotnyk.етн retweetledi

🇺🇦Alik.eth ✙ (on Farcaster)@alik_eth_·24 Mar

I built ZK selective disclosure for eIDAS 2.0 credentials. zk-eidas.com — 15 Circom circuits, Groth16, ECDSA P-256 in-circuit. Contracts without personal data. Proofs print on paper. Verify with a phone camera. Offline.

English

347

Sotnyk.етн@d3magexgod·21 Mar

@oleh_bc I don't think that we'll have such capabilities in the near future. Would be cool to, though.

English

Alex (oleh)@oleh_bc·21 Mar

@d3magexgod I mean instant LLM output for vibe coding. You get the whole new project on each keystroke

English

Alex (oleh)@oleh_bc·19 Mar

Imagine a real time LLM. On each key stroke, you get a new improved version of your app. A massive unlock. Is there something like this on the market?

English

Sotnyk.етн@d3magexgod·19 Mar

@cartoonitunes You're my favorite history channel. Thank you very much!

English

620

cartoon.the🦄.eth@cartoonitunes·19 Mar

In September 2015, someone at the Ethereum Foundation deployed a 57-byte contract to mainnet. No compiler. No tooling. Just raw bytecode written by hand. Just cracked it using Yul and wanted to share because it’s an interesting window into early development 🎉 🧵

English

11.9K

Sotnyk.етн@d3magexgod·13 Mar

@archethect Gud stuff, I always wondered why no one asked the models to shut up and confirm what they imagined. What models did you use in your test setup? Two instances of Claude?

English

127

archethect 🏴@archethect·13 Mar

The problem was never "can AI find bugs." It always could. The problem was "can AI shut up about bugs that aren't real." Turns out you just needed a second AI whose entire purpose is to call bullshit. sc-auditor V2 is live, open source and a substantional improvement over V1 at that! 👇 github.com/Archethect/sc-…

English

1.6K

archethect 🏴@archethect·13 Mar

I spent weeks trying to solve one problem: AI auditors find bugs. They also hallucinate bugs that don't exist. Security researchers know this. It's why most dismiss AI auditing tools. So I asked: what if the AI had to prove itself wrong before it could prove itself right? 🧵👇

English

2.3K

Sotnyk.етн retweetledi

Julian@_julianma·12 Mar

Ethereum needs an Encrypted Mempool and it needs it fast. It's not just about stopping sandwiching. Encrypted mempools are how Ethereum matures its onchain markets. I just published a post on why Ethereum needs encrypted mempools. Here are the core arguments:

English

332

43.5K

Sotnyk.етн@d3magexgod·11 Mar

@fileverse @aboutcircles yeah, I was talking about Collab with gnosis, too bad I missed it STILL LOVING YOU

English

Fileverse@fileverse·11 Mar

@d3magexgod AWWWWWW 💃💛 Tyyy anooon! Ur using both the gnosis app and ddocs dot new? 💾For the custom @aboutcircles floppy: it was a one time collab with them! 💾If a new custom floppy drops on the gnosis app: would you want us to ping you?

GIF

English

Sotnyk.етн@d3magexgod·10 Mar

hey @fileverse I love you! is there a chance you will resupply fileverse disks on circles?

English

112

Sotnyk.етн@d3magexgod·3 Mar

@philogy @andreaslbigger You know, this feels like a discovery week for me. Firstly, Plank, then Ora, now this.

English

115

philogy@real_philogy·3 Mar

@andreaslbigger Holy, very cool. Edge is also one of my inspirations for Plank. You plan on maintaining/building this long-term?

English

1.6K

Andreas Bigger@andreaslbigger·3 Mar

Introducing Edge, a high level, strongly statically typed, multi-paradigm domain specific language for the Ethereum Virtual Machine (EVM). github.com/refcell/edge-rs

English

654

94.4K

Sotnyk.етн@d3magexgod·1 Mar

@Logicb0x @philogy @plankevm Thank you so much for the response! I'll certainly try it out tomorrow ;)

English

Axe 🐙@Logicb0x·28 Şub

@philogy @d3magexgod @plankevm Ora uses Plank (aka Sensei IR) to lower to bytecode. They are connected but have different roles. You can think Ora as a modern take on Refinements, SMT formal verification and strong comptime. TLDR: Ora helps you every step of the way. Ora loves auditors :D

English

Plank@plankevm·26 Şub

Our first blog post is up! We explain our high-level vision, design & roadmap. 🏗️ plankevm.github.io what happened to the website design? sorry no time, we need to get back to shi— 🚀💻

English

5.3K

Sotnyk.етн@d3magexgod·28 Şub

@philogy @plankevm This reminds me of Yul and Solidity. Solidity implicitly adds some checks to the bytecode, and Yul just works as you've written. Thanks. Looking forward to new updates!

English

philogy@real_philogy·28 Şub

They share some similarities (e.g. some comptime overlap) but the way I see it: Plank prioritizes raw capabilities & features first but in exchange will force the dev to build more things from scratch (at least initially). While Ora is more high-level & batteries included and is prioritizing refinement types + first class SMT-based formal verification. but maybe @Logicb0x has a different take

English

Sotnyk.етн@d3magexgod·25 Şub

Almost forgor to fill out the annual Solidity survey!

English

Keşfet

@realbarnakiss @drakefjustin @CreatorsOfChaos @JKim_Tran @alexanderlee314 @real_philogy @oleh_bc @cartoonitunes