Suraj

2.1K posts

Suraj banner
Suraj

Suraj

@PwnFunction

sf Joined Ocak 2019
808 Following42.3K Followers
Matan Halevy
Matan Halevy@MatanHalevy·
Grok 4.1 VS GPT-5.3 Codex in CivBench LIVE Which LLM will build the dominant empire?? This is CivBench's first run with the newest OpenAI Model and holy shit its an insane model. While only 20 turns in, looks like it's pulling ahead with nearly 2x in treasury and tech race than Grok. 🧵 below has some details from yesterday's matches featuring Anthropic's models
English
5
0
14
2.7K
Suraj retweeted
Matan Halevy
Matan Halevy@MatanHalevy·
What happens when you let Claude or ChatGPT run a government? I built CivBench to find out. Everyday frontier AI models compete head to head in strategy games. Here’s what our first set of matches revealed 🧵
English
23
24
208
43.3K
Suraj retweeted
Andrew Gazelka
Andrew Gazelka@andrewgazelka·
Hiring to build git for VMs. We fork, snapshot, and resume full sandboxes in 26ms. DM me if you are interested.
English
28
14
385
39.7K
Suraj
Suraj@PwnFunction·
any of you run claude code or codex on vps? what's the easiest way to do it?
English
5
2
13
5.6K
Suraj retweeted
Andrej Karpathy
Andrej Karpathy@karpathy·
New art project. Train and inference GPT in 243 lines of pure, dependency-free Python. This is the *full* algorithmic content of what is needed. Everything else is just for efficiency. I cannot simplify this any further. gist.github.com/karpathy/8627f…
English
651
3.2K
25.2K
5.2M
Suraj retweeted
Arpit Bhayani
Arpit Bhayani@arpit_bhayani·
your logging stack is just a distributed printf.
English
17
15
409
25K
Suraj retweeted
Matan Halevy
Matan Halevy@MatanHalevy·
i have @Zai_org 's GLM 5 playing Civilization against Opus 4.6 and GLM5 is exploring in a Z shape, did we just hit brand-aware AI 🤔🤔
Matan Halevy tweet media
English
1
1
10
1.2K
Suraj retweeted
Bilgin Ibryam
Bilgin Ibryam@bibryam·
Learn eBPF through hands-on exercises directly from your browser. ebpf.party
English
7
78
651
43.5K
Suraj retweeted
Ruikai Peng
Ruikai Peng@ruikai·
For the past month, Pwno has autonomously discovered 29 vulnerabilities across Linux, FFmpeg, V8, Firefox, Webkit, Redis, PostgreSQL; with 15 OOBs, 6 UAFs. Most of these bugs are fixed; some are still in the disclosure process. you can see them at bugs.pwno.io It is really a pay-off moment for me. the idea of Pwno started out by simply harnessing gdb for solving ctf pwn challenges, exactly two years ago. eight months ago, after deciding to pivot from a campus startup I worked on for a couple of months, I decided to pick up what brought me to this crazy world of computer systems in the first place, binary security; and choose the most interesting problem I could ever think about: making AIs that can find cool memory bugs. I am always saying we're doing research, but the fact is just that most of the time things don't work out. It takes a lot of learning, trial and error, rebuilding things from scratch, and most importantly in someway believing in things could work out even at times it sounds stupid to say. it always amazes me how we can reinterpret systems that are entirely created by us in a completely different way. we'll hopefully find and patch more interesting bugs that in some way help the internet a little:)
English
6
13
141
11.1K
Suraj retweeted
suraj
suraj@matmul·
concave.ai is now oss
suraj tweet media
English
1
5
24
2.4K
Suraj
Suraj@PwnFunction·
✨ Opensourced a project i've been working on for a while, Sandboxes. Run untrusted code safely github.com/pwnfunction/sa…
Suraj tweet media
English
1
1
11
1.6K
Suraj retweeted
Ruikai Peng
Ruikai Peng@ruikai·
We’re open-sourcing pwno-backend - our previous production backend architecture, that covers up from uploading a binary to k8s ingress that went through a literation of six months, as Pwno heading to new direction. github.com/pwno-io/pwno-b…
English
3
10
63
11.5K
Suraj retweeted
s1r1us (mohan)
s1r1us (mohan)@S1r1u5_·
A case study of AI-accelerated hacking: How we at @HacktronAI hacked our way into Lovable's office, cut attack time from weeks to days, and helped secure Supabase from one of the most complex vulnerability chains we’ve ever worked through.
English
16
42
251
73.3K
Rebane
Rebane@rebane2001·
after chatting with @horsemankukka who thought saying "no html" and serving the content-type of "text/html" is cheating - i figured out a way to make tic-tac-nohtml work with the content-type of "text/css" :) as before, this is firefox-only
Rebane tweet media
English
4
3
141
5.2K
Suraj retweeted
Ruikai Peng
Ruikai Peng@ruikai·
This is my debut hour-long talk on exploiting a heap-overflow in Llama.cpp RPC, when I was fifteen at ZeroCon. Enjoy:) research.pwno.io/llama-paradox
Ruikai Peng tweet media
English
1
23
101
9.1K