Ivan Milev

162 posts

Ivan Milev banner
Ivan Milev

Ivan Milev

@ivan_milev21

Building: https://t.co/STzWBeeb0m AI Agents for SWE, Code Explainability, AI Agent Tool Testing, ML for code, SWE MSc @ ETH Zurich

Zurich, Switzerland Katılım Haziran 2013
656 Takip Edilen73 Takipçiler
Ivan Milev
Ivan Milev@ivan_milev21·
@yacineMTB I think understanding shouldn't be ever outsourced, however I believe there can't be an argument about thinking if you have no understanding. Is it even your work if you have no understanding of something?
English
0
0
0
86
kache
kache@yacineMTB·
you can outsource your thinking but you cannot outsource your understanding
English
238
3.6K
16.1K
2.2M
Simon
Simon@realsimon·
@ivan_milev21 Hey, let's chat if you're raising funds. Send me a dm.
English
1
0
1
16
Arthur Zucker
Arthur Zucker@art_zucker·
Today I re-iterate: I hate MoEs and we are wasting time on them.... Let's unite and call a global ban on MoEs please. Please 1M+ salary researchers: do better... credits to @IlysMoutawwakil for the graph:
Arthur Zucker tweet media
English
76
49
823
155.5K
GitHub Projects Community
GitHub Projects Community@GithubProjects·
| ̄ ̄ ̄ ̄ ̄ ̄ ̄ ̄ ̄ ̄ ̄ ̄ ̄| | Share your GitHub profile. | |_____________| \ (•◡•) / \ / —— | | |_ |_
English
924
43
1.4K
137.6K
Ivan Milev
Ivan Milev@ivan_milev21·
@paulg It feels like being non-AI is now a differentiator and somehow a defensibility angle, as OpenAI and Anthropic are less likely to kill you on a tuesday morning. Tbh most ppl have forgotten that one doesn't need an LLM to rename a variable or to classify a spammy email...
English
0
0
2
609
Paul Graham
Paul Graham@paulg·
The biggest opportunity for would-be startup founders is AI. But the most underpriced opportunity is probably non-AI ideas. So if you have a good non-AI idea, go for it, because everyone else is going to overlook it.
English
352
565
6.9K
331.3K
Ivan Milev
Ivan Milev@ivan_milev21·
@dexhorthy Super well said, SWE is much more than coding and momentarily output. On mentorship, I feel the same about code reviews. As a junior, I learned the system by reviewing others’ work, understanding their thinking, and discussing tradeoffs (on the PR). Man, I had a great team.
English
0
0
2
377
dex
dex@dexhorthy·
you don’t hire interns because they’re gonna ship mad features Every intern is an investment in a future hire. Perhaps A modern day apprenticeship. You don’t hire juniors cause you need that extra 2-3 bug fixes per day. You hire juniors so you can turn them into seniors When you say “ai will replace juniors” all you’re telling me is you don’t understand mentorship in software careers, or you think of every engineer as a ticket factory. We do need to rethink how we upskill this new generation of SWEs though. The best engineers are great at AI (coding, context eng, etc) and great at software engineering - (systems, algorithms, debugging, architecture) More seniors are good at the latter, more juniors are good at the former, but a lot of ai coding takes away the “friction” which, as @badlogicgames so helpfully pointed out, is where you learn
English
14
34
260
12.7K
Ivan Milev
Ivan Milev@ivan_milev21·
@dexhorthy I still have a strong believe that such projects dont really work at high scale, outside of green field projects. But they will open new job positions like vibe code cleaner, vibe code explainer, vibe code consultant
English
1
0
0
1.1K
dex
dex@dexhorthy·
in general i'm not opposed to the goals of gstack i'm not opposed to yc promoting gstack as a general lesson in "use skills and import other people's knowledge" if I have any feedback on gstack it's at the technical level and optimization opptys on how transformer attention works (and I tried to ping garry to share some of these thoughts but he's a busy man) - instruction budget - many of the gstack commands have well over 200 instructions. That means limited adherence to all of them, and would be better off routing to smaller more focused skills (see link in comments for Monday's @aiDotEngineer talk on the subject) - instruction budget - most skills have long preambles about setup / init / upgrade steps, long inline bash scripts that get read every time, do not follow progressive disclosure at all - inline bash and prompting to do state management and version updates etc - clever, almost novel, as @davis7 pointed out this is building large complex programs in markdown, fantastically futuristic. But again, including it in every skill invocation means you're wasting a ton of tokens and attention on noise instead of putting attention into solving the user's problem with *their* context all that means that as well-meaning as the system is, and as bullish as I am on "the YC way" and packaging that knowledge and giving it to everyone, it could be SO SO SOOOO much better on the execution side
Y Combinator@ycombinator

GStack is an open-source toolkit built by YC President & CEO @garrytan that turns Claude Code into an AI engineering team — with skills for office hours, design, code review, QA, and browser testing. In this video, Garry walks through how GStack works, starting with Office Hours, a skill modeled after real YC partner sessions that pressure-tests your idea before you write a line of code. He demos it live, going from idea through adversarial review, design mockups, and automated QA in a single session.

English
31
14
360
76.3K
Ivan Milev
Ivan Milev@ivan_milev21·
@Tocelot @speedrun You should take a look at @CodeBoarding we've been on this for few months now. Coding agents in CLI are just a blackbox that wrecks your codebase, we generate a higher-level representation so one can see what the heck is going on
English
0
0
2
66
Jon Lai
Jon Lai@Tocelot·
a16z @speedrun request for startups: GUIs for Agents we’re still in the MS-DOS era of agents today - CLI, terminal sessions, file directories deleted by openclaw etc. while a small slice of silicon valley are power users, we're SO early for the rest of the world at Speedrun, we’re looking for bold founders excited to bring the power of agents to normies everywhere. there's a whole slew of products to be built here - from agent builders to marketplaces to managed infrastructure one broad idea we’re excited about are visual abstraction layers for agents. if you don't know exactly what you want, a command line / chat interface is paralyzing - you need to see options 1 example - think of a GUI or visual command center inspired by strategy games (ex. Factorio) where agents and workflows are represented graphically. skills, tools, MCP connections, background processes, etc could all be configured and shown visually in a workspace on UX, strategy games have long perfected agent management. zoom to get a birds-eye view of your agents, batch and queue orders via shortcuts, assign agents in multiplayer etc. a well-designed agent command center would make multi-agent orchestration for normies feel easy & intuitive most folks today still haven't moved beyond ChatGPT. the potential is enormous - just as Windows unlocked mass-market use of personal computers, the right visual abstraction layer could unlock agentic work for everyone - from individuals to enterprise teams if you share our vision, we'd love to chat!
English
277
92
1.3K
195.5K
Ivan Milev
Ivan Milev@ivan_milev21·
Everyone talks about how coding agents brought excitement to coding as you can do everything in a day. No one talks about how they took satisfaction away, there is something super satisfying in writting up a tight implementation, somehow a tight plan doesn't hit the same thou...
English
0
0
2
55
dax
dax@thdxr·
so what do you do when your agent is working and don't say start another agent because i know you're lying
English
592
17
1.2K
97.6K
Ivan Milev
Ivan Milev@ivan_milev21·
@LuthraAbhyuday Cars is absolutely my favorite cartoon of all time (a close second is Kung Fu Panda). “Focus. Speed. I am speed.”
English
0
0
1
10
Abhyuday Luthra
Abhyuday Luthra@LuthraAbhyuday·
“i just never thought i couldn’t” - lightning mcqueen
GIF
English
1
0
2
52
Ivan Milev
Ivan Milev@ivan_milev21·
I think that at 90+ percent of the events I've been to in SF people are mentioning PR and Lines of Code ae measurement for SWE productivity. Cant wait for people to find out about ast fuzzers, productivity will go through the roof...
English
0
0
2
71
Ivan Milev
Ivan Milev@ivan_milev21·
@dexhorthy @barry_zyj @MaheshMurag Curious to see if we end up doing spec driven development. Then we'd build up specs on different levels of detail instead of skills (I would imagine)
English
1
0
0
168
dex
dex@dexhorthy·
"thick skills thin harness" is basically getting at @barry_zyj and @MaheshMurag said back in november "don't build agents, build skills instead" - youtube.com/watch?v=CEvIs9… and while "put the biz logic in the skills + the deterministic code they reference (CLIs)" is probably +EV my concern is that the particular phrase "fat skills" will encourage our friends to build long skill files with too many instructions and you have an instruction budget. youtube.com/watch?v=YwZR6t… even the best frontier models have degrading performance the more instructions you try to stuff in. you always get more intelligence and better results if you limit the number of instructions. Thin harness, thin skills, skilled operator. x.com/garrytan/statu…
YouTube video
YouTube
YouTube video
YouTube
English
10
8
126
11K
Ivan Milev
Ivan Milev@ivan_milev21·
@thdxr I feel like email communication is slowly shutting down because of LLM automated campaigns and spam...
English
0
0
0
210
dax
dax@thdxr·
AI safety focuses on dramatic things like bioweapons and nukes but LLMs are breaking so many systems we rely on in more boring ways via spam, noise and misinformation wish this was more of a focus
English
73
68
941
92.2K
corbin
corbin@corbin_braun·
who in SF is building something cool and wants to hop on a podcast today in person?
English
47
3
111
16.5K
Ivan Milev
Ivan Milev@ivan_milev21·
Just noticed that it is free advertisement for @opencode (love it), we should put something similar for @CodeBoarding as well hahah
English
0
0
0
64
Ivan Milev
Ivan Milev@ivan_milev21·
POV: Showing what agent coding should look like to @brycent . Great meeting you at the @UseCorgi cafe man. I feel like I might start streaming some coding sessions on tw after chatting with the guy in front of the camera...
Ivan Milev tweet media
English
1
2
9
1.3K