Finn Brown

1K posts

Finn Brown banner
Finn Brown

Finn Brown

@finnatsea

cofounder of Aqua Voice 🌊, speech & audio ai, realness

Northern California Katılım Kasım 2010
841 Takip Edilen1K Takipçiler
Sabitlenmiş Tweet
Finn Brown
Finn Brown@finnatsea·
Act as if you are already free.
English
2
8
77
0
Finn Brown
Finn Brown@finnatsea·
@joshpuckett These renders are beautiful, but the ruins will be ugly. That's why you build out of stone, it's long-range.
English
0
0
0
68
Finn Brown
Finn Brown@finnatsea·
75 day Aqua streak and counting
Finn Brown tweet media
English
1
1
8
1.1K
Matt Shumer
Matt Shumer@mattshumer_·
i'm a few days late to realizing this but: wow, opus 4.7 is god awful like so, so bad it's making mistakes on things i'd expect gpt-4o to handle cleanly there's got to be some explanation, right?
English
264
38
1.5K
226.3K
Finn Brown
Finn Brown@finnatsea·
@patrickc where did you get it sequenced without them keeping a copy?
English
0
0
0
127
Patrick Collison
Patrick Collison@patrickc·
I'm lucky enough to have a great doctor and access to excellent Bay Area medical care. I've taken lots of standard screening tests over the years and have tried lots of "health tech" devices and tools. With all this said, by far the most useful preventative medical advice that I've ever received has come from unleashing coding agents on my genome, having them investigate my specific mutations, and having them recommend specific follow-on tests and treatments. Population averages are population averages, but we ourselves are not averages. For example, it turns out that I probably have a 30x(!) higher-than-average predisposition to melanoma. Fortunately, there are both specific supplements that help counteract the particular mutations I have, and of course I can significantly dial up my screening frequency. So, this is very useful to know. I don't know exactly how much the analysis cost, but probably less than $100. Sequencing my genome cost a few hundred dollars. (One often sees papers and articles claiming that models aren't very good at medical reasoning. These analyses are usually based on employing several-year-old models, which is a kind of ludicrous malpractice. It is true that you still have to carefully monitor the agents' reasoning, and they do on occasion jump to conclusions or skip steps, requiring some nudging and re-steering. But, overall, they are almost literally infinitely better for this kind of work than what one can otherwise obtain today.) There are still lots of questions about how this will diffuse and get adopted, but it seems very clear that medical practice is about to improve enormously. Exciting times!
English
490
640
9.6K
4M
Dhruv Mishra
Dhruv Mishra@1dhruvmishra·
@aquavoice The founder seems to be lying in the video. What voice input were they using "17 years" ago in 2009?
English
2
0
0
157
Aqua Voice
Aqua Voice@aquavoice·
Aqua Voice is now live for iOS. It's a premium voice keyboard for every app on your phone.
English
178
135
1.8K
870.2K
Finn Brown
Finn Brown@finnatsea·
@steipete It's very close-minded of them to try to control everyone's use of AI. Never bet against tinkerers.
English
0
0
3
165
Peter Steinberger 🦞
Peter Steinberger 🦞@steipete·
Yeah folks, it's gonna be harder in the future to ensure OpenClaw still works with Anthropic models.
Peter Steinberger 🦞 tweet media
English
541
248
5.5K
1.4M
jeffscottworld
jeffscottworld@jeffscottward·
@aquavoice Tried both in the tutorial and in the app still nothing. Why release a broken product?
English
1
0
0
68
Finn Brown
Finn Brown@finnatsea·
big things are happening...
Finn Brown tweet media
English
0
0
4
265
Alex Teichman
Alex Teichman@alex_teichman·
83 recent applicants agentically prioritized by claude code using our Research A Person cli tool and a google workspace cli tool. Minimal interventions / permissions / supervision required. Wow. Example:
Alex Teichman tweet media
English
1
0
45
18.1K
Andrej Karpathy
Andrej Karpathy@karpathy·
Very interested in what the coming era of highly bespoke software might look like. Example from this morning - I've become a bit loosy goosy with my cardio recently so I decided to do a more srs, regimented experiment to try to lower my Resting Heart Rate from 50 -> 45, over experiment duration of 8 weeks. The primary way to do this is to aspire to a certain sum total minute goals in Zone 2 cardio and 1 HIIT/week. 1 hour later I vibe coded this super custom dashboard for this very specific experiment that shows me how I'm tracking. Claude had to reverse engineer the Woodway treadmill cloud API to pull raw data, process, filter, debug it and create a web UI frontend to track the experiment. It wasn't a fully smooth experience and I had to notice and ask to fix bugs e.g. it screwed up metric vs. imperial system units and it screwed up on the calendar matching up days to dates etc. But I still feel like the overall direction is clear: 1) There will never be (and shouldn't be) a specific app on the app store for this kind of thing. I shouldn't have to look for, download and use some kind of a "Cardio experiment tracker", when this thing is ~300 lines of code that an LLM agent will give you in seconds. The idea of an "app store" of a long tail of discrete set of apps you choose from feels somehow wrong and outdated when LLM agents can improvise the app on the spot and just for you. 2) Second, the industry has to reconfigure into a set of services of sensors and actuators with agent native ergonomics. My Woodway treadmill is a sensor - it turns physical state into digital knowledge. It shouldn't maintain some human-readable frontend and my LLM agent shouldn't have to reverse engineer it, it should be an API/CLI easily usable by my agent. I'm a little bit disappointed (and my timelines are correspondingly slower) with how slowly this progression is happening in the industry overall. 99% of products/services still don't have an AI-native CLI yet. 99% of products/services maintain .html/.css docs like I won't immediately look for how to copy paste the whole thing to my agent to get something done. They give you a list of instructions on a webpage to open this or that url and click here or there to do a thing. In 2026. What am I a computer? You do it. Or have my agent do it. So anyway today I am impressed that this random thing took 1 hour (it would have been ~10 hours 2 years ago). But what excites me more is thinking through how this really should have been 1 minute tops. What has to be in place so that it would be 1 minute? So that I could simply say "Hi can you help me track my cardio over the next 8 weeks", and after a very brief Q&A the app would be up. The AI would already have a lot personal context, it would gather the extra needed data, it would reference and search related skill libraries, and maintain all my little apps/automations. TLDR the "app store" of a set of discrete apps that you choose from is an increasingly outdated concept all by itself. The future are services of AI-native sensors & actuators orchestrated via LLM glue into highly custom, ephemeral apps. It's just not here yet.
Andrej Karpathy tweet media
English
914
1K
12.1K
1.9M
Finn Brown
Finn Brown@finnatsea·
@korzhov_dm Really cool project merging history + ai + voice.
English
0
0
1
139
Dmitry Korzhov
Dmitry Korzhov@korzhov_dm·
Introducing the world's first and largest gallery of digital personas of 1,000+ Holocaust Survivors. Their voices are fading, and with them, the lessons of history. Come, talk to them, and learn what we must never forget. It’s free.
English
19
11
61
17.7K
Finn Brown
Finn Brown@finnatsea·
The shape of the 21st century is clear for the first time I think. - the computer learns - our own reflection looks different now - working will become dreaming, visioning
English
0
0
3
513
Adam Schefter
Adam Schefter@AdamSchefter·
Bill Belichick, the 8-time Super Bowl-winning HC, is not a first-ballot Hall of Famer, per @SethWickersham and @DVNJr. Belichick fell short of the 40 out of 50 votes needed for induction to the Pro Football Hall of Fame in his first year of eligibility. espn.com/nfl/story/_/id…
English
9.6K
2.8K
32K
52.7M
Finn Brown
Finn Brown@finnatsea·
@NickADobos A system is a system. It’s about how the pieces go together.
Finn Brown tweet media
English
0
0
1
325
Finn Brown
Finn Brown@finnatsea·
@karpathy Every visual programming language has failed to deliver on the hype… except factorio.
English
0
0
3
576
Andrej Karpathy
Andrej Karpathy@karpathy·
A few random notes from claude coding quite a bit last few weeks. Coding workflow. Given the latest lift in LLM coding capability, like many others I rapidly went from about 80% manual+autocomplete coding and 20% agents in November to 80% agent coding and 20% edits+touchups in December. i.e. I really am mostly programming in English now, a bit sheepishly telling the LLM what code to write... in words. It hurts the ego a bit but the power to operate over software in large "code actions" is just too net useful, especially once you adapt to it, configure it, learn to use it, and wrap your head around what it can and cannot do. This is easily the biggest change to my basic coding workflow in ~2 decades of programming and it happened over the course of a few weeks. I'd expect something similar to be happening to well into double digit percent of engineers out there, while the awareness of it in the general population feels well into low single digit percent. IDEs/agent swarms/fallability. Both the "no need for IDE anymore" hype and the "agent swarm" hype is imo too much for right now. The models definitely still make mistakes and if you have any code you actually care about I would watch them like a hawk, in a nice large IDE on the side. The mistakes have changed a lot - they are not simple syntax errors anymore, they are subtle conceptual errors that a slightly sloppy, hasty junior dev might do. The most common category is that the models make wrong assumptions on your behalf and just run along with them without checking. They also don't manage their confusion, they don't seek clarifications, they don't surface inconsistencies, they don't present tradeoffs, they don't push back when they should, and they are still a little too sycophantic. Things get better in plan mode, but there is some need for a lightweight inline plan mode. They also really like to overcomplicate code and APIs, they bloat abstractions, they don't clean up dead code after themselves, etc. They will implement an inefficient, bloated, brittle construction over 1000 lines of code and it's up to you to be like "umm couldn't you just do this instead?" and they will be like "of course!" and immediately cut it down to 100 lines. They still sometimes change/remove comments and code they don't like or don't sufficiently understand as side effects, even if it is orthogonal to the task at hand. All of this happens despite a few simple attempts to fix it via instructions in CLAUDE . md. Despite all these issues, it is still a net huge improvement and it's very difficult to imagine going back to manual coding. TLDR everyone has their developing flow, my current is a small few CC sessions on the left in ghostty windows/tabs and an IDE on the right for viewing the code + manual edits. Tenacity. It's so interesting to watch an agent relentlessly work at something. They never get tired, they never get demoralized, they just keep going and trying things where a person would have given up long ago to fight another day. It's a "feel the AGI" moment to watch it struggle with something for a long time just to come out victorious 30 minutes later. You realize that stamina is a core bottleneck to work and that with LLMs in hand it has been dramatically increased. Speedups. It's not clear how to measure the "speedup" of LLM assistance. Certainly I feel net way faster at what I was going to do, but the main effect is that I do a lot more than I was going to do because 1) I can code up all kinds of things that just wouldn't have been worth coding before and 2) I can approach code that I couldn't work on before because of knowledge/skill issue. So certainly it's speedup, but it's possibly a lot more an expansion. Leverage. LLMs are exceptionally good at looping until they meet specific goals and this is where most of the "feel the AGI" magic is to be found. Don't tell it what to do, give it success criteria and watch it go. Get it to write tests first and then pass them. Put it in the loop with a browser MCP. Write the naive algorithm that is very likely correct first, then ask it to optimize it while preserving correctness. Change your approach from imperative to declarative to get the agents looping longer and gain leverage. Fun. I didn't anticipate that with agents programming feels *more* fun because a lot of the fill in the blanks drudgery is removed and what remains is the creative part. I also feel less blocked/stuck (which is not fun) and I experience a lot more courage because there's almost always a way to work hand in hand with it to make some positive progress. I have seen the opposite sentiment from other people too; LLM coding will split up engineers based on those who primarily liked coding and those who primarily liked building. Atrophy. I've already noticed that I am slowly starting to atrophy my ability to write code manually. Generation (writing code) and discrimination (reading code) are different capabilities in the brain. Largely due to all the little mostly syntactic details involved in programming, you can review code just fine even if you struggle to write it. Slopacolypse. I am bracing for 2026 as the year of the slopacolypse across all of github, substack, arxiv, X/instagram, and generally all digital media. We're also going to see a lot more AI hype productivity theater (is that even possible?), on the side of actual, real improvements. Questions. A few of the questions on my mind: - What happens to the "10X engineer" - the ratio of productivity between the mean and the max engineer? It's quite possible that this grows *a lot*. - Armed with LLMs, do generalists increasingly outperform specialists? LLMs are a lot better at fill in the blanks (the micro) than grand strategy (the macro). - What does LLM coding feel like in the future? Is it like playing StarCraft? Playing Factorio? Playing music? - How much of society is bottlenecked by digital knowledge work? TLDR Where does this leave us? LLM agent capabilities (Claude & Codex especially) have crossed some kind of threshold of coherence around December 2025 and caused a phase shift in software engineering and closely related. The intelligence part suddenly feels quite a bit ahead of all the rest of it - integrations (tools, knowledge), the necessity for new organizational workflows, processes, diffusion more generally. 2026 is going to be a high energy year as the industry metabolizes the new capability.
English
1.6K
5.5K
40.1K
7.7M