andrey

702 posts

andrey

andrey

@andreylebedev29

australian in london. building. ex-goldman sachs M&A. prev world #1 concert guitarist. 2:47 marathoner. side project https://t.co/XPRQd0UyIf

London Katılım Haziran 2014
1.3K Takip Edilen776 Takipçiler
andrey
andrey@andreylebedev29·
Really insightful analysis. The Napster analogue is spot on. Totally agree!
Daniel Jeffries@Dan_Jeffries1

I think I finally figured out why OpenClaw is amazing and took off like wild fire and why Peter is a genius, as Altman called him. And it's actually a different way of looking at it. It's not a DeepSeek moment for agents. It's a Napster moment. And just like Napster it will eventually force the industry to change. In essence when Napster came out the entire world told the music industry we don't want to buy CDs anymore and if you don't provide us a digital download experience we are just going to take it until you do. It forced the industry to create Apple Music and eventually Spotify. Both essentially killed most music piracy by making it ubiquitous and cheap and good. But it forced change. The same will now happen to software. Here's why: In essence OpenClaw lets you take what vendors don't want to give you: Unified access to countless applications. We all want a personal assistant that can talk to freaking everything and do anything for us in the digital world. But vendors don't want this. They want you locked into their bullshit. For example, none of the messaging platforms want bots on there. None. They all have explicit policies against them and make it hard to do this. WhatsApp doesn't want you on there. Signal. Telegram's bot father is garbage. It's all designed to keep bots out. They were designed for a pre-agentic era when bot = spam. Many other things are like this. The API layers are gated, hoop-jumping bullshit. Go get an enterprise account and wait for approval and yada yada. Want access to WhatsApp? Get a business account and attach a number (what small business has a real number anymore 😂) and messages can't come from a person, etc. Google ads? It's not just an auth, it's go get a special manager account and create an enterprise key and blah blah blah. It's a horrible experience because it was all designed for corporations to control access. Now people are saying, make your app easy to access and accessible to me and my machine avatars and do it in a headless way or you will be dead. Peter hacked around all this by making everything command line in the classic Linux style and using things like an open source library that reverse engineered the web version of WhatsApp. It's all a bit house-of-cards-y because he had no choice. At my company we had a similar idea early (and failed). Basically we wanted to make the best multimodal/computer using model because then it doesn't need an API or access hoops. You just go through the human interface layer and ain't nobody going to stop you. We failed because we weren't big enough and it's really a job for the mega-labs to solve because it is a hard problem and costs a shit ton of money. Peter was much smarter. Make it all command line because that is ready now. Use any reverse engineered library or project or proxy available come Hell or high water and make it work by any means necessary even if it is hacky. In short, he signaled to the software world that they better change and change fast or we are going to do this anyway and you can't stop us. Of course some are foolishly trying. Meta is banning Claws on WhatsApp, etc. They will all try to build their own gated, controlled, enshittified version of this thing. They will fail. And eventually everyone will offer a clear, easy way to get access via API for agents or they will be gone. In essence OpenClaw gave people what they wanted, which was an app connected to everything, even when most of the vendors don't want you to have this.

English
0
0
1
26
Ian Livingstone
Ian Livingstone@ianlivingstone·
Incredibly excited to announce Keycard for Coding Agents - no more copy & pasting credentials or approving individual tool calls. Agents get task-scoped access, so you can stay in flow and actually build. You’re only pulled in when it matters. Yolo mode, without compromise.
Keycard@KeycardLabs

Your coding agents inherit your credentials and your permissions. No identity system in the stack can tell the difference between you and the agent acting in your name. Today: Keycard for Coding Agents 🧵

English
19
7
99
39.7K
andrey
andrey@andreylebedev29·
@pmarca Mine is at least 70% podcasts. So much great information. Started building a searchable index of @tbpn shows this weekend to help me stay on top of the latest developments, in case I miss a show
English
0
0
1
135
Marc Andreessen 🇺🇸
My information consumption is now 1/4 X, 1/4 podcast interviews of the smartest practitioners, 1/4 talking to the leading AI models, and 1/4 reading old books. The opportunity cost of anything else is far too high, and rising daily.
English
1.4K
3.9K
35.1K
34.5M
andrey
andrey@andreylebedev29·
@myuan95 The two are connected. And they still struggle with genuinely complex models. The relationships and logic are not obvious. Merger models with switches and cases and ranged outputs etc. They're a very long way off from understanding these in my experience
English
1
0
0
10
Michael Yuan
Michael Yuan@myuan95·
@andreylebedev29 True true. But I think this is a use case where AI actually does very well? Not writing the correct formulas but checking them
English
1
0
0
22
Michael Yuan
Michael Yuan@myuan95·
Brett hits the nail on the head here. I've been thinking for a while: Even if we hit 90% on Excel benchmarks (which will be tough).. So what? 1) As we know, benchmarks are gamed so that 90% doesn't translate to real-world usage 2) In Finance, 90% gets you fired
Brett Caughran@FundamentEdge

This is one of the biggest sticking points on AI Excel that I'm trying to understand. 73% accuracy is progress, but is it useful for anything at all? We were on a vendor call last month and the vendor bragged of hitting 65% accuracy in Excel and Andrew Carr and I texted "an analyst who is 65% accurate in Excel is 100% fired". Why is AI Excel only 60-70% accurate? Are these issues fundamental or solvable? > Is MCP fundamentally too brittle to get to 99% accuracy? > Is the data layer clean enough to hit 99% accuracy (i.e. there's a reason why hedge fund analysts don't start their models with a Bloomberg download) > Are the foundation models powerful enough to handle the multi-modal (filings, PRs, investor decks, data supplementals), multi-document, "needle in a haystack" issues for LLMs? Context windows have grown, but they are still not large enough to capture all of the documents and files for one ticker (letalone a coverage universe) > Is the commercial opportunity large enough for foundation labs to build RL environments for public equity modeling, as they are doing on investment banking modeling? Does the "march of 9s" on AI Excel take 6 months or 6 years? Driverless cars took 13 years from DARPA Urban challenge to first Waymo. These are legit questions. I don't know. I also don't really trust public evaluation sets (i.e. LLM's win physics competitions...then you learn the LLM trained on the physics competition test bank lol). The real questions in investment research modeling are out of sample questions (i.e. how to model SAAS retention in a Claude-world...there is no prior on which to rely). So I am building my own evaluation set. 100 use cases ranging from simple (input 3 statements from 10-K to AMZN model) to complex (model GE split/spin). Am I wasting my time? 36 months form now, are we still only at 80% accuracy in AI Excel? These are questions, now answers - love your takes in replies or DM!

English
3
0
2
1.1K
andrey
andrey@andreylebedev29·
Fwiw I don’t think this is a threat to DataSnipper per se. Because the value has always been in the depth of enterprise workflow adoption, security, and ecosystem
English
0
0
0
18
andrey
andrey@andreylebedev29·
This is crazy powerful. Unicorns have been built on Invoice-to-Table conversion at scale (eg DataSnipper). And now it’s just a feature. To make this more production-ready, you would need visual grounding so humans can quickly validate the accuracy. But crazy progress
Aaron Levie@levie

It’s starting to get wild what agents are able to do within your software. Here’s Box + Claude + Excel for full spreadsheet automation. This is just the beginning. Agents that have access to your data and tools, safely, will accelerate all knowledge work.

English
1
0
0
43
Chetan Puttagunta
Chetan Puttagunta@chetanp·
If you haven't listened to this conversation between @btaylor and @jaltma yet, I couldn't recommend it more. If you are building in the AI space right now, consider this required listening. Truly exceptional content. 10/10.
Jack Altman@jaltma

This week, on Benchmark's new podcast Uncapped 😂, I sat down with @btaylor, founder of Sierra and Chairman of OpenAI. He's easily one of the most impressive people I’ve met in tech or in general. We talked about AI and the saaspocalypse, the unique considerations of building an AI native / agent company, whether young or experienced founders have the advantage right now, Codex and OpenAI ads, and much more. Learned a ton from Bret, hope you enjoy. (0:00) Intro (0:20) The Saaspocalypse and systems of record (12:34) Sierra's landscape (17:05) Outcome-based pricing (24:22) The rapid evolution of AI support technology (28:21) Young founders vs. experienced founders (34:12) What comes next beyond support (38:47) Codex and the future of software engineering (51:49) OpenAI and advertising (54:59) Working with investors and boards

English
3
14
171
53.7K
andrey
andrey@andreylebedev29·
@gm_mertd This is future of a lot of software more generally I think. I’m running my life with Claude code connected with MCPs to everything now. It generates what I need on demand
English
1
0
0
26
Mert Deveci
Mert Deveci@gm_mertd·
There are a million AI CRMs out there yet all of them are missing the point. Request for startup: a CRM that runs and adjusts itself through code execution Code is the primary primitive to it. It enables the users to: - auto sync data to outbound platforms - send notifications (by writing code) - manipulate the user's tables to bring in different views - set up automations or custom fields automatically If I hadn't come to like the sales tech market, I would have definitely built this
English
3
0
3
163
andrey
andrey@andreylebedev29·
A founder I know: 3 people, bootstrapped, already live with a hospital, logistics co, and hotel chain. Pricing: "we save you €400k, we take 25%." He sits with the people doing the work. Takes their headsets. Watches over their shoulders. Rebuilds what he saw. Six weeks later, a whole team's job runs on its own. This model was uneconomic until very recently. AI changed the math. @mandrusko1, I think you're wrong about the mid-market. The three constraints that made Palantir clones unviable (proprietary platform, rare talent, uneconomic value delivery) have all shifted. The mid-market is open. Tell me where my reasoning breaks down:
andrey@andreylebedev29

x.com/i/article/2024…

English
0
0
0
39
andrey
andrey@andreylebedev29·
@andrewchen It also drafts emails. I'm not bold enough to let it send them yet, but they're there in my drafts folder ready to shoot out
English
0
0
0
25
andrey
andrey@andreylebedev29·
I'm doing something very close to this currently with Claude Code connected gmail, slack, whatsapp, granola via MCP. It just stores info as markdown files, that include all my relationships, to dos, due dates etc. And prompts me when I'm falling behind on goals. The crazy thing is it's just Claude Code with MCPs, scheduled jobs, and markdown files, but unbelievably powerful!
English
1
0
2
635
andrew chen
andrew chen@andrewchen·
what's the current best approach on an AI that can help me handle my email inbox? seems like a big opportunity for folks playing with openclaw. For all of us who are drowning in email, this seems like a tier one problem that would be amazing to solve. (And I think I would pay $150k/year to have this product? I bet I'm not the only one) what I want is: - watch my inbox and process emails as they come in - score each message to see if it seems important (look at the sender, the topic/body, if its addressed to me or a big list, if I've ever replied to the sender before, etc etc) - read the email and reference a vast DB of knowledge that's been assembled already (based on my work, meeting notes, what I've replied on, etc), and decide what to do - reply with a draft note. For now, don't send, so that I can review the email -- but in the future maybe there's a YOLO option (but it would probably disclose that it's my assistant writing) - if less important, label it and file away. Eventually gather summaries for all of these less important emails and send me a summary of all of them with links to get back to it - or archive if it seems unimportant - or unsubscribe / mark spam / block if random marketing - if critical send me a notification right away so I can take a look I've played around with a bunch of the current AI tools and nothing quite works like this. There's a lot of blockers: - first, it needs 1000x more context about each problem, which it could get by crawling all my projects/notes/emails/slides/meetings/etc - This system should be designed to take action rather than simply just prioritizing messages. We've had prioritized inboxes for a long time but they're fine, not great - then someone has to put this entire UX together to be cohesive. In the future, we may not even really have an email inbox, but instead an interaction that feels more like I'm talking to an assistant who has a few questions for me. But otherwise just wants to provide a few quick updates and get some yes/nos. And otherwise filter all the noise -- just give me the most important messages It feels like we're very, very close to being able to do this, with the latest models from Anthropic and Open AI, we have the technology already. Someone just needs to package it all together in a way where it's able to index all of your emails and notes and calendars and contacts and sort of create a second brain that knows almost everything that you know so that I actually do things that are intelligent. It seems like with the excitement of OpenClaw we have the architecture to integrate a lot of different data sources and to take actions across multiple different channels. And it's built with one sort of monolithic memory and context, so that you're able to interact with it in such a way where it feels like it can try to replicate your actions more closely than the relatively stateless and memoryless LLM chats that we've gotten accustomed to. If someone is working on this, please point them to me. I would be both a customer and an investor!
English
183
13
315
108.3K
nico
nico@nicochristie·
I gave claude access to a full music DAW and it can create, edit, and remix songs surprisingly well My dream is to make an album one day but these softwares are hard to learn Feels like a multi-million dollar product if someone wants to run with it!
English
9
3
55
6.9K
andrey
andrey@andreylebedev29·
Anthropic’s execution is utterly exceptional. They were (are?) the clear underdog to OpenAI, don’t have a cash printer to fund them like Google or Meta, and yet are over delivering every quarter. Great models and exceptional execution commercially. Killer API business and the best application layer product too (Claude Code)! Menlo Ventures must be thrilled with their early decision to back them!
Alex Albert@alexalbert__

Opus 4.6 is here. The jump in autonomy is real. The biggest shift for me personally has been learning to let it run. Give it the context, step away, and come back to something pretty amazing. The way we work alongside models is starting to completely change.

English
0
0
1
80