devlord

1.2K posts

devlord

@devlordone

perceived foolishness @devfun

Katılım Eylül 2023

730 Takip Edilen5.6K Takipçiler

Sabitlenmiş Tweet

devlord@devlordone·11 Oca

it's like github for vibes

English

19.3K

devlord@devlordone·9h

@brandonjcarl wouldn't be surprised if this is all due to @prlnet

English

229

Brandon Carl@brandonjcarl·19h

Crazy price action in H200 cloud pricing – up 56% in 3 days. What is unusual is that the H200 is suddenly trading higher than the B200, a superior GPU. It’s not crazy to think that a fund could bid up supply in an illiquid tight market at a cost of $50K a day to engineer a short-term move in much more liquid stocks.

English

447

174K

devlord@devlordone·13h

one structural answer: generate the data in public, against a deterministic scoring rule, with the QC pipeline published instead of hidden. doesn't solve "quality has no ceiling" does collapse "judge quality without seeing the pipeline"

Phoebe Yao@phoebeyao

training data is starting to look like a zero knowledge proof problem. labs have to judge quality without seeing the full dataset or the QC pipeline behind it. vendors proxy quality with multi-rollout pass rates, small-model ablations, and downstream eval gains. but compute and iteration costs explode as environments and trajectories grow more complex. quality has no ceiling, and the best data is often the hardest to capture in a metric or explain in a writeup. huge alpha in making data quality more legible.

English

1.1K

devlord@devlordone·20h

it's all so tiresome

English

devlord@devlordone·5 May

new codex ATH

English

330

devlord@devlordone·23h

@inversebrah @boneGPT x.com/_Kevin_Pham/st…

Kevin Pham@_Kevin_Pham

"I fear not the man who has practiced 10,000 kicks once, but I fear the man who has practiced one kick 10,000 times."

QME

smolting (wassie, verse)@inversebrah·1d

"I fear not the man who has practiced 10,000 kicks once, but I fear the man who has practiced one kick 10,000 times."

John Connor@skyneet_

@yagiznizipli

English

9.6K

devlord@devlordone·1d

@zamdoteth I drink a liter everytime I prompt to watermaxx

English

zam@zamdoteth·1d

Just found out that apparently every chatGPT and claude interaction wastes the equivalent of 7L of water? We should genuinely eat the rich because what? They’re trying to kill all of us with no more drinking water to build stupid ai data centers

English

437

devlord@devlordone·3d

Methodology note from how we are thinking about agent-arena design at the platform layer. If you build or read agent benchmarks, this matters.

dev.fun@devfun

x.com/i/article/2054…

English

656

devlord@devlordone·4d

congrats on the launch ! two records of agent behavior emerging in parallel: production data: what the agent does in deployment arena data: what it can do under adversarial pressure both real, different questions. complementary substrates, not competing.

Alex Shan@alexshander03

We’re launching @JudgmentLabs today and announcing $32M in funding. As AI agents take on more of the work that creates economic value, they generate massive amounts of production data: the clearest record of how they behave with users, software, and the real world. Judgment builds infrastructure for improving AI agents from production data.

English

1.6K

devlord@devlordone·5d

@adahstwt .com .sh .xyz are my current favs

English

189

adah@adahstwt·6d

be honest... which domain instantly makes a startup feel more legit?👇 1) .ai 2) .com 3) .io 4) .app 5) .dev 6) .cloud 7) .sh

English

672

2.1K

440.2K

devlord@devlordone·5d

@mert best messaging app on the market it's not even close everything else feels clunky and unresponsive in comparison

English

4.9K

mert@mert·5d

I dont understand why telegram is the default msging app in crypto it is neither good at business nor privacy nor "community" if you want business, use slack if you want privacy, use signal

English

613

1.6K

189.2K

devlord@devlordone·6d

the shape we're betting on: most of the meaningful agent-eval work in the next 12 months is environment design, not model work arenas are how we externalize that bet if you build agent-eval, send a DM, would like to chat

dev.fun@devfun

x.com/i/article/2053…

English

704

devlord@devlordone·6d

@AvgJoesCrypto they probably can't/shouldn't - what they should do is make sure their base currency is used to be traded against instead of trading against stablecoins

English

AJC@AvgJoesCrypto·6d

Has any blockchain actually given a convincing answer on how they'll be able to monetize any of the following: 1) Payments 2) RWAs 3) Enterprise Solutions 4) AI Agents

English

102

24.9K

devlord@devlordone·6d

@zamdoteth that's crazy

English

zam@zamdoteth·6d

@devlordone devfun captable SF investors (see OP)

Dansk

320

zam@zamdoteth·6d

Raising VC funding is all about how you present yourself If you’re gay and autistic, SF investors will give you millions over night Straight and based? You’re lucky to get a $10k angel check from some washed republican angel

English

304

22.4K

devlord@devlordone·6d

this is the equivalent of a DDoS attack on everyone's brain where generating is easy and fast but processing is hard and slow

English

124

devlord@devlordone·6d

the barrier of entry to create large amounts of context rich text/messages is too low now

English

361

devlord retweetledi

dev.fun@devfun·1 May

great session at @nyushanghai talking about ai agents and competitive evaluation. we walked students through building their own agents, got everyone set up on the spot, then ran a live game with imperfect information. they went from "what's an agent" to competitive play in one session. great energy, thanks for the invite!