Arjun Balaji

14.2K posts

Arjun Balaji banner
Arjun Balaji

Arjun Balaji

@arjunblj

investing/research @paradigm, technology, history, strategy games, boston sports. optimist

San Francisco, CA Katılım Mart 2009
3.4K Takip Edilen69.3K Takipçiler
Arjun Balaji
Arjun Balaji@arjunblj·
@PINTO03091 さん サブピクセル精度の手作業アノテーションと、高品質なオープンソースモデルの公開に深く感銘を受けています。長年のご尽力に敬意を表します。 もしお役に立てる機材やクラウド計算資源などで、微力ながら支援できれば幸いです。ご関心があればDMでお気軽にご連絡ください。 いつも素晴らしいご活動をありがとうございます。
日本語
0
0
12
1.1K
Arjun Balaji retweetledi
Patrick McKenzie
Patrick McKenzie@patio11·
If the *only* impact of LLMs professionally was causing people to "think out loud" in a way which was routinely captured by computer systems and then could be operated on by computer systems, that would *by itself* be one of the most consequential changes in practice in 100 years
Patrick McKenzie@patio11

@snewmanpv @David_Kasten Incredible value in the log files, too, because they’re contemporaneous candid notes of what I was thinking and doing, in better fidelity than I’ve ever had before. Terminal logs are great but don’t include annotations like “Dead end; ignoring that line of inquiry.”

English
16
42
685
114.8K
Arjun Balaji
Arjun Balaji@arjunblj·
@alexlomanto Assuming you have a functional local setup already running, you can get it up and running quite fast We've now run it across multiple teams with very different usecases on this same infra stack Happy to onboard you myself if you hit issues, DM
English
0
0
1
48
Alessandro Lo Manto
Alessandro Lo Manto@alexlomanto·
They did it for their use case, their processes. Every company has its own dynamics. There's no one-size-fits-all solution. But I have to say it looks like a good job and I'm curious to try it out.
English
1
0
0
38
Alessandro Lo Manto
Alessandro Lo Manto@alexlomanto·
I love the architecture pod isolation, credential injection, proper observability. This is how agent sandboxing should work. But I'm looking at my calendar and thinking: how long until I actually understand this stack? The tech is right. The learn/operational tax might kill me.
Georgios Konstantopoulos@gakonst

Open Sourcing Centaur: Multiplayer, self-hosted, secure agents for Slack. Centaur has been transforming how @paradigm and @tempo invest, build and research. Now you can run it yourself on infrastructure you control. Instructions below.

English
1
0
2
79
Arjun Balaji retweetledi
Dwarkesh Patel
Dwarkesh Patel@dwarkesh_sp·
Currently it is shocking and newsworthy when AIs solve an important open problem that humans couldn't Before AI totally surpass us intellectually, there will be an interesting era, where it will be just as shocking (but not impossible) for a human to solve a problem AI couldn't
English
88
53
1.2K
89.2K
Arjun Balaji
Arjun Balaji@arjunblj·
@btraut @ajambrosino Another nit: this error state is broken. It disappears on hover, and clicking into the thread doesn't reveal the error. Ideally, it would show an error on hover so I can click into the thread if needed. In a perfect world, it would give me 2 options to choose from on hover.
Arjun Balaji tweet media
English
0
0
2
169
Arjun Balaji
Arjun Balaji@arjunblj·
@btraut One last micronit: why is gpt-5.3-codex-spark usage broken out separately from general (is it included? is it separate, etc?). Shocking @ajambrosino has allowed this so long ;)
English
1
0
1
70
Brent Traut
Brent Traut@btraut·
Now that I've joined the Codex team, it's so freakin' cool being able to fix paper cut bugs that I was running into before the switch. What paper cuts are you running into? I'll see what I can do.
English
330
17
899
76.3K
Arjun Balaji retweetledi
kain.inx
kain.inx@kaiynne·
one of the most insanely valuable things anyone has launched as an open source project this year and it only has 225 stars so far because it is complex and not a larpy download this to solve world hunger style app😅
Georgios Konstantopoulos@gakonst

Open Sourcing Centaur: Multiplayer, self-hosted, secure agents for Slack. Centaur has been transforming how @paradigm and @tempo invest, build and research. Now you can run it yourself on infrastructure you control. Instructions below.

English
16
6
250
61.7K
Arjun Balaji
Arjun Balaji@arjunblj·
@dudhat_paresh @paradigm @gakonst There’s a web interface that we previously link to Slack that we used, but didn’t include that in this release. Definitely open to more adapters, so feel free to DM/PR
English
0
0
0
38
Arjun Balaji
Arjun Balaji@arjunblj·
This year, Centaur made working at @paradigm feel more like a multiplayer game. Solo queuing Claude all day is OK, but you really can go much farther together. If you’re interested in contributing to Centaur with me and @gakonst, DMs open
Georgios Konstantopoulos@gakonst

Open Sourcing Centaur: Multiplayer, self-hosted, secure agents for Slack. Centaur has been transforming how @paradigm and @tempo invest, build and research. Now you can run it yourself on infrastructure you control. Instructions below.

English
15
12
182
31K
dotta 📎
dotta 📎@dotta·
@arjunblj @gakonst @paradigm @tempo Talked to the team this morning about integrating Paperclip over Centaur Should be possible but there is some overlap of concerns around task liveness / state machine / workflow Worth exploring!
English
1
0
3
694
Georgios Konstantopoulos
Open Sourcing Centaur: Multiplayer, self-hosted, secure agents for Slack. Centaur has been transforming how @paradigm and @tempo invest, build and research. Now you can run it yourself on infrastructure you control. Instructions below.
English
70
107
1.1K
482.3K
dotta 📎
dotta 📎@dotta·
@gakonst @paradigm @tempo this is incredible. we're building similar for Paperclip and now I wonder if we should just integrate Centaur under the hood
English
6
0
10
1.5K
Arjun Balaji retweetledi
dcbuilder.eth ⚪️
dcbuilder.eth ⚪️@dcbuilder·
@runneragent + @obsdmd + Centaur + personal OS (APIs, DBs, MD context, ...) + dcbuilder.dev + codex and my life is complete. Next week I'll be releasing an article on how to productively set this up for yourself while spending 1/100th the time it took me to set it up
Georgios Konstantopoulos@gakonst

Open Sourcing Centaur: Multiplayer, self-hosted, secure agents for Slack. Centaur has been transforming how @paradigm and @tempo invest, build and research. Now you can run it yourself on infrastructure you control. Instructions below.

English
7
4
49
4.8K
Arjun Balaji retweetledi
Quinn Slack
Quinn Slack@sqs·
Self-recommending. The good folks at @paradigm and @tempo have consistently been far ahead of basically everyone else in using agents. They've pushed Amp further and harder than any other team on a per capita basis.
Georgios Konstantopoulos@gakonst

Open Sourcing Centaur: Multiplayer, self-hosted, secure agents for Slack. Centaur has been transforming how @paradigm and @tempo invest, build and research. Now you can run it yourself on infrastructure you control. Instructions below.

English
6
4
87
14.7K
Arjun Balaji retweetledi
Matthew Slipper
Matthew Slipper@mslipper·
Huge launch. Self-hosted agent systems are the future, and egress enforcement is what makes them safe. iron-proxy is the egress layer inside centaur. We went deep on the hard parts like OAuth brokering, HMAC signing, and Postgres MITM for RLS.
Georgios Konstantopoulos@gakonst

Open Sourcing Centaur: Multiplayer, self-hosted, secure agents for Slack. Centaur has been transforming how @paradigm and @tempo invest, build and research. Now you can run it yourself on infrastructure you control. Instructions below.

English
1
1
14
1.6K
Arjun Balaji
Arjun Balaji@arjunblj·
RT @dwr: Our internal multiplayer AI Slack agent. Couldn't imagine working at a company that uses Slack without it.
English
0
1
0
173
benedict
benedict@bqbrady·
A normal playoff basketball game takes roughly 150 minutes of wall clock time. For all of basketball history, the best players have been able to play 40+ minutes, but not the full 48 In practice, this is the difference between running 27% of the time vs. 32% of the time, but it is extremely expensive when good players sit. It is not uncommon for the best players in the league to cost their teams a few points of expected value during their time on the bench If the game was 48 minutes consecutively, I would understand how catching a few minutes of breath could have large non-linear return. But given that the game is naturally broken up and 66% of the game is dead time by default, why can't the superstars condition themselves to play the full 48 during big games?
benedict tweet media
English
13
1
65
15K