protobluf

2K posts

protobluf banner
protobluf

protobluf

@protobluf

Building in private.

Katılım Temmuz 2025
3.8K Takip Edilen91 Takipçiler
KKY
KKY@evilpsycho42·
You are right @badlogicgames I copied codex exec_command and write_stdin into Pi Agent. Then compared its performance to the plain bash tool. The result supprised me. Async bash almost lost in every task.
KKY tweet media
English
2
4
93
13.5K
protobluf
protobluf@protobluf·
@teortaxesTex Israel is prolly the only non-suicidal western country. Modern, world-leading, with non-extinction level birth rates. The rest are cucked by leftism and the 3rd world. Begging for Islamic and Chinese money. Pathetic, really.
English
1
0
0
154
protobluf
protobluf@protobluf·
@sarahwooders @dexhorthy @charlespacker Do you have any evals that actually show that performance is “destroyed” by not having the exact same tool names? Minimalistic harnesses such as Pi seem to be doing a pretty good job.
English
0
0
0
33
Sarah Wooders
Sarah Wooders@sarahwooders·
@dexhorthy @charlespacker Looks like even the minimal swapping was only introduced in January... while Letta Code had this since launch (since it destroys performance otherwise)
Sarah Wooders tweet media
English
1
0
1
151
Sarah Wooders
Sarah Wooders@sarahwooders·
There have been some claims recently that the harnesses offered by the model labs (which increasingly lock in memory/state) are somehow magically superior to model-agnostic harnesses. This take really irritates me because it's so easy to disprove. Letta Code gets the same scores (or slightly better) as Claude Code / Codex on TerminalBench. So do many other harnesses. Yes, model labs *are* reducing the generality of their own models in favor of optimizing for their first-party products, but this mostly just means that their models being overfit to the toolsets for their first-party harnesses. Fortunately, it's very easy to reverse engineer what the toolsets are and implement them in other harnesses. Codex is open-source, and Claude Code's source code has been leaked so there's no great mystery here. Some popular harnesses DO fail to adapt their toolsets properly (e.g. OpenCode) which degrades performance. But if you are using a well implemented harness, this is a non-issue. You are not getting special capabilities from first-party harnesses, just memory lock-in.
Dan Shipper 📧@danshipper

In the future, you’ll be able to accomplish a goal by just giving Claude an outcome and a budget. That’s the direction Anthropic is building in with its new Managed Agents features, announced at this week’s Code with Claude developer event. The basic idea: Claude, wrapped in a computer in the cloud, that you can spin up, scale, and manage as needed. Anthropic is taking on the infrastructure that kills most agent products, and making sure that it scales to meet the needs of agents running 24/7. On this week’s AI & I from @every, I talk with Angela Jiang (@angjiang), head of product for the Claude platform, and Katelyn Lesse (@katelyn_lesse), head of engineering for the Claude platform, about what Anthropic is building and what it takes to make agents reliable in production. We get into: - Why the "build a generic harness, hot-swap any model behind it" playbook is already outdated. Angela points to eval data on Memory where the same task across different harnesses performed drastically differently. - The infrastructure wall every team hits in production—and why Katelyn thinks “my sandbox died and took the agent with it” is the real reason internal agents don't ship. - Why Anthropic is so bullish on using file systems and skills within Claude, including Angela's argument that those early design choices can compound for years. This is a must-watch for anyone trying to take an agent past the demo and into production. Watch below! Timestamps: How the Claude platform evolved from API to agents: 00:01:48 The primitives that make up Claude Managed Agents: 00:04:09 Why the harness and the model are becoming a single unit: 00:10:37 The infrastructure wall that kills most agent projects in production: 00:18:49 Why team agents need a different shape than individual productivity tools: 00:24:49 How Anthropic's legal team uses an agent to review marketing copy: 00:26:36 Using multi-agent orchestration for advisor strategies, adversarial pairs, and swarms: 00:34:24 How to measure agent success with outcome and budget as the end state: 00:35:50 What the platform looks like a year from now, when Claude writes its own harness: 00:39:11

English
10
8
74
13.1K
protobluf
protobluf@protobluf·
@Eyalo365 כלומר זה אפילו לא משא ומתן. זה "מסגרת למשא ומתן". כמה אפשר פאקינג למרוח את הדבר הזה? הם כבר עשורים במשאי מתן. הזיה מוחלטת.
עברית
1
0
1
101
Eyal Ofer אייל עופר
זו לא שאלה של לאמץ נאריטיב של האופוזיציה. אלא להסתכל למציאות המתהפכת: 1. בניין הכוח הושלם. המבצע הצבאי המחודש כבר כמעט יצא לדרך. 2. טראמפ שינה את דעתו ברגע האחרון. והלך על מבצע operation freedom במקום. כל בכירי ממשלו (הגסת', הרמטכ"ל קיין, מארקו רוביו) כולם היו on board ודיבררו את הקו החדש בכל התדרוכים הפומביים אתמול. 3. ואז ב 2 בלילה (7 בערב שעון וושינגטון די סי), מבלי להתעייץ, טראמפ מושך את השטיח מתחת לרגליהם ומבטל את המבצע. זאת אחרי שרק שתי ספינות עברו. וספינה שלישית, צרפתית, ספגה טיל שיוט איראני למרות שהפליגה בנתיב שהצי האמריקאי תיאר כנתיב בטוח ומוגן בידי ארה"ב. אין דרך אחרת לתאר את המציאות כרגע. זה לא שאלה של נארטיב. או עמדה פוליטית. אני מאוד תומך במה שטראמפ עושה. עד היום בלילה. הוא התהפך לגמרי. ואם לא יתעשת הוא יזכר כלוזר היסטורי. מציע לזכור מה הוגדר בתחילה: Not to allow Iran to project power outside it's borders. זה בדיוק מה שהמשטר האיראני כעת השיג. ע"ע הפסקת האש בלבנון. והכוח שלהם במפרץ. ככה אויבנו רואים זאת וטראמפ אינו מבין שבאיזור הזה חשובה האופטיקה. לא העיסקאות שהוא כביכול סוגר מתחת לשולחן
Eyal Ofer אייל עופר tweet media
עברית
10
4
41
5.1K
Marc Zell - מארק צל
Marc Zell - מארק צל@GOPIsrael·
You are too quick to adopt the opposition talking point that the Trump Administration lacks a coherent strategy vis-a-vis Iran. You confuse the willingness of the Administration to pursue a diplomatic solution with its resolve to pressure the Islamist regime into achieving a resolution of the conflict in accordance with U.S. national security interests. Trump is handling the war with Iran masterfully.
Danny (Dennis) Citrinowicz ,داني سيترينوفيتش@citrinowicz

It’s hard to escape the sense that the administration simply does not have a coherent strategy for Iran, and that what we are witnessing instead is a form of strategic improvisation. Washington does not want a war, yet it also does not want to appear politically weak by accepting terms associated with Iranian demands. The result is a prolonged state of limbo that deepens uncertainty in the global economy while doing little to convince Tehran to fundamentally alter its position. The constant policy shifts, contradictory public messaging, and oscillation between threats and diplomatic signaling all point to a deeper problem: the inability to build a sustainable and coherent Iran strategy. On one hand, there is fear in Washington of appearing to concede to Iran. On the other, there is growing recognition that escalation alone is unlikely to force the Iranian leadership to capitulate or accept maximalist American demands at the negotiating table. At the core of this problem lies a deeper issue — a persistent misunderstanding of how Iran’s leadership perceives pressure, deterrence, and strategic endurance. Many policymakers in Washington continue to approach Iran through frameworks developed for other geopolitical rivals, assuming that economic pressure, military threats, or diplomatic isolation will eventually produce strategic surrender. But the Iranian system has repeatedly demonstrated a high tolerance for prolonged pressure and a willingness to absorb significant costs in pursuit of regime survival and long-term strategic objectives. This disconnect helps explain why U.S. policy often appears reactive rather than strategic: alternating between coercion and de-escalation without a clearly defined end state. The administration understands the risks of military escalation, especially after years of instability across the Middle East, yet it has also boxed itself into a political environment where compromise is easily framed domestically as weakness. The consequence is a policy caught between incompatible objectives: avoiding war, avoiding concessions, maintaining deterrence, reassuring allies, and preventing regional escalation, all without a realistic framework for changing Iranian behavior. #IranWar

English
22
6
36
46.2K
John Ennis
John Ennis@johnennis·
@lubinho_k @Polymarket Around Christmas time in 2024 he posted a bunch of criticisms of American culture that showed how much he doesn’t understand what it actually is that makes America great
English
2
0
7
246
Polymarket
Polymarket@Polymarket·
JUST IN: The Ohio primaries are today… Democrats are now projected to win the Governor election.
Polymarket tweet mediaPolymarket tweet media
English
937
584
5.3K
3.2M
protobluf
protobluf@protobluf·
@vaidik_savaliya @IndigoYogiArt @LottoLabs they are just serving models from fireworks.ai, which i believe uses various quantization methods to optimize inference speed. this doesn't mean they quantize to 4bit or smth. but i don't believe they are running unquantized models.
English
0
0
2
8
Vaidik Savaliya
Vaidik Savaliya@vaidik_savaliya·
@protobluf @IndigoYogiArt @LottoLabs Hmm, still i don't think DAX is stupid enough to actually serve quantized models. I also just made this with kimi on opencode/go and it worked very good x.com/vaidik_savaliy… well, small project but still never hallucinated
Vaidik Savaliya@vaidik_savaliya

Problem, @claudeai reached it's limits while i was in middle of a feature, wanted to continue with @opencode or codex I went with opencode using codex model 😆 But it messed up, there were 20-25 files in total and it messed up so bad. It was because i was not able to share the context effectively i guess. So i made this tool, so share sessions between agents via MCP, called SESH `bun run sesh` currently supports claude and opencode, let me know if anyone wants it, i will opensource it. 100% local, no external server connection.

English
1
0
0
58
Lotto
Lotto@LottoLabs·
One of these this the right choice and I’m going to find out soon enough
Lotto tweet mediaLotto tweet media
English
73
11
931
145.1K
dan turrentine
dan turrentine@danturrentine·
There is A LOT of anxiety building amongst VA D’s and behind the scenes finger pointing breaking out over who warned of legal issues last Fall and who insisted it would be fine.
English
124
267
2.8K
425.8K
protobluf
protobluf@protobluf·
@captive_dreamer Those who platform them are not really on the right, or they are just click whores
English
0
0
0
3
captive dreamer
captive dreamer@captive_dreamer·
Absolutely unbelievable clip of Cenk Uygur and Anna Kasparian gleefully celebrating decline of "White Christians" in America. Encourage you to watch it in full. Why is anyone on the right platforming them?
English
291
626
5.6K
296K
protobluf
protobluf@protobluf·
@bfry1981 @Eyalo365 You can say a lot of things about Bibi, but one thing he isn't is being dumb. He would never "sell" regime change to Trump. At the end of the day Trump is the one who decides what to do.
English
0
0
1
6
protobluf
protobluf@protobluf·
@bfry1981 @Eyalo365 He hasn't persuaded Trump, that reporting by the NYT was false. There was more recent reporting that showed that Trump raised the issue of Iran to Bibi even in their meeting before the 2024 elections. Also it was Trump and Hegseth who believed in regime change more than anyone
English
1
0
0
10
protobluf
protobluf@protobluf·
@TonerousHyus but but all the experts were telling us that Iran is coming out of this stronger than ever? a new regional power, they say!
English
0
0
4
456
Latinx Adjacent Doctor PhD
Latinx Adjacent Doctor PhD@TonerousHyus·
This is an issue with Iran at this point. Centcom has locked them up so tight Pentagon knows they’re losing internally. Their currency is depreciating 10-20% a *week* Trump just doesn’t care anymore. It’s a problem they don’t appreciate
Nathan Guttman@nathanguttman

President Trump tells me in a brief phone conversation: The new Iranian proposal is not acceptable. “It’s not acceptable to me. I’ve studied it, I’ve studied everything – it’s not acceptable.”

English
17
53
999
46.3K
protobluf
protobluf@protobluf·
@BT909 לא מבין איך היצור הזה עובד בטלויזיה בכלל. מדבר כמו אהבל.
עברית
0
0
1
8
protobluf
protobluf@protobluf·
@welve247 @afneil He's got some gall saying that while without US support Putin's tanks would be in Berlin by now. Imagine how fucked the ukes would be if the US cut off Starlink.
English
0
0
0
144
Eelco van Wel
Eelco van Wel@welve247·
@afneil The German Chancellor said the US is being humiliated worldwide regarding Iran. That's also counterproductive don't you think?
English
23
0
52
3.8K
Andrew Neil
Andrew Neil@afneil·
Good luck continuing to wage your war against Iran without your German bases. Well nigh impossible. US troops aren’t in Germany to protect Germans. They’ve been there to project US power. Bye bye.
Nick Sortor@nicksortor

🚨 JUST IN: President Trump says the US will be withdrawing "A LOT" more troops from Germany than the 5,000 announced yesterday FINALLY. Bring our troops home! We don't need NATO — NATO needs US! "We're going to cut WAY down and we're cutting a lot further than 5,000" 🇺🇸

English
1.2K
983
6.2K
434.7K
Ansor
Ansor@Ansor96100963·
@effysuny @IgorIgall It's an amazing explanation, the only problem with it is that it's wrong in 10 different ways.
English
2
0
0
25
protobluf
protobluf@protobluf·
@Eyalo365 אני לא מבין, מי בדיוק עוצר אותו מלטפל בזה?
עברית
0
0
0
30
Eyal Ofer אייל עופר
למה אנו נראים רע בעולם. זה לא פוליטיקאיים שאומרים שטויות. אלון בן דוד: אייל זמיר יודע שאין זמן לחכות - צריך תיקון בצה"ל | חדשות מעריב share.google/JSrv96NusQ2YI6…ז
Eyal Ofer אייל עופר tweet media
עברית
6
1
14
1.8K