bryan

170 posts

bryan banner
bryan

bryan

@sudo_bryan

Taming agents in the wild

San Francisco, CA Katılım Mayıs 2026
116 Takip Edilen22 Takipçiler
bryan
bryan@sudo_bryan·
@TheAhmadOsman Not true as being able to serve and inference these at anthropic scale even matters! Unless Huawei ships a micro cluster with enough ram to ram these locally! Then maybe enterprises would opt for them and not pay for tokens! Because the cost benefits changes from open to capex
English
0
0
0
2.9K
Ahmad
Ahmad@TheAhmadOsman·
Wanna know why Anthropic hates Opensource AI? GLM 5.2 being free and available to download made their $1 Trillion valuation make no sense
English
196
445
7.2K
250.2K
Matthew Berman
Matthew Berman@MatthewBerman·
> mythos is so good at cyber it can't be released also > mythos can't detect 20k fraudulent chinese accounts attacking it
English
358
1K
18.6K
508.5K
bryan
bryan@sudo_bryan·
@yashmp2004 5.8 million indians but no one knows to code
English
0
0
0
311
yash.jsx
yash.jsx@yashmp2004·
Let me tell you a fun fact.... China has 4.4 million software engineers. India has 5.8 million. Russia has 1.3 million. USA has 4.4 million. Among these four countries... India is the only one that hasn't built an app, operating system, website, database, cloud platform, AI, CDN, or social media platform that's widely used across the world.
English
131
48
682
39.9K
Ahmad
Ahmad@TheAhmadOsman·
GPT 5.5 > GLM 5.2 But GLM 5.2 > Opus 4.8
Indonesia
76
16
638
58.2K
bryan
bryan@sudo_bryan·
@EXM7777 Please support your claims
English
0
0
0
6
Machina
Machina@EXM7777·
GLM-5.2 isn't a great model and anyone calling it good is reading the benchmark card, not running it in a real agent loop
English
159
11
370
186.3K
bryan
bryan@sudo_bryan·
@Linahuaa Who the f is Hasan picket
English
0
0
0
147
LinaHua
LinaHua@Linahuaa·
Girls like money because they like luxury. Guys like money as a dick measuring stick and to invest in big stuff for legacy. Thus, money has huge diminishing returns for girls once they reach a level that can maintain fine dining, luxury brands, business class flights and fancy hotels. At that point, clout, looks, good sex, fun etc are all wayyyy more important than money. And that's why athletes, rock stars, finance guys, and actors all have WAYYYYY higher quantity and quality of girlfriends compared to post-exit AI startup guys.
LinaHua tweet media
English
112
76
1.1K
246.1K
bryan
bryan@sudo_bryan·
@AlexFinn How does it perform when you have multiple agents running and doing work in parallel?
English
0
0
1
38
Alex Finn
Alex Finn@AlexFinn·
I can't believe this is real I have GLM 5.2 running 100% locally on my Mac Studio. 2 bit quant. The results I'm getting are better than Opus 4.8 It's now powering my Hermes Agent and Codex. 100% free, local, private super intelligence on my desk I also have it in a loop coding for me 24/7 now I thought we were at least a year away from this type of event. It happened today. The model takes up about 250gb of memory. So you can technically run it on a Mac Studio with 256gb, but you probably want the 512gb memory version (please tell me you listened to me 5 months ago when these were sitting on store shelves) With Fable gone, I now have Opus 4.8 level intelligence on my desk for free. This is the future. Local, private, secure, personal super intelligence. If you're still writing off local AI as a fad or engagement bait, you are officially delusional
English
529
478
5.5K
610K
bryan
bryan@sudo_bryan·
@0xCodez As long as it’s free I’ll run a 1000 swarms
English
0
0
0
63
Codez
Codez@0xCodez·
Anthropic research lead: "99% of our engineers are running swarms of 300+ self-improving agents. close the agent loop. Give the model a way to verify its own output" in a 20-minute session, Anthropic team member explains how to build a model that improves itself. Claude + loops + plan mode + dynamic workflows -that’s the secret. Watch the talk, then save the playbook below.
Movez@0xMovez

x.com/i/article/2067…

English
124
340
3.5K
659.2K
bryan
bryan@sudo_bryan·
@amasad @agupta @danwwang and why shouldnt they? its crystal clear how important it is! Actually the only mandate is probably open source. no wonder ever single one of them is open source releases
English
0
0
0
63
Amjad Masad
Amjad Masad@amasad·
@agupta I’d be surprised if China’s government is not subsidizing LLM development, like they did with EVs. Breakneck by @danwwang is a good book on this.
English
6
0
100
8.9K
Ankit Gupta
Ankit Gupta@agupta·
in china, there are ~10 frontier AI labs, about half of them the offshoot of a money printer (Tencent, Bytedence, Alibaba, etc) and the other half of them new startups. America has the startups, but has very few money-printer-attached labs. It's really just Meta, Google, and sort of nvidia. why is an IBM or Cisco or Netflix or amazon frontier AI lab not a thing (w/ actually shipped products not just a research team).
English
94
22
607
87.7K
Marcos Rico Peng
Marcos Rico Peng@Marcos12345rico·
today we're launching @Palmier_io, a video editor Claude can edit. use AI to edit, organize, and generate footage directly in the timeline. finally, a video editor built for AI. open-source. mac native. available now.
English
573
978
12.8K
2.7M
bryan
bryan@sudo_bryan·
@bryantchou Is the latency better than e2b?
English
0
0
0
8
bryan
bryan@sudo_bryan·
@ryanbrewer Only difference a midget seems to be screaming
English
0
0
0
332
bryan
bryan@sudo_bryan·
@khole_emily Can you even automate it? Imagine everything looks the same
English
0
0
0
351
Emily Segal
Emily Segal@khole_emily·
When taste is fully automated it ceases to function as taste
Thais Castello Branco@thaiscbranco_

We’re excited to introduce Taste Labs. Our mission is to end AI slop. We’re building the data and infrastructure layer to give AI models and agents taste. And today we’re coming out of stealth, announcing our $18.5M seed funding, co-led by @CRV and @AmplifyPartners AI has nailed objective domains and made it easy to generate anything. But it still feels off. Now, the challenge is judgement. What fits, what feels like you, what’s GREAT. This requires turning a fuzzy, subjective domain into something we can measure and codify. We’re starting with design. There are two sides to cracking this, the foundation model layer and the agent layer: - We’ve already been working with the top frontier labs to evaluate and improve their models, crafting the right post-training data and RL environments. - We’ve also been working with app-layer companies to build the context and verification tools for their agents to produce better, more on-brand, more creative outputs. We want a future where AI feels right. If you’re passionate about this mission, join us!

English
42
107
1.8K
100.3K
TBPN
TBPN@tbpn·
Y Combinator's @garrytan says he wants his new project GBrain to be the Postgres for agents: "The thing I realized is, a human can only keep 7, plus or minus 3, things in their head. But a computer with an LLM can keep about three Harry Potter books in its head." "Then, when you think about what most computer systems are, you should think of the Library of Alexandria — thousands, maybe millions, of books. It's even bigger than that. It's the whole internet." "You could basically take all the relevant info about customers, or any person that anyone at the company has ever even met. You can have that in like, 100,000 or a million markdown files that comprises everything that the business is. That's basically what GBrain can do." "The magic moment for GBrain is basically being able to take any 'book' that exists in your entire business, and making sure the 3 books that really matter for the thing you're trying to do are loaded." "And that's basically ASI. You don't have to write software anymore. You can just straight-up use Hermes agent or OpenClaw plus GBrain."
Garry Tan@garrytan

Humans can keep 7 +or- 3 things in their head Your AI agent can keep 3 whole Harry Potter books in context You could have 300,000 books in your library GBrain will make sure your AI agent has the 3 books out of 300,000 loaded in context for your task presently Big unlock

English
43
20
360
529.6K
George Pu
George Pu@TheGeorgePu·
I'm trying out DeepSeek V4 Pro, and really like it. Super underrated model. As good as Opus 4.8 from the few tests I ran.
English
36
4
98
8.3K
bryan
bryan@sudo_bryan·
@ml_angelopoulos Stop posting bs benchmarks! Post actual outputs via same prompts!
English
0
0
1
280
Anastasios Nikolas Angelopoulos
Anastasios Nikolas Angelopoulos@ml_angelopoulos·
Just to be clear, if you remove Fable which is unavaialble, GLM-5.2 (Max) is the #1 model in the world for frontend coding. This is a huge moment. OSS has caught up with proprietary, and China has caught up with the US, in this very important domain.
Arena.ai@arena

Exciting news: GLM-5.2 (Max) ranks #2 in Code Arena: Frontend, with +29pt over Claude Opus 4.7 (Thinking) and only behind Fable 5! GLM-5.2 is the best open model vs Kimi-K2.6 and Minimax-M3 by a large margin. - #2 React and #4 HTML sub-leaderboards - Ranks as the top model in nearly all sub categories: Brand & Marketing, Reference-Based Design, Data & Analytics, Consumer Product, Gaming, and Simulations. Congrats @Zai_org for the incredible milestone!

English
145
358
4.3K
594.9K
bryan
bryan@sudo_bryan·
@thaiscbranco_ @m0recilantr0 But this can be solved by curating skills! The models are really capable of your know what you want and what to expect. I don’t know if you used fable and saw the results
English
0
0
0
107
Thais Castello Branco
Thais Castello Branco@thaiscbranco_·
What if there’s something that can help you uncover your own taste? Taste is a skill that takes a ton of work! Repetoire, reps, time, point of view. Tastemakers want to spend the time honing this. The avg person is strapped for time and doesn’t necessarily want to put in the work. I’d love to have this person better understand their own taste and create great things (not as good as tastemakers). Imo you only reduce slop it you raise the floor for everyone, not just the “top”
English
10
0
24
10.8K
Tyler Cecchi
Tyler Cecchi@m0recilantr0·
Two things. 1. AI slop is not a design problem, it's a meaning problem. 2. If you haven't yourself discovered what 'feels like you', AI can't tell you. Bonus. Slop, by definition, is the algorithmic interpretation of taste. Someone please explain what the point of this is.
Thais Castello Branco@thaiscbranco_

We’re excited to introduce Taste Labs. Our mission is to end AI slop. We’re building the data and infrastructure layer to give AI models and agents taste. And today we’re coming out of stealth, announcing our $18.5M seed funding, co-led by @CRV and @AmplifyPartners AI has nailed objective domains and made it easy to generate anything. But it still feels off. Now, the challenge is judgement. What fits, what feels like you, what’s GREAT. This requires turning a fuzzy, subjective domain into something we can measure and codify. We’re starting with design. There are two sides to cracking this, the foundation model layer and the agent layer: - We’ve already been working with the top frontier labs to evaluate and improve their models, crafting the right post-training data and RL environments. - We’ve also been working with app-layer companies to build the context and verification tools for their agents to produce better, more on-brand, more creative outputs. We want a future where AI feels right. If you’re passionate about this mission, join us!

English
18
14
186
19.6K
bryan
bryan@sudo_bryan·
@tankots @WisprFlow Don’t need it anymore! Rebuilt it with Claude in two prompts! Have spent 0.2 on api costs since! And 5 dollars on building it! Incase anyone wants a copy dm me
English
0
0
0
5
Tanay Kothari
Tanay Kothari@tankots·
Calling all haters of @WisprFlow - give me your biggest issue with Wispr. Yes I will personally read through each and every comment and have our team right some wrongs.
English
830
20
931
281.2K
komal 🤸🏽‍♀️
komal 🤸🏽‍♀️@komal_42·
@jasveer10 every app can be used by scammers. Including Whatsapp. And X. And instagram. And your email as well. You want the govt to act like a nanny to small babies and ban everything in existence in India? What an insane thing to expect that too by a person who is an "entrepreneur"
English
5
3
183
8K
Jasveer Singh
Jasveer Singh@jasveer10·
Oh hello, @durov Nobody is using Telegram in India for messaging. Telegram is mostly used by scammers in India. Most financial fraud (Billions of dollars) in India happens through Telegram The Indian government should have banned Telegram years ago. It is long overdue. I’ve been noticing the same pattern for years. Almost every fraudster immediately moves to Telegram. it’s harder to trace, easier to operate. Calling this an internet freedom issue misses the point completely. Telegram became one of the preferred platforms for financial fraud, scam networks, betting groups, piracy, and other illegal activities in India.
Pavel Durov@durov

India’s IT ministry banned Telegram for one week because some users shared leaked exam questions. This punishes 150M+ ordinary Telegram users in India — not the insiders who leaked the exam materials. And the ban hasn't stopped anything. The leaks just moved to other apps.

English
3.4K
342
2.9K
1.5M