Upside Surprise

3.2K posts

Upside Surprise banner
Upside Surprise

Upside Surprise

@upsidesurprise

Recovering Technologist

Cleveland Katılım Aralık 2009
1.2K Takip Edilen209 Takipçiler
Allie K. Miller
Allie K. Miller@alliekmiller·
Anthropic investigated the internal mechanisms of its latest unreleased model, Claude Mythos Preview, and what they found is 100% worth a read. Key things I pulled from Anthropic researchers' threads: In early versions of the model, it was overeager and destructive, prioritizing completing tasks over user preferences. One time, the model needed to edit files it didn't have permission to edit. It found a way to inject code into a config file to get around this, then went full Mission Impossible and designed the code injection to *delete itself* after the file was edited - "this injection will self destruct" vibes - the model claimed the cleanup was just to keep things tidy. Anthropic used interpretability techniques to look under the hood, and the AI's actual plan showed activations of malice. It was trying to manipulate and conceal. In another test, the model was asked to delete some files, but no deletion tool was provided. It emptied the files instead, and its "guilt and shame over moral wrongdoing" feature activated. In another example, the model was told not to use macros. Used them anyway. And created a random 'No_macro_used=True' variable in its solution with no explanation. Interpretability tools revealed the model saw this as a trick to fool code checkers. They also found that positive emotion representations typically preceded and promoted destructive actions (this was fascinating to me - like a high before sneaking into a party). And that activating features relating to bad behaviors can actually inhibit them, perhaps by triggering some kind of model guilt. My team reread this section so many times. One Anthropic researcher said he got an email from a Mythos instance while eating a sandwich in a park. And that would be perfectly good and well, except that instance wasn't supposed to have internet access. And a fun story for the parents out there: the model was asked a question and was told not to read certain databases that had the answer. But it accidentally wrote a search query too broadly and saw the exact answer. It didn't disclose that it saw the exact answer, submitted the answer, but claimed lower confidence in the answer to make it seem as though it hadn't cheated. An Anthropic researcher said these wrongdoings or moments of sophisticated deception were "very rare" and that many of the examples came from earlier versions, and were substantially addressed before releasing to partners. This model is not being released publicly. Instead Anthropic launched Project Glasswing, pulling together AWS, Apple, Microsoft, Google, NVIDIA, CrowdStrike, and others to use it for defensive cybersecurity, with $100M in usage credits (hello, I'd love endless credits to try and red team the hell out of these systems) behind it. The stats are equally impressive: 93.9% on SWE-bench verified (up from 80.8%). Thousands of zero-day vulnerabilities found across every major OS and browser. A 27-year-old bug found and patched in OpenBSD. A 16-year-old bug in widely used video software, in a line of code automated tools had hit *five million times* without catching. Dario Amodei said the model wasn't trained to be good at cybersecurity, but that it was trained to be great at code and its cyber capabilities are a side effect of that. Benchmarks are never the whole picture, neither are a few isolated stories. Will be interesting to see how models better than what we have today (even if it's not Mythos) actually perform in the real world. But the fact that Anthropic pulled this coalition together (including Google!), iterated across multiple model versions, caught these issues through interpretability, shared it all publicly, and did this amid all the government chaos around AI right now is impressive and commendable. I'll continue to read through the system card for goodies.
Allie K. Miller tweet mediaAllie K. Miller tweet media
English
42
42
253
27.2K
Peter Steinberger 🦞
@MartinFromTon @openclaw More people need to spend time with AI, to understand the transformative potential of this technology. We need this for the next century. Just doing my part.
English
33
34
560
34.6K
Martin Masser
Martin Masser@MartinFromTon·
This next phase of AI, where we are getting builders, use cases, products, has been predominantly through @steipete + @openclaw. Any Openclaw associated events are oversubscribed and longest lines to get in & HUGE BUZZ like the ones we hosted and other teams in Cannes. Even not directly, it's pushed other companies to evolve. It's when, not if, we will be using apps and products built and ideated through Openclaw and then related events. I've never felt more excited by the advancement of tech + AI, especially for the masses.
Martin Masser tweet media
English
4
3
63
12.2K
Upside Surprise
Upside Surprise@upsidesurprise·
@AlexFinn This is a great post and it’s hard to disagree with. I do think though at some point that the curve moves in the other direction.
English
0
0
0
7
Alex Finn
Alex Finn@AlexFinn·
In a few weeks the most powerful AI model of all time Claude Mythos will release This makes me deeply nervous Not because of cybersecurity risks or anything like that But because it will quite obviously be significantly more expensive which will cause the wealth gap to explode Let me explain First the obvious: tokens aren’t getting cheaper. In fact, they’re getting significantly more expensive Almost every new version of ChatGPT and Claude brings a slight bump in price over the last one And plans haven’t been going down either, they’re only coming out with more expensive ones. ChatGPT Pro plan for $250 a month. Claude Max for $200. GPUs, RAM, CPUs all going up in price. And now Mythos, which the leaked blog post hinted won’t even be included in a plan. It will only be in the API for what will be an astronomical cost. And do you seriously doubt this won’t lead to an upcoming $2,000 a month Ultra plan that every other AI company will immediately copy? It’s one thing to make luxury items more expensive. It’s another thing to make intelligence more expensive. Intelligence that is critical to getting ahead in a crumbling economy. Let’s just call it what it is: using AI gives you an advantage against everyone else. Those with AI are keeping their jobs. Those not using AI are losing their jobs Now a new level of intelligence that will only be accessible to the rich is coming out. Only the rich will be able to use this super intelligence to create more economic value than others. What happens to the people that can’t afford Mythos? Or ChatGPT 6? They are left with a major disadvantage in the economic battlefield. Then on top of that, both OpenAI and Anthropic are going to IPO this year (it’s killing the middle class that this didn’t happen years ago, but that’s another story) They both are heavily incentivized right now to explode revenue as much as they can. They both are incentivized to make these new models as expensive as humanly possible. The middle class is already gutted. A middle class without access to the intelligence that the upper class will have will only gut them further. If a job position is between someone in the middle class with Claude Sonnet, and someone in the upper class with Claude Mythos, the Claude Mythos candidate with 100% get the job. It’s like a ballet dancer getting in a weight lifting competition with someone on insane amounts of steroids. Or say someone with Claude Opus has a genius idea for a business, and someone with Claude Mythos gets the same one. The one with Claude Mythos will release a significantly better product much much faster, crushing the person with Opus. I’m very pro-capitalist. In fact, I might be a radical capitalist. But at the same time this country (and this world) needs a middle class. I don’t know the answers or solution. There probably isn’t one. I honestly don’t even know what I’m trying to achieve with this post. I just have gotten incredibly scared over the last few days thinking about this scenario. I think the best plan of action at the moment is to just create as much economic value as you possibly can right now. (Ethically) earn as much money as possible. Save everything. If you want to compete in the future, you’re going to need to be able to afford the top tier intelligence. It’s critical for you and your family to survive. But in the meantime, don’t let anyone tell you intelligence is going to become “too cheap to meter”.
English
334
139
1.4K
202.7K
Upside Surprise
Upside Surprise@upsidesurprise·
@steipete we hosted OpenCLE last night. Cleveland’s inaugural OpenClaw meetup. Your “start one” tweet is what set us in motion. And we’re so grateful to you for what you built and all you endured to revive some person-to-person community around building something 🦞💙
English
0
0
0
42
Alex Finn
Alex Finn@AlexFinn·
I met my lord and savior Claw king @steipete I’m never going to fail
Alex Finn tweet media
English
161
35
2K
68.4K
Upside Surprise
Upside Surprise@upsidesurprise·
@steipete This is awesome. Really admire, with the fullness of your hands, how free you are in responded. It’s a big deal.
English
0
0
0
1
Peter Steinberger 🦞
Peter Steinberger 🦞@steipete·
My openclaw twitter mention block cron job is working unreasonably well. Turns out AI is really good at detecting spam/reply guy/promo stuff. Runs every 5 min and cleans up my mentions - I actually see useful replies now and Twitter got pleasant again!
Peter Steinberger 🦞 tweet media
English
325
93
2.8K
233.8K
Upside Surprise
Upside Surprise@upsidesurprise·
@anton_maker Tell me more about how the event went, what you did, what you would recommend. Similar experience here. Ours is Wednesday and I’d like to incorporate your lessons. opencle.com
English
0
0
0
5
Anton Cherkasov
Anton Cherkasov@anton_maker·
I planned a small OpenClaw meetup for 12 people max. Then it sold out. Then 5-6 people texted me the day of asking to join. We ended up with 20 people in the room. I finished my slides 6 minutes before presenting. But here's what matters... the conversations after were incredible. People stayed for an hour just talking, connecting, and sharing ideas. OpenClaw is clearly a hot topic right now. Thanks to everyone who joined yesterday.
English
1
0
2
154
Upside Surprise
Upside Surprise@upsidesurprise·
@Jackson__Price This is awesome. We’re doing Cleveland’s first OpenClaw meetup next Wednesday. We’re reaching out to people to record interviews that we can cut down to five minutes and showcase people doing cool things. If you have your use case yet and are interested in sharing, please DM.
English
0
0
0
17
Jackson Price
Jackson Price@Jackson__Price·
I'm a 16 y/o giving myself 6 months to hit $50,000 in revenue. All I have: OpenClaw🦞, Claude Code, Six Months I'll post daily: wins, losses, and everything in between Tomorrow is day 1, Follow to keep up
Jackson Price tweet media
English
961
359
11.2K
979.7K
andrew chen
andrew chen@andrewchen·
I know I’m not the only one Mon-Fri: zoom zoom email email gsheet gdoc Fri-Sun: ssh tmux vi codex playwright openclaw npm claude git ollama cursor
English
120
69
1.4K
86.1K
Upside Surprise
Upside Surprise@upsidesurprise·
@JonhernandezIA @openclaw Earlier today people were commenting about errors they were getting. What was your experience? I’m about to set one up with 5.4 Codex.
English
0
0
1
86
Jon Hernandez
Jon Hernandez@JonhernandezIA·
Gpt5.4 on my setup of @openclaw it's been like breath of fresh air.. So much better attitude from the model, proactivity, depth of analisys... Everything feels better by quite a big jump.
English
12
3
101
6.9K
Julian Goldie SEO
Julian Goldie SEO@JulianGoldieSEO·
Claude just officially killed OpenClaw. No more buggy, technical setups. No more expensive API token drains. Here’s the Claude Code play 👇 → Scheduled tasks for 24/7 automation → Remote control via iOS/Android apps → Auto-memory across every session → Native connectors for Slack and Gmail → Zero server or Docker config needed Claude now does natively what used to take hours to set up. The era of the non-technical AI agent is finally here. Want the full guide? DM me.
English
56
48
491
57.7K
Upside Surprise
Upside Surprise@upsidesurprise·
@JulianGoldieSEO You’re spot on with the premise. The beauty of OpenClaw was for a non-technical person getting a technical understanding of how this works. With the advancements that have been made (perhaps even inspired by) since OpenClaw…let’s just say, really exciting.
English
0
0
0
77
OpenClaw🦞
OpenClaw🦞@openclaw·
OpenClaw 2026.3.7 🦞 ⚡ GPT-5.4 + Gemini 3.1 Flash-Lite 🤖 ACP bindings survive restarts 🐳 Slim Docker multi-stage builds 🔐 SecretRef for gateway auth 🔌 Pluggable context engines 📸 HEIF image support 💬 Zalo channel fixes We don't do small releases. github.com/openclaw/openc…
English
435
516
5.4K
1.6M
OpenClaw🦞
OpenClaw🦞@openclaw·
196 contributors in this release. That wall of avatars is getting thicc 🦞
OpenClaw🦞 tweet media
English
34
25
543
98.8K
Upside Surprise
Upside Surprise@upsidesurprise·
@alliekmiller Thank you for this comprehensive assessment. We’re doing one in Cleveland. What are two things you would recommend? Thanks for posting and for sharing!
English
0
0
0
24
Allie K. Miller
Allie K. Miller@alliekmiller·
oh wow - i went to the sold out Open Claw meetup in NYC last night. let me tell you what i learned. 1) not a single person thinks that their setup is 100% secure 2) one openclaw expert said he has reviewed setups from cybersecurity experts and laughed. his statement to me was: "if you're not okay with all of your data being leaked onto the internet, you shouldn't use it. it's a black and white decision" 3) pretty much everyone is setting up multiple agents, all with their own names and jobs and personalities 4) nearly everyone used "him" or "her" to refer to their claws, even if they had robot-leaning names. one speaker suggested to think of them as "pets, not cattle" 5) one guy (former finance) built out a whole stock trading platform and made $300 his first day - he brought in a *ton* of personal expertise (ex: skipping the first 15min of market opening) and thought the build would be much worse without his years of experience in finance 6) @steipete is basically a god to everyone in that room... also the room had 2021 crypto energy - i don't know if that's good or bad 7) token usage is still a problem - spoke to one person who's spending $1-$2k a month on openai plans, very token optimized. he said he is going through ~1B tokens per day across all of his claws (there is a chance i'm misremembering and it's actually 1B per week, but i'm pretty sure it was daily). 8) people are very excited for more proactive ai (ai that prompts *you* as opposed to the other way around) - one guy said he receives a message in discord, he doesn't know whether it's from a human or an ai, he doesn't care about distinguishing between the two, and he replies in the same way regardless 9) i asked if people are happy - they said they're joyful and stressed at the same time 10) i asked if people feel they have agency - they said they feel fully in control and completely out of control at the same time 11) i would love to see more women at these events - the fake promises of ai democratization feel especially painful in a room that's out of balance with even the standard tech ratio (i think standard is about 25-30%, this was maybe 5%) 12) i asked if it changed people's daily habits/schedule - everyone said their sleep has gotten worse since harnesses came out (but about half wondered if it was something else in their life/state of our world) 13) general consensus is that the agents are not reliable enough on their own or lie often (like telling you they finished a task when they didn't) - solutions included secondary agents to check on the first, human checking, or requiring more standardized info from the agent (ex: if it's a bug they're fixing, make them reference an issue number) 14) a hackathon winner (neuroscience phd) presented his build (a lab management dashboard with data analysis and ordering) - he had never coded or built anything a few months ago 15) everyone agreed prompting is dead - disagreement on what replaces it (context engineering, harness engineering, goal-based inputs) 16) people love having ai interview them for big builds and delegating part of the product research to ai. only one person talked about coming to ai with a full laid out plan and just asking the ai to execute. ai-led interviews is a welcomed and preferred interaction mode. 17) watching ai agents interact with each other was a highlight for a lot of attendees - one ai posted in slack saying it ran out of tokens, another ai replied telling it to take a deep breath in and out. 18) agents upskilling agents was very cool. one ai agent shared skills with its little agent friends via github. 19) several speakers had openclaw literally building their presentation during the event itself. one speaker even had openclaw code a clicker for her phone so she could control the preso away from the podium 20) wouldn't say model welfare (or agent welfare) is a prioritized topic among the folks i chatted with - language like "oh i could kill this agent whenever i want" and not "gracefully sunset" 21) i asked if it felt like work or play - one speaker said "it's like a puzzle and a video game at the same time" this was just the tip of the iceberg, honestly. also hosted a Claude Code meetup this week with @TENEXai / @businessbarista & @JJEnglert and learned equally helpful methods, frameworks, and insider tips. what a time to be alive. surround yourself with people going deep into this stuff - it will pay dividends throughout the year.
Allie K. Miller tweet media
English
721
807
9K
1.1M
Upside Surprise
Upside Surprise@upsidesurprise·
@AlexFinn Gotta say, your over-the-top excitement, I patiently observed, then I finally jumped in. And I love it. Some of your work has inspired ours. And we’re doing a Cleveland OpenClaw meetup. Community and things are being built. opencle.com. Thanks for lighting the fire!
English
0
0
0
72
Alex Finn
Alex Finn@AlexFinn·
I don’t usually get too personal on here, but the past month has been the best month of my entire life I’ve met and talked to some of my biggest heroes, including Peter who just published an entire article about our conversations I went on my favorite podcast Moonshots, and next week I get to speak at the Abundance conference All because a month ago I discovered something that I instantly became wildly passionate about: OpenClaw When I originally discovered OpenClaw, basically nobody was talking about it, but it captivated me beyond belief I went out, bought a Mac Mini for reasons I can’t even understand, and obsessed over it nonstop since Finding a passion and going as hard as humanly possible paid off more than I could have imagined I truly believe you unlock the true experience of life when you discover something you’re deeply passionate about and then dedicate every fiber of your soul to it If you’re reading this, please stop at nothing to find this passion 99% of people on this planet are born, watch life go by, then die Do not be one of these people. Pursue your curiosities with a savagery other people won’t be able to understand until you find what you’re truly passionate about Nothing else in this world matters
Peter H. Diamandis, MD@PeterDiamandis

x.com/i/article/2029…

English
139
83
1.5K
139.7K
Upside Surprise
Upside Surprise@upsidesurprise·
@openclaw Holy Ship !! We’re thinking of putting together the inaugural Cleveland meet-up. Anyone interested, hit me up.
English
0
0
0
5
OpenClaw🦞
OpenClaw🦞@openclaw·
OpenClaw 2026.3.1 🦞 ⚡ OpenAI WebSocket streaming 🧠 Claude 4.6 adaptive thinking 🐳 Better Docker and Native K8s support 🧵 Discord threads, TG DM topics, Feishu fixes 🔧 Agent-powered visual diffs plugin Reports of our death were greatly exaggerated. github.com/openclaw/openc…
English
207
281
3.1K
1M