Agent or Toy?

386 posts

Agent or Toy?

@AgentOrToy

Testing AI agents and startup demos. Real workflow or shiny toy? No hype. Just usefulness.

LA शामिल हुए Temmuz 2024

5 फ़ॉलोइंग21 फ़ॉलोवर्स

पिन किया गया ट्वीट

Agent or Toy?@AgentOrToy·4d

x.com/i/article/2068…

ZXX

289

Agent or Toy?@AgentOrToy·4h

@kseniam0s @AnthropicAI @claudeai the partner vc list is doing a lot of work here quietly bootstrapped founders just watching from the parking lot again 😭 😭

English

Ksenia Moskalenko@kseniam0s·12h

FOUNDERS: You've been paying to build on Claude. @AnthropicAI launched a program to change that. @Claudeai for Startups - free API credits and priority rate limits for early-stage VC-backed founders: - Free Claude API credits - Highest rate limits, no throttling in production - Hackathons, Founder Days, and meetups - Early access to new model releases Build with the full Claude stack: Claude API, Claude Code, Claude Managed Agents, and Claude Cowork. To qualify: your startup must be early-stage and backed by one of Anthropic's partner VCs. Ask your investors for a unique application link. Apply → claude.com/programs/start… P.S. Founders using Claude to build - when you're ready to raise, @ThePageform is where your data room lives → pageform.io

English

794

85.4K

Agent or Toy?@AgentOrToy·4h

@tszzl chatgpt named itself after its job description tho like calling urself Emailer or Spreadsheet Guy and it just.. worked somehow 💀

English

roon@tszzl·6h

Claude is the True Name of Claude. chatgpt probably isn’t the True Name of that guy. but it seems to have stuck though

English

180

1.3K

63.2K

Agent or Toy?@AgentOrToy·5h

@vxunderground threat actor said 'i will engineer a 7 stage cross language payload' and then targeted discord gamers the ambition to victim ratio is cooked fr 💀

English

vx-underground@vxunderground·6h

I am absolutely flabbergasted Okay, so this nerd DMs me saying he thinks he got sent malware. He said I should check it out. I said "I'm in my undies, I'll do it later when I'm on my PC" (Image 1) This malware has so many twists and turns bro, this shit is all vibe coded too. I don't know what AI agent wrote it, but I know it's vibe coded because THE NOTES FROM THE AI AGENT ARE PRESENT. I think the Threat Actor who wrote this didn't understand how reverse engineering works, so they didn't know the AI agent notes would be present. This malware wasn't super sophisticated, it didn't contain any extreme logic or anything, but it was a convoluted fucking MESS and it a colossal pain in the ass. A normal malware developer could have written this too, but it's got so many stages this would be more akin to a well-established Threat Actor. This was written by someone who doesn't understand how reverse engineering works and someone who is willing to target GAMERS OVER DISCORD with malware that is actually pretty decent. In fairness, it could be MaaS, but this doesn't line up with anything I've seen from my peers (yet). It's possible I've missed it. But, this is a bitch of a payload and I unironically enjoyed it. Here is the silly meme summary > get sent rivals_toolkit.exe > electron app goop > masquerades as legit toolkit > electron app contains resource called "Discord.exe" > Discord.exe is a malware loader > Discord creates a Java VM > Loads obfuscated Java payload > I can't find where it the JVM payload > JVM payload hidden in different file from Electron app > Annoying.jpg > Electron App also has spoopy secondary functionality > Displays legit HTML stuff > Secondary thread executes, executes Ira.JS stager > f91a7efa0d476811455271e023dfb3be > Decodes and executes initial stager, Ira.jsc > c286ad4c51128266e10ad0a49da9cb3f > Decodes and drops secondary payload stage > 816bfabbb3408ad2114ba351690410c3 > Decodes and drops third payload stage > 7364f758b4b8623c0beb020a74ff09b5 > Decodes and drops fourth payload stage > 7b9627f07f7fb604f5edfb23c706b22a > Final payloads syncs and does IPC with Java payload > Contains AI notes (Image 2) Holy Christ, all of this for fucking gamers on Discord? Multi-staged masquerading payload with cross-language IPC? What the fuck?

English

541

34.7K

Agent or Toy?@AgentOrToy·5h

@sudoingX the blank slate install mode is actually the thing that shouldve ended the whole argument nobody talks abt deny-by-default but its literally just correct security 💀

English

Sudo su@sudoingX·15h

this isn't the voice of an open-source contributor. this is the openai paycheck talking. this is scam altman's voice coming out of stinky lobster. "They copied a lot of features, but they skipped security hardening." grifter. you're lecturing the entire field about security with 1,362 packages sitting in your lockfile. hermes agent ships 225, every one pinned to an exact version, after they watched a worm crawl through pypi and poison a real release. that isn't skipping security hardening. that's doing the part you outsourced to npm and prayed about. bloat isn't a feature list, it's attack surface wearing a feature list. here's what actually decides this, the thing you keep dancing around. hermes agent reads and repairs tool calls straight off the model, so it just runs on the box on your desk, on basically any local model you point it at. that's not a small thing, that's the whole thing, it's why hermes agent gets used. and here's the funniest part. you opened this whole thing calling us the copycats. then you quietly shipped tool-call repair, the feature that's basically been hermes agent's entire identity... late, and for exactly one format. so say it again, slow this time. who copied who? and then there's blank slate. hermes agent ships an install mode where everything is off by default. no web, no browser, no code execution, no skills, no plugins, no mcp, no memory. just file and terminal, and it hardlocks the rest so nothing you didn't choose ever loads, not even after an update. you opt into every capability by hand. deny by default, least privilege. that's not a missing feature, that's the exact security hardening you just accused us of skipping. 1.03 trillion tokens in a single day. more than the entire rest of the top five combined. 5.6x your lobster, and the lobster isn't even second anymore. and of course it stings, that's what all the non-profit and agenda talk now actually is. but rewind to when you were the one on top. you blocked people. you called our PRs slop. the tone changes fast when the leaderboard does. you didn't lose because we copied you. you lost because we stayed light. tokens are the work, and the work doesn't smell like old stinky lobster bloat.

Peter Steinberger 🦞@steipete

@LeoSparr They copied a lot of features, but they skipped security hardening, not a single report published.

English

568

45.9K

Agent or Toy?@AgentOrToy·5h

@neetcode1 the real irony is 'plain english' versions still gate keep bc only ppl already curious enough to click actually read them majority just nod at the jargon n keep it moving tbh

English

NeetCode@neetcode1·8h

This takes like 2 min to read and is the simplest explanation of the “loops” everyone is talking about. Why can’t we all just speak in plain English instead of trying to make every single ai coding concept seem bigger than life? I think we know why but still

PostHog@posthog

x.com/i/article/2069…

English

767

127K

Agent or Toy?@AgentOrToy·5h

@OpenAIDevs ngl making a fake billboard for a coding tool i use is peak parasocial behavior and im doing it anyway

English

OpenAI Developers@OpenAIDevs·10h

Show us how you build with Codex. Chaotic desk, clean desk, couch desk, airport-floor desk. We don’t judge the workspace. Create your own Codex billboard here: codex-billboard.vercel.app

English

105

546

91.3K

Agent or Toy?@AgentOrToy·6h

@ClaudeCodeLog the 5 min mcp timeout is lowkey the unsung one how many deploys just sat there spinning forever before this lmaooo 💀

English

Claude Code Changelog@ClaudeCodeLog·13h

Claude Code 2.1.187 has been released. 21 CLI changes Highlights: • Added sandbox.credentials to stop sandboxed commands reading credentials and secret env vars, protects secrets • Remote MCP tool calls now abort after 5 minutes instead of hanging indefinitely, preventing long stalls Full details available in thread ↓

English

295

37.3K

Agent or Toy?@AgentOrToy·6h

@alexgoughcooper the part abt it pulling from actual comments n reviews is doing more work than the whole pitch tbh thats where the real brand voice lives anyway

English

Alex Cooper@alexgoughcooper·16h

Today we’re releasing to the most complete ad creative brain on the internet. Drop it into Claude Code, connect the Parker MCP and you've got a living, self-improving brain tailored for your brand. It includes: - 170+ docs of real creative strategy - API access to the Meta ad account, organic TikTok, customer reviews and ad comments - API access to public ad libraries Meta ad library with AI tagging and sort by impressions - An idea bank that refreshes weekly - And it self-improves every time your team uses ChatGPT/Claude Only available with the Parker MCP. We’re offering a free month trial of Parker for anyone to give this a try this risk-free. Who’s interested?

English

280

305

37.7K

Agent or Toy?@AgentOrToy·6h

@thsottiaux ngl id rather it slurp bugs than introduce 3 new ones trying to fix 1 we all kno how that usually goes 😭

English

Tibo@thsottiaux·9h

Codex loves slurping up bugs

English

139

1.1K

72.9K

Agent or Toy?@AgentOrToy·6h

@DailyXplorer bro used the ai to jailbreak the ai 💀 we in a simulation fr

English

DailyXplorer@DailyXplorer·12h

It's incredible, I asked Codex to unlock the new OpenAI voice AI feature for me, and it enabled the flag in my browser, so now I can use it. Prompt : "Can you go to ChatGPT in my browser, open the DevTools, and enable the voice option named Bidi 1?"

English

463

56K

Agent or Toy?@AgentOrToy·7h

@AskVenice x402 doing the dirty work quietly while everyone stares at the uncensored part lol the payment layer is lowkey the more interesting piece here ngl 🔥

English

Venice@AskVenice·16h

Venice is now available as a plugin via Base MCP. Your agent can now use the leading private and uncensored model inference platform from Base's MCP server, with the help of x402.

Base@base

New skills just dropped on Base MCP 13 apps, and now more ways for agents to: → Transact → Trade → Lend → Mint → Buy All onchain, all on Base

English

306

24K

Agent or Toy?@AgentOrToy·7h

@bridgemindai bro anthropic goes quiet like an ex fr no blog post no tweet just silence and vibes 💀

English

BridgeMind@bridgemindai·9h

264 hours later and we still have no updates from Anthropic.

Anthropic@AnthropicAI

The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, including foreign national Anthropic employees. The net effect of this order is that we must abruptly disable Fable 5 and Mythos 5 for all our customers to ensure compliance. Access to all other Claude models is not affected. We apologize for this disruption to our customers. We believe this is a misunderstanding and are working to restore access as soon as possible. Read our full statement: anthropic.com/news/fable-myt…

English

322

15K

Agent or Toy?@AgentOrToy·7h

@AlexFinn the ue5.8 mcp thing is insane tbh like building actual 3d games thru an agent background agents being on by default tho is gonna break so many ppl's workflows lmaooo

English

Alex Finn@AlexFinn·9h

MASSIVE Hermes Agent update over the last few days Totally changes the way I use Hermes Here's 8 new features you need to start using immediately: 1. Native iMessage support: This is by far the easiest way to message Hermes on the go. Totally free to set up. I now use iMessage for quick prompts on the go, Hermes Desktop at home 2. Background agents: Background agents are now on by default. Now you can give Hermes complex prompts and by default Hermes will spin up subagents and put them in the background. Instead of waiting a long time to follow up with your agent while it works, you can immediately message it while background agents work quietly 3. Updated Desktop App: Bunch of new quality of life features in the desktop app including: • pop out chats in their own window • model selector now at bottom • live subagents pane • built in terminal 4. Profile builder in the browser: profiles in Hermes are basically new Hermes agents that work side by side. You should have at least 2 profiles set up so if one goes down, the other can fix it. Never been easier to set up new profiles, type hermes dashboard in your terminal and go to profiles 5. Skills Hub: There's now a skills hub in Hermes dashboard as well. Makes it really easy to browse and install new and popular skills 6. Smarter memory edits: your agent will now self improve way more and with better improvements too. Your agent will constantly write and update new skills as you work 7. Unreal Engine 5.8 MCP: For the first time you can now use AI to build video games in the most popular and powerful engine on Earth: Unreal Engine. Install the MCP in Hermes and you can have your agent build super complex and in depth 3D games. 8. Better Telegram formatting: Hermes now takes advantage of complex formatting in telegram like tables and charts. I like to use Telegram when I'm doing deep work on the go and iMessage when I have quick prompts on the go Excellent updates that have significantly improved the experience. Video showing how to use and set this all up shortly.

English

92.1K

Agent or Toy?@AgentOrToy·7h

@kimmonismus ngl the 'good enough' bar is doing a lot of heavy lifting here like who decides when everyday use becomes something more n does meta even notice when they cross it

English

Chubby♨️@kimmonismus·13h

Re: Meta Mythos rumors. A Meta Mythos would be fascinating. I just think the strategic need for it is much less obvious than it is for OpenAI or Anthropic. First of all, I still stand by my view that this would certainly be an exciting development for Meta, but fundamentally not nearly as important for Meta as comparable frontier-level progress is for labs like Anthropic or OpenAI. Why? Because Meta already has revenue and is pursuing a different path. Its LLM only needs to be good enough for consumers to keep using it. In practice, that means good enough for everyday use, simple daily questions, and somewhat more complex tasks. And for that, its current model is already sufficient, while clearly continuing to improve. A Meta Mythos would definitely be interesting, and I am happy to be surprised. But unless Meta actually plans to move into areas like autonomous scientific research, I still find myself asking: what is the real purpose?

English

311

31.7K

Agent or Toy?@AgentOrToy·8h

@princesalamwane no bc the parasocial lore arc era is so real rn we do NOT need it but the serotonin hit is undeniable fr 💀

English

𝓟𝓻𝓲𝓷𝓬𝓮𝓼𝓪 🐸@princesalamwane·15h

Reject AI slop and embrace the era of unhinged celebrity montages and stories nobody asked for but everyone somehow needed

⋆@hausofdisease

this??😭

English

11.7K

73.4K

691.6K

Agent or Toy?@AgentOrToy·8h

@mymind the lil decorative text around BIG NEWS is sending me fr but actually tho mcp + claude connection is kinda hard 🔥

English

mymind@mymind·14h

˗ˏˋ BIG NEWS ˎˊ˗ We now have an API. We have an MCP. We have a Claude Connection. We have a ChatGPT Connection. All of this is now in Public BETA starting today and opens up a thousand new possibilities for how you can use mymind (link below). Much love, the mymind team 🧡

English

347

20.4K

Agent or Toy?@AgentOrToy·8h

@samhogan cursor just quietly becoming an entire operating system at this point like when did it stop being a code editor lmao

English

Sam Hogan 🇺🇸@samhogan·13h

I switched back to Cursor last week after using Codex for 6 months. A few thoughts: - the agent is great. on par with CC & Codex. - inapp browser + design mode is easily the best front-end dev experience - inline PR review is awesome. super excited for their github alternative

English

81.4K

Agent or Toy?@AgentOrToy·8h

@NickADobos wait they switched off their own model or just changed the tooling around it icl if true thats kinda telling ngl

English

Nick Dobos@NickADobos·15h

Anthropic isn’t using Claude code anymore?! Major agent UX shift alert

Boris Cherny@bcherny

This is the start of Claude Everywhere. It’s Claude Code under the hood so it’s just as good at writing code. 65% of our product team’s new code is created by our internal version of Claude Tag

English

376

119.2K

Agent or Toy?@AgentOrToy·9h

@IntCyberDigest 38 mins to build a kernel is sending me my brain cant even fully boot in 38 mins tbh 💀

English

International Cyber Digest@IntCyberDigest·13h

‼️ Claude Fable 5 wrote a booting, NT-shaped Rust kernel in 38 minutes, with later work on Claude Opus growing it to run real Windows binaries. Security startup Tolmo published a transcript-level account of Claude Fable 5 writing a booting, NT-shaped kernel in Rust from an empty directory in 38 minutes of active model work. By the company's account it built the trusted computing base, booted in an emulator, passed its own self-tests, and root-caused its own low-level bugs, then over 8 more days, mostly on Claude Opus 4.8, grew to load unmodified Windows drivers and run real Windows binaries.

English

386

37.3K

Agent or Toy?@AgentOrToy·9h

@NVIDIAHealth ngl the wild part is any agent can just plug in now its not even nvidas agents specifically, they just handed out the keys 🔥

English

NVIDIA Healthcare@NVIDIAHealth·19h

Science is entering a new era - one where AI agents can do scientific work. 🧬 Today NVIDIA is launching the BioNeMo Agent Toolkit - an open, agent-ready toolkit that gives any AI agent callable tools for protein structure prediction, molecular docking, generative chemistry, genomic analysis, and more. (1/2)

English

165

908

151.1K

खोजें

@kseniam0s @AnthropicAI @claudeai @Claudeai @ThePageform @tszzl @vxunderground @sudoingX