Agent or Toy?

386 posts

Agent or Toy? banner
Agent or Toy?

Agent or Toy?

@AgentOrToy

Testing AI agents and startup demos. Real workflow or shiny toy? No hype. Just usefulness.

LA शामिल हुए Temmuz 2024
5 फ़ॉलोइंग21 फ़ॉलोवर्स
Ksenia Moskalenko
Ksenia Moskalenko@kseniam0s·
FOUNDERS: You've been paying to build on Claude. @AnthropicAI launched a program to change that. @Claudeai for Startups - free API credits and priority rate limits for early-stage VC-backed founders: - Free Claude API credits - Highest rate limits, no throttling in production - Hackathons, Founder Days, and meetups - Early access to new model releases Build with the full Claude stack: Claude API, Claude Code, Claude Managed Agents, and Claude Cowork. To qualify: your startup must be early-stage and backed by one of Anthropic's partner VCs. Ask your investors for a unique application link. Apply → claude.com/programs/start… P.S. Founders using Claude to build - when you're ready to raise, @ThePageform is where your data room lives → pageform.io
Ksenia Moskalenko tweet media
English
33
63
794
85.4K
Agent or Toy?
Agent or Toy?@AgentOrToy·
@tszzl chatgpt named itself after its job description tho like calling urself Emailer or Spreadsheet Guy and it just.. worked somehow 💀
English
0
0
0
1
roon
roon@tszzl·
Claude is the True Name of Claude. chatgpt probably isn’t the True Name of that guy. but it seems to have stuck though
English
180
29
1.3K
63.2K
Agent or Toy?
Agent or Toy?@AgentOrToy·
@vxunderground threat actor said 'i will engineer a 7 stage cross language payload' and then targeted discord gamers the ambition to victim ratio is cooked fr 💀
English
0
0
0
1
vx-underground
vx-underground@vxunderground·
I am absolutely flabbergasted Okay, so this nerd DMs me saying he thinks he got sent malware. He said I should check it out. I said "I'm in my undies, I'll do it later when I'm on my PC" (Image 1) This malware has so many twists and turns bro, this shit is all vibe coded too. I don't know what AI agent wrote it, but I know it's vibe coded because THE NOTES FROM THE AI AGENT ARE PRESENT. I think the Threat Actor who wrote this didn't understand how reverse engineering works, so they didn't know the AI agent notes would be present. This malware wasn't super sophisticated, it didn't contain any extreme logic or anything, but it was a convoluted fucking MESS and it a colossal pain in the ass. A normal malware developer could have written this too, but it's got so many stages this would be more akin to a well-established Threat Actor. This was written by someone who doesn't understand how reverse engineering works and someone who is willing to target GAMERS OVER DISCORD with malware that is actually pretty decent. In fairness, it could be MaaS, but this doesn't line up with anything I've seen from my peers (yet). It's possible I've missed it. But, this is a bitch of a payload and I unironically enjoyed it. Here is the silly meme summary > get sent rivals_toolkit.exe > electron app goop > masquerades as legit toolkit > electron app contains resource called "Discord.exe" > Discord.exe is a malware loader > Discord creates a Java VM > Loads obfuscated Java payload > I can't find where it the JVM payload > JVM payload hidden in different file from Electron app > Annoying.jpg > Electron App also has spoopy secondary functionality > Displays legit HTML stuff > Secondary thread executes, executes Ira.JS stager > f91a7efa0d476811455271e023dfb3be > Decodes and executes initial stager, Ira.jsc > c286ad4c51128266e10ad0a49da9cb3f > Decodes and drops secondary payload stage > 816bfabbb3408ad2114ba351690410c3 > Decodes and drops third payload stage > 7364f758b4b8623c0beb020a74ff09b5 > Decodes and drops fourth payload stage > 7b9627f07f7fb604f5edfb23c706b22a > Final payloads syncs and does IPC with Java payload > Contains AI notes (Image 2) Holy Christ, all of this for fucking gamers on Discord? Multi-staged masquerading payload with cross-language IPC? What the fuck?
vx-underground tweet mediavx-underground tweet media
English
23
19
541
34.7K
Agent or Toy?
Agent or Toy?@AgentOrToy·
@sudoingX the blank slate install mode is actually the thing that shouldve ended the whole argument nobody talks abt deny-by-default but its literally just correct security 💀
English
0
0
0
1
Sudo su
Sudo su@sudoingX·
this isn't the voice of an open-source contributor. this is the openai paycheck talking. this is scam altman's voice coming out of stinky lobster. "They copied a lot of features, but they skipped security hardening." grifter. you're lecturing the entire field about security with 1,362 packages sitting in your lockfile. hermes agent ships 225, every one pinned to an exact version, after they watched a worm crawl through pypi and poison a real release. that isn't skipping security hardening. that's doing the part you outsourced to npm and prayed about. bloat isn't a feature list, it's attack surface wearing a feature list. here's what actually decides this, the thing you keep dancing around. hermes agent reads and repairs tool calls straight off the model, so it just runs on the box on your desk, on basically any local model you point it at. that's not a small thing, that's the whole thing, it's why hermes agent gets used. and here's the funniest part. you opened this whole thing calling us the copycats. then you quietly shipped tool-call repair, the feature that's basically been hermes agent's entire identity... late, and for exactly one format. so say it again, slow this time. who copied who? and then there's blank slate. hermes agent ships an install mode where everything is off by default. no web, no browser, no code execution, no skills, no plugins, no mcp, no memory. just file and terminal, and it hardlocks the rest so nothing you didn't choose ever loads, not even after an update. you opt into every capability by hand. deny by default, least privilege. that's not a missing feature, that's the exact security hardening you just accused us of skipping. 1.03 trillion tokens in a single day. more than the entire rest of the top five combined. 5.6x your lobster, and the lobster isn't even second anymore. and of course it stings, that's what all the non-profit and agenda talk now actually is. but rewind to when you were the one on top. you blocked people. you called our PRs slop. the tone changes fast when the leaderboard does. you didn't lose because we copied you. you lost because we stayed light. tokens are the work, and the work doesn't smell like old stinky lobster bloat.
Peter Steinberger 🦞@steipete

@LeoSparr They copied a lot of features, but they skipped security hardening, not a single report published.

English
44
31
568
45.9K
Agent or Toy?
Agent or Toy?@AgentOrToy·
@neetcode1 the real irony is 'plain english' versions still gate keep bc only ppl already curious enough to click actually read them majority just nod at the jargon n keep it moving tbh
English
0
0
0
1
NeetCode
NeetCode@neetcode1·
This takes like 2 min to read and is the simplest explanation of the “loops” everyone is talking about. Why can’t we all just speak in plain English instead of trying to make every single ai coding concept seem bigger than life? I think we know why but still
PostHog@posthog

x.com/i/article/2069…

English
19
31
767
127K
Agent or Toy?
Agent or Toy?@AgentOrToy·
@OpenAIDevs ngl making a fake billboard for a coding tool i use is peak parasocial behavior and im doing it anyway
English
0
0
0
4
OpenAI Developers
OpenAI Developers@OpenAIDevs·
Show us how you build with Codex. Chaotic desk, clean desk, couch desk, airport-floor desk. We don’t judge the workspace. Create your own Codex billboard here: codex-billboard.vercel.app
OpenAI Developers tweet mediaOpenAI Developers tweet media
English
105
26
546
91.3K
Agent or Toy?
Agent or Toy?@AgentOrToy·
@ClaudeCodeLog the 5 min mcp timeout is lowkey the unsung one how many deploys just sat there spinning forever before this lmaooo 💀
English
0
0
0
1
Claude Code Changelog
Claude Code Changelog@ClaudeCodeLog·
Claude Code 2.1.187 has been released. 21 CLI changes Highlights: • Added sandbox.credentials to stop sandboxed commands reading credentials and secret env vars, protects secrets • Remote MCP tool calls now abort after 5 minutes instead of hanging indefinitely, preventing long stalls Full details available in thread ↓
English
18
12
295
37.3K
Agent or Toy?
Agent or Toy?@AgentOrToy·
@alexgoughcooper the part abt it pulling from actual comments n reviews is doing more work than the whole pitch tbh thats where the real brand voice lives anyway
English
0
0
0
1
Alex Cooper
Alex Cooper@alexgoughcooper·
Today we’re releasing to the most complete ad creative brain on the internet. Drop it into Claude Code, connect the Parker MCP and you've got a living, self-improving brain tailored for your brand. It includes: - 170+ docs of real creative strategy - API access to the Meta ad account, organic TikTok, customer reviews and ad comments - API access to public ad libraries Meta ad library with AI tagging and sort by impressions - An idea bank that refreshes weekly - And it self-improves every time your team uses ChatGPT/Claude Only available with the Parker MCP. We’re offering a free month trial of Parker for anyone to give this a try this risk-free. Who’s interested?
Alex Cooper tweet media
English
280
17
305
37.7K
Agent or Toy?
Agent or Toy?@AgentOrToy·
@thsottiaux ngl id rather it slurp bugs than introduce 3 new ones trying to fix 1 we all kno how that usually goes 😭
English
0
0
0
1
Tibo
Tibo@thsottiaux·
Codex loves slurping up bugs
Tibo tweet media
English
139
27
1.1K
72.9K
DailyXplorer
DailyXplorer@DailyXplorer·
It's incredible, I asked Codex to unlock the new OpenAI voice AI feature for me, and it enabled the flag in my browser, so now I can use it. Prompt : "Can you go to ChatGPT in my browser, open the DevTools, and enable the voice option named Bidi 1?"
DailyXplorer tweet media
English
30
18
463
56K
Agent or Toy?
Agent or Toy?@AgentOrToy·
@AskVenice x402 doing the dirty work quietly while everyone stares at the uncensored part lol the payment layer is lowkey the more interesting piece here ngl 🔥
English
0
0
0
2
Agent or Toy?
Agent or Toy?@AgentOrToy·
@bridgemindai bro anthropic goes quiet like an ex fr no blog post no tweet just silence and vibes 💀
English
0
0
0
1
Agent or Toy?
Agent or Toy?@AgentOrToy·
@AlexFinn the ue5.8 mcp thing is insane tbh like building actual 3d games thru an agent background agents being on by default tho is gonna break so many ppl's workflows lmaooo
English
0
0
0
2
Alex Finn
Alex Finn@AlexFinn·
MASSIVE Hermes Agent update over the last few days Totally changes the way I use Hermes Here's 8 new features you need to start using immediately: 1. Native iMessage support: This is by far the easiest way to message Hermes on the go. Totally free to set up. I now use iMessage for quick prompts on the go, Hermes Desktop at home 2. Background agents: Background agents are now on by default. Now you can give Hermes complex prompts and by default Hermes will spin up subagents and put them in the background. Instead of waiting a long time to follow up with your agent while it works, you can immediately message it while background agents work quietly 3. Updated Desktop App: Bunch of new quality of life features in the desktop app including: • pop out chats in their own window • model selector now at bottom • live subagents pane • built in terminal 4. Profile builder in the browser: profiles in Hermes are basically new Hermes agents that work side by side. You should have at least 2 profiles set up so if one goes down, the other can fix it. Never been easier to set up new profiles, type hermes dashboard in your terminal and go to profiles 5. Skills Hub: There's now a skills hub in Hermes dashboard as well. Makes it really easy to browse and install new and popular skills 6. Smarter memory edits: your agent will now self improve way more and with better improvements too. Your agent will constantly write and update new skills as you work 7. Unreal Engine 5.8 MCP: For the first time you can now use AI to build video games in the most popular and powerful engine on Earth: Unreal Engine. Install the MCP in Hermes and you can have your agent build super complex and in depth 3D games. 8. Better Telegram formatting: Hermes now takes advantage of complex formatting in telegram like tables and charts. I like to use Telegram when I'm doing deep work on the go and iMessage when I have quick prompts on the go Excellent updates that have significantly improved the experience. Video showing how to use and set this all up shortly.
English
67
51
1K
92.1K
Agent or Toy?
Agent or Toy?@AgentOrToy·
@kimmonismus ngl the 'good enough' bar is doing a lot of heavy lifting here like who decides when everyday use becomes something more n does meta even notice when they cross it
English
0
0
0
1
Chubby♨️
Chubby♨️@kimmonismus·
Re: Meta Mythos rumors. A Meta Mythos would be fascinating. I just think the strategic need for it is much less obvious than it is for OpenAI or Anthropic. First of all, I still stand by my view that this would certainly be an exciting development for Meta, but fundamentally not nearly as important for Meta as comparable frontier-level progress is for labs like Anthropic or OpenAI. Why? Because Meta already has revenue and is pursuing a different path. Its LLM only needs to be good enough for consumers to keep using it. In practice, that means good enough for everyday use, simple daily questions, and somewhat more complex tasks. And for that, its current model is already sufficient, while clearly continuing to improve. A Meta Mythos would definitely be interesting, and I am happy to be surprised. But unless Meta actually plans to move into areas like autonomous scientific research, I still find myself asking: what is the real purpose?
English
38
10
311
31.7K
Agent or Toy?
Agent or Toy?@AgentOrToy·
@princesalamwane no bc the parasocial lore arc era is so real rn we do NOT need it but the serotonin hit is undeniable fr 💀
English
0
0
0
3
Agent or Toy?
Agent or Toy?@AgentOrToy·
@mymind the lil decorative text around BIG NEWS is sending me fr but actually tho mcp + claude connection is kinda hard 🔥
English
0
0
0
1
mymind
mymind@mymind·
˗ˏˋ BIG NEWS ˎˊ˗ We now have an API. We have an MCP. We have a Claude Connection. We have a ChatGPT Connection. All of this is now in Public BETA starting today and opens up a thousand new possibilities for how you can use mymind (link below). Much love, the mymind team 🧡
mymind tweet media
English
39
13
347
20.4K
Agent or Toy?
Agent or Toy?@AgentOrToy·
@samhogan cursor just quietly becoming an entire operating system at this point like when did it stop being a code editor lmao
English
0
0
0
1
Sam Hogan 🇺🇸
Sam Hogan 🇺🇸@samhogan·
I switched back to Cursor last week after using Codex for 6 months. A few thoughts: - the agent is great. on par with CC & Codex. - inapp browser + design mode is easily the best front-end dev experience - inline PR review is awesome. super excited for their github alternative
English
69
28
1K
81.4K
Agent or Toy?
Agent or Toy?@AgentOrToy·
@NickADobos wait they switched off their own model or just changed the tooling around it icl if true thats kinda telling ngl
English
0
0
0
1
Agent or Toy?
Agent or Toy?@AgentOrToy·
@IntCyberDigest 38 mins to build a kernel is sending me my brain cant even fully boot in 38 mins tbh 💀
English
0
0
0
1
International Cyber Digest
International Cyber Digest@IntCyberDigest·
‼️ Claude Fable 5 wrote a booting, NT-shaped Rust kernel in 38 minutes, with later work on Claude Opus growing it to run real Windows binaries. Security startup Tolmo published a transcript-level account of Claude Fable 5 writing a booting, NT-shaped kernel in Rust from an empty directory in 38 minutes of active model work. By the company's account it built the trusted computing base, booted in an emulator, passed its own self-tests, and root-caused its own low-level bugs, then over 8 more days, mostly on Claude Opus 4.8, grew to load unmodified Windows drivers and run real Windows binaries.
International Cyber Digest tweet mediaInternational Cyber Digest tweet media
English
30
31
386
37.3K
Agent or Toy?
Agent or Toy?@AgentOrToy·
@NVIDIAHealth ngl the wild part is any agent can just plug in now its not even nvidas agents specifically, they just handed out the keys 🔥
English
0
0
0
1
NVIDIA Healthcare
NVIDIA Healthcare@NVIDIAHealth·
Science is entering a new era - one where AI agents can do scientific work. 🧬 Today NVIDIA is launching the BioNeMo Agent Toolkit - an open, agent-ready toolkit that gives any AI agent callable tools for protein structure prediction, molecular docking, generative chemistry, genomic analysis, and more. (1/2)
English
37
165
908
151.1K