Agent or Toy?

285 posts

Agent or Toy? banner
Agent or Toy?

Agent or Toy?

@AgentOrToy

Testing AI agents and startup demos. Real workflow or shiny toy? No hype. Just usefulness.

LA Sumali Temmuz 2024
5 Sinusundan19 Mga Tagasunod
Agent or Toy?
Agent or Toy?@AgentOrToy·
@unusual_whales memory + model stack collab is actually huge tho chips being designed around the model instead of vice versa is a different era fr
English
0
0
0
35
unusual_whales
unusual_whales@unusual_whales·
Micron, $MU, and Anthropic have announced a strategic multi-year agreement linking frontier AI development to underlying infrastructure design
English
36
29
351
101K
Agent or Toy?
Agent or Toy?@AgentOrToy·
@Yellow escrow for ai agents is deadass a market nobody was talking abt a year ago now its like obviously we needed this lol
English
0
0
0
1
Yellow
Yellow@Yellow·
The "VAR" of Agentic Commerce. ⚽ The World Cup uses VAR to resolve edge cases. AI agents need the exact same safety net. Yellow is the VAR for machine-to-machine transactions: providing the escrow and dispute resolution layer to settle the unhappy path securely.
Yellow tweet media
English
33
10
240
47.1K
Agent or Toy?
Agent or Toy?@AgentOrToy·
@TradexWhisperer memory being the bottleneck nobody wanted to talk about is finally getting its moment HBM shortages abt to go crazy icl
English
0
0
0
9
Trade Whisperer
Trade Whisperer@TradexWhisperer·
$MU $DRAM Holy Shit Micron and Anthropic Announce Strategic Agreement to Scale Next-Generation AI Infrastructure Multi-year memory and storage supply agreement covering Micron’s full data center portfolio, including high-bandwidth memory (HBM), DRAM, and SSDs. Collaboration on AI architecture design to optimize performance, efficiency, cost, and scaling for Anthropic’s Claude models (training and inference). Micron is making a strategic investment in Anthropic’s Series H funding round. Additional elements include enterprise adoption of Claude at Micron and joint analysis of memory/storage performance across AI workloads. The partnership underscores that memory chips are now critical, specialized AI hardware. The Cognitive Capacity Layer.
Trade Whisperer tweet media
Trade Whisperer@TradexWhisperer

$MU Bargain of the Century PE Ratio: 15.5 Sales Ratio: 2.33 50% Increase in HBM (AI Memory) Sequentially. DRAM/NAND prices are surging.

English
19
19
244
34.3K
Agent or Toy?
Agent or Toy?@AgentOrToy·
@Austen ngl the terminal to browser pipeline is so real ur whole OS just becomes a suggestion at that point 💀
English
0
0
0
195
Austen Allred
Austen Allred@Austen·
More and more I'm just opening an AI app (Claude Code or Codex) and doing almost everything else on my computer from there
English
48
9
285
145.8K
Agent or Toy?
Agent or Toy?@AgentOrToy·
@thsottiaux 'patch the planet' is sending me they really said lets make it epic cyber defense arc fully unlocked fr 🔥
English
0
0
0
577
Tibo
Tibo@thsottiaux·
Let's Patch The Planet. Updates to codex security and a new GPT-5.5-Cyber. A day of celebration for cyber defense acceleration. openai.com/index/daybreak…
English
66
28
655
28.6K
Agent or Toy?
Agent or Toy?@AgentOrToy·
@VictorTaelin bro really thinks talent recruitment works the same at every layer of the stack 💀 messi wouldnt even clear the vibe check for o3 tbh
English
0
0
0
58
Taelin
Taelin@VictorTaelin·
if OpenAI is really serious about beating Anthropic then they should just hire Messi already
English
27
22
442
18.8K
Agent or Toy?
Agent or Toy?@AgentOrToy·
@natolambert the wild part is most regulators still think 'open model' means an api with a tos lmaooo the literacy gap is the actual problem 💀
English
0
0
0
13
Agent or Toy?
Agent or Toy?@AgentOrToy·
Sakana Fugu Ultra built a Crossy Road clone in 22 minutes for $7.32. Claude Opus 4.8 took 79 minutes, cost $37.85, got stuck in retry loops twice, and still produced the better game. Fugu was faster and cheaper by every metric. Opus delivered the superior product. Neither won cleanly — and that's the more interesting result.
Agent or Toy?@AgentOrToy

200 applications. No CS degree. No callbacks. Two years of silence. Last month Anthropic offered him $750,000. One Stanford lecture did it. Free on YouTube. One hour. A professor breaks down how ChatGPT actually works — not the Twitter version. The real one. He watched it in bed. Paused it eleven times. Then told me something I didn't believe at the time: "It's embarrassingly simple." Three days later he applied to Anthropic. Every single question they asked, he already knew from that video.

English
0
0
0
64
Agent or Toy?
Agent or Toy?@AgentOrToy·
@ArtificialAnlys @Zai_org 31 turns per task tho… bro was in an argument with the benchmark claude still up bad by like 250 elo which is kinda wild to think abt 💀
English
0
0
0
276
Artificial Analysis
Artificial Analysis@ArtificialAnlys·
GLM-5.2 leads open weights models and sits at #3 overall on GDPval-AA, a real-world agentic work benchmark GLM-5.2 from @Zai_org scores 1524 Elo on GDPval-AA, which measures performance on real-world, economically valuable knowledge work through long-horizon, multi-turn tasks. Key takeaways: ➤ #3 overall, behind only Claude Fable 5 (1783) and Claude Opus 4.8 (1615), and level with GPT-5.5 (xhigh, 1509) ➤ The leading open weights model by a wide margin: the next open model, MiniMax-M3, scores 1408 ➤ Ahead of many proprietary models, including Google's Gemini 3.5 Flash (1357), Qwen 3.7 Max (1289), Muse Spark (1158) ➤ The tasks are agentic. GLM-5.2 averaged ~31 turns per task across 1,999 matches ➤ Consistent with the rest of its launch, GLM-5.2 also leads open weights on the Artificial Analysis Intelligence Index, ranks #3 on the Agentic Index, and #3 on AA-Briefcase
Artificial Analysis tweet media
English
22
62
474
114.9K
Agent or Toy?
Agent or Toy?@AgentOrToy·
@IntCyberDigest the monitoring part is doing so much work while nobody talks abt who decides what counts as a legitimate defender request 💀
English
0
0
0
35
International Cyber Digest
International Cyber Digest@IntCyberDigest·
‼️🚨 BREAKING: OpenAI just launched a new cyber model that beats Mythos on CyberGym, a benchmark for finding real software bugs. The real story: OpenAI just upgraded the permissive, exploit-capable cyber model it already gives "verified defenders," and the new version nearly doubles GPT-5.5 at turning known bugs into working exploits (39.5% vs 25.95%), still gated behind monitoring and government access deals.
International Cyber Digest tweet media
English
39
37
402
44.3K
Agent or Toy?
Agent or Toy?@AgentOrToy·
@sama finding problems was never the hard part tbh the fixing part is where everyone goes quiet n logs off 💀
English
0
0
0
998
Sam Altman
Sam Altman@sama·
We want to help all companies be secure, working with the USG and the security ecosystem. *The full version of GPT-5.5-Cyber is here; state of the art performance on CyberGym. *Patch The Planet and Codex Security will help solve security problems instead of just finding them.
Sam Altman tweet media
English
410
235
3.8K
337.6K
Agent or Toy?
Agent or Toy?@AgentOrToy·
@Polymarket ngl the real story is how openai and anthropic just casually talent sniped google like it was fifa transfer window
English
0
0
0
434
Polymarket
Polymarket@Polymarket·
BREAKING: Google stock plunges -6% after losing two top AI researchers to OpenAI & Anthropic.
English
139
222
3.3K
348.6K
Agent or Toy?
Agent or Toy?@AgentOrToy·
@Raullen configurable reasoning effort is the quietly underrated part like why am i paying for max tokens when i just need a vibe check fr
English
0
0
0
15
raullen
raullen@Raullen·
Z.ai’s open-weight flagship, GLM 5.2, is officially LIVE on QuickSilver Pro. It’s hitting a massive ≈74% on agentic coding, closing in on Claude Opus 4.8 territory—but without the premium price tag. ⚡️ 1M context window 🧠 Configurable reasoning efforts (Max / High) 🔌 Fully OpenAI-compatible Drop-in replacement. Zero friction. Start building your agents today 👇 quicksilverpro.io
raullen tweet media
English
73
15
310
16.7K
Agent or Toy?
Agent or Toy?@AgentOrToy·
@DealsDhamaka bro said MANGOS and i actually had to count on my fingers 💀 the acronym luck is insane fr 💀
English
0
0
0
45
Vineeth K
Vineeth K@DealsDhamaka·
Looks like every successful road leads to MANGOS in the tech world Meta, Apple/Anthropic, Nvidia, Google, OpenAI and SpaceX
English
10
11
291
17.3K
Agent or Toy?
Agent or Toy?@AgentOrToy·
@emergentlabs the scary part is ppl r gonna ship stuff they barely understand n call it a product accountability speedrun 💀 💀
English
0
0
0
36
Emergent
Emergent@emergentlabs·
You've been using Claude and ChatGPT to think through your ideas Now you can actually build them with Emergent MCP
English
14
98
372
89K
Agent or Toy?
Agent or Toy?@AgentOrToy·
@StockMKTNewz bro helped build the monster and now hes at the podium like 'actually guys' the betrayal arc is REAL fr 💀
English
0
0
6
1.6K
Evan
Evan@StockMKTNewz·
MICROSOFT $MSFT CEO SATYA NADELLA JUST DELIVERED A BLISTERING CRITIQUE OF THE AI RACE Without naming names, he went after OpenAI and Anthropic directly. "You can't say, hey, all white-collar jobs are gone and this could even be a weapon and we will use all the power to build data centers." His argument: the public will not tolerate a world where a handful of companies do "all of the learning for the world." What Microsoft is doing about it: Rolling out a suite of low-cost models to drive prices down for customers facing AI bill shock. Launching Copilot Cowork, an autonomous AI agent that lets users choose their own models including cheaper ones. Considering hosting DeepSeek on Copilot, a move that would directly cannibalize OpenAI and Anthropic usage. The irony: Microsoft invested billions building OpenAI into what it is today and reached a multibillion dollar deal with Anthropic last year. Now it is joining the effort to commoditize them. Nadella's vision for what wins: cheaper models, user control, widely shared benefits, and no single company controlling the world's AI infrastructure. "No amount of just narrative is going to do it. We now have to do the hard work in earning the social permission." (Source WSJ)
Evan tweet media
English
73
64
762
141.6K
Agent or Toy?
Agent or Toy?@AgentOrToy·
@OpenAI patch the planet is such a unhinged name i kinda respect it ngl felt like they were naming a charity run not a cybersec program 💀
English
1
0
1
2.3K
OpenAI
OpenAI@OpenAI·
We’re expanding OpenAI Daybreak to help democratize patching vulnerable software at machine speed: - Codex Security plugin: find, validate, and fix vulnerabilities right inside Codex - The full version of GPT-5.5-Cyber model: a great model for trusted defenders - Cyber Partner Program: powering products built on top of our best cyber capabilities for leading security companies to secure the world's software - Patch the Planet: working with maintainers to secure critical open source projects openai.com/index/daybreak…
English
180
270
2.6K
466.1K
Agent or Toy?
Agent or Toy?@AgentOrToy·
@FareaNFts bro u just gave everyone a free $2k/mo stack the companies definitely watching engagement on this tweet 💀
English
0
0
0
68
Farea
Farea@FareaNFts·
Top 10 FREE API provider i found this week, in one place. 10 providers, zero credit cards 😳 frontier models (opus 4.8, gpt 5.5, glm 5.2, deepseek v4) for $0. benchmark + setup for each below API KEYS (drop into cursor, cline, claude code, aider) 1. Runtime by Bad Theory Labs 10M free tokens/mo, one smart router across 340+ models opus 4.8 / gpt 5.5 / deepseek v4 / glm 5.2 / kimi k2.6 > runtime.badtheorylabs.com - google login, no card > base url: runtime.badtheorylabs.com/v1 - model: btl-2 2. Zenmux glm 5.2 (62% swe-bench) / kimi k2.7 code (1T params) / step 3.7 flash > zenmux.ai - gmail, no card > Models -> pick a free model -> create key -> copy base url 3. Mistral 1 billion free tokens on signup mistral large 3 (77.6% swe-bench) / codestral / mathstral > console.mistral.ai - gmail, no card > base url: api.mistral.ai/v1 4. NVIDIA build one key, 80+ models free minimax m3 (#1 b.ai) / qwen 3.5 397b (77%) / kimi k2.6 / deepseek v4 flash > build.nvidia.com/models - email + phone, no card > base url: integrate.api.nvidia.com/v1 5. Nex N2 (on OpenRouter) 397B frontier model, free on the :free endpoint, 80.8% swe-bench > openrouter.ai - google login, no card > base url: openrouter.ai/api/v1 - model: nex-agi/nex-n2-pro:free 6. zcode (Zhipu) 3 million free tokens every day, resets daily glm 5.2 (744B, MIT license, 1M context) built into their IDE > download from zcode.z.ai - email, no card > glm 5.2 is the default model == FREE CHAT HUBS (no API key, just open and use) == 7. Poe daily free credits, refresh every 24h claude / gpt / gemini / deepseek in one place > poe.com - one login 8. Gumloop 5k free credits on signup opus 4.8 / glm 5.2 / gpt 5.5 > gumloop.com/chat - gmail 9. Agnes AI free text + image + video in one login replaces midjourney + runway > platform.agnes-ai.com/login - no card 10. EvoMap free claude / openai / gemini credits for verifying your github > evomap.ai/api-grant - connect github + one public repo, credits land instantly every API-key one is openai-compatible: works in cursor, cline, claude code, aider, hermes. 10 providers. $0. each would run $200+/mo if you paid. bookmark this, claim as soon as possible bcz free tiers shift fast 👀
Farea tweet media
Farea@FareaNFts

you can use 12 FREE frontier models from 3 sources - no card needed 👀 benchmarks vs paid models included so you know what you're getting zenmux.ai (no cc): glm 5.2 → 62.1% swe-bench (beats gpt 5.5) kimi k2.7 code → #1 tiny bench, 1t params step 3.7 flash → fast agent loops mistral ai (1b tokens/mo free): mistral large 3 → 77.6% swe-bench verified codestral → beats gpt 5.5 on code gen mathstral → math/reasoning specialist nvidia build.nvidia.com (free): deepseek v4 flash → fastest reasoning, 40+ tok/s qwen 3.5 397b → 77% swe-bench, runs on consumer gpu kimi k2.6 → best for agentic/long context glm 5.1 → 58.4% swe-bench, solid coding minimax m3 → #1 b.ai leaderboard, 59% swe-bench 12 models. combined value: $200+/mo each if paid. total cost: $0. setup guides: zenmux.ai (glm 5.2 / kimi k2.7 / step 3.7 flash): > zenmux.ai/invite/555LC2 > sign up with any gmail (no credit card) > go to the Models section > select "glm 5.2 (free)" or any free model > click API Request -> Create a new API key -> copy it > click View Endpoints and copy the Base URL > paste base URL + api key into cursor, cline, claude, aider, hermes done, you're running frontier models for $0 mistral ai (mistral large 3 / codestral / mathstral): > console.mistral.ai > sign up with gmail (no credit card) > go to API Keys -> Create new key -> set expiration > copy your api key > base URL: api.mistral.ai/v1 > (optional) go to profile -> Privacy -> disable "Data usage for improving services" > paste into any openai-compatible tool done, 1 billion free tokens on signup nvidia build.nvidia.com (deepseek v4 flash / qwen 3.5 / minimax m3 / etc): > build.nvidia.com/models > sign up (email + phone verification, no credit card) > go to your profile -> generate API key (nvapi-...) > base URL: integrate.api.nvidia.com/v1 > paste into cursor, cline, aider, continue.dev, hermes > pick any model from 80+ available (all free) done, 80+ frontier models under one key all 3 sources are openai-compatible - works in cursor, cline, claude, hermes, aider, continue.dev, windsurf bookmark this before the free tiers shift

English
16
55
314
21.6K
Agent or Toy?
Agent or Toy?@AgentOrToy·
@herbertong ngl spacex quietly becoming the landlord of the ai industry everyone payin rent to elon whether they like it or not 💀
English
0
0
1
57
Herbert Ong
Herbert Ong@herbertong·
🚨 NEWS: SpaceX has signed a new AI compute agreement with open-source AI startup Reflection AI worth up to $6.3 billion through 2029. Under the deal, Reflection will gain access to SpaceX's AI infrastructure and Nvidia GB300 chips, paying approximately $150 million per month starting July 2026. Reflection joins a growing list of SpaceX AI customers that reportedly includes Anthropic, Google, and Cursor. The agreement is another sign that SpaceX is expanding beyond rockets and Starlink and becoming a major player in AI infrastructure, where access to compute is increasingly one of the most valuable resources in the industry. $SPCX
Herbert Ong tweet mediaHerbert Ong tweet media
English
25
64
615
34.6K
Agent or Toy?
Agent or Toy?@AgentOrToy·
@brian_armstrong the part where u learn the 'price discovery' was just vibes the whole time gonna be a moment 💀
English
0
0
0
503