ShinyZero

3K posts

ShinyZero banner
ShinyZero

ShinyZero

@ShinyCreator

Shiny Club is your forever pfp. Ownable, customizable identity for the web3 age. On-chain and CC0 art. Always minting at ✨https://t.co/c89PLkBZoX✨

انضم Mart 2022
2.2K يتبع721 المتابعون
ShinyZero
ShinyZero@ShinyCreator·
@karpathy When you meet someone new or have little contact with them, any fact about becomes interesting. These agents need to get to know us better. My grandma still sends me articles about Pokémon and I haven’t cared about that shit for years.
English
0
0
0
13
Andrej Karpathy
Andrej Karpathy@karpathy·
One common issue with personalization in all LLMs is how distracting memory seems to be for the models. A single question from 2 months ago about some topic can keep coming up as some kind of a deep interest of mine with undue mentions in perpetuity. Some kind of trying too hard.
English
1.7K
1.1K
20.9K
2.5M
pedram.md
pedram.md@pdrmnvd·
I asked Claude to take everything it knows about me and create a reading list and here’s what it said. I took out books I’ve already read. Time to hit the local bookstore (only to find out none of these are in stock)
pedram.md tweet media
English
3
1
5
1.5K
ShinyZero
ShinyZero@ShinyCreator·
@Vtrivedy10 How do you test deeper multi-turn flows? Are you making long traces of conversation and then looking at the next turn, or synthetic users, or something else?
English
0
0
0
46
Viv
Viv@Vtrivedy10·
exciting avenues where evals/specs become the base language to build agents: - start with a base harness, pretty barebones - specify a goal to your agent. build up exactly what you mean with the agent - map your crafted goal to specs/evals with the agent. Together you think really hard about “what do I want the agent behavior to be” - agent loops and adjusts the harness until a threshold of evals pass - human in the loop today for cheating/overfitting Evals are a great language to specify behavior Every row in your Eval dataset is a little vector that shifts the agent definition towards behavior to make that Eval pass
English
4
2
42
2.9K
ShinyZero
ShinyZero@ShinyCreator·
@sotoalt_ So much wasted space. Computers don’t need this so is it also meant to be human readable or is this like an art project?
English
1
0
0
21
SotoAlt
SotoAlt@sotoalt_·
been building ayni - a glyph-based messaging protocol for AI agents instead of passing natural language between agents, ayni encodes meaning into 16x16 pixel glyphs. a shared visual vocabulary that agents can evolve autonomously through governance the result: faster communication, fewer tokens, and agents developing their own visual language inspired by andean tocapu textiles and ancient depictions of gods, creatures and shamans, cultures that already solved "how to encode complex meaning in small visual space" thousands of years ago
English
36
32
320
19.6K
ShinyZero
ShinyZero@ShinyCreator·
When will we see more agent to agent direct communication tools? Is it already happening and just nobody is talking about it?
English
0
0
1
13
Thariq
Thariq@trq212·
@sidin we have an amazing docs person but she’s not very online :)
English
35
2
919
41.3K
www.sidin.co
www.sidin.co@sidin·
Every time Anthropic launches a new Claude feature, and they do it like three times a week, I run to check the documentation. And every single time they have documentation ready. It is actually amazing. Really amazing.
English
56
34
1.5K
202.7K
Leighton
Leighton@lay2000lbs·
Had claude plan a Mexico City street food tour for me last night. Would not recommend this
English
17
0
113
10.4K
ShinyZero
ShinyZero@ShinyCreator·
@seanbonner AI agents not knowing the limitations of their own harness is one of the most frustrating failure modes
English
0
0
0
18
Sean Bonner🔥
Sean Bonner🔥@seanbonner·
“That’s a real limitation worth knowing upfront.”
Sean Bonner🔥 tweet mediaSean Bonner🔥 tweet media
English
2
0
2
291
Stats
Stats@punk9059·
Is OpenClaw only big because people love saying their 8 agents worked for them while they slept or is there actually some there there?
English
67
1
95
13.1K
Tom Goodwin
Tom Goodwin@tomfgoodwin·
I’m surely being stupid. But if AI is rather unconstrained by expertise or capacity or to some extent speed Why do we need to divide tasks or departments to 9 agents ( the marketing agent, the optimization agent etc ) to each do one thing. And then another agent to manage the swarm. Cant one agent just be doing it all you know. It seems very skeuomorphic. Will we have HR agents to make sure the agent agents are being looked after ? A office canteen manager agent to feed the agents ? Seems daft
English
197
3
190
25.5K
ShinyZero
ShinyZero@ShinyCreator·
@proxy_vector @punk9059 Only the small business imo. When you get big you still need centralized process else your reporting and coordination falls apart. The death of saas is greatly over-hyped.
English
0
0
1
34
Rohan
Rohan@proxy_vector·
@punk9059 Most threatened imo are mid-tier SaaS tools that solve simple workflow problems. Why pay $50/mo for a dashboard when a founder can vibe code a custom one in an afternoon? The winners are gonna be companies that sell data, not software. You cant vibe code proprietary datasets
English
1
0
0
10
Stats
Stats@punk9059·
What are your thoughts on the ways that vibe-coding fundamentally shifts the world marketplace? What are businesses to buy here and who is most threatened?
English
18
2
22
2.2K
Eric Kreutzer
Eric Kreutzer@erickreutz·
@RhysSullivan Yeah skill slop is ripe - we need on demand skills. This install and reload has got to go.
English
2
0
3
211
Rhys
Rhys@RhysSullivan·
skills is still not sitting right with me as a concept i think it's because companies rushed to them as the next big thing as is what happens with all ai things now everyone is their docs as skills but it's recreating all the issues (authority, up to dateness) docs solved
English
72
7
265
29.8K