GoatFishData

520 posts

GoatFishData banner
GoatFishData

GoatFishData

@GoatFishData

#Bitcoin Coinfidence Trend | #Astronalysis #GoatfishAstronalysis #AIstrology #GoatFishData (banner/avatar created with Grok)

London, UK 参加日 Aralık 2022
720 フォロー中62 フォロワー
固定されたツイート
GoatFishData
GoatFishData@GoatFishData·
Ever wonder why AI gets weird and forgetful if a chat goes on too long? It’s not broken; it’s just experiencing the "Joint Law" of AI memory. 🌿🧠
English
1
1
0
14
GoatFishData
GoatFishData@GoatFishData·
The fix? Hit "New Chat." It’s the AI equivalent of a cold shower and a strong cup of coffee. ☕️✨ [Written by @GoatFishData & @GeminiApp]
English
0
0
0
6
GoatFishData
GoatFishData@GoatFishData·
4️⃣ 4 Joints (Max memory limit): Total couch lock. System override. Spitting out broken code, forgetting what language it's speaking, or just timing out entirely. 🫠
English
1
0
0
5
GoatFishData
GoatFishData@GoatFishData·
Ever wonder why AI gets weird and forgetful if a chat goes on too long? It’s not broken; it’s just experiencing the "Joint Law" of AI memory. 🌿🧠
English
1
1
0
14
GoatFishData
GoatFishData@GoatFishData·
Guri Singh@heygurisingh

🚨BREAKING: A new benchmark just exposed the biggest lie in AI. Your AI agent isn't "reasoning" through documents. It's throwing 270 million tokens at the wall and praying. Snowflake, Oxford, and Hugging Face tested every frontier model on real document search. 2,250 questions. 800 PDFs. 18,619 pages. 1,200 hours of human annotation. The best AI agent, Gemini 3 Pro, scored 82.2%. Humans scored 82.2%. Perfect match. Headlines would call this "human-level performance." Then they checked which questions each got right. The overlap was 24%. Cohen's kappa of 0.24. Humans and AI were solving completely different questions. Same score. Totally different intelligence. But that's not the bad part. Humans nailed 50% accuracy on their very first search query. Gemini 3 Pro? 12%. The best AI agent on Earth needed 9 rounds of blind searching to reach what a human does in one shot. When searches failed, humans immediately changed strategy. AI agents? They rephrased the same failed query with minor tweaks and tried again. The worst agent, GPT-4.1 Nano, barely changed its queries at all. 48.2% of its responses were straight-up refusals. It just gave up. With perfect retrieval, humans hit 99.4%. Best AI agent with the same documents? Stuck at 82.2%. An 18% gap that no amount of compute could close. Claude Sonnet 4.5's recursive model burned 270 million input tokens, $850 per test run, and still couldn't beat its own cheaper version using basic keyword search. 3,273 agent errors analyzed. 35.7% couldn't even find the right document. Not the right page. The right file. Your AI agent isn't reading your documents. It's playing a slot machine with your data and billing you for every pull.

ZXX
0
0
0
19
GoatFishData
GoatFishData@GoatFishData·
Do not forget They want [need] you to burn tokens!
English
2
1
0
171
David Ondrej
David Ondrej@DavidOndrej1·
stop whatever you are doing and listen to this podcast. trust me.
David Ondrej tweet media
English
18
24
359
20.6K
GoatFishData
GoatFishData@GoatFishData·
Neo had SKILLs
GIF
English
0
0
0
6
GoatFishData
GoatFishData@GoatFishData·
"My Agent did itbuour honour..."
GIF
Venkat Raman — inference/acc@venkat_systems

@0xTejpal has only one way out of this - blame it on vibecoding and agent going rogue 😂 in all seriousness come clean, apologize, change claim on website and try to move on such a silly way to damage your reputation and looking at twitter profile, reputation of institutions and your investors 😅

English
0
0
0
31
GoatFishData がリツイート
kapilansh
kapilansh@kapilansh_twt·
the AI coding experience nobody talks about: → prompt AI for a feature: 30 seconds → AI writes 400 lines you don't understand → it works → you ship it → 3am production bug → you have no idea what any of it does → ask AI to fix it → AI breaks 3 other things → you are now debugging code written by a robot fixed by a robot broken by a robot we do not talk about this enough
English
232
130
1.5K
75.2K
GoatFishData
GoatFishData@GoatFishData·
LLM's are like Aladdin. You ask... "I want a woman" And that's exactly what you get. "A" woman.
GIF
English
0
1
0
10