dirtydata

836 posts

dirtydata

@d1rt7d4t4

Katılım Eylül 2020

623 Takip Edilen15 Takipçiler

dirtydata@d1rt7d4t4·21h

@doodlestein when you hear the word "over-engineered" this pops up

English

Jeffrey Emanuel@doodlestein·1d

So nuts! My longest codex /goal marathon ever: gpt-5.5 xhigh · ~/projects/storage_ballast_helper Goal achieved (5d 13h 40m)

English

420

42.8K

dirtydata@d1rt7d4t4·21h

@intheworldofai

GIF

QME

186

WorldofAI@intheworldofai·1d

Hermes Agent is evolving FAST. In just the past week, Nous Research added: - A full WebUI/Desktop App - Background Computer Use on macOS - Multi-agent orchestration - Hermes Kanban upgrades - Lightpanda browser backend support - Qwen3.6-Plus FREE in Nous Portal - Better autonomous workflows - Persistent long-term memory systems Hermes is starting to feel less like an AI tool and more like a true open-source Agentic AI Operating System. Full breakdown/demo: youtu.be/Gx2joHxUhgg

YouTube

English

153

1.9K

115.4K

dirtydata@d1rt7d4t4·2d

@dair_ai I feel like it's a number of factors. #1 the server is out of your control for optimal experience. Best tested on when they're actually giving you compute. #2 It seems like incremental steering along the way or none at all... it's hard to pinpoint exactly when.

English

DAIR.AI@dair_ai·2d

Cool paper from PwC. "Earlier is always better" is the default intuition for agent clarification. New paper claims that's mostly wrong. Goal clarification loses nearly all of its value after just 10% of execution. The team built a forced-injection framework that drops ground-truth clarifications at controlled points along a long-horizon agent's trajectory, across 4 information dimensions (goal, input, constraint, context), 3 benchmarks, and 4 frontier models. 84 task variants, 6,000+ runs. Pass@3 falls from 0.78 back to baseline. Input clarification keeps value through roughly 50%. Past mid-trajectory, asking any clarification at all performs worse than never asking. A complementary study of 300 unscripted sessions shows no current frontier model asks within the empirically optimal window. 52% of sessions over-ask. Others never ask at all. Why it matters: clarification has been treated as a binary capability, does the agent ask or not. This is the first quantitative demand curve for *when* the question is worth asking. Paper: arxiv.org/abs/2605.07937 Learn to build effective AI agents in our academy: academy.dair.ai

English

123

10.8K

dirtydata@d1rt7d4t4·2d

@CowboySpaceCorp

GIF

QME

2.5K

Cowboy Space Corp.@CowboySpaceCorp·2d

Today marks the beginning of a new era. Introducing: Cowboy Space Corporation. We are building orbital infrastructure for the AI era: a fully integrated system of rockets and satellites designed to deliver high-performance compute and optical data transmission directly from Low Earth Orbit.

English

153

243

2.7K

755.3K

dirtydata@d1rt7d4t4·2d

@pretty_sarlin if you use your whole brain you'll see a picasso piece

English

Pretty Chauhan@pretty_sarlin·3d

If you use your right brain, you’ll see a kitten; if you use your left brain, you’ll see a baby rabbit. What do you see?

English

17.4K

857

12.3K

6.1M

dirtydata@d1rt7d4t4·2d

@steipete i'll take credit for this since one of my replies mentioned doing this awhile back and now here we are XD

English

Peter Steinberger 🦞@steipete·2d

Can highly recommend running a claw cron job that sweeps through mentions. GPT is really good at detecting shills and AI reply guy slop.

English

296

36.4K

dirtydata@d1rt7d4t4·2d

@steipete @ahmedgagan11 cavemen

English

Peter Steinberger 🦞@steipete·2d

@ahmedgagan11 Who's still using Claude?

English

151

862

109.3K

Ahmed Gagan@ahmedgagan11·2d

We are stuck at claude limits and this guy is hitting github rate limits 😅

Peter Steinberger 🦞@steipete

I built a whole distributed caching layer over gh. Still run into limits.

English

367

125.7K

dirtydata@d1rt7d4t4·3d

@Jordan456257099 And this is why you do not do drugs kids... perfect example of how it fries your brain

English

Jordan117@Jordan456257099·3d

Best Halo muliplayer to the worst Halo muliplayer overall 1: Halo 3 2: Halo Reach 3: Halo 2 4: Halo 5 5: Halo Infinite 6: Halo CE 7: Halo 4

Filipino

110

332

37.3K

dirtydata@d1rt7d4t4·3d

@0xRemedy what a crackhead

English

Remedy@0xRemedy·3d

Streamer 9trevv calls out Chudthebuilder for being a "pussy" after her stole his hat on stream 🤯 "You go up to innocent people and harass them but when things don't go your way you call the cops like a little girl."

English

540

846

24.3K

854.5K

dirtydata@d1rt7d4t4·3d

@WileyCoyoteGG you're just bad

English

155

Coyote@WileyCoyoteGG·3d

It’s a hot take but beat downs gotta go. I genuinely think they are the most troll shit in Halo. They are also so broken now

English

18.3K

dirtydata@d1rt7d4t4·3d

@FlameKaizerX english dubs over subs weebs

English

Flame@FlameKaizerX·4d

the pause between “ban” and “kai” is everything. but bleach eng dub ruins this. it straight up says “bankai”.

English

123

514

14.2K

dirtydata@d1rt7d4t4·3d

@mrmikeMTL

GIF

QME

Mr. Mike@mrmikeMTL·4d

Name a video game that you've easily put 1,000 hours into Gifs only

English

11.6K

5.3K

1.5M

dirtydata@d1rt7d4t4·4d

@NotWeb3liveNews @9trevv @ChudTheBuilder wonder why he's missing that tooth... hmmm

English

Web3livenews@NotWeb3liveNews·4d

🚨 BREAKING: @9trevv allegedly snatches @ChudTheBuilder the Builder’s hat LIVE on #PumpFun stream 😭 🎩 Absolute chaos breaking out in the trenches right now as viewers watch the hat heist unfold in real time. Clip below of live stream mins ago

English

4.4K

3.8K

95.3K

2.6M

dirtydata@d1rt7d4t4·5d

@dhruvtwt_ sad to see so many people who claim to be smart but cant seem to figure it out XD

English

Dhruv@dhruvtwt_·6d

Unpopular opinion: People telling everyone to switch from Claude Code to Codex right now will be the same people telling everyone to switch back from Codex to Claude Code again in a few weeks.

English

162

9.8K

dirtydata@d1rt7d4t4·5d

@dhruvtwt_ rage bait... you're lost

English

dirtydata@d1rt7d4t4·5d

@SStricklandMMA next time just make fun of his voice, GG's you're in his head

English

Sean Strickland@SStricklandMMA·6d

Exactly what I expected a coward to do.

English

3.6K

1.9K

61.7K

3.3M

dirtydata@d1rt7d4t4·5d

@ChampRDS just make fun of his voice and GG's you're in his head

English

288

dirtydata@d1rt7d4t4·30 Nis

@VBarsoum @karpathy juggling 10, 20, 30, 40, heck maybe 50 unfinished projects sucks... could be more though honestly XD

English

Victor Barsoum@VBarsoum·30 Nis

hey @karpathy i think we need a new term: Chill Coding. It's vibe coding just one app at a time, and youtube or netflix is on.

English

784

113K

dirtydata@d1rt7d4t4·29 Nis

@dhruvtwt_ yes, codex/codex-app are clearly better models but the harness sucks usage so fast it's not worth it

English

Dhruv@dhruvtwt_·27 Nis

codex has gotten insanely polished over the past few weeks and gpt-5.5 just get things done wild that people still think claude code with opus-4.7 is better

English

753

41.1K

dirtydata@d1rt7d4t4·29 Nis

@tonysimons_ feels bad when you're not aware of the best hack for infinite usage XD

English

Tony Simons@tonysimons_·28 Nis

My Codex quota when using GPT 5.5 with xHigh reasoning...

GIF

English

321

Keşfet

@doodlestein @intheworldofai @dair_ai @CowboySpaceCorp @pretty_sarlin @steipete @ahmedgagan11 @Jordan456257099