GlitchNarwal

941 posts

GlitchNarwal

@JamKo_Ams

Dull boy, work all day

Amsterdam Katılım Şubat 2010

292 Takip Edilen101 Takipçiler

GlitchNarwal@JamKo_Ams·1d

@claudeai @lydiahallie Bro’s cooking

English

Claude@claudeai·1d

New in Claude Code: auto mode. Instead of approving every file write and bash command, or skipping permissions entirely, auto mode lets Claude make permission decisions on your behalf. Safeguards check each action before it runs.

English

2.8K

38.1K

GlitchNarwal@JamKo_Ams·1d

@_jaydeepkarale Also not on a VM?

English

Jaydeep@_jaydeepkarale·2d

90% companies dont allow this btw

Claude@claudeai

You can now enable Claude to use your computer to complete tasks. It opens your apps, navigates your browser, fills in spreadsheets—anything you'd do sitting at your desk. Research preview in Claude Cowork and Claude Code, macOS only.

English

249

371.8K

GlitchNarwal@JamKo_Ams·1d

@JeroenBreevoort @cursor_ai Need to RL composer to effectively use it I guess?

English

893

Jeroen Breevoort@JeroenBreevoort·1d

@cursor_ai Cursor is not doing this, It's the Figma MCP that is doing this right?

English

7.4K

Cursor@cursor_ai·1d

Cursor can now create new components and frontends in Figma using your team's design system.

English

109

269

3.7K

590.9K

GlitchNarwal@JamKo_Ams·2d

@slow_developer GPT 5.4 is behind 5.3 Codex for 99% of coding tasks

English

Haider.@slow_developer·3d

anthropic keeps doing this thing where its models are brilliant at launch, then much worse a month later opus 4.6 now is far behind gpt-5.3 codex, and even more behind gpt-5.4 high when working on large codebases not surprised though this has been happening since sonnet 4

English

141

1.7K

160.4K

GlitchNarwal@JamKo_Ams·2d

@clairevo @openclaw Why would other people tell what you’ll be spending on your openclaw…

English

claire vo 🖤@clairevo·3d

since no one else will say it: i'm on my way to spending $15-25k this year on my @openclaw agents

English

287

55.7K

GlitchNarwal@JamKo_Ams·2d

@AliR_Ahmadi x.com/jamko_ams/stat…

GlitchNarwal@JamKo_Ams

@business Funny how there hasn’t been a fire on a carrier for almost 20 years but Iran doesn’t has anything to do with this one. Also what’s up with this new trend where everybody in Israel/UAE that shares rocket impacts is getting indicted?

QME

Ali Ahmadi@AliR_Ahmadi·3d

If you buy what Centcom is selling, this is the most accident-prone the US military has been in its entire history. The F-15 entered service 60 years ago and its never been downed during a war. Now 5 of them have been downed by "malfunction" and "friendly fire" in 2 weeks.

China Global@ChinaGlobl

🚨 Breaking | U.S. Central Command: “Iran’s claim that it shot down an F-15 aircraft is a ‘rumor.’ It was a technical malfunction.”

English

317

28.4K

GlitchNarwal@JamKo_Ams·2d

@TheDefiantGhost How am I seeing Tucker Carlson doing good shit everywhere while in my mind he was a right wing extremist?

English

Defiant Ghost@TheDefiantGhost·3d

Remember when Tucker Carlson straight-up silenced Mark Cuban? Mark: "Half my family is Ukrainian. I think we should help." Tucker: "How much money have you sent to Ukraine?" Mark: "None." Tucker: "So what do you mean by we? If you think we need to help, why don't you start? How about you first?" This never gets old.

English

1.2K

8.9K

345.9K

GlitchNarwal@JamKo_Ams·3d

@archiexzzz @karpathy This kind of thing + automated scripted voice models + scammers are going to be a pain in the but in the comming years

English

Archie Sengupta@archiexzzz·15 Mar

Introducing AutoVoiceEvals I've applied the @karpathy autoresearch loop to voice AI agents. It's open source. Your voice agent has a system prompt. That prompt determines how it handles every call - bookings, complaints, edge cases, background noises, long pauses, people trying to trick it. Most teams write it once, test manually, and hope for the best. autovoiceevals makes it a loop. One artifact (system prompt), one metric (adversarial eval score), keep what improves it, revert what doesn't. Run it overnight. Wake up to a better agent. > How it works: You describe your agent in a config file - what it does, its services, policies, and what it should never do. You don't write test cases. You don't define attack vectors. provider: vapi / smallest ai assistant: id: "your-agent-id" description: | Voice receptionist for a hair salon. Maria does coloring only. Jessica does cuts only. $25 cancellation fee under 24 hours notice. Cannot advise on skin conditions. Closed Sundays. From that description alone, Claude generates adversarial caller personas - each with an attack strategy, a voice profile (accents, background noise, mumblers, interrupters), a multi-turn caller script, and pass/fail evaluation criteria. The eval suite is generated once and held fixed for the entire run, like a validation set. > The loop: 1. Read the agent's current prompt from the platform 2. Generate adversarial eval suite from your description 3. Run baseline 4. Claude proposes ONE surgical change to the prompt 5. Push the modified prompt to the agent via API 6. Run all scenarios against the updated agent 7. Score improved? Keep. Same score but shorter prompt? Keep. Otherwise revert. 8. Go to 4. Run until Ctrl+C. The system sees its own experiment history. When a change fails, the next proposal knows what was tried and why it didn't work. We ran 20 experiments on a live Vapi dental scheduling agent. 0 human intervention. > Score: 0.728 → 0.969 (+33%) > CSAT: 45 → 84 > Pass rate: 25% → 100% > 9 kept, 10 discarded > Prompt: 1191 → 1139 chars (better AND shorter) You describe your agent. It figures out how to break it.

English

1.2K

276.1K

GlitchNarwal@JamKo_Ams·5d

@VantagePoi64328 @tim_cook lol. Everybody in here got rage baited hardcore

English

Vantage Point@VantagePointHQx·5d

@tim_cook Can you make a Mac that doesn’t become a potato after 4 years?

English

60.8K

Tim Cook@tim_cook·5d

Mac just had its best launch week ever for first-time Mac customers. We love seeing the enthusiasm!

English

1.3K

1.4K

30K

5.1M

GlitchNarwal@JamKo_Ams·6d

@cursor_ai @GeoffreyHuntley Everybody who criticises Cursor: “don’t get met wrong, I use Cursor and Love it, but:”

English

Cursor@cursor_ai·6d

We're also sharing an early alpha of our new interface. cursor.com/glass

English

142

117

744.4K

Cursor@cursor_ai·6d

Composer 2 is now available in Cursor.

English

634

896

9.8K

5.3M

GlitchNarwal@JamKo_Ams·19 Mar

@GeospyAI crazy. How are you trying to make cash out of this slop app when literally every button on your site is broken.

English

GlitchNarwal@JamKo_Ams·19 Mar

@protosphinx They made us promise to undersell it so Nvidia can have 70% margins

English

sphinx@protosphinx·18 Mar

ASML is alien tech. The aliens bequeathed it to the Dutch. Ain’t no way anyone else is getting that tech.

The AI Investor@The_AI_Investor

Rumor: Tesla is going to build its own EUV lithography machines because ASML is “too slow.” to supply to the new chip plant. ASML currently produces <100 EUV machines per year. Apparently that’s not fast enough for Elon Musk. The new plan: • 1,000 EUV machines per week starting 2027 • 10,000 per week by 2028 And since HBM memory is also a bottleneck, Tesla will casually build 10 new memory fabs in the next two years producing 5× the rest of the industry combined. Meanwhile, Micron Technology, SK Hynix, and Samsung Electronics are panicking.

English

128

223

5.9K

503.5K

GlitchNarwal@JamKo_Ams·19 Mar

English

1.9K

Bloomberg@business·18 Mar

The USS Gerald R. Ford aircraft carrier is leaving the fight with Iran and heading back to port, a source said, after a fire broke out in its laundry area and left at least two sailors with non-life-threatening injuries. bloomberg.com/news/articles/…

English

178

536

1.8K

577.7K

GlitchNarwal@JamKo_Ams·19 Mar

@MarioNawfal Funny how there hasn’t been a fire on a carrier for almost 20 years but Iran doesn’t has anything to do with this one. Also what’s up with this new trend where everybody in Israel/UAE that shares rocket impacts is getting indicted?

English

Mario Nawfal@MarioNawfal·18 Mar

🚨 BREAKING: The USS Ford is now leaving combat after a a major fire broke out last week injuring ~200 sailors and knocking out about out ~100 bunsleeping quarters. It took 30 hours to fully control the blaze Iran claim they struck the aircraft carrier. The U.S. states damage is non-combat related Not sure what to believe anymore

English

1.6K

2.4K

12.7K

GlitchNarwal@JamKo_Ams·19 Mar

@ericzakariasson @branmcconnell @cursor_ai Correction… codex 5.3 is a better model, And cheaper 💸. 5.4 supposedly incorporated codex coding skills is bs.

English

eric zakariasson@ericzakariasson·18 Mar

@branmcconnell @cursor_ai tbh gpt 5.4 is a better model (except for ui)

English

266

13K

Brandon McConnell@branmcconnell·17 Mar

.@cursor_ai JUST CHARGE ME DOUBLE PLEASE

English

17.2K

GlitchNarwal@JamKo_Ams·18 Mar

@trq212 I do keep wondering how long these kinds of things will be necessary. They feel overly manual and prime targets to bake into models capabilities, and a lot of it will be obsolete once memory is - finally - evolving into a more robust system

English

Thariq@trq212·17 Mar

x.com/i/article/2033…

ZXX

371

2.2K

16K

6.6M

GlitchNarwal@JamKo_Ams·17 Mar

@sarahwooders Same. Blew away tons of tokens on mid gpt performance. Why did they even claim to have integrated Codex performance into the model 🤷🏻‍♂️

English

Sarah Wooders@sarahwooders·17 Mar

To the people who had early access to GPT-5.4 and told us all it was amazing for coding - did you even use it?? I'm fully back to Codex 5.3

English

338

86K

GlitchNarwal@JamKo_Ams·17 Mar

@TheFl0orIsLaVa What oil? 🤷🏻‍♂️

English

Kadi🇪🇪🌻@TheFl0orIsLaVa·16 Mar

USA: We're lifting sanctions on russian oil Ukraine: Hold my beer 🍺💥

English

105

1.1K

7.5K

52.2K

GlitchNarwal@JamKo_Ams·17 Mar

@maddenifico You guys got played so hard. It’s been crazy to see al this shit go down from outside the US

English

Bill Madden@maddenifico·17 Mar

This is a great idea. 👇

English

1.6K

10.9K

47.5K

348.9K

GlitchNarwal@JamKo_Ams·16 Mar

@leerob Couple more years and we don’t even have to let ‘em play, we’ll just have ASI simulate based on probabilities and do NIL payouts accordingly

English

143

Lee Robinson@leerob·16 Mar

I built an app to simulate the 2026 NCAA tournament! It uses historical data, KenPom rankings, game locations, and more to determine the win probability. ...but then has an AI model review the results and prompt for the reality of March Madness, unpredictable!

English

101

2.3K

425.3K

Keşfet

@claudeai @lydiahallie @_jaydeepkarale @JeroenBreevoort @cursor_ai @slow_developer @clairevo @openclaw