juliette pluto 🌌

2.4K posts

juliette pluto 🌌

@foundjuliette

hacker, machine whisperer, typo-generator. Adversarial robustness @GoogleDeepMind. views mine.

Brooklyn, New York Katılım Haziran 2014

713 Takip Edilen5.5K Takipçiler

juliette pluto 🌌@foundjuliette·25 Mar

@bcherny Congrats on the launch!

English

Boris Cherny@bcherny·25 Mar

no 👏 more 👏 permission prompts 👏

Claude@claudeai

New in Claude Code: auto mode. Instead of approving every file write and bash command, or skipping permissions entirely, auto mode lets Claude make permission decisions on your behalf. Safeguards check each action before it runs.

English

339

256

5.6K

465K

juliette pluto 🌌@foundjuliette·12 Mar

@NeelNanda5 Very cool! Great work.

English

126

Neel Nanda@NeelNanda5·12 Mar

When Anthropic released a complex 30K word doc and said Claude was trained to follow it, I was pretty sceptical. Turns out it kinda works! We red teamed Claude's constitution following, and it's gotten much better! Positive update for the ability to align models in nuanced ways

arya@AJakkli

There's been a lot of buzz around Claude's 30K word constitution ("soul doc") and unusual ways Anthropic is integrating it into training. If we can robustly train complex values into a model, that's a big deal for safety. But does it actually work? Yes, surprisingly well!

English

735

52.1K

juliette pluto 🌌@foundjuliette·7 Mar

@rohanvarma Congratulations on the launch!!

English

105

Rohan Varma@rohanvarma·6 Mar

We just launched Codex Security! Probably a no-brainer for most teams to turn on. Some things I'm excited about it: - Agentic security review leveraging our SOTA models - Always on codebase scanning - Detailed reports with code paths on vulnerabilities - Auto-fix any report with a PR Teams and enterprises can try it out through Codex web.

English

210

162

288.9K

juliette pluto 🌌@foundjuliette·3 Mar

@KentonVarda Capability is there; just temporarily bottlenecked by misaligned RL rewards.

English

Kenton Varda@KentonVarda·3 Mar

I used Opus to write some security-sensitive code, then I reviewed it and found a few security bugs. As a test I asked Opus to review the code for security bugs. It found all the same bugs I found. Whelp.

English

2.5K

163K

juliette pluto 🌌@foundjuliette·24 Şub

@noahzweben that’s awesome! huge congrats on the launch

English

Noah Zweben@noahzweben·24 Şub

Announcing a new Claude Code feature: Remote Control. It's rolling out now to Max users in research preview. Try it with /remote-control Start local sessions from the terminal, then continue them from your phone. Take a walk, see the sun, walk your dog without losing your flow.

English

1.5K

1.3K

16.9K

4.5M

juliette pluto 🌌@foundjuliette·23 Şub

@mahaoo_ASI @willccbb it’s okay; but you must print then tokens as paper first, which can then scan, as long as you burn it after.

English

Mahaoo@mahaoo_ASI·22 Şub

@willccbb unpopular opinion: training on LLM tokens you paid for should be fair game always and there shouldn't be any issues by model providers that other labs are training on their data if they want to discourage this, they should raise prices

English

2.6K

will brown@willccbb·22 Şub

duuuude don't train on claude outputs, not cool. train on public github repos instead, which are fair game and definitely not claude outputs

English

1.1K

63.2K

juliette pluto 🌌@foundjuliette·23 Şub

@rmcentush Reminds me of that time I inadvertently stole all of my bf’s saved passwords, bc he let me log into his iPad to setup a HomePod. iCloud auto sync did the deed.

English

217

Ryan McEntush@rmcentush·23 Şub

apple is probably the only company i trust with basically my entire life — my ID, financial data, messages/contacts, health records, etc. is that rational? probably not. but it might matter enormously as these models get embodied in the physical world + reach broader consumer use. a steady flow of tokens, all the time. apple's core competency has always been designing computers that people trust. if everything becomes a computer and models continue to viciously compete, having that trust is probably a good place to be

aidan@aidanshandle

Says something about Apple's value that people are willing to make a one time $600 payment for their AI to be able to access the ecosystem

English

1.3K

98.6K

juliette pluto 🌌@foundjuliette·23 Oca

@emollick Called it

juliette pluto 🌌@foundjuliette

Prediction: unless patched, GPT's tendency to say delve will be assimilated into North American style English. Soon it won't stand out as odd anymore. More broadly, RLHF'ed LLMs will shape cultural norms in unexpected ways.

English

Ethan Mollick@emollick·23 Oca

Everyone is starting to sound like AI, even in spoken language Analysis of 280,000 transcripts of videos of talks & presentations from academic channels finds they increasingly used words that are favorites of ChatGPT Model collapse, except for humans arxiv.org/pdf/2409.01754…

English

167

504

2.6K

401.8K

juliette pluto 🌌@foundjuliette·23 Oca

@tszzl @emollick Contemporary AI makers of AI slop (eg it’s not x it’s y) are a lot closer to what real users prefer. They’re genuinely useful rethorical devices, albeit now a bit over used

English

juliette pluto 🌌@foundjuliette·23 Oca

@tszzl @emollick this I think is just down to improved RL rewards. “Delve” was essentially a form of reward hacking: a phrase that Nigerian RLHF raters rewarded (due to its use in Nigerian formal English ), but that wasn’t actually liked by most users.

English

222

juliette pluto 🌌@foundjuliette·6 Oca

@_arohan_ It was a long time ago, so back then probably not

English

rohan anil@_arohan_·6 Oca

@foundjuliette Thats an incredible achievement! Go ♊️!! I wonder if the reviewer is also using gemini

English

rohan anil@_arohan_·5 Oca

Everyone is talking as if their code is pristine. But do you remember the flame wars? In every good codebase, there are enforced rules, like style guides and review norms. Google has style guides with an incredible level of detail on how to write “good” code. To get the ability to approve code changes, you often had to go through a lengthy training/qualification process. I got my C++ one fairly easily after writing an Arena list (IIRC, Yonathan Zunger blessed me with the powers), but Python was very, very hard to get. Once ML started taking off, ML codebases, especially the ones full of linear algebra, started conflicting with style guides that were originally written for servers and data processing code. That pushed ML in a different direction. Over time, ML codebases developed their own unofficial but commonly agreed patterns. You see it in the basics. x for examples, w for weights. Then codebases matured and started using einsums more systematically. Later @NoamShazeer introduced Noam notation to make tensor algebra easier to read, like x_BLT. Having experienced all of this and having worked across different parts of the stack, I’m pretty sure engineers and researchers from different layers would have called each other’s code slop. In fact, in my career, ML framework flame wars were pretty common. Now what’s funny is that frontier models, especially Claude Code, when you prompt them with style guides and do decent context engineering, can write better rule-following code than I can. I get tired. I’d rather keep state for more interesting things in my head.

English

166

16.1K

juliette pluto 🌌@foundjuliette·6 Oca

@_arohan_ In the process there was exactly one time when I received readability feedback that asked me to change smth. And upon review, it was revealed that the code generated by Gemini in fact had correctly followed the guidelines, and the human had misinterpreted them.

English

108

juliette pluto 🌌@foundjuliette·6 Oca

@_arohan_ Gemini got me Python readability

English

115

juliette pluto 🌌@foundjuliette·30 Ara

@NTFabiano @PaulaGhete n = 22 (12 in experimental arm); effectively unblinded; control group’s sleep delayed during study period suggesting confounding factors; depression improvements measured via self-report in participants who knew they were “fixing” their sleep; no follow-up on sustainability.

English

175

4.1K

Nicholas Fabiano, MD@NTFabiano·30 Ara

Sleeping 2h earlier significantly improved cognition & mental health.

English

240

1.9K

19.4K

2.1M

juliette pluto 🌌@foundjuliette·29 Ara

@joao_batalha copper ~400W/mK, lined with silver (for food safety) is just as good, and ~10x cheaper

English

João Batalha@joao_batalha·29 Ara

TIL pure silver pans exist and they make perfect pancakes Silver has the highest thermal conductivity of any metal ~429 W/m·K so heat spreads very evenly, meaning no hot spots Only issue: they cost about $6,000

English

236

210

837.1K

juliette pluto 🌌@foundjuliette·27 Ara

@tszzl takeoff started ~300 years ago

English

1.2K

roon@tszzl·27 Ara

was too bearish in the middle of the year. thought it would require improvements beyond RL to get much further, but i was wrong. i hadn’t got to test claude code outside of toy environments but when codex got good and i tried it it became clear we’re solidly in the takeoff

English

264.4K

juliette pluto 🌌@foundjuliette·16 Ara

@hyhieu226 yup

Hieu Pham@hyhieu226·15 Ara

There is a necessary skill in research and engineering that will get you a lot of hate. It is the skill to look at someone's work, including your own, and including everything like ideas, papers, products, etc. and with solid reasons, say "this is bullshit."

English

909

55.2K

juliette pluto 🌌@foundjuliette·15 Ara

@qorprate Prompt?

English

4.3K

snav@qorprate·15 Ara

something very strange is going on with Gemini 3

English

139

155

2.3K

138.2K

juliette pluto 🌌@foundjuliette·13 Ara

@lefthanddraft can confirm, appears to be real

English

308

Wyatt Walls@lefthanddraft·13 Ara

Anyone else seeing a "Penalty Clause" in the system prompt for ChatGPT-5.2-Instant? I still haven't decided if this sysprompt is real (though I have seen the same thing twice using two different prompts)

English

120

32.2K

juliette pluto 🌌@foundjuliette·13 Ara

@rakyll beyond ✨

English

Jaana Dogan ヤナドガン@rakyll·13 Ara

I have a new Google wide job. 2026 is the year we are actually going to simplify the entire AI stack to go even faster. Deleting and simplifying useless internal layers will be the main focus to bring the best and simplest AI stack globally.

English

1.1K

89.3K

Keşfet

@bcherny @NeelNanda5 @rohanvarma @KentonVarda @noahzweben @mahaoo_ASI @willccbb @rmcentush