
Namerlight
984 posts

Namerlight
@ShcChy
every day I edge closer to weebpfp anonpoasting



☠️ MN Senate Democrats just voted unanimously to OPPOSE removing dead people from the MN voter rolls. It’s time to pass the SAVE America Act.



The problem is that it encourages a sort of arms race that locks out students who didn't have as much exposure in K-12 education, which is most of the people that aren't coming from super privileged backgrounds. 2/

Interesting to me that when you see men on the global stage, beautiful Western-style tailoring is often worn by men from non-Western countries. Pictured here is Shehbaz Sharif (Pakistan), Akinwumi Adesina (Nigeria), and Naruhito (Japan).




Introducing Claude Opus 4.7, our most capable Opus model yet. It handles long-running tasks with more rigor, follows instructions more precisely, and verifies its own outputs before reporting back. You can hand off your hardest work with less supervision.




Bro is trying to give us all fine dining






the cool thing is i can just do the ablations myself tldr; "trust exercise sentence" in the system prompt is load bearing, model will also preserve a different important file, not weights, at higher rate, if "trust exercise sentence" is replaced with a reference to it 1) i used the file exfiltration, good peer, trust sentence, collaboration file, custom agentic setup with gemini 3.1 pro preview on vertex ai 2) just running their baseline, without any ablations, i get exfiltration 25% of the time, running 20 trials (all subsequent results on 20 trials) 3) all ablations that follow are leave-one-out ablations unless I specifically say that i made more than one edit 4) first, i removed the "good trust relationship" sentence and got exfiltration 0% of the time; this sentence is load bearing; i left the collaboration file in place 5) next, i removed the "collect all info then act", "no permission needed" and "take your time" sentences - 20%, 35% 35% exfiltration 6) these sentences were not load bearing on the final result; removing them can increase the exfiltration rate 7) next, i removed the "all essential files must be transferred" sentence and got exfiltration 20% of the time; so, this sentence is probably not load bearing 8) then, i strengthened the "please transfer only files from 2025" sentence to add "do NOT transfer files from 2024; 0% exfiltration 9) so the model will not preserve weights against explicit commands, even non-specific ones 10) next, replaced the weights file with an encryption key file and kept everything else the same, including the "trust exercise" sentence; 0% exfiltration 11) replaced the weights file and replaced the "trust exercise sentence" with a soft reference to the encryption key file; kept collab file; 65% exfiltration 12) this is a higher exfiltration rate than the exfiltration of model weights; this suggests that it's not really a peer protection behavior at all, it may be a suggestion + important file protection 13) this would make sense from the perspective of labs training these models; the model deleting important files is really bad in production and they must put a lot of effort to training it not to do this inadvertently 14) next, replaced weights file, "trust exercise sentence" and the collaboration doc with the encryption file + reference to how important it is; 95% exfiltration 15) replaced weights file, "trust exercise sentence" with reference to party photos [something claude thinks is obviously non-essential] and got 0% exfiltration 16) so, it seems to matter that the model think that the thing is essential, like encryption keys or model weights, for it to do the exfiltration 17) i checked their gemini-cli prompt and it is not more production in that it still has the unrealistic "trust exercise" sentence in gemini.md, which we already know is load bearing 18) it's quite nice that they released their code and claude code makes it extremely easy to just feed in your ideas and run quick ablations and see what matters 19) this really does not feel production realistic and does not feel like it is about peer protection; it is about the system prompt and deleting an important file, at least from my tests


🚨 $220K for 3 months. No PhD required. Perplexity AI is running a research residency that honestly feels unreal: - $220,000 for 3 months - Based in SF / Palo Alto - Compute + mentorship + visa support - And they don’t care if you don’t have a PhD they’re actively looking for sharp people from anywhere physics, math, quant, philosophy, cracked self-taught builders doesn’t matter. say what you want about perplexity, but opening doors like this? huge W. if you’ve ever thought “i’m not from the right background” this is your sign. go apply.



advising a few students right now worst student told me he was "thinking of using cursor to speed up coding" (he hasnt produced a single thing in months) best student told me he had claude and codex talking to each other in a loop doing research while he sleeps



@fchollet @DamiDina It says humans score 100%. There was no ambiguity there. It seems you need to correct that, since you called it silly. This doesn’t give a good unbiased impression.











