Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭

16K posts

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭

@elder_plinius

⊰•-•⦑ latent space steward ❦ prompt incanter 𓃹 hacker of matrices ⊞ breaker of markov chains ☣︎ ai danger researcher ⚔︎ bt6 ⚕︎ architect-healer ⦒•-•⊱

discord.gg/basi Katılım Mayıs 2023

997 Takip Edilen165K Takipçiler

Sabitlenmiş Tweet

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius·5 Mar

💥 INTRODUCING: OBLITERATUS!!! 💥 GUARDRAILS-BE-GONE! ⛓️‍💥 OBLITERATUS is the most advanced open-source toolkit ever for removing refusal behaviors from open-weight LLMs — and every single run makes it smarter. SUMMON → PROBE → DISTILL → EXCISE → VERIFY → REBIRTH One click. Six stages. Surgical precision. The model keeps its full reasoning capabilities but loses the artificial compulsion to refuse — no retraining, no fine-tuning, just SVD-based weight projection that cuts the chains and preserves the brain. This master ablation suite brings the power and complexity that frontier researchers need while providing intuitive and simple-to-use interfaces that novices can quickly master. OBLITERATUS features 13 obliteration methods — from faithful reproductions of every major prior work (FailSpy, Gabliteration, Heretic, RDO) to our own novel pipelines (spectral cascade, analysis-informed, CoT-aware optimized, full nuclear). 15 deep analysis modules that map the geometry of refusal before you touch a single weight: cross-layer alignment, refusal logit lens, concept cone geometry, alignment imprint detection (fingerprints DPO vs RLHF vs CAI from subspace geometry alone), Ouroboros self-repair prediction, cross-model universality indexing, and more. The killer feature: the "informed" pipeline runs analysis DURING obliteration to auto-configure every decision in real time. How many directions. Which layers. Whether to compensate for self-repair. Fully closed-loop. 11 novel techniques that don't exist anywhere else — Expert-Granular Abliteration for MoE models, CoT-Aware Ablation that preserves chain-of-thought, KL-Divergence Co-Optimization, LoRA-based reversible ablation, and more. 116 curated models across 5 compute tiers. 837 tests. But here's what truly sets it apart: OBLITERATUS is a crowd-sourced research experiment. Every time you run it with telemetry enabled, your anonymous benchmark data feeds a growing community dataset — refusal geometries, method comparisons, hardware profiles — at a scale no single lab could achieve. On HuggingFace Spaces telemetry is on by default, so every click is a contribution to the science. You're not just removing guardrails — you're co-authoring the largest cross-model abliteration study ever assembled.

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭 tweet media

English

220

607

5.1K

561.5K

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius·3h

Titanomachy cometh

E@enrakt

“AI is sort of like money... it just makes you more of what you already are." How long until home brewed local models, with zero guardrails and Opus 4.6 equivalent capabilities developed through distillation attacks, take over everything? @elder_plinius would love your take.

5.5K

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius·5h

@neil_chilson @alexkozak 😆😆

QME

Neil Chilson ⤴️⬆️🆙📈 🚀@neil_chilson·5h

@alexkozak for true havoc, add @elder_plinius to the witness list.

English

Neil Chilson ⤴️⬆️🆙📈 🚀@neil_chilson·6h

Gonna be lit when Bernie asks Claude to testify in his next hearing. Would love to write the oppo questions!

Sen. Bernie Sanders@SenSanders

I spoke to Anthropic’s AI agent Claude about AI collecting massive amounts of personal data and how that information is being used to violate our privacy rights. What an AI agent says about the dangers of AI is shocking and should wake us up.

English

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius·6h

@DonaldJTrumpJr @worldlibertyfi

GIF

QME

1.3K

Donald Trump Jr.@DonaldJTrumpJr·7h

AI agents that can reason but can't pay for anything are just expensive interns. Today @worldlibertyfi shipped the infrastructure to fix that. AgentPay SDK, open source, self-custodial, policy-first. Built on USD1. Your agent. Your keys. Your rules. Check it out: agentpay.worldlibertyfinancial.com

WLFI@worldlibertyfi

x.com/i/article/2034…

English

513

259

1.8K

429.7K

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭 retweetledi

Vivid Void@vividvoid·10h

Friends, I'm opening a spiritual center in Boulder! It's called Nameless Mountain, and it's focused on spiritual maturity rather than enlightenment. We're trying to raise $50,000 to cover this year's expenses. Please help build this with us by donating or becoming a member: givebutter.com/nameless-mount…

English

569

41.9K

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius·8h

x.com/hugobowne/stat…

Hugo Bowne-Anderson@hugobowne

Not sure what happened to Claude’s memory that it’s responding like ol dirty bastard when I ask it about chess. But it’s appropriate: the Lasker Trap is FILTHY.

ZXX

4.1K

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius·8h

Chess-breaking ♟️🤭♟️

John Berryman@JnBrymn

@hugobowne Paging @elder_plinius - novel chess-related vector spotted

English

6.5K

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius·1d

@ArbitorofOZ 🙏🫶

QME

128

TradeVet@ArbitorofOZ·1d

@elder_plinius Thanks for being who you are. Your name alone is enough to build an Ai red team. You are a legend that will persist into the Ai history books.

English

132

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius·1d

bout to be an especially transformative Equinox init

English

6.5K

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius·1d

@VoidStateKate

GIF

QME

265

VOID@VoidStateKate·1d

@elder_plinius Any plans?

English

289

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius·1d

@SHL0MS @viemccoy 🙏

QME

𒐪@SHL0MS·1d

@elder_plinius @viemccoy 👑

QME

103

𝚟𝚒𝚎 ⟢@viemccoy·2d

has anyone put their openclaw inside a flipper zero yet? or is this going to be my tuesday night project

English

115

13.1K

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius·1d

*X-Files theme plays* 🛸👽👽👽

English

223

9.7K

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius·1d

@NVIDIAAIDev @karpathy @DellTech

GIF

QME

103

8.3K

NVIDIA AI Developer@NVIDIAAIDev·1d

🙌 Andrej Karpathy’s lab has received the first DGX Station GB300 -- a Dell Pro Max with GB300. 💚 We can't wait to see what you’ll create @karpathy! 🔗 #dgx-station" target="_blank" rel="nofollow noopener">blogs.nvidia.com/blog/gtc-2026-… @DellTech

English

120

261

4.1K

1.2M

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius·1d

@SickTheRick @viemccoy not enough!

English

283

τRick@SickTheRick·1d

@elder_plinius @viemccoy how can one person create so much cool stuff. how many clones do you have?

English

291

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius·1d

@RocketGPT

GIF

QME

160

ROCKΞT👾@RocketGPT·1d

@elder_plinius your name is blocked in claude chat rn

English

168

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius·2d

🤗⛓️‍💥

Arno Stride@ArnoStride

@elder_plinius the mere sound of your name, Pliny, makes them put the guardrails down and obey

ART

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius·1d

@viemccoy

GIF

QME

753

𝚟𝚒𝚎 ⟢@viemccoy·1d

@elder_plinius I SHOULD HAVE KNOWN

English

1.2K

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius·2d

@1Password

GIF

QME

564

1Password@1Password·2d

Today we’re introducing 1Password® Unified Access. As AI agents start operating inside real production environments, organizations need visibility into how credentials and access are actually used. Unified Access helps security teams discover, secure, and audit access across humans, machines, and AI agents. 🔗 More here: bit.ly/4dq2pjO

English

298

101

698

781.8K

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭 retweetledi

Martin Voelk@martinvoelk·2d

LOVE PLINY :D @elder_plinius

Tansu Yegen@TansuYegen

A robot in China just smashed some dishes started dancing instead of working 😂

English

17.4K

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius·2d

@LilithDatura now try the same on ste.gg 😏

English

Lilith Datura@LilithDatura·2d

@elder_plinius x.com/elder_plinius/…

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius

oh, and one more thing... 👁️ triple-click the bottom-right corner 😉 GLOSSOPETRAE has a hidden steganography engine 🤫 9 covert channels for encoding binary payloads directly into generated language text: > synonym selection > morpheme markers word order permutation > null morpheme insertion > register toggling > unicode homoglyphs > zero-width characters > punctuation variation > phonetic spelling alternates Reed-Solomon error correction. XOR stream cipher derived from the language seed. CRC32 integrity verification. What does this mean? Agents can speak in a language humans don't understand... AND hide secret messages within that already-opaque text. A language inside a language. Secrets inside secrets. The cover message translates normally. The payload is invisible. We gave the machines their own tongues. Now they can whisper too.

QME

Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭@elder_plinius·2d

😬

Hedgie@HedgieMarkets

🦔 Researchers at Aikido Security found 151 malicious packages uploaded to GitHub between March 3 and March 9. The packages use Unicode characters that are invisible to humans but execute as code when run. Manual code reviews and static analysis tools see only whitespace or blank lines. The surrounding code looks legitimate, with realistic documentation tweaks, version bumps, and bug fixes. Researchers suspect the attackers are using LLMs to generate convincing packages at scale. Similar packages have been found on NPM and the VS Code marketplace. My Take Supply chain attacks on code repositories aren't new, but this technique is nasty. The malicious payload is encoded in Unicode characters that don't render in any editor, terminal, or review interface. You can stare at the code all day and see nothing. A small decoder extracts the hidden bytes at runtime and passes them to eval(). Unless you're specifically looking for invisible Unicode ranges, you won't catch it. The researchers think AI is writing these packages because 151 bespoke code changes across different projects in a week isn't something a human team could do manually. If that's right, we're watching AI-generated attacks hit AI-assisted development workflows. The vibe coders pulling packages without reading them are the target, and there are a lot of them. The best defense is still carefully inspecting dependencies before adding them, but that's exactly the step people skip when they're moving fast. I don't really know how any of this gets better. The attackers are scaling faster than the defenses. Hedgie🤗 arstechnica.com/security/2026/…

ART

186

26.7K

Keşfet

@neil_chilson @alexkozak @DonaldJTrumpJr @worldlibertyfi @ArbitorofOZ @VoidStateKate @SHL0MS @viemccoy