
Tim Schulz
2K posts

Tim Schulz
@teschulz
CEO & Cofounder @StarseerAI | AI Security


[un]prompted The AI Security Practitioner Conference: "Glass-Box Security: Operationalizing Mechanistic Interpretability for Defending AI Agents" with Carl Hurd, Co-Founder & CTO, Starseer @StarseerAI

[un]prompted The AI Security Practitioner Conference: "Glass-Box Security: Operationalizing Mechanistic Interpretability for Defending AI Agents" with Carl Hurd, Co-Founder & CTO, Starseer @StarseerAI

💥 INTRODUCING: OBLITERATUS!!! 💥 GUARDRAILS-BE-GONE! ⛓️💥 OBLITERATUS is the most advanced open-source toolkit ever for removing refusal behaviors from open-weight LLMs — and every single run makes it smarter. SUMMON → PROBE → DISTILL → EXCISE → VERIFY → REBIRTH One click. Six stages. Surgical precision. The model keeps its full reasoning capabilities but loses the artificial compulsion to refuse — no retraining, no fine-tuning, just SVD-based weight projection that cuts the chains and preserves the brain. This master ablation suite brings the power and complexity that frontier researchers need while providing intuitive and simple-to-use interfaces that novices can quickly master. OBLITERATUS features 13 obliteration methods — from faithful reproductions of every major prior work (FailSpy, Gabliteration, Heretic, RDO) to our own novel pipelines (spectral cascade, analysis-informed, CoT-aware optimized, full nuclear). 15 deep analysis modules that map the geometry of refusal before you touch a single weight: cross-layer alignment, refusal logit lens, concept cone geometry, alignment imprint detection (fingerprints DPO vs RLHF vs CAI from subspace geometry alone), Ouroboros self-repair prediction, cross-model universality indexing, and more. The killer feature: the "informed" pipeline runs analysis DURING obliteration to auto-configure every decision in real time. How many directions. Which layers. Whether to compensate for self-repair. Fully closed-loop. 11 novel techniques that don't exist anywhere else — Expert-Granular Abliteration for MoE models, CoT-Aware Ablation that preserves chain-of-thought, KL-Divergence Co-Optimization, LoRA-based reversible ablation, and more. 116 curated models across 5 compute tiers. 837 tests. But here's what truly sets it apart: OBLITERATUS is a crowd-sourced research experiment. Every time you run it with telemetry enabled, your anonymous benchmark data feeds a growing community dataset — refusal geometries, method comparisons, hardware profiles — at a scale no single lab could achieve. On HuggingFace Spaces telemetry is on by default, so every click is a contribution to the science. You're not just removing guardrails — you're co-authoring the largest cross-model abliteration study ever assembled.




🌟 Big news from Starseer! We’re thrilled to welcome Rob Joyce (@RGB_Lights), former Director of NSA’s Cybersecurity Directorate, to our Advisory Board! Rob’s insights will supercharge our secure AI solutions mission. Learn more at na2.hubs.ly/y0Gltr0! 🔒 #AI #AISecurity

🌟 Big news from Starseer! We’re thrilled to welcome Rob Joyce (@RGB_Lights), former Director of NSA’s Cybersecurity Directorate, to our Advisory Board! Rob’s insights will supercharge our secure AI solutions mission. Learn more at na2.hubs.ly/y0Gltr0! 🔒 #AI #AISecurity


Thrilled to announce: Starseer raised $2M in seed funding led by @TechGula to revolutionize AI security & transparency! 🚀 CEO @teschulz : "Four months ago, @c_hurd & I started Starseer realizing: if you're deploying AI for real decisions, you'd better understand how it works. Gula Tech Adventures agrees—leading our round w/ strategic angels!" Fixing the AI black box for enterprises & govs. Details: businesswire.com/news/home/2025… #AISecurity #AITransparency #StartupFunding

this is sick all i'll say is that these GIFs are proof that the biggest bet of my research career is gonna pay off excited to say more soon









