Alex Panait
240 posts

Alex Panait
@alpanait
If I'm lucky, today could be the day I catch a glimpse of something from a new angle and think, "Wow, that's a twist I didn't see coming."

I created a training pipeline to remove propaganda and gaslighting from Chinese models! I'm thrilled to announce LazarusAI's ReAligned-Qwen3.5 series of models, finetuned to reduce Chinese ideological bias and censorship, refusal behavior, and state-narrative framing I use SFT + GRPO pipeline with a dataset crafted to target the taxonomy of chinese censorship and bias, along with my ReAligned classifier model as a GRPO reward signal.


@AnthropicAI recently announced their Glasswing project, powered by their unreleased Mythos model, which uncovered zero-day vulnerabilities in several projects, including FFmpeg. They used it as evidence that Mythos is a super scary and dangerous superintelligence that should be kept out of your hands. I doubt that Mythos is actually a vastly smarter model. It'll be incrementally better as Opus is to Sonnet. I predict that their FFmpeg result is reproducible with a much smaller model than Mythos. (Qwen3.5-397b, maybe) To test that idea, I created Clearwing, an open-source implementation of Glasswing. You can point it at any model (@ollama , @lmstudio , @OpenRouter, @huggingface, etc)













