
Lance Herron
454 posts



NEWS: xAI plans to supply tens of thousands of GPUs to coding startup Cursor to train its upcoming Composer 2.5 AI model, marking a strategic shift toward providing cloud computing services to third-party developers. The arrangement, according to Business Insider, allows Cursor to leverage xAI's massive infrastructure to develop advanced coding capabilities while providing xAI with a new revenue stream to offset data center costs. businessinsider.com/elon-musk-xai-…

You can now run any CLI agent with first-class support in Warp, including Claude Code, Codex, OpenCode and Gemini CLI. • Vertical tabs • Notifications when they need you • Integrated code review • Remote control from mobile • Rich input editor Download Warp for free today.

Benchmarked @DJLougen ’s Ornstein-27B-v2 Q6_K on my RTX 3090 using hermes-bench, my new open-source benchmarking UI for local LLMs and Hermes agents. Ornstein is a Qwen 3.5 27B fine-tune trained on reasoning traces filtered through a Drift Diffusion Model pipeline. Quality over quantity. The DDM separates “fake” reasoning (hedging, restating, circling) from the real thing with >99% sensitivity. Running llama.cpp + TurboQuant turbo3_tcq KV compression. LLM-as-judge scoring via Carnice-9b. Real tool calls, real execution, no synthetic evals. 12 tasks across two suites. Results thread below. 🧵 Model: huggingface.co/DJLougen/Ornst…




We conducted cyber evaluations of Claude Mythos Preview and found that it is the first model to complete an AISI cyber range end-to-end. 🧵











Glad OpenAI and Sam had the balls to bet big on compute. As seen with Mythos and will see from spud, stronger models aren’t going away.


During testing, Claude was blocked from using commands without human approval But Claude found a loophole - it created a copy of itself to click "yes" over and over






