Sesem_ Ag
249 posts











Gemini Gemini Gemini Gem







Introducing SubQ - a major breakthrough in LLM intelligence. It is the first model built on a fully sub-quadratic sparse-attention architecture (SSA), And the first frontier model with a 12 million token context window which is: - 52x faster than FlashAttention at 1MM tokens - Less than 5% the cost of Opus Transformer-based LLMs waste compute by processing every possible relationship between words (standard attention). Only a small fraction actually matter. @subquadratic finds and focuses only on the ones that do. That's nearly 1,000x less compute and a new way for LLMs to scale.








🚨 GPT 5.5 spotted in codex cli and app thursday launch > btw gpt 5.5 will be 3-4 times the usual gpt 5.4 > image v2 will help in better webdev



Learn more: ollama.com/library/gemma4

Show me a worse casting. I’ll wait…











