
@StefanoErmon Been advising Inception. Mercury 2 is live: first reasoning diffusion LLM, ~1,000 tokens/sec, 5x faster. Not a tuning trick, a different architecture. @stefanoermon helped invent diffusion. His team made it work at production scale. Pay attention.
English












