Xre
2.9K posts




watch gemma 4 12b q8 dancing on a single rtx 3090 at 33 tokens a second average. google dropped this two days ago and it's the kind of thing that quietly moves the floor. a fully multimodal model, text image and audio in one net, 256k context, apache licensed, running entirely on one consumer gpu, no one metering your tokens. what you're watching is the whole loop live: the server streaming tokens top left, the gpu pegged bottom left, the answer landing on the right. all local, all mine. a year ago this needed someone else's datacenter. today it's a card you can buy. open source isn't catching up anymore, it's setting the pace. how fast does yours run?

I just bought a Tesla Model Y Full Self Driving is one of the most impressive pieces of technology I’ve ever experienced. I now wish I’d got it a year ago.



Proof of Grok block in this video. You will see under 2 replies it shows they have a reply and I see nothing. Also that going to Grok's page it shows I am blocked and cannot follow due to this.




























