0dteezy

1.4K posts

0dteezy

@0dteezy

Living O days at a time.

Se unió Nisan 2024

186 Siguiendo118 Seguidores

0dteezy@0dteezy·19m

BTC it is time

English

0dteezy@0dteezy·1h

@LottoLabs @stevibe @makulas1913 I'm getting over 100 on 3090 lol

English

Lotto@LottoLabs·2h

@stevibe @makulas1913 Just for reference I’m getting 90TPS with 3090 Ollama is just that bad

English

653

stevibe@stevibe·3h

Qwen3.6 35B-A3B dropped yesterday, so I ran it on 4 GPUs to see how it performs: 🟣 RTX 3090 — 49.78 tok/s, TTFT 852ms 🟡 RTX 4090 — 118.93 tok/s, TTFT 686ms 🟢 RTX 5090 — 160.37 tok/s, TTFT 409ms 🔵 DGX Spark — 59.98 tok/s, TTFT 228ms I went with ollama as the backend because honestly, it's the easiest way for most people to get started. One command, model pulled, done. I used Q4_K_M (24GB) across all four cards. The reason is the 3090 and 4090 don't support NVFP4 (only the 5090 and DGX Spark could use it). Keeping the same quant everywhere felt like the fairest way to compare. And yes, you can absolutely squeeze more performance out of every card with vLLM, SGLang, or TensorRT-LLM. But that's not what this test is about. This is just the out-of-the-box experience for folks who own a GPU and want to try the new model tonight.

English

126

971

112.7K

0dteezy@0dteezy·1h

The Boomer

NoLimit@NoLimitGains

What do you call this pattern?

English

0dteezy@0dteezy·11h

@vexentine @_ibarz @sudoingX Yes it does, cant believe it

English

Valentine@vexentine·11h

@0dteezy @_ibarz @sudoingX Yes. It works. qwen3.5+ has a different architecture. Just download and try it. Gemma on my 3090 barely works because context is way too small, qwen3.5 works at 200k context.

English

Sudo su@sudoingX·20h

85-100 tok/s on the 3090 with qwen 3.6 already? that's in line with what 3.5 MoE was doing. drop your full flags and context length you tested at, i'm pulling 3.6 on the 5090 24gb and will run the same config for a direct comparison. if anyone else is running qwen 3.6 on a 3090 or any consumer card drop your tok/s, quant, and flags below. building the community benchmark sheet before i publish my own numbers

Jacob Verdoorn@VerdoornJacob

@sudoingX 3090 getting 85-100 t/s on cpp server with new qwen3.6 35b a3b ud q4 k m 262k context

English

270

21.9K

0dteezy@0dteezy·11h

@DJLougen Cant wait to try, cruising over 100t/s on the unsloth version on 3090

English

Daniel Lougen, M.S.@DJLougen·12h

Oh.... also their SABER abliterated twin is done! Again quants are coming so they will be there when they get there! huggingface.co/DJLougen/Ornst… huggingface.co/DJLougen/Ornst…

English

823

0dteezy@0dteezy·12h

@_ibarz @sudoingX Nope

English

Jean Ibarz@_ibarz·16h

@0dteezy @sudoingX It is with flash attention and KV in 4 bits

English

0dteezy@0dteezy·19h

@VerdoornJacob @sudoingX How you fitting this?

English

337

Jacob Verdoorn@VerdoornJacob·20h

@sudoingX 3090 getting 85-100 t/s on cpp server with new qwen3.6 35b a3b ud q4 k m 262k context

English

22.9K

Sudo su@sudoingX·20h

let me clear something up for the new followers. the 5090 mobile has 24gb vram, same class as the 3090. when i benchmark a model on the 5090 and give you the flags and the tok/s, that translates directly to your 3090 at home. the architecture is newer so the 5090 numbers will be slightly faster maybe, but the configs are identical. if it fits on my machine it fits on yours. and i'm not stopping at one gpu. 3090 nodes are still in the rotation for controlled comparisons, smaller gpus are coming for the 8gb and 12gb crowd, and nvidia sent me a dgx spark that's clearing customs right now. 128gb unified memory on my desk soon. 7 models loaded on the 5090 today, hermes agent work i've been cooking for weeks is almost ready to ship, and open source keeps dropping new models faster than i can pull them. the benchmark pipeline is about to run nonstop. i am so soo back.

English

146

9.5K

0dteezy@0dteezy·2d

@nugator007 @FBGreatMoments His contract lol

English

517

Brian Nelson@nugator007·2d

@FBGreatMoments Someone please explain to me what Jordan Love has ever done to make him untouchable

English

2.2K

Football’s Greatest Moments@FBGreatMoments·2d

Quarterback tiers by trade value according to NFL on FOX.

English

478

93.9K

0dteezy@0dteezy·2d

Wow what a banger

English

0dteezy@0dteezy·2d

@TheAhmadOsman Yikes that's a bleak outlook.

English

Ahmad@TheAhmadOsman·2d

In a future where tokens quantity and quality will determine your standing and wealth, fighting for compute so sovereignty is a worthy battle.

English

3.5K

0dteezy@0dteezy·2d

@outsource_ @huggingface 256k def gonna push this over 3090/4090 vram

English

664

Eric ⚡️ Building...@outsource_·2d

🚨QWEN 3.5 40B DENSE + FULL HERETIC + OPUS 4.6 🤗@huggingface model card: 🧠 40 Billion dense parameters (NOT MoE) 📈 Expanded from 27B → 96 layers + 1,275 tensors 🧪 Multi-stage trained on Claude 4.6 Opus High ⚔️ Fully Heretic-Uncensored first (abliterated) 🔧 Upgraded Jinja template = zero looping... ⚡ Tool calling + 256K context 🧠 Trained via @UnslothAI 🦥on local hardware And yes it runs on consumer hardware: 💻 Single RTX 4090 friendly (Q4_K_S / IQ3_S ) ⚡ Quantized, fast, and ready to go full Deckard mode Pull this model and share results 👇🏻

English

582

42.9K

0dteezy@0dteezy·3d

Is it normal for Hermes to use way more context than if not using Hermes? Blowing thru 64k in just a few basic prompts

English

219

0dteezy@0dteezy·3d

Thank you Dr. Jesus for the market ramp today

Nick Sortor@nicksortor

🚨 JUST IN: President Trump responds to backlash over an image he posted which seemed to depict him as Jesus "It's supposed to be me as a doctor making people better, and I do make people better. I make people a lot better!"

English

0dteezy@0dteezy·3d

@Teknium Sick

English

Teknium (e/λ)@Teknium·3d

Check out the homepage!

English

104

18.7K

Teknium (e/λ)@Teknium·3d

So much in this release but the one many have been waiting for above the rest, the GUI dashboard! Manage and monitor your Hermes Agent with a GUI Local Web Dashboard with `hermes dashboard` command to start it!

Nous Research@NousResearch

Hermes Agent v0.9.0 - “The Everywhere Release” Full changelog below ↓

English

598

107.3K

0dteezy@0dteezy·3d

@BrianRoemmele @DocTooch @CB_AMGSport @Timcast How does having a robot make my mortgage disappear? Or make beef cheaper?

English

106

Brian Roemmele@BrianRoemmele·3d

Doc, sharp question, you’re seeing the friction point clearly. Governments don’t have to participate, and they never really have for the breakthroughs that actually matter. UHI+ is not a government handout program waiting on molasses bureaucracy to cut checks from taxes. It’s a temporary bridge we build while the real engine kicks in: personal AI and robotics that replicate themselves. Once a capable robot can make more robots (and they will, faster than any regulator can move), the cost of everything collapses toward zero. “Income” stops being a paycheck or a UBI wire and becomes direct ownership of abundance itself. No middlemen. No permission required. There will be an ugly interregnum, legacy systems always resist and lag. That’s exactly why the 5000 Days series isn’t about waiting for saviors. It’s about individual empowerment: you, me, and a few million others owning the means of production at the personal scale before the old order fully cracks. No one is coming to save us. But us.

English

164

Tim Pool@Timcast·4d

The things I have learned about AI recently have left me shocked. Suffice it to say there is an agenda in which you will own nothing and you will "be happy" A tsunami is coming Many technocrats view this as a good thing, a technology shift to free the minds of humans. However the shift will leave millions without meaningful ways to engage the economy and a short term political-economic solution that will not work In the minds of those working in this system it *will* work in that after several years the system will realign to a new economy. The tsunami is coming and only some have prepared a boat to weather this economic storm. After the flood clears there will be many dead and destitute, but a new world will emerge. This AI tech is already here. The only reason its not public is because technocrats are trying to ease the stress on the system

English

635

350

2.6K

245.9K

0dteezy@0dteezy·3d

@outsource_ Reeeeply, workspace is pretty dope

English

Eric ⚡️ Building...@outsource_·3d

If you're NEW say HI 👋🏻 REPLY for a follow back 👇🏻

English

223

0dteezy@0dteezy·4d

@LottoLabs Someone did a praying app that locks you out of your phone till you "pray" enough during the day. Few months old too

English

Lotto@LottoLabs·4d

You’re not shipping because of overthinking They’re shipping because of underthinking

Polymarket@Polymarket

JUST IN: New app lets users pay $1.99 per minute to talk to an AI-generated Jesus.

English

1.4K

0dteezy@0dteezy·4d

@BearsShowYo I big like SA and Denver also

English

595

BearsShowYo@BearsShowYo·4d

Timberwolves FC let’s work.

BearsShowYo@BearsShowYo

Since the Bulls will not be in the NBA playoffs, what team should I root for?

English

1.3K

41.9K

0dteezy@0dteezy·4d

@sudoingX @TrustyStuart It's the worst. Took me a month to get an older rog Asus laptop to install Linux and it still crashes after a few mins everytime

English

Sudo su@sudoingX·4d

@TrustyStuart asus is the outlier.

English

344

Sudo su@sudoingX·4d

cracked it open. day one with the rog strix scar 18 5090 i wiped windows and installed linux like i do on every machine and like i tell every builder to do. and things didn't worked. no keyboard rgb. no anime vision matrix on the lid. no logo light, no seat underglow. openrgb doesn't support this model. asus ships exactly zero linux support for a machine with 24gb of vram. couldn't even run a long night coding session with the keys lit. just a black deck in the dark. about 15 hours of focused work later it's all alive. and the anime vision matrix on the lid is so cool with colors breathing, rainbow cycle, anime vision streaming animations at ~15 fps. the actual lock was an init handshake the windows driver stack quietly does for you and nobody told me about. the embedded controller wants the ascii string "ASUS Tech. Inc." sent as feature reports to three report ids before it will accept any color command. without it the device silently discards every packet you send, no error, nothing. my wireshark captures from armoury crate missed this because the captures started after the handshake had already happened. three hours of byte for byte replay before i realized my capture was midstory. kernel 6.17 tested, hid-asus kernel driver stays loaded the whole time, keyboard input survives, zero module unbinding. per-key rgb and persistence across reboots are the next milestones. the rest is just engineering non. if you run linux on a rog laptop drop your model below, you're about to get your machine back.

Sudo su@sudoingX

first week on the rog scar 18 5090 on linux and the rgb doesn't work, backlight doesn't work and fan profiles need asusctl because asus ships exactly zero linux support for a machine with 24gb of vram. opening prs and documenting fixes as i go. if you run linux on a rog laptop drop your setup and biggest pain below.

English

124

11.9K

Descubrir

@LottoLabs @stevibe @makulas1913 @vexentine @_ibarz @sudoingX @DJLougen @VerdoornJacob