0dteezy

1.4K posts

0dteezy

@0dteezy

Living O days at a time.

参加日 Nisan 2024

185 フォロー中118 フォロワー

0dteezy@0dteezy·1h

@brivael Imagine comparing lidar to Marxism. Truly astounding

English

Brivael - FR@brivael·1d

Aujourd'hui grosse discussion avec mes ingés (chez Argil) sur pourquoi Elon a viré le LIDAR de ses voitures autonomes. Choix radical, moqué pendant des années, et comme d'hab il avait raison depuis le début. Le LIDAR c'est un laser qui balaye l'environnement et crache un nuage de points 3D. Sur le papier tu obtiens la géométrie exacte du monde. Dans la vraie vie c'est une verrue technologique collée sur le toit parce qu'on sait pas faire mieux avec la vision seule. Problème numéro un : ça rajoute une modalité dans le training du modèle. Ton réseau doit apprendre à fusionner vision + lidar + radar + ultrasons. Chaque capteur en plus c'est une source de désaccord à arbitrer, pas une source d'info supplémentaire. Sensor fusion artisanale = dette technique permanente. Problème numéro deux, la bitter lesson de Rich Sutton : scaler le compute sur une seule modalité bat systématiquement les architectures bricolées à la main. Tesla a dropé le radar, puis les ultrasons, est passé full end-to-end vision. Leur courbe sur les edge cases s'est accélérée APRÈS, pas avant. Waymo fait l'inverse et reste stuck en ops géofencée. Problème numéro trois, le plus fondamental : le LIDAR voit la géométrie, pas la sémantique. Il sait qu'il y a un truc, pas ce que c'est ni ce que ça va faire. Les derniers 9 de fiabilité sont des problèmes de cognition, pas de perception brute. Un capteur de plus résout rien, il ajoute du bruit. Sébastien Loeb balance une 208 T16 à 180 dans un chemin boueux corse sous la pluie avec zéro LIDAR. Deux yeux, un cerveau. L'évolution a donné des yeux aux prédateurs pendant 500 millions d'années, pas des lasers. Il y a une raison. Le LIDAR c'est l'équivalent du marxisme appliqué à l'économie. Une solution planifiée, centralisée, qui prétend modéliser explicitement ce qui doit émerger d'un système distribué et adaptatif. Tu remplaces l'intelligence par de la mesure, la compréhension par de la donnée, l'émergence par le contrôle. Ça rassure les ingénieurs qui veulent tout spécifier en amont, exactement comme la planif rassurait les économistes soviétiques. Et ça échoue pour les mêmes raisons : la réalité est trop riche pour être capturée par un capteur, comme elle est trop riche pour être capturée par un plan quinquennal. La vraie intelligence, celle de Hayek comme celle de Tesla, c'est de faire confiance à un système qui apprend de l'expérience plutôt que de tout pré-encoder. L'élégance d'une solution c'est son rapport signal sur complexité. Le LIDAR explose le dénominateur. Défendre le LIDAR en 2026 c'est préférer empiler des hacks plutôt que résoudre le vrai problème. C'est de la feignasserie intellectuelle maquillée en rigueur d'ingénieur. Les mêmes gens qui défendaient les systèmes experts en 2012 contre le deep learning. Ils finiront pareil. Never bet against end-to-end. Never bet against la simplicité. Never bet against Elon.

Français

924

1.8K

15.8K

23.5M

0dteezy@0dteezy·1h

Need to get me an Arby's French dip royale

English

0dteezy@0dteezy·2h

Good luck ever convincing those grifting evil people in government

Elon Musk@elonmusk

Universal HIGH INCOME via checks issued by the Federal government is the best way to deal with unemployment caused by AI. AI/robotics will produce goods & services far in excess of the increase in the money supply, so there will not be inflation.

English

0dteezy@0dteezy·2h

@CalebFC18 Anything but the lakefront in an open stadium will suck ass

English

Caleb Williams Fan Club@CalebFC18·7h

Really hope Illinois legislature can get a deal done in Arlington Heights…

English

441

20.8K

0dteezy@0dteezy·10h

@testt1234567891 @LottoLabs @stevibe @makulas1913 llama.cpp

Español

Matt@testt1234567891·10h

@0dteezy @LottoLabs @stevibe @makulas1913 Using Lm Studio?

English

stevibe@stevibe·16h

Qwen3.6 35B-A3B dropped yesterday, so I ran it on 4 GPUs to see how it performs: 🟣 RTX 3090 — 49.78 tok/s, TTFT 852ms 🟡 RTX 4090 — 118.93 tok/s, TTFT 686ms 🟢 RTX 5090 — 160.37 tok/s, TTFT 409ms 🔵 DGX Spark — 59.98 tok/s, TTFT 228ms I went with ollama as the backend because honestly, it's the easiest way for most people to get started. One command, model pulled, done. I used Q4_K_M (24GB) across all four cards. The reason is the 3090 and 4090 don't support NVFP4 (only the 5090 and DGX Spark could use it). Keeping the same quant everywhere felt like the fairest way to compare. And yes, you can absolutely squeeze more performance out of every card with vLLM, SGLang, or TensorRT-LLM. But that's not what this test is about. This is just the out-of-the-box experience for folks who own a GPU and want to try the new model tonight.

English

133

226

302.2K

0dteezy@0dteezy·10h

As she gets chauffeured around the city she gazes up at the buildings and thinks...Yes, its time to cut rates.

FinancialJuice@financialjuice

Fed's Daly: Office buildings appear less empty.

English

0dteezy@0dteezy·13h

BTC it is time

English

0dteezy@0dteezy·15h

@LottoLabs @stevibe @makulas1913 I'm getting over 100 on 3090 lol

English

137

Lotto@LottoLabs·15h

@stevibe @makulas1913 Just for reference I’m getting 90TPS with 3090 Ollama is just that bad

English

1.4K

0dteezy@0dteezy·15h

The Boomer

NoLimit@NoLimitGains

What do you call this pattern?

English

0dteezy@0dteezy·1d

@vexentine @_ibarz @sudoingX Yes it does, cant believe it

English

Valentine@vexentine·1d

@0dteezy @_ibarz @sudoingX Yes. It works. qwen3.5+ has a different architecture. Just download and try it. Gemma on my 3090 barely works because context is way too small, qwen3.5 works at 200k context.

English

Sudo su@sudoingX·1d

85-100 tok/s on the 3090 with qwen 3.6 already? that's in line with what 3.5 MoE was doing. drop your full flags and context length you tested at, i'm pulling 3.6 on the 5090 24gb and will run the same config for a direct comparison. if anyone else is running qwen 3.6 on a 3090 or any consumer card drop your tok/s, quant, and flags below. building the community benchmark sheet before i publish my own numbers

Jacob Verdoorn@VerdoornJacob

@sudoingX 3090 getting 85-100 t/s on cpp server with new qwen3.6 35b a3b ud q4 k m 262k context

English

276

23.1K

0dteezy@0dteezy·1d

@DJLougen Cant wait to try, cruising over 100t/s on the unsloth version on 3090

English

Daniel Lougen, M.S.@DJLougen·1d

Oh.... also their SABER abliterated twin is done! Again quants are coming so they will be there when they get there! huggingface.co/DJLougen/Ornst… huggingface.co/DJLougen/Ornst…

English

0dteezy@0dteezy·1d

@_ibarz @sudoingX Nope

English

Jean Ibarz@_ibarz·1d

@0dteezy @sudoingX It is with flash attention and KV in 4 bits

English

0dteezy@0dteezy·1d

@VerdoornJacob @sudoingX How you fitting this?

English

353

Jacob Verdoorn@VerdoornJacob·1d

@sudoingX 3090 getting 85-100 t/s on cpp server with new qwen3.6 35b a3b ud q4 k m 262k context

English

24.2K

Sudo su@sudoingX·1d

let me clear something up for the new followers. the 5090 mobile has 24gb vram, same class as the 3090. when i benchmark a model on the 5090 and give you the flags and the tok/s, that translates directly to your 3090 at home. the architecture is newer so the 5090 numbers will be slightly faster maybe, but the configs are identical. if it fits on my machine it fits on yours. and i'm not stopping at one gpu. 3090 nodes are still in the rotation for controlled comparisons, smaller gpus are coming for the 8gb and 12gb crowd, and nvidia sent me a dgx spark that's clearing customs right now. 128gb unified memory on my desk soon. 7 models loaded on the 5090 today, hermes agent work i've been cooking for weeks is almost ready to ship, and open source keeps dropping new models faster than i can pull them. the benchmark pipeline is about to run nonstop. i am so soo back.

English

153

10.7K

0dteezy@0dteezy·2d

@nugator007 @FBGreatMoments His contract lol

English

517

Brian Nelson@nugator007·2d

@FBGreatMoments Someone please explain to me what Jordan Love has ever done to make him untouchable

English

2.2K

Football’s Greatest Moments@FBGreatMoments·3d

Quarterback tiers by trade value according to NFL on FOX.

English

477

94.3K

0dteezy@0dteezy·3d

Wow what a banger

English

0dteezy@0dteezy·3d

@TheAhmadOsman Yikes that's a bleak outlook.

English

Ahmad@TheAhmadOsman·3d

In a future where tokens quantity and quality will determine your standing and wealth, fighting for compute so sovereignty is a worthy battle.

English

3.5K

0dteezy@0dteezy·3d

@outsource_ @huggingface 256k def gonna push this over 3090/4090 vram

English

666

Eric ⚡️ Building...@outsource_·3d

🚨QWEN 3.5 40B DENSE + FULL HERETIC + OPUS 4.6 🤗@huggingface model card: 🧠 40 Billion dense parameters (NOT MoE) 📈 Expanded from 27B → 96 layers + 1,275 tensors 🧪 Multi-stage trained on Claude 4.6 Opus High ⚔️ Fully Heretic-Uncensored first (abliterated) 🔧 Upgraded Jinja template = zero looping... ⚡ Tool calling + 256K context 🧠 Trained via @UnslothAI 🦥on local hardware And yes it runs on consumer hardware: 💻 Single RTX 4090 friendly (Q4_K_S / IQ3_S ) ⚡ Quantized, fast, and ready to go full Deckard mode Pull this model and share results 👇🏻

English

583

43K

0dteezy@0dteezy·3d

Is it normal for Hermes to use way more context than if not using Hermes? Blowing thru 64k in just a few basic prompts

English

228

0dteezy@0dteezy·4d

Thank you Dr. Jesus for the market ramp today

Nick Sortor@nicksortor

🚨 JUST IN: President Trump responds to backlash over an image he posted which seemed to depict him as Jesus "It's supposed to be me as a doctor making people better, and I do make people better. I make people a lot better!"

English

ディスカバー

@brivael @CalebFC18 @testt1234567891 @LottoLabs @stevibe @makulas1913 @vexentine @_ibarz