Justin Thyme 🇺🇸🐿️

36.9K posts

Justin Thyme 🇺🇸🐿️ banner
Justin Thyme 🇺🇸🐿️

Justin Thyme 🇺🇸🐿️

@looking5452

Just looking to set the record straight Anti-celebrity

Katılım Mart 2020
897 Takip Edilen3.4K Takipçiler
Sabitlenmiş Tweet
Justin Thyme 🇺🇸🐿️
Justin Thyme 🇺🇸🐿️@looking5452·
He's holding a sign that literally just says, "The right to openly discuss ideas must be defended." Let that sink in.
English
82
230
627
0
sdmat
sdmat@sdmat123·
@looking5452 @kdaigle @github I know someone who prefers low end models because they are fast, literal and unambitious. If you know exactly what you want and express it with precision the technical scribe may be as good as an ersatz engineer. This in no way means haiku is as capable as opus.
English
1
0
1
16
Kyle Daigle
Kyle Daigle@kdaigle·
Hot take from looking at @github Copilot telemetry: benchmarks make coding models look wildly different. Production workflows make them look much more similar. 👀 We looked at 23M+ Copilot requests and examined one simple metric: code survivability.
Kyle Daigle tweet media
English
27
40
308
58.6K
sdmat
sdmat@sdmat123·
@kdaigle @github This is like studying percentage of food sent back to the kitchen vs. knife brand
English
1
0
1
252
Justin Thyme 🇺🇸🐿️
@ProofOfCash @iMilnb I did long context evals in all major quants and kv quants for this model and q8_0 definitely is NOT a free lunch BEST result at q8_0 was 57% recall; f16 on most quants was easily >97%; major fail at q8_0
English
1
0
1
18
ProofOfCash ⚔️ URSF
ProofOfCash ⚔️ URSF@ProofOfCash·
@iMilnb It's very handwavey when it comes to the KV cache quantization. Of course short sequences won't show degradation at q8_0. I don't think that it's a free lunch if you use the model for agentic coding.
English
1
0
3
329
National Security Division, U.S. Dept of Justice
Three Charged with Conspiring to Unlawfully Divert Cutting Edge U.S. Artificial Intelligence Technology to China “The indictment unsealed today details alleged efforts to evade U.S. export laws through false documents, staged dummy servers to mislead inspectors, and convoluted transshipment schemes, in order to obfuscate the true destination of restricted AI technology—China,” said John A. Eisenberg, Assistant Attorney General for National Security. “These chips are the product of American ingenuity, and NSD will continue to enforce our export-control laws to protect that advantage.” 🔗: justice.gov/opa/pr/three-c…
National Security Division, U.S. Dept of Justice tweet media
English
260
1.5K
4.7K
4.2M
Justin Thyme 🇺🇸🐿️
@EricRichards22 copilot cli is much more coherent the application itself is trash but it... does what it's supposed to (at least for rust, bash, and python)
English
0
0
1
13
Lotto
Lotto@LottoLabs·
Qwen 3.5 models ranked on 3090 W/ hermes agent. 0.8b: for fun, cpu usage, don’t expect much but it runs on anything 2b: starting to be usable, can do small tool calls (not super reliably), drifts from tasks easily, major steering required 4b: actually usable, follows tool calls reliably, follows skills reliably (major bonus), doesn’t drift from tasks as bad as 2b. 9b: all of 4b but more capable w/ more complex tasks, still needs steering, still not 1 shoting tasks but more intelligent than the smaller models A3b: fast, more general intelligence, feels like the 9b speed but the reasoning closer to 27b, follows tool calls and complex skills well, minimal drift, just lacks big model coding abilities. 27b: the 3090 goat imo, no drift, tool calls for days, writes and follows skills very well, feels like sonnet 3.6-4 range of knowledge level with less glazing, code is usable and can deal w/ multiple files in projects. General knowledge level just feels higher. Only downside is it is slower than A3b and 9b obviously.
English
35
36
548
36.1K
Justin Thyme 🇺🇸🐿️
@Mike562389 during use I set around 160k because I don't trust models over 128k, tho I have no measurable reason in this case... just a habit to only trust a little more than half max context
English
1
0
2
17
Justin Thyme 🇺🇸🐿️
unsloth Qwen3.5-122B-A10B IQ2_M works EXTREMELY well on modest modern hardware at about 19-21 tps (benches at 23 tps gen, 224 prompt), VERY strong long context, all other standard benches show minimal (<3-5%, many <1%) degradation from baseline; extremely usable, smarter than 35B (50% generation speed but approx == wall clock time to solution due to fewer thinking tokens) rtx 5060 ti 16gb, ryzen 7 9700x (at 85W limit), 64gb ddr5 at 6000mt/s --threads 11 \ --threads-batch 13 \ --gpu-layers 99 \ --n-cpu-moe 45 \ --ctx-size 262144 \ --predict 32768 \ --batch-size 512 \ --ubatch-size 512 \ --parallel 1 \ --kv-offload \ --cache-type-k f16 \ --cache-type-v f16 \ --flash-attn on \ --fit on something magical about this quant; keep token probability tight
English
2
0
3
131
Justin Thyme 🇺🇸🐿️
I witnessed a chase when I was a kid the culprit drove behind some bushes which my friends and I didn't quite understand why (yet) because the police were so far behind; when the cops arrived 20 seconds later we put 2 and 2 together and all pointed at the bushes screaming which the cop took note of and reversed and went behind them and flushed him out; very exciting for little me!
English
0
0
0
6
Kensetsu
Kensetsu@Kensetsu6·
I saw a slow speed chase today, spike strips and all. I was just trying to get lunch. Interesting to see. I hope they’re all okay. The cops had to stack up with their guns drawn. A tough day for at least a couple people there.
English
1
0
8
76
Justin Thyme 🇺🇸🐿️
@ID_AA_Carmack my wife thought I was weird for getting an aux cable for the car until I demonstrated the difference between aac+sbc and flac+aux now she thinks I'm REALLY weird
English
0
0
2
76
John Carmack
John Carmack@ID_AA_Carmack·
When you stream Spotify to Bluetooth speakers or headphones, the audio comes over the network lossily compressed with Vorbis or AAC codecs, is then decoded on your device to 48 Khz raw samples, then the Bluetooth stack lossily re-compresses it with SBC or AAC codecs before sending it over the airwaves to the speakers. I don’t have “golden ears” to pick apart audio quality like I can with, say, missing gamma correction on texture filtering, but that still hurts my system optimization soul. It is likely over-optimization, but It would be cleaner if there were a way to send bluetooth-ready, compressed audio directly.
English
272
241
5.8K
438.2K
Justin Thyme 🇺🇸🐿️
@rah_66_comanche I think there's a good argument to be made about the guy's uncanny valley comment, but probably not in the way he meant it one takeaway from the demo is that the highly photorealistic appearance contrasts VERY hard against unrealistic movement, esp in the keyframe animated games
English
1
0
1
11
RAH-66 COMANCHE
RAH-66 COMANCHE@rah_66_comanche·
Yeah you know what it absolutely was not fucking delivering? Real lighting. That's why the last of us looks like everything was shot in a light box lit by 1 LED, and the death stranded chick looks like her skin is made of soft touch lightbulbs.
Nyx@justnyxs

We don’t need DLSS filter slop. Gaming was already delivering real emotion, real performances, and real immersion long before AI upscaling became a crutch... and it’ll keep doing that for decades to come.

English
2
0
10
352
Justin Thyme 🇺🇸🐿️
@rah_66_comanche nothing that I said conflicts with what he said I agree, it's not post-processing; it's rendering with the richness of a deep knowledge of what's present underneath; this is a spin-off of their AI textures; same concept, bigger application
English
1
0
0
16
RAH-66 COMANCHE
RAH-66 COMANCHE@rah_66_comanche·
@looking5452 "It’s not post-processing, it’s not post-processing at the frame level, it’s generative control at the geometry level," he said.
English
1
0
1
24
Justin Thyme 🇺🇸🐿️
for one as big as this, it definitely is solid; obviously a dense would be less affected, but out of curiosity just took all the lower quants that I could get to run at all and ran them through several evals and for some reason this specific quant was like magic; 3-bits were all garbage, same with the other 2-bit quants; there's something special about this specific quant
English
0
0
4
145
Lotto
Lotto@LottoLabs·
@looking5452 Cool I should know better than to shit on smaller quants than Q4, all that matters is if it works
English
1
0
1
148