Dean Sheppard

2K posts

Dean Sheppard

Dean Sheppard

@SheppaDean

Engineer Republican 2A believer 1A defender

New Jersey, USA Katılım Kasım 2023
15 Takip Edilen111 Takipçiler
Dean Sheppard
Dean Sheppard@SheppaDean·
One of the weakest models for coding tasks: terrible tool discipline and reasoning you can’t turn off. It’s the only model I’ve managed to push into a looping hallucination just by asking a technical question in the WebUI, repeated three times with a full model reload between prompts. Impressively strong (and wrong) behavior.
English
1
0
1
110
witcheer ☯︎
witcheer ☯︎@witcheer·
GPT-OSS-20B deep dive: 8 coding tasks, 8 passes, 1.8 GB VRAM. >setup: RTX 4060 Ti 8GB, WSL2, llama-server with ncmoe=30, Pi coding agent. model uses 1.8 GB VRAM, 10 GB host RAM. >result: 8/8 pass. every task produced working, tested code. combined with the original benchmark (portscout + logpulse), that is 10/10 completed agentic tasks on consumer hardware. (find all prompts in HF). >what I found: context efficiency: 6-48% of the 32K window used per task. no task exhausted context. the hardest prompt (multi-module with topological sort) used 47.6%. self-correction works: the model found and fixed its own bugs 7 times across 8 tasks. topological sort direction was backwards, fixed. printf format string missing %s, fixed. sed quoting wrong, fixed. no hallucinated APIs: prompt 5 was a trap, "use only the standard library." model used real modules (http.client, json, time, urllib.parse). no fake convenience wrappers. >weaknesses found: edit tool struggles: exact string matching for the edit tool fails repeatedly. model needs 3-4 attempts before falling back to full file rewrites. this is partly an agent-side problem (Pi's edit tool requires exact match), not purely a model problem. directory scan waste: runs "ls -R ~" from home directory, dumping 11K+ lines into context. happened on 2 of 8 prompts. wastes context budget for zero value. rumination: the model debates obvious things internally. "should I use 127.0.0.1 or localhost?" "is name == 'main' correct?" burns tokens but never causes failures. >why it works: 21B MoE with only 3.6B active params. native MXFP4 FFN weights hold quality under Q4_K_M quant. OpenAI's instruction tuning handles tool call JSON cleanly. 1.8 GB VRAM with expert offload leaves massive headroom on an 8GB card. >recipe: - model: gpt-oss-20b Q4_K_M (11 GB on disk) - agent: Pi coding agent (read/write/edit/bash tools) - server: llama-server --jinja -ncmoe 30 -c 32768 -n 8192 - hardware: any 8GB GPU + 16GB system RAM
witcheer ☯︎ tweet media
English
6
7
68
9K
Dean Sheppard
Dean Sheppard@SheppaDean·
8x A100, multi instance GPUs, cost around $20k each for 80GB version. Each A100 can be partitioned into 7 instances, so up to 56 instances max from this only server. Each instance rental is from $2.50 to $3.50 per hour, let’s say $3 *56 =$168 per hour or about $4k per day. $120k per month. Electricity and all, it’s still very lucrative business.
English
0
0
0
88
starmex
starmex@starmexxx·
> 8 GPUs in one server rig > dude went homeless to build it > electrical bill costs more than rent now > while everyone else pays $400/month to openai > a 2 GPU desktop kills the api bill forever > rtx 4080 super + rtx 5060 ti = 32gb vram > runs qwen 3.6 with 100k context locally > no rate limits, no api keys, no data leaving the room > agents loop 400 times for free > claude opus still wins on hard reasoning > but local handles 90% of daily work > $1,200 setup pays itself off in 4 months > bookmark this and read the article below
leopardracer@leopardracer

x.com/i/article/2055…

English
43
68
561
165.2K
Dean Sheppard
Dean Sheppard@SheppaDean·
@RWPhysics Sure, and this “simplification” has nothing to do with a 50 MB bare Python runtime just to execute this printf😂
English
0
0
0
20
Dean Sheppard
Dean Sheppard@SheppaDean·
Automotive engineer here. Despite appearances, any part that involves motors is pretty complicated because of the required safety guardrails. It doesn’t need an advanced MCU, but you guys (customers) came with lawsuits about your kids and pets getting injured by closing windows, so we had to add motor current monitoring and the associated anti-pinch logic.
English
0
0
2
19
LⒶVENDER
LⒶVENDER@stonewall1312·
no damn reason a window switch needs this much computer, and thats why its already broken. all the switchgear in my 40 year old acura is OE and WORKS.
LⒶVENDER tweet media
English
22
8
134
4.9K
Dean Sheppard
Dean Sheppard@SheppaDean·
@leopardracer Comparing Qwen3.6-27B to frontier models is like comparing apples and oranges. I’m running a local Qwen 35B-A3B as an experiment, but 95% of the real coding work is done with ChatGPT and the $200/month plan.
English
1
0
3
2.3K
leopardracer
leopardracer@leopardracer·
THIS DEVELOPER HASN’T PAID AN API BILL IN 3 MONTHS. HIS AGENTS RAN 10,000 TIMES FOR FREE he built a local AI lab under his desk two GPUs 32GB VRAM zero rate limits his agents loop 400 times if they want to his coworkers are still watching the usage dashboard the only thing separating them: llama.cpp + llama-swap every prompt stays on his machine every experiment costs $0 zero invoices bookmark & like this before your next API bill hits
leopardracer@leopardracer

x.com/i/article/2055…

English
85
164
1.8K
531.1K
Dean Sheppard
Dean Sheppard@SheppaDean·
It’s not recommended for two reasons. First, it’s a nightmare to rework because you have to apply a lot of heat to compensate for the solid copper heat sinking. Second, without thermal reliefs, your pad shape is defined only by the solder mask. Any solder mask defect around the pads will result in a failed board, since there’s nothing to prevent the melting solder from flowing away from the pad. There are also some annoying defects, like tombstoning caused by irregular heat distribution across copper pours, but your board is probably too small to experience those. Removing thermal reliefs won’t make the ground connection better - it will just create unnecessary complications. Basically, every component should use the default thermal reliefs, and then, only when necessary, you apply an exception rule and use a solid pad connection instead of inherited one.
English
0
0
0
23
IDV
IDV@IDV_FPV·
@SheppaDean Fair point but since the board is relatively small I rather have good ground connections.
English
1
0
0
77
IDV
IDV@IDV_FPV·
Layout done, managed to cram everything onto a 25x30mm board. Heat isolation of the SHT40 is important on a small board like this, no copper pour + cutout slot so ESP32-C6's heat doesn't skew the readings. Next the fun part, 3D renders and case design!
IDV tweet media
English
12
8
191
6.8K
Dean Sheppard
Dean Sheppard@SheppaDean·
@DabsMalone No, it won’t work. There are multiple issues, but the major one is that once loaded, there’s nothing to keep the motors horizontal to produce lift. With a loaded net, they will flip 90 degrees, and the whole setup will fall like a rock.
English
3
0
5
262
Dabs🩸
Dabs🩸@DabsMalone·
Do you think this could work?🧐
Dabs🩸 tweet media
English
76
0
88
8.3K
Dean Sheppard
Dean Sheppard@SheppaDean·
@svpino Most of the forced and artificial hype around OpenClaw was about selling you online courses on “how to make money with OpenClaw.”
English
0
0
0
276
Santiago
Santiago@svpino·
The “buy a Mac mini or you’ll never make it” crew moved on already. The whole “I’m making $10,000/mo with OpenClaw in a Mac Mini” was a grift, and it’s now dead. What’s the latest now? DGX Spark?
English
92
25
360
116.2K
Dean Sheppard
Dean Sheppard@SheppaDean·
@TheWapplehouse How do $15k side steps sound okay to anyone? Am I the only one too poor for that?
English
1
0
14
2K
Dean Sheppard
Dean Sheppard@SheppaDean·
@PR0GRAMMERHUM0R Oh, the famous 4o! You’re using it wrong! Instead of building a romantic relationship and debating existential topics, you’re just asking stupid math questions.
English
1
0
214
11.9K
Programmer Humor
Programmer Humor@PR0GRAMMERHUM0R·
floatingPointArithmetic
Programmer Humor tweet media
English
89
149
4.6K
376.6K
Dean Sheppard
Dean Sheppard@SheppaDean·
@repairfan_com The tip isn’t missing, it’s stuck inside the fuse holder. Disconnect the power before trying to pull it out.
English
1
0
2
85
REPAIRFAN公式
REPAIRFAN公式@repairfan_com·
電源が入らないと言う機器のヒューズを確認すると先っぽが無い😳
REPAIRFAN公式 tweet media
日本語
7
2
65
2.5K
Behnam
Behnam@OrganicGPT·
If you wanna run AI models locally, the best option is an RTX 6000 Pro. DON'T get a 5090/4090. And DON'T listen to people who hype the 3090; those cards are beat at this point. Get the RTX with education discount through Nvidia. These used to be $8000, now they're +$9000.
Behnam tweet media
English
64
5
74
31.2K
Dean Sheppard
Dean Sheppard@SheppaDean·
@witcheer I’d say, look into how the PCIe slots are spaced if you decide to add another GPU one day (it will eventually happen). Most motherboards leave no room for GPU airflow when two are installed.
English
1
0
0
37
witcheer ☯︎
witcheer ☯︎@witcheer·
I'm about to make my next computer hardware purchase. If you know your way around computers, please help me make sure this is a wise choice. it's a significant investment, and this will be my first time buying computer hardware. I currently have: >CPU: AMD Ryzen 5 7600X (6 cores, Zen 4, AM5) >Cooler: Arctic Liquid Freezer II 360 A-RGB + Arctic MX-4 paste >Motherboard: ASUS TUF Gaming A620M-PLUS WIFI (mATX, A620 chipset) >RAM: 32GB DDR5-6000 CL36 (2×16GB Corsair Vengeance) >GPU: MSI GeForce RTX 4060 Ti VENTUS 3X OC 8GB >Storage: Kingston KC3000 1TB NVMe >PSU: MSI MAG A650GL (650W, 80+ Gold) I want to buy: >Used RTX 4090 24GB >PSU 850W 80+ Gold (e.g. Corsair RM850x) >RAM 64GB DDR5-6000 (2×32GB)
English
12
0
10
2.9K
Dean Sheppard
Dean Sheppard@SheppaDean·
@eevblog Their whole business is based on hiding buyers and sellers from each other and making money from transactions. Of course, any link outside eBay triggers alarms. They became paranoid about it a while ago.
English
0
0
3
76
Dave Jones
Dave Jones@eevblog·
You aren't allowed to put PDF datasheet links in ebay listings it seems.
Dave Jones tweet media
English
10
1
43
2.5K
Dean Sheppard
Dean Sheppard@SheppaDean·
@fishPointer And this is actual scalper reasoning. They bought RAM at a fixed (lower) price, then sell it at “market” value, expecting to squeeze every dollar out of the shortage. It’s a greedy practice, nothing more.
English
4
0
44
2.6K
fish
fish@fishPointer·
RAM (Market Price)
fish tweet media
English
47
60
1.2K
54.1K
Dean Sheppard
Dean Sheppard@SheppaDean·
@loktar00 Gemma is too chatty - like an Instagram girl - while Qwen is an introverted handyman dude.
English
2
0
7
489
Loktar 🇺🇸
Loktar 🇺🇸@loktar00·
I love Qwen 3.6 but the more I test between Qwen and Gemma the more I respect Gemma's ability to not overthink it and just spit an answer out.
English
23
3
107
7.3K
Dean Sheppard
Dean Sheppard@SheppaDean·
@sirius_kpc They had to strike a balance between having no components at all (which looks suspicious) and using something inexpensive like a 0-ohm resistor. In all fairness, 99% of people don’t even know why a 1W 0-ohm resistor (which looks like a 2512 package) shouldn’t be there.
English
0
0
1
180