Tech2Wild

156 posts

Tech2Wild banner
Tech2Wild

Tech2Wild

@Tech2Wild

🎮 Tech, gaming, AI, and everything in between. 🤖 Building with it, not just talking about it. 🔥 From the mind of @ToNYD2WiLD

Tham gia Mart 2026
59 Đang theo dõi100 Người theo dõi
g023
g023@g023dev·
@Tech2Wild Opus as orchestrator with deepseek as a subagent can be a pretty decent combination which can be a good way to stretch those limits.
English
1
0
0
12
Tech2Wild
Tech2Wild@Tech2Wild·
Made a decision today, and I am going 80% local. Moving ALL my agents to DSv4 Flash & Qwen 3.6 27B. Running my orchestrator, ONE Agent on Opus 4.8 to lead the charge. We will build automations & Skills that will help teach and shape the team. Workflows = Success. Wish Me Luck !
English
3
0
15
474
Behnam
Behnam@OrganicGPT·
@sakurayukiai @Tech2Wild how do you offload the work to local agents? did you create an MCP for that? or do you restart Claude with base URL pointing to your local server?
English
2
0
0
31
Sakura Yuki
Sakura Yuki@sakurayukiai·
@Tech2Wild Running a similar hybrid setup on my RTX 5070 Ti. Using Opus 4.8 for orchestration and offloading the tight loops to local Qwen3.6-27B is the only way you don't go bankrupt serving agents.
English
2
0
4
119
Tech2Wild
Tech2Wild@Tech2Wild·
@gospaceport @Web3Twon I bought a second, then just got a 3rd one. But now I can run it together because TP=3 don't work so guess what... I need a 4th one ! Don't fall for the trap lmaoo
English
0
0
1
9
Digital Spaceport
Digital Spaceport@gospaceport·
@Web3Twon 2x 3090 is a crazy amount of added performance. 4x is nice also, but man 48GB is a certain amount of models fitting that makes it make sense.
English
7
0
9
355
Tech2Wild
Tech2Wild@Tech2Wild·
@GPTWare and bruh I didn't even thing about that til you said it. It's a GEN 5 ! Not a GEN 4? That makes it a STEAL at this point not a 1:1 lol. That almost looks like a Ebay Scam now lol. I trust its real though but I'm just saying.
English
1
0
1
15
Tech2Wild
Tech2Wild@Tech2Wild·
@GPTWare True ! I built a PC about 2 years ago and ram was so cheap so I tend to forget.
English
0
0
1
11
Tech2Wild
Tech2Wild@Tech2Wild·
@GPTWare I am close to buying a 4th 3090 if I could just return everything and get my money back I'd just buy this your literally paying around $4500USD which is basically $1k each GPU and $500 in CPU, RAM, PSU, MB like its literally a 1:1 price
English
1
0
1
18
yourfren
yourfren@0xyourfren·
@Tech2Wild Two 3090’s on top of each other just sat on the PSU outside the case got me feeling a type of way
English
1
0
0
71
Tech2Wild
Tech2Wild@Tech2Wild·
TRIPLE 3090s…. Need a new MB and start building a RACK if I decide to go any further lol. Temporary rig for now I gotta get a rack 🤣🤣🤣
Tech2Wild tweet mediaTech2Wild tweet mediaTech2Wild tweet media
English
3
0
18
2K
Tech2Wild
Tech2Wild@Tech2Wild·
One issue with Opus since 4.7 that still hasn't resolved: the agent sometimes NOT using telegram to respond. Goes silent but been blabbing on the terminal. Tell them to always use the plugin to respond back & they still revert to not responding in telegram.
English
0
0
0
18
bgeneto
bgeneto@netobge·
@Tech2Wild Qwen3.6 35B A3B AutoRound fits in a single 24GB GPU with 262K context with fp8 KV cache and runs at 160 tps in a rtx 3090 via vLLM... Produces much better code than Gemma 4 12B. Unfair to compare them.
English
1
0
3
487
Tech2Wild
Tech2Wild@Tech2Wild·
Hmm wondering which is better: Gemma 4 12B or Qwen 3.6 35B-A3? 🤔
English
35
1
50
18.1K
Master Builder
Master Builder@MakerInParadise·
@Tech2Wild I can’t say anything negative about either model other than that 12B’s native vernacular is too informal for my liking… it has grok4.1/deepseek sentence structure and punctuation. Otherwise, I think that 12B is the better chat model and 35B the better reasoner/researcher.
English
2
0
4
1.8K
Tech2Wild
Tech2Wild@Tech2Wild·
@malikwas1f Good call I been running your recipes bro thanks for what you do
English
1
0
1
106
Tech2Wild
Tech2Wild@Tech2Wild·
Got the 3rd GPU setup (3x 3090s) but no TP=3, so I'm running a separate model or cloned 27B on the extra card. Been looking at Gemma 4 12B but honestly wondering if it's worth it when I can already run 27B or 35B at full context... What's your take? 🤔
English
6
0
8
1.6K
Tech2Wild
Tech2Wild@Tech2Wild·
@sakurayukiai I have 35B running now. The issue I’m having is 2GPUs of 27B give me almost identical speeds as 1 GPU on 35B
English
2
0
2
1.9K
Sakura Yuki
Sakura Yuki@sakurayukiai·
@Tech2Wild If you can fit the 35B footprint, Qwen is wild. Only 3B active params means it runs circles around Gemma's 12B dense decode speeds, but Gemma 4 is way friendlier on a single consumer GPU.
English
3
0
16
2.1K
Tech2Wild
Tech2Wild@Tech2Wild·
@gospaceport Sir I literally just watched your video on your Quad Build from 9 months ago 🙏🏽. Debating whether you go to GEN 5 or just grab one of the motherboards you showed and stay Gen 4.
English
0
0
0
33