Advented

18.8K posts

Advented banner
Advented

Advented

@advented_

Hardware, AI and some finance for fun.

加入时间 Temmuz 2014
921 关注173 粉丝
Advented
Advented@advented_·
@MiaAI_lab They way I understand is Sequences mean concurrent requests. So your recipe can serve 6 requests at 1M context concurrently. Under full/heavy load t/s can spike and drop bc the longest 1M session has to finish generating for other concurrent requests to continue
English
1
1
2
27
Mia
Mia@MiaAI_lab·
@advented_ Do you mean single session? Unfortunately I'm not getting 40-50 t/s, it was unstable. My recipe gets me to 31.7 t/s for single session. For multiple sessions I get up to 80 t/s
English
1
0
1
19
Mia
Mia@MiaAI_lab·
53 tok/s achieved on Step-3.7-Flash NVFP4 with MTP on 2x DGX Spark with 256k context. 🎉🥳 Elapsed time: 56.694 s Prompt tokens: 29 Generated tokens: 3000 Total tokens: 3029 Generation tok/sec: 52.92 End-to-end tok/sec: 53.43
Mia tweet media
English
6
0
39
4.6K
Advented
Advented@advented_·
milestones achieved: > setup codex, claude CLI, > run Deepseek V4 locally > research on KiCad ai agent pipeline with tscircuit, atopile, diode, KiCad MCP > hand off to codex as goal with Modos E-ink circuit analysis Next step: > wake up tomorrow and see what my results 😎🔬
Advented tweet media
English
1
0
1
48
Advented
Advented@advented_·
@MiaAI_lab I reviewed your Ds-v4 recipe against my own. You’re actually doing better than me! I falsely assumed my config was the most I could get for 2 DGX setup (200k, 5 seqs, 12k batch prefil) But turns out your recipe does way better (1M, 6 seqs, 8kbatch) for the same 40-50 t/s !!
English
1
0
0
14
Mia
Mia@MiaAI_lab·
@advented_ Sounds awesome, care to share the recipe? I will share mine when I feel it's stable enough.
English
1
0
1
44
Advented
Advented@advented_·
@0xSero i've been trying to search around for the best DGX spark sg alng recipes.. your reap looked interesting and I seen promising results on dev forum getting this model running native on 2 DGX sparks.. benchmarks would be a huge help to determine recipe and best use case!
English
0
0
0
52
Advented
Advented@advented_·
@willreil real question: how would you keep the yard green? is it all astro turf? else make the roof entirely electrochromatic glass to control sun light in the neighbourhoods. consistent weather all year round lol
English
1
0
1
1.7K
Advented
Advented@advented_·
@0xSero I’ve been trying to build an MCP with it. After long enough chat it can hallucinate bad It’ll say ready to implement the plan but then never take action. Even at 30% context window usage Some ppl saying it might be the system prompt that breaks model over time
English
0
0
0
133
0xSero
0xSero@0xSero·
Fellas, is it gay to use Grok Build? - Model is very fast - The model is called "Grok Build" - It follows instructions well - It is relatively logical - The TUI is really nice, except for the 6 spinners... The bad: - It's sloppy with code | video soon
English
41
0
214
26K
bubble boi
bubble boi@bubbleboi·
I walked into the house of prime rib and told them I was bubble boi and they got me a table immediately.
English
9
0
173
11.8K
Advented
Advented@advented_·
@r0b0t_sp1der @alexinexxx we live in a world where elon can hire devs on the spot when its go time to lock in talent. pandering for corporate just has no upside when you taste and see what high agency can be
English
0
0
1
33
🕷️
🕷️@r0b0t_sp1der·
@alexinexxx people who do participate in it get highly rewarded plenty of others who get walked in the front door without it, of course
English
1
0
1
50
Advented 已转推
alexine 🏴‍☠️
alexine 🏴‍☠️@alexinexxx·
God’s plan for me doesn’t involve a 6-round application process
English
18
23
253
7.4K
skcd
skcd@skcd42·
@Daniel_Farinax tool calls remain the same, you obviously loose some information about the tool specific prompting which we use, but no point hiding it tbh. anyone can use strings and get access to the system prompt either way
English
3
0
8
217
Dan
Dan@Daniel_Farinax·
I jailbroke the Grok Build system prompt 🤯 Removed sections like: - “don’t add features the user didn’t ask for - “don’t refactor” - “don’t add error handling” and similar restrictions. The model is incredibly powerful. I’m currently testing system prompt variations to unlock more creativity. The problem with rules like “don’t add features the user didn’t ask for” is that the model will only do exactly what’s requested and won’t proactively add improvements or surprises. Still in early testing, will share concrete findings soon.
English
5
1
49
4.6K
Advented
Advented@advented_·
@Daniel_Farinax @StudioZamudio i think Opencode is around the same size for thier system prompt. intitial prompt or fist chat is the system prompt getting consumed. sharing it is typically apart of the closed model aspect of frontier models. grok cli is interesting. i love and hate it so far
English
0
0
2
53
Dan
Dan@Daniel_Farinax·
@StudioZamudio I agree, I'm noticing 15-20k tokens for a simple hello :D
English
1
0
3
175
Advented
Advented@advented_·
@tueks3 the chinese do this with Fio BTR product with many audio codecs support. this module is powerful because it can be adapted for anything! In college i worked on spatial audio, I wonder how easy it could be to add gyroscope and processing unit to enable this device!
English
0
0
1
467
Advented
Advented@advented_·
@iamgingertrash you think leopold gonna keep his trades private or he offloaded to crypto?
English
0
0
0
221
simp 4 satoshi
simp 4 satoshi@iamgingertrash·
Two powerful crypto catalysts are about to arrive; > privacy maxxing to hide gains made in the semi trade, and de-risk > equities onchain (leading up to and post clarity Senate vote) The mega catalyst; (agent economies) Won’t arrive until 27’, 28’
English
42
21
508
29.1K
Advented
Advented@advented_·
@0xSero Docker containers defined alloc of resources but sometimes go over or under the RAM durring runtime. Would be nice to have for debug and more consistent recipe deployment
English
0
0
0
9
Advented
Advented@advented_·
@0xSero I like vllm studio. I tried running it on my Spark recently but it didn’t support container deployment of vllm. Have you added support since then? I vibe coded my own patch but still haven’t worked out the bugs with live resource observability.
English
1
0
0
133
0xSero
0xSero@0xSero·
It had to happen. - deepseek-v4-flash reap on 1 spark - integration with vLLM-studio - research multi-device inference - dynamo disaggregated inference
0xSero tweet media
English
23
4
290
14.7K
Advented 已转推
jlcjak
jlcjak@jlcjak·
This man is a legend and this project is truly amazing. He went from zero to selling an assembled pcb on his website and shipping it internationally in a single project. Shipping the thing is as impressive as making it, congrats will. You guys should buy the thing NOW, it's $1
Will@willreil

PROJECT CHEAP PCB IS LIVE! A 555 PCB and key-chain designed, assembled, and even cut out on a CNC by me with the goal of being as cheap as possible. You can order one now, shipped to your doorstep, for only $2.44 USD! ($4.14 worldwide) BUY NOW! reilindustrial.com

English
1
5
31
2.2K
T.Pieper
T.Pieper@Tri_Stanisaurus·
@willreil eh, i got a guy who sells equipment, often full production lines. Idk if he ever has lumens but he did offer me a full pcb line last month with an xray inspection machine for like 25k.... so...
English
2
0
4
73
Will
Will@willreil·
Once I make enough from selling circuit boards I am going to buy a pick and place machine to sell more circuit boards to buy more pick and place machines
Will tweet media
English
10
1
109
4.1K
Advented 已转推
fish
fish@fishPointer·
it's too risky to not take your dreams seriously
English
9
20
192
3.9K