TNG Technology Consulting GmbH

1.6K posts

TNG Technology Consulting GmbH banner
TNG Technology Consulting GmbH

TNG Technology Consulting GmbH

@tngtech

TNG, aka "The Nerd Group", is a consulting partnership focused on high end information technology, particularly AI. 926 employees, 99.9% academics, ~53% PhDs.

Unterföhring, Deutschland Katılım Aralık 2010
174 Takip Edilen2.2K Takipçiler
Sabitlenmiş Tweet
TNG Technology Consulting GmbH
Today we release DeepSeek-TNG R1T2 Chimera. This new Chimera is a Tri-Mind Assembly-of-Experts model with three parents, namely R1-0528, R1 and V3-0324. R1T2 operates at a sweet spot in intelligence vs. output token length. It appears to be... * about 20% faster than R1, and more than twice as fast as R1-0528 * significantly more intelligent than R1 in benchmarks such as GPQA Diamond and AIME-24/25, albeit not quite on R1-0528 level * much more intelligent than our first R1T Chimera, and also think-token consistent, which is a major improvement We perceive it as generally well-behaved and a nice persona to talk to. The weights are on @huggingface under the MIT licence. We are looking forward to your experiments and feedback! Thanks to @deepseek_ai for giving their models to the world, to @chutes_ai and @openrouter for hosting R1T, to @WolframRvnwlf for benchmarking it, to @xlr8harder for beta-testing the new Chimera, and to @natolambert for constructive discussions at @aiDotEngineer.
TNG Technology Consulting GmbH tweet media
English
21
88
393
127.1K
TNG Technology Consulting GmbH
Thanks for doing interesting and intriguing research! "Wir müssen wissen, wir werden wissen" (David Hilbert)
Deutsch
0
0
2
99
Yuchen Jin
Yuchen Jin@Yuchenj_UW·
GPU shortage is worse than ever. H100s cost more today than they did 3 years ago, and you cannot get them on-demand. The big AI labs have locked up most of the supply for years. I’m worried university researchers and individual developers simply won’t be able to get GPUs.
Yuchen Jin tweet media
English
116
105
1.7K
255.8K
0xSero
0xSero@0xSero·
I believe this guy is onto something really interesting and valuable. I will commit 500$ for compute if I can get any of the clouds match me (Lambda, Prime Intellect, Hotaisle) He’s working on continuous learning and has given lots of valuable advice. 🙏
ar0cket1@ar0cket1

x.com/i/article/2054…

English
7
10
223
26.8K
Ai2
Ai2@allen_ai·
Today we’re releasing EMO, a new mixture-of-experts (MoE) model trained so modular structure emerges directly from data without human-defined priors. EMO can use a small subset of its experts for a given task while keeping near full-model performance. 🧵
Ai2 tweet media
English
13
57
404
85.5K
TNG Technology Consulting GmbH
@tugot17 GB300, yes, supposedly fast delivery until July 1st - but is that true? VR200, yes, supposedly January 27 delivery - if VR does not have the amount of teething troubles that GB200 had
English
1
0
2
146
Lawrence Chan
Lawrence Chan@justanotherlaw·
A recent viral paper claims to reverse-engineer the parameter counts of frontier models: GPT-5.5 = 9.7T, Opus 4.7 = 4.0T, o1 = 3.5T, etc. @ben_sturgeon and I investigated and found serious issues in the paper; fixing them gives GPT-5.5 as ~1.5T (90% CI: 256B-8.3T).
Lawrence Chan tweet media
English
29
96
956
209.2K
TNG Technology Consulting GmbH
If you know where to buy B300 servers for $0.5 million "in the West", give us a call. The street prices that we get quoted in Europe are already in the range of $0.8-1.0M, both air- and watercooled, without the rack and the Infiniband switch and the CDU. There seems to be very strong B300 demand in the US, the EU et al, too.
English
0
1
9
338
Andrew Curran
Andrew Curran@AndrewCurran_·
The price of B300 servers in China has risen to the point that it is now double the price in the West. The crackdowns on smuggling appear to be having an effect. Mythos has changed things.
Andrew Curran tweet media
English
8
19
196
9.8K
TNG Technology Consulting GmbH
Experiments using @Zai_org 's GLM-5.1 on Hopper and Blackwell, using @opencode with 50k input tokens and 500 output tokens on average. The RTX 6000 Pro x 8 node seems not to scale well with concurrency, while the B200 x 8 is just very strong. Are these measurement errors or reality? With luck and new kernels, how much could the RTX be improved?
TNG Technology Consulting GmbH tweet media
English
2
0
21
1.8K
0xSero
0xSero@0xSero·
Nvidia, why?
0xSero tweet media
English
3
0
24
3K
TNG Technology Consulting GmbH retweetledi
MuniHac
MuniHac@MuniHac·
Registration for #MuniHac 2026 is open! #registration" target="_blank" rel="nofollow noopener">munihac.de/2026.html#regi#MuniHac will take place October 9–11, 2026 at the @tngtech office in Munich. Keynote by @TacticalGrace confirmed, more to come. Want to contribute? Check out our CfC #cfc" target="_blank" rel="nofollow noopener">munihac.de/2026.html#cfc! See you in Munich!
MuniHac tweet media
English
0
5
10
570