Maximilian Bode

70 posts

Maximilian Bode

@mxpbode

Associate Partner @tngtech

Munich, Bavaria Bergabung Haziran 2018

148 Mengikuti87 Pengikut

Maximilian Bode me-retweet

TNG Technology Consulting GmbH@tngtech·6 Mar

Munich today in the sunshine: Taking the @UnitreeRobotics dog for a walk.

English

484

Maximilian Bode@mxpbode·1 Oca

🥳

TNG Technology Consulting GmbH@tngtech

Happy 25th birthday, TNG - coding since January 1st 2001!

ART

Maximilian Bode me-retweet

Nathan Lambert@natolambert·15 Ara

@deepseek_ai @AlibabaGroup @crystalsssup Okay okay, due to reasonable feedback we added: @cohere for their non commercial models @ServiceNow with Apriel, I like folks there (and pipeline rl) Motif @tngtech as a shout out for awesome hacks and merges of big MoEs This is DEFINITELY right, no take backs

English

24.5K

Maximilian Bode me-retweet

TNG Technology Consulting GmbH@tngtech·11 Ara

Hehehe ;-)

TNG Technology Consulting GmbH tweet media

Filipino

1.2K

Maximilian Bode me-retweet

TNG Technology Consulting GmbH@tngtech·7 Ara

x.com/i/article/1997…

ZXX

1.6K

Maximilian Bode me-retweet

TNG Technology Consulting GmbH@tngtech·8 Kas

G1PO, our @UnitreeRobotics humanoid robot, was in a playful showrace against the Munich @Motorworld_de's #GT3RS. Of course, he still got helping hands from his human friends to steer him and thus the kart. @Porsche cars and TNG managers narrowly escaped, and the oil drums got only slightly bumped (yt link in reply ;-).

English

541

Maximilian Bode me-retweet

TNG Technology Consulting GmbH@tngtech·13 Eki

TNG's Chimera models reached 8th place on the market share list on @openrouter . Model usage grew to 14.4 billion tokens/day, running on @chutes_ai.

English

1.3K

Maximilian Bode me-retweet

TNG Technology Consulting GmbH@tngtech·7 Eki

Zen Driving Études after a long day of robotics work.

English

996

Maximilian Bode me-retweet

TNG Technology Consulting GmbH@tngtech·12 Ağu

News from the Aider discord regarding DeepSeek-TNG R1T2 Chimera's performance in the Aider Polyglot benchmark, courtesy of benchmark wizard neolithic5452 and the magic @UnslothAI quantizations: - 2 bit UD-IQ2_M: 60.0% - 4 bit Q4_K_XL: 62.7% - 8 bit: 64.4% This seems to be the second highest open-weights result, after @deepseek_ai's R1-0528 which scored 71.4%. It appears to be before Kimi K2 (59.1%) and also @Alibaba_Qwen's Coder-480B, which scored 60.9% in the 4 bit UD-Q4_K_XL version. On @openrouter, R1T2 currently is the 11th most popular model for Aider. Over all applications, R1T2 processed 3.12B tokens yesterday on OR. On @chutes_ai, as of today is the tenth-most popular model.

English

1.6K

Maximilian Bode me-retweet

TNG Technology Consulting GmbH@tngtech·16 Tem

@elonmusk R1T2 Chimera #3 trending model today Right behind Grōk and Kimi K2

English

2.1K

Maximilian Bode me-retweet

TNG Technology Consulting GmbH@tngtech·11 Tem

Fun screenshot. Let's hand over the #1 trending position to #Grok4.

English

5.4K

Maximilian Bode me-retweet

TNG Technology Consulting GmbH@tngtech·9 Tem

DeepSeek-TNG-R1T2-Chimera is currently the #1 trending model on @openrouter The platform which @karpathy called the "transfer switch of AI". Our Assembly-of-Experts method pushes the Pareto frontier between model intelligence and inference cost. Thanks again to the Open Weights community: @deepseek_ai @huggingface, @AIatMeta @openrouter @chutes_ai @UnslothAI and @jon_durbin @ping_toven @xlr8harder @reach_vb @alexatallah to name a few

Deedy@deedydas

Karpathy at YC startup school calls this the transfer switch of AI. It's hard to keep up with all the LLMs out there. OpenRouter is the go-to place for 2.5M developers to choose from 400+ models with one API. They're serving 100T tokens/yr! Excited to back this special company!

English

12.1K

Maximilian Bode me-retweet

TNG Technology Consulting GmbH@tngtech·3 Tem

R1T2 Chimera is available on @chutes_ai Thanks to @jon_durbin and team for making it available within 24 hours!

English

5.5K

Maximilian Bode me-retweet

TNG Technology Consulting GmbH@tngtech·3 Tem

Today we release DeepSeek-TNG R1T2 Chimera. This new Chimera is a Tri-Mind Assembly-of-Experts model with three parents, namely R1-0528, R1 and V3-0324. R1T2 operates at a sweet spot in intelligence vs. output token length. It appears to be... * about 20% faster than R1, and more than twice as fast as R1-0528 * significantly more intelligent than R1 in benchmarks such as GPQA Diamond and AIME-24/25, albeit not quite on R1-0528 level * much more intelligent than our first R1T Chimera, and also think-token consistent, which is a major improvement We perceive it as generally well-behaved and a nice persona to talk to. The weights are on @huggingface under the MIT licence. We are looking forward to your experiments and feedback! Thanks to @deepseek_ai for giving their models to the world, to @chutes_ai and @openrouter for hosting R1T, to @WolframRvnwlf for benchmarking it, to @xlr8harder for beta-testing the new Chimera, and to @natolambert for constructive discussions at @aiDotEngineer.

English

391

126.5K

Maximilian Bode me-retweet

TNG Technology Consulting GmbH@tngtech·16 Haz

We post our new paper "Assembly of Experts: Linear-time construction of the Chimera LLM variants with emergent and adaptable behaviors" on @huggingface, while waiting for @arxiv. We explain how we constructed the 671B R1T Chimera child model from the great @deepseek_ai V3-0324 and R1 parent models (谢谢!) in less than one hour of CPU time. The Chimera research prototype is currently the 4th most-popular LLM on @chutes_ai with about 3.5B tokens/day, 0.9-1.0B of which flow through @openrouter. Over 160B tokens have been processed since release on April 26th.

English

12.8K

Maximilian Bode me-retweet

TNG Technology Consulting GmbH@tngtech·4 Haz

Assembly of Experts: Our linear-time 671B Chimera LLM construction paper should soon appear on arxiv.org. We are at the @aiDotEngineer fair in SFO until tomorrow, so for a chat -> DM ;-)

English

779

Maximilian Bode me-retweet

TNG Technology Consulting GmbH@tngtech·1 Haz

More evidence for the effectiveness of the Chimera construction method: Taking DeepSeek's R1-0528 release, we started benchmarking new Chimera variants on AIME-24 and SimpleQA. R1-0528 significantly improves AIME performance from 79.8 to 91.4 while doubling the amount of output tokens compared to R1. It appears to be a great model overall. Our R1-0528-Chimera variants of R1-0528 and V3-0324 seem to improve math performance too, with results up to 83.3. Positive aspect: more compact reasoning with output-token count below that of R1. This could be an interesting trade-off for real-world applications. On SimpleQA R1T-0528-Chimera is close to the V3-0328 results and seems to fare better than R1-0528. Caveat: these numbers are just a preliminary indicator, since they stem from single benchmark runs and are subject to statistical fluctuation. We'll continue searching for variants with interesting features and beneficial behavior combinations

English

9.1K

Maximilian Bode me-retweet

TNG Technology Consulting GmbH@tngtech·29 May

x.com/i/article/1928…

ZXX

2.2K

Maximilian Bode@mxpbode·13 May

Having a blast in beautiful Estes Park, CO. It was a real pleasure sharing my perspective on how LLMs are transforming software engineering, and why running your own GPU stack can be a game-changer. Thanks to everyone who came with smart questions and big ideas! #LambdaConf2025

LambdaConf 2025@lambda_conf

Max Bode at the podium! #LambdaCon2025

English

623

Maximilian Bode@mxpbode·7 May

8x256 GB!

TNG Technology Consulting GmbH@tngtech

Eight new @AMD MI325X GPUs joined our compute cluster of @nvidia H100s. The new @Supermicro_SMCI server is an AI machine with spectacular 2 Terabytes of total GPU memory in one ~10kW node. ROCm worked right away with full VRAM and GPU utilization, allowing new types of experiments with ultra-large language models.

Jelajahi

@UnitreeRobotics @deepseek_ai @AlibabaGroup @crystalsssup @cohere @ServiceNow @tngtech @Motorworld_de