Monk Zero

1.9K posts

Monk Zero

@NoCommas

@antigma_labs, prev: @awsCloud, @Meta, @Mysten_Labs. A Turing Complete mind, wandering the world of Gödel Incompleteness.

Latent Space Katılım Temmuz 2012

1K Takip Edilen1.4K Takipçiler

Sabitlenmiş Tweet

Monk Zero@NoCommas·27 Şub

The only way we human are able to communicate and understand each other, is that across space and time, we are one and all Inspired by <The Egg> by Andy Weir galactanet.com/oneoff/theegg_…

English

Monk Zero@NoCommas·2d

@ludwigABAP @Yuchenj_UW Yep, actually I think somehow this become of better signal to know candidate has good fundamentals. Public github projects and profiles means a lot less now. In my experience working in most top tech companies , this remains one of top indicators regardless of what people say

English

ludwig@ludwigABAP·3d

@Yuchenj_UW x.com/ludwigabap/sta…

ludwig@ludwigABAP

how is it possible to not be able to invert a binary tree on a whiteboard though? regardless of what you think of tech interviews, this isn’t an insane HFT whiteboard brain melter, it’s literally just inverting a binary tree

QME

10.2K

Yuchen Jin@Yuchenj_UW·3d

I’m so glad AI killed LeetCode interviews. For 10 years, tech companies made every engineer grind the same puzzles and prove they could invert a binary tree from memory. Today, the dumbest AI model can walk in and one-shot the entire interview. Thank you, AI.

English

221

153

2.9K

663.1K

Monk Zero@NoCommas·2d

@mycharmspace 啊才知道你也用twitter😂. 祝贺🎉! Search is indeed my most used grok feature ❤️

中文

Tianyi Zhang@mycharmspace·3d

Today is my last day at xAI. I joined xAI a year ago and had the pleasure of leading the search and factuality post-training team. Over time, we developed so many recipe and engineering co-optimizations, making Grok the best AI for search and real-time agent. I am also particularly proud of working with a small group of talented people delivering the recent iterations of the instant mode of Grok - the one I personally liked and used the most. My thanks to all the friends and teammates for their support and help over the past year. They are among the brightest minds I’ve met in my career. I am sure the team will continue the mission to make better Grok and understand the universe.

English

648

82.9K

Monk Zero@NoCommas·2d

@antirez This is the way 🫡. Next is bidirectional multi-stream.

English

155

antirez@antirez·2d

Now DS4 implements the OpenAI Responses API and attempts to match the IDs in order to continue from the live KV cache without doing the efforts required in the chat completion API code path.

English

166

10.5K

Monk Zero@NoCommas·2d

@badlogicgames @ShopifyDevs x.com/nocommas/statu…

Monk Zero@NoCommas

`auto-research` and `goal` is the same thing with different level of details

QME

Mario Zechner@badlogicgames·4d

looked into /goal in both cofex and claude codr and all i'm seeing are inferior versions of autoresearch. what am i missing? available from the @ShopifyDevs folks as a pi extension. shopify.engineering/autoresearch

English

293

18.3K

Monk Zero@NoCommas·2d

@shafu0x Member of Technical Support

English

shafu@shafu0x·2d

forward deployed engineer just means the guy is not fucking autistic

English

153

7.3K

565.7K

Monk Zero@NoCommas·2d

@jonasgeiping Got a feeling some of Thinky and OpenAI realtime api already does something like this. Great work, this direction feels right

English

Jonas Geiping@jonasgeiping·3d

Finally, we find that models with many internal streams allow us to more easily monitor their thinking, for example concerning evaluation awareness. With many parallel internal streams, it would be my hope that the model continues to subvocalize concerns in side-streams, even if the main CoT/thinking stream is occupied with solving a particular task.

GIF

English

3.7K

Jonas Geiping@jonasgeiping·3d

We’re training models wrong and it’s due to chatGPT. Even the modern coding agents used daily still use message-based exchanges: They send messages to users, to themselves (CoT) and to tools, and receive messages in turn. This bottlenecks even very intelligent agents to a single stream. The models cannot read while writing, cannot act while thinking and cannot think while processing information. In our new paper, see below, we discuss LLMs with parallel streams. We show that multi-stream LLMs can … 🔵Be created by instruction-tuning for the stream format 🔵Simplify user and tool use UX removing many pain points with agents and chat models (such as having to interrupt the model to get a word in) 🔵Multi-Stream LLMs are fast, they can predict+read tokens in all streams in parallel in each forward pass, improving latency 🔵 LLMs with multiple streams have an easier time encoding a separation of concerns, improving security 🔵 LLMs with many internal streams provide a legible form of parallel/cont. reasoning. Even if the main CoT stream is accidentally pressured or too focused on a particular task to voice concerns, other internal streams can subvocalize concerns that would otherwise not be verbalized. Does this sound related to a recent thinky post :) - Yes, but I don’t feel so bad about being outshipped with such a cool report on their side by 23 hours. I’ll link a 2nd thread below with a more direct comparison. I actually think both are complementary in interesting ways.

GIF

English

168

1.4K

150.7K

Monk Zero@NoCommas·2d

@iridescence_dev @ST_Automation @ptr_to_joel @joshmo_dev Yes and we already did. And it is beautiful github.com/AntigmaLabs/an…

English

Iridescence@iridescence_dev·2d

It's worth it, even if the network is the ultimate bottleneck. Users should have high quality software. Taking 0.5 seconds to load and 270-300MB at startup for a TUI is completely unacceptable when their competition, Codex CLI (Rust), can do it in a fraction of the time and half the RAM. Software Engineers have a duty to demand higher quality software that people can actually love using and stop making excuses.

English

Joel 🇦🇺@ptr_to_joel·2d

holy wow they merged it

English

138

189

4.4K

820.1K

Monk Zero@NoCommas·3d

@heyandras Doing gods work

English

Andras Bacsai@heyandras·3d

We made a fake repo with fake bounties, and the bots are applying fake PRs, so we know who is fake, and we can ban them from the Coolify repo. IQ over 1000

English

194

499

10.6K

497.6K

Monk Zero@NoCommas·3d

@badlogicgames @MaksShamihulau Haha I held the same strong opinion against GoLang; the only language I dislike with some passion

English

Mario Zechner@badlogicgames·3d

@MaksShamihulau but then i'd have to use rust, and i dislike rust a ton.

English

871

Mario Zechner@badlogicgames·3d

uhm i sort of disagree :p

Armin Ronacher ⇌@mitsuhiko

Pi wouldn’t make any sense in rust or go. Extensibility is key to it. That leaves ruby, python, js, php for the most part unless you want to ship an interpreter. None of those languages have any benefit over node.

English

221

45.2K

Monk Zero@NoCommas·3d

@badlogicgames @hjanuschka Lua was always the most lovely one. But right now bash is the sweet spot.

English

Mario Zechner@badlogicgames·3d

@hjanuschka i don't want to push lua onto anyone.

English

786

Monk Zero@NoCommas·3d

@mitsuhiko Yep. Rust based agent should focus on resource footprint and reliability; there is room for both.

English

129

Armin Ronacher ⇌@mitsuhiko·3d

English

555

146.9K

Monk Zero@NoCommas·10 May

@satory_ua @ThePrimeagen It has got too good recently since gpt-5.2. Miss the time when it is super easy to spot filter slop rust

English

166

🇺🇦🍉 Geopolitics expert 🍉🇺🇦@satory_ua·10 May

@ThePrimeagen > Rust > Looks inside > Arc::new(Some struct::new(param.clone())) everywhere > CLAUDE.md in the root of the repository

English

ThePrimeagen@ThePrimeagen·10 May

Current meta

Español

1.8K

71.6K

Monk Zero@NoCommas·10 May

@SIGKITTEN @thdxr interesting how different team has very different priorities; building sdk was actually where we started and everything grows from how to best interact with LLM; never occurred to me to use any 3rd party client sdk library

English

107

SIGKITTEN@SIGKITTEN·10 May

@thdxr finally big enough to remove ai-sdk!

English

1.8K

dax@thdxr·10 May

we're working on a library to abstract over all the llm providers there's very few teams that have dealt with the quirks between providers at the scale we have it's written in effect but will also have a vanilla api progress is in the opencode repo under packages/llm

English

111

1.7K

219.3K

Monk Zero@NoCommas·10 May

@badlogicgames @rebelcrayon “Inline this” is probably my most typed words recently. I feed the entire deep module lecture into both and still see this shit from time to time

English

174

Mario Zechner@badlogicgames·9 May

calling it slopex from now on so it can join its sibling slopus.

English

1.5K

121.4K

Monk Zero@NoCommas·8 May

@tenderizzation him and Linus really built the pillars of our current digital world

English

697

tender (mlsys 5/18-21)@tenderizzation·7 May

wow

4.3K

123K

Monk Zero@NoCommas·8 May

@thdxr there is alway an ancient Chinese proverb for these kind of things: 守正出奇

English

dax@thdxr·7 May

i pay attention to: 99%: using our product in dumb/simple ways and will never change behavior 0.01%: aliens who are showing us the distant future ignore the reminder: "pro" users who think they invented some clever workflow every week but get less done than the 99% group

English

873

63K

Monk Zero@NoCommas·8 May

@orzxh97 @badlogicgames honestly the first one reads like AI reply lol

English

142

Kiana@orzxh97·8 May

@badlogicgames Oh I’m not saying to you. I’m agreeing you. Sorry that wasn’t clear.

English

1.6K

Mario Zechner@badlogicgames·8 May

but it's cool that frontier models are now basically regressing. maybe all this madness will come to an end soon.

English

392

34K

Monk Zero@NoCommas·7 May

@antirez Makes sense thanks!

English

465

antirez@antirez·7 May

@NoCommas Unfortunately this si what DeepSeek official API returns, I don't know why they are masked / place-holder-ed. So the test checks that the continuation matches without being able to check the logits values. But it's enough to spot issues given that we test with long contexts.

English

2.7K

antirez@antirez·7 May

Welcome to DS4, a specialized inference engine for DeepSeek v4 Flash. github.com/antirez/ds4 This project would have been impossible without the existence of llama.cpp and GGML and the work of @ggerganov and all the other contributors. Thanks!

English

214

1.5K

190.8K

Monk Zero@NoCommas·7 May

@eshear 🫡 you built something outlived your own fame