antirez

42.1K posts

antirez

@antirez

Reproducible bugs are candies. I like programming too much for not liking automatic programming.

Sicily, Italy Katılım Mayıs 2007

770 Takip Edilen55.5K Takipçiler

Sabitlenmiş Tweet

antirez@antirez·22 Kas

My second short story release in English is ready: Tales of Illustrious Computer Scientists: Iola Varga, nun and computer scientist. invece.org/iola.html

English

123

127.2K

antirez@antirez·3h

@0xFFA4 I extensively wrote about this on my blog and in the HN comments.

English

アレクサンドル・ジュカ@0xFFA4·3h

@antirez Did you use LLMs during the process of developing this? If so, which model do you think generates better C code?

English

antirez@antirez·17h

I just submitted a PR for a new Redis Array type :) github.com/redis/redis/pu…

English

189

13.7K

antirez retweetledi

lcamtuf@lcamtuf·8h

The coreutils Rust rewrite story is pretty funny. Coreutils are tools like rm, mv, mkdir, etc. Unlike binutils, this isn't a fertile ground for memory safety bugs. But, the rewrite was completed, and in the spirit of progress, Canonical decided to switch. 🡇

English

936

138.8K

antirez@antirez·6h

@p_mbanugo GPT started to be good with the 5.2 series but 5.3 for me was a jump.

English

154

Peter Mbanugo@p_mbanugo·6h

@antirez Does it matter if it's GPT 5.3 codex or GPT 5.4 or later? I'm curious if there's a min version you think was very well or good enough.

English

187

antirez@antirez·10h

[blog post] Redis array: short story of a long development process => antirez.com/news/164

English

120

45K

antirez retweetledi

Alexandru Ică@vg_head·7h

@antirez I am working on a knowledge base full of legislation, and IIUC this is _precisely_ what I would want. Markdown files, where the agent can grep through everything trivially. Thank you for this. I was searching for a solution for a long time.

English

1.5K

antirez@antirez·9h

There are projects that I develop not looking at the code, but looking and owning the concepts, algorithms, ideas, and product. But not for Redis, not yet at least. When in the future this will be possible, server software the way it is developed today will be over, there will be still projects I believe, but developed in a very different way. Programmers will do mainly what Linus did so far for the kernel.

English

440

Vittorio Romeo@supahvee1234·10h

@antirez Closely matches my own experiences with current SOTA AI. Extremely useful collaborator, far from being a replacement for human intelligence and creativity.

English

569

antirez@antirez·10h

ARGREP was the *last* command I added to the specification. I realized that the Array type was perfect to store text files only very later during the development :) But I believe it is going to be my main use case in the short run.

English

1.8K

antirez@antirez·10h

I believe the fact that Redis is so well understood by LLMs and people, is remote, and this support will allow to create knowledge bases for agents that are not centralized, do not need to live in the filesystem, and are trivial to update / access.

English

2.3K

antirez@antirez·11h

ARGREP was the *last* command I added to the specification. I realized that the Array type was perfect to store text files only very later during the development :)

English

1.6K

antirez@antirez·11h

English

2.8K

antirez@antirez·11h

One thing to understand about the new Array type of Redis, and the support of ARGREP, is that you can store, in Redis keys, different markdown documents (skills) that are collectively used and updated by a multitude of remote agents.

English

101

8.6K

antirez@antirez·15h

@g_giovanii Yes

790

Gian Giovani@g_giovanii·15h

@antirez Is this the new data type that teased many times before?

English

833

antirez@antirez·1d

@pupposandro Basically DeepSeek v4 Lightning Indexer but as a component of existing models (even if with limitations compared to the DS4 architecture of course). Interesting idea.

English

Sandro@pupposandro·3d

We just released something new: Luce PFlash Long-context prefill is a silent killer for throughput speed. llama.cpp takes ~257 seconds to prefill 128K tokens of Qwen3.6-27B on a single RTX 3090. So we tried to solve the problem. A small Qwen3-0.6B drafter loads in-process, scores token importance across the whole prompt, and the heavy 27B target only prefills the spans that matter. 128K prompt in 24.8 seconds, ~10.4x faster TTFT, NIAH retrieval preserved at every measured context. It is a clean C++/CUDA port of FlashPrefill wired through Block-Sparse Attention, with a custom Qwen3-0.6B BF16 forward so drafter and target share one ggml allocator. The whole thing is a single daemon command (compress) in front of the existing dflash spec-decode stack. More details here: github.com/Luce-Org/luceb…

GIF

Sandro@pupposandro

x.com/i/article/2050…

English

705

112.1K

antirez@antirez·1d

What I was able to achieve today was a *much* better slope in the prefill rate, that remains at 200 t/s even in very long contexts. This already makes the game a more fair one, since it does not start to degradate in a sensible way as you continue to work. In the M3 Ultra is much faster btw. I also tested there, 2x speed in prefill.

English

2.1K

Mario Zechner@badlogicgames·1d

@antirez wonder what magic could be done to improve prefill rate. i think that's mostly what's holding things back at the moment.

English

2.7K

antirez@antirez·1d

DeepSeek v4 small KV cache + MacBook fast SSD disks = the idea that the disk is not a good target for KV cache is, in this context, totally obsolete. It works *great*. The session you see is opencode using my inference engine for DS4, saving, loading sessions from disk.

English

585

43.2K

antirez@antirez·1d

@lucastech 128 GB, with space for generous contexts. 2 bit asymmetric quantization where shared experts, routing and projections are taken at full quality.

English

1.1K

Lucas Tech@lucastech·1d

@antirez The Apple nvme are dramatically faster than most ssd, which makes the disk access much more tenable than it would be in most other systems. How much ram do you need for that to run locally though?

English

1.3K

antirez@antirez·1d

@thought_sync Yep I'll make all of it MIT licensed. It will take some time as I believe in the AI space we see too many rushed things, so I want to make sure it works well before releasing it.

English

842

Vyacheslav@thought_sync·1d

@antirez Is it possible to try out your inference engine!?

English

926

Keşfet

@0xFFA4 @p_mbanugo @g_giovanii @pupposandro @elonmusk @BarackObama @taylorswift13 @cristiano