
Roni Dover 💔
2.7K posts

Roni Dover 💔
@doppleware
Infovore and board game geek. co-founder of https://t.co/KNnoqs4hDG


מיזם כלא התנינים מתקדם: קציני שב״ס ב״סיור למידה״ בחוות התנינים בחמת גדר


LLMs process text from left to right — each token can only look back at what came before it, never forward. This means that when you write a long prompt with context at the beginning and a question at the end, the model answers the question having "seen" the context, but the context tokens were generated without any awareness of what question was coming. This asymmetry is a basic structural property of how these models work. The paper asks what happens if you just send the prompt twice in a row, so that every part of the input gets a second pass where it can attend to every other part. The answer is that accuracy goes up across seven different benchmarks and seven different models (from the Gemini, ChatGPT, Claude, and DeepSeek series of LLMs), with no increase in the length of the model's output and no meaningful increase in response time — because processing the input is done in parallel by the hardware anyway. There are no new losses to compute, no finetuning, no clever prompt engineering beyond the repetition itself. The gap between this technique and doing nothing is sometimes small, sometimes large (one model went from 21% to 97% on a task involving finding a name in a list). If you are thinking about how to get better results from these models without paying for longer outputs or slower responses, that's a fairly concrete and low-effort finding. Read with AI tutor: chapterpal.com/s/1b15378b/pro… Get the PDF: arxiv.org/pdf/2512.14982




באיזה כלי אתם משתמשים לתמלל שיחות / פגישות בעברית? ועד כמה הוא טוב?

We just made a $200,000,000 AI movie in just one day. Yes, this is 100% AI.























