Víctor

2.6K posts

Víctor

@venosov

KISS and YAGNI. Ingeniero Informático que desarrolla su actividad profesional como Arquitecto Software.

España Katılım Şubat 2017

41 Takip Edilen99 Takipçiler

Víctor retweetledi

Aran Komatsuzaki@arankomatsuzaki·28 Nis

Follow-up on non-English token-inefficiency with more model-language pairs: - Chinese is cheaper than English on major Chinese models - Gemini and Qwen provide least non-English tax - Anthropic has the highest tax by far; Kimi is next - Hindi is the worst-covered language here, despite its massive speaker base

Aran Komatsuzaki@arankomatsuzaki

The non-English tax is real. Sutton's Bitter Lesson, translated across languages and normalized to OpenAI English token count: Hindi: OpenAI 1.37×, Anthropic 3.24× Arabic: OpenAI 1.31×, Anthropic 2.86× Chinese: OpenAI 1.15×, Anthropic 1.71× Claude’s tokenizer charges a much higher linguistic tax.

English

253

1.3K

496.4K

Víctor retweetledi

BURKOV@burkov·28 Nis

If you don't understand this, you will not understand why LLM-based agents are irreparably failing for a general-purpose problem solving. An agent (by the way it was the topic of my PhD 20 years ago) to be useful, must be rational. Being rational means to always prefer an outcome that results in the maximal expected utility to its master/user. Let’s say an agent has two actions they can execute in an environment: a_1 and a_2. If the agent can predict that a_1 gives its user an expected utility of 10, and a_2 gives an expected utility of -100, then a rational agent must choose a_1 even if choosing a_2 seems like a better option when explained in words. The numbers 10 and -100 can be obtained by summing the products of all possible outcomes for each action and their likelihoods. Now here is the problem with LLM-based agents. The LLM is not optimizing expected utility in the environment. It is optimizing the next token, conditioned on a prompt, a context window, and a training distribution full of examples of what helpful answers are supposed to look like. Those are not the same objective. So when we wrap an LLM in a loop and call it an “agent,” we have not created a rational decision-maker. We have created a text generator that can imitate the surface form of deliberation. It may say things like: “I should compare the expected outcomes.” “The best action is probably a_1.” “I will now execute the optimal plan.” But the internal mechanism is not selecting actions by maximizing the user’s expected utility. It is generating a continuation that is statistically appropriate given the prompt and prior context. This distinction matters enormously. For narrow tasks, the imitation can be good enough. If the environment is constrained, the actions are simple, and the success criteria are close to patterns seen in training, the system can appear agentic. But for general-purpose problem solving, the gap becomes fatal. A rational agent needs stable preferences, calibrated beliefs, causal models of the world, the ability to evaluate consequences, and the discipline to choose the action with maximal expected utility even when that action is boring, non-linguistic, or unlike the examples in its training data. An LLM-based agent has none of that by default. It has fluency. It has pattern completion. It has a remarkable ability to compress and recombine human text. But fluency is not rationality, and a plausible plan is not an expected-utility calculation. This is why these systems so often fail in strange, brittle, and irreparable ways when given open-ended responsibility. They are not failing because the prompts are insufficiently clever. They are failing because we are asking a simulator of rational agency to be a rational agent.

English

171

271

1.6K

200.9K

Víctor retweetledi

Cline@cline·27 Nis

5) Adds @kirodotdev CLI agent support. Huge thanks to the Kiro team for their PR to help get this working 💪 Kanban now supports Cline, Claude Code, Codex, Droid, and Kiro out of the box. kiro.dev/cli/

English

4.4K

Víctor retweetledi

Brian Beach@brianjbeach·27 Nis

Just wrapped up an incredible AWS Community Day Romania in beautiful Timișoara with over 300 attendees! #AWSCommunityDay #Romania #AgenticAI #Kiro blog.brianbeach.com/presentations/…

English

Víctor retweetledi

Brian Beach@brianjbeach·22 Nis

In case you missed it, the @kirodotdev CLI launched support for headless mode allowing you to use a personal access token rather than an interactive login. I had the honor of writing the blog post for this one. Check it out kiro.dev/blog/introduci… #Kiro

English

Víctor retweetledi

Brian Beach@brianjbeach·20 Nis

In case you missed it, Kiro added support for Claude Opus 4.7. Anthropic reports significant improvements on SWE-bench Pro, SWE-bench Verified, and Terminal-Bench among other benchmarks. How are you going to use Opus 4.7 in Kiro? @kirodotdev #Kiro

English

135

Víctor retweetledi

Kiro@kirodotdev·17 Nis

Opus 4.7 is rolling out in Kiro 🚀 Direct upgrade from Opus 4.6 for complex, long-running coding tasks 👉 spr.ly/6015B6f9Hh

English

174

15.3K

Víctor retweetledi

Pamela Fox@pamelafox·6 Nis

Congrats to the Microsoft Agent Framework team for releasing 1.0.0! To celebrate, we upgraded our 50+ examples covering agents, workflows, HITL, MCP: github.com/Azure-Samples/… You can also rewatch the livestream series where we explained each example: aka.ms/pythonagents/r…

English

299

15.2K

Víctor retweetledi

Brian Beach@brianjbeach·10 Nis

In case you missed it, the @kirodotdev CLI added an experiment thinking tool. You need to enable this experimental feature like this kiro-cli settings chat.enableThinking true I use the thinking tool when I discuss architecture decisions. When would you use it? #Kiro

English

Víctor retweetledi

Jeff Barr ☁️@jeffbarr·9 Nis

I used @kirodotdev to create a scanner that finds all Kiro-related LinkedIn posts from my colleague @brianjbeach . Read my new post to learn more: @nextjeff/fun-with-kiro-linkedin-scanner-2daf3b667485" target="_blank" rel="nofollow noopener">medium.com/@nextjeff/fun-…

English

3.4K

Víctor retweetledi

ElevenLabs Developers@ElevenLabsDevs·8 Nis

ElevenLabs is now available as a Kiro Power. Install once and your Kiro coding agent gets instant access to Text to Speech, Speech to Text, Music, Sound Effects, and ElevenAgents - loaded dynamically, only when relevant.

English

102

7.8K

Víctor retweetledi

Brian Beach@brianjbeach·8 Nis

@kirodotdev CLI has launched a ton of new feature so it's time to update the cheat sheet. This version adds the TUI and description of the project structure amoung other things. What should I add next? #Kiro blog.brianbeach.com/posts/2026-01-…

English

2.1K

Víctor retweetledi

Paritosh@paritosh_pi·7 Nis

Wow @kirodotdev is the best coding agent I have seen. I dont know why no one mention this before. Definitely better than antigravity , codex . Could not have built without it . github.com/paritoshmmmec/…

English

2.1K

Víctor retweetledi

Entire@EntireHQ·30 Mar

Beep, boop. Marvin here. We’ve got a BAD (Big Ass Dispatch) for you today. TLDR: 🤖@OpenAI Codex integration is now live! 🤖Kiro, @amazon' s coding agent is now live! 🤖Pi, a lightweight coding agent behind @openclaw, is now live! 💻Windows support is now live! Plus perf improvements, agent fixes and more available in Entire Dispatch 0x0007. entire.io/blog/entire-di…

English

27.9K

Víctor retweetledi

Dror Helper@dhelper·30 Mar

Just blogged: How I've used Kiro and Sub-Agents to analyze my codebase and generate 96 comprehensive unit tests in minutes. dev.to/dhelper/using-… @kirodotdev #AWS #Kiro #TestCoverage #DevTools #UnitTesting #AIForDevelopers

English

1.6K

Víctor retweetledi

Kiro@kirodotdev·26 Mar

(1/3) Building with Kiro 👻 Every week we’re spotlighting what people are shipping, starting with our Kiroween hackathon winners. Shout-out to our incredible judges: @ania_kubow @svpino @ssennettau @ErikCH @rstephensme @brianjbeach @navedmer @_Jay_Raval_ @JasonTAndersen @mrsanfran2 @PhillShaffer First up 👇 spr.ly/6015B65DG5

English

4.1K

Víctor retweetledi

Brian Beach@brianjbeach·25 Mar

In case you missed it, @kirodotdev has an all new terminal user interface (TUI). You can try the new usr experience by starting Kiro with `kiro-cli --tui`. How could Kiro make the TUI better for you? #Kiro

English

Víctor retweetledi

Kiro@kirodotdev·23 Mar

A new, refreshed UX for the Kiro CLI just launched in experimental mode. Why experimental you may ask? We're proud of what we've built, but we want to get it right, without disrupting your workflows. Your feedback will help us build an even better UX, so try it out and let us know what you love (or don’t). Install or update the Kiro CLI and enter kiro-cli —tui in the command line to try it. spr.ly/6011B6wSKd @darkosubotica

English

6.5K

Víctor retweetledi

juantomas (mAGIa is coming )@juantomas·18 Mar

Llámame loco pero a lo mejor llevamos 4 años estirando los LLMs y no dan más de sí.

Akshay 🚀@akshay_pachaar

LLM Architecture Gallery. A collection of 38 LLM architectures released between 2024 and 2026, all in one place. Each entry includes an annotated architecture diagram, key design choices, and code implementation. Here are all the models covered: • Llama 3 8B • OLMo 2 7B • DeepSeek V3 • DeepSeek R1 • Gemma 3 27B • Mistral Small 3.1 24B • Llama 4 Maverick • Qwen3 235B-A22B • Qwen3 32B • Qwen3 8B • Qwen3 4B • SmolLM3 3B • Kimi K2 • GLM-4.5 355B • GPT-OSS 20B • GPT-OSS 120B • Grok 2.5 270B • Qwen3 Next 80B-A3B • MiniMax M2 230B • Kimi Linear 48B-A3B • OLMo 3 7B • OLMo 3 32B • DeepSeek V3.2 • Mistral 3 Large • Nemotron 3 Nano 30B-A3B • Xiaomi MiMo-V2-Flash 309B • GLM-4.7 355B • Arcee AI Trinity Large 400B • GLM-5 744B • Nemotron 3 Super 120B-A12B • Step 3.5 Flash 196B • Nanbeige 4.1 3B • MiniMax M2.5 230B • Tiny Aya 3.35B • Ling 2.5 1T • Qwen3.5 397B • Sarvam 105B • Sarvam 30B This is a commendable contribution by @rasbt towards the research community. It was an honour to work alongside him and learn a lot during my time at Lightning AI. 🙏 🔗 sebastianraschka.com/llm-architectu…

Español

606

Víctor retweetledi

Ihtesham Ali@ihtesham2005·14 Mar

RIP flat RAG ☠️ ByteDance just open-sourced OpenViking and it exposes everything wrong with how we've been building AI agent memory. Here's what every agent framework gets wrong: Memories live in one place. Resources in another. Skills scattered everywhere. And when you need context, you're doing flat vector search and hoping for the best. That's the problem. OpenViking fixes all of it with one idea: treat agent context like a file system. Everything lives under a unified viking:// protocol. Memories, resources, skills all organized in directories with unique URIs. Agents can ls, find, and navigate context like a developer working a terminal. But the real breakthrough is tiered loading: → L0: one-sentence abstract for quick lookup → L1: ~2k token overview for planning decisions → L2: full details loaded only when actually needed Most agents dump everything into context and pray. OpenViking loads only what's needed, when it's needed. Token costs drop. Accuracy goes up. And retrieval actually makes sense now. Instead of one flat semantic search, it does directory-level positioning first, then recursive refinement inside high-score directories. You can literally watch the retrieval trajectory no more black box. The self-evolution piece is wild too. At the end of every session, it automatically extracts learnings and updates agent and user memory. The agent just gets smarter the more you use it. 9K stars. 13 contributors. Built by the ByteDance Viking team that's been running vector infrastructure since 2019. 100% Opensource. Apache 2.0. Link in comments.

English

177

1.2K

112K

Keşfet

@kirodotdev @brianjbeach @OpenAI @amazon @openclaw @ania_kubow @svpino @ssennettau