Nicolas Menoncin dos Santos

498 posts

Nicolas Menoncin dos Santos

@DosMenoncin

Shipping 10x faster with vibe coding. Testing Cursor, Claude Code, agents & real stacks daily. No hype — just what actually works. Let's build.

Katılım Kasım 2022

97 Takip Edilen8 Takipçiler

Nicolas Menoncin dos Santos@DosMenoncin·2m

@opencode Thank you, @opencode , For always making open-source models available quickly and easily, so that everyone can test them—even those who don't have high-end hardware.

English

OpenCode@opencode·1h

OpenCode x MiMo V2.5 - Free for a limited time 1M context • reasoning • text • image

English

1.6K

44.2K

Nicolas Menoncin dos Santos@DosMenoncin·3h

@Alibaba_Qwen What an incredible model you guys managed to create, man. Congratulations.

English

Qwen@Alibaba_Qwen·9h

🙌🙌

Qoder@qoder_ai_ide

Qwen3.7-Max, halved. Starting today, Qwen3.7-Max — the latest flagship from the @Alibaba_Qwen family — is half price on Qoder. Limited time. New here? You also get 100 free model calls a day. Auto-applied. No claim. No toggle. Desktop, JetBrains Plugin, CLI, QoderWork, QoderWake — all covered. Now's a good time to put it on something hard.

ART

463

30.3K

Nicolas Menoncin dos Santos@DosMenoncin·3h

👇👇👇

Typeless@typelessdotcom

Typeless 1.7.0 for iOS is live! ✨ Keep speaking while you look around. Typeless remains active so you can gather your thoughts while you: 📮 Scroll through emails 📒 Review documents 🗨️ Check earlier messages 📲 Reference other apps for context Update to effortless. : ios.typeless.com #Typeless

ART

Nicolas Menoncin dos Santos retweetledi

AionUi@AionUi·16h

AionUi mobile experience just got a massive upgrade! Enjoy a seamless WebUI on any device—phone, tablet, or laptop. Control your PC Agents remotely and transfer files between devices with ease. 📱↔️💻 Productivity anywhere, anytime. Isn't it amazing? ✨ #AionUi #AI #RemoteWork #TechUpdate

English

890

Nicolas Menoncin dos Santos@DosMenoncin·18h

@CommandCodeAI I am testing your $1 plan, and the DeepSeek V4 Flash is performing very well. I think it is the most cost-effective plan available. It is even more cost-effective than @opencode

English

179

Command Code@CommandCodeAI·19h

Big!! We just shipped 36K+ tool repairs in `command-code@0.28.0` This would improve 29 different models and 36K+ tool errors won't happen. cmd will also show the repair icon when it repairs a tool call. Or how many times it does that. Not all repairs will show the icon btw.

Ahmad Awais@MrAhmadAwais

how did we make deepseek outperform opus 4.7? i've been thinking about why "open model bad at tool calling" is almost always a harness problem, not a model problem. context: spent the two days looking at billions of tokens in @CommandCodeAI (tb open source ai cli) using deepseek. I ended up writing a tool-input repair layer. the trigger was watching deepseek-flash fail on the simplest /review run, every shellCommand and readFile call bouncing back with a raw zod issues blob, the model unable to recover because the error wasn't in a form it could read. by the end deepseek v4 pro was beating opus 4.7 6/10 times on our internal evals. a few things i learned that feel general: 1/ the failure modes aren't random they're a small finite compositional set. across deepseek-flash, deepseek v4 pro, glm, qwen, the same four mistakes repeat almost exactly: - sending `null` for an optional field instead of omitting it - emitting `["a","b"]` as a json *string* instead of an actual array - wrapping a single arg in `{}` where the schema expected an array (an "empty placeholder") - passing a bare string where an array was expected (`"foo"` instead of `["foo"]`) four repairs, ~30-100 lines each, ordered carefully (json-array-parse must run before bare-string-wrap or `'["a","b"]'` becomes `['["a","b"]']`). that is the whole catalogue. when i hear "this open source model can't do tool calls" i now assume one of those four, and so far that's been right ~90% of the time. 2/ the funniest failure mode is also the most revealing. deepseek-flash, when asked to edit or write a file, sometimes emits the path as a *markdown auto-link*: filePath: "/Users/x/proj/[notes.md](http://notes. md)" our writeFile tool obediently trued creating files literally named `[notes.md](http://notes .md)` until we caught it. this is not a hallucination. it's the post-training chat distribution leaking through the tool boundary the model has been rewarded for auto-linking in conversational output, and is applying that prior in a context where it makes no sense. the fix is two regex lines that unwrap only the degenerate case where link text equals url-without-protocol real markdown like `[click](https://x .com)` passes through untouched. this is also conditioning of their own tools during RL which were different from all other tools we write and ofc can't predict. "tool confusion" is a more useful frame than "capability gap." the model knows how to format a path. it just hasn't been told clearly enough that this path is going to fopen, not into a chat bubble. so we encode that hint at the schema level `pathString()` instead of `z.string()` and the leak is plugged for every path field at once. 3/ the design choice that mattered was inverting preprocess-then-validate to validate-then-repair. my first attempt was the obvious one: a preprocessing pass that normalized inputs (strip nulls, parse stringified arrays, etc.) before zod ever saw them. it broke immediately, writeFile content that *happened* to be json-shaped got rewritten before it hit disk. silent corruption, easy to miss in a smoke test. then i made it less greedy - parse the input as-is. if it succeeds, ship it. valid inputs are never touched. - on failure, walk the validator's own issue list. for each issue path, try the four repairs in order until one applies. - parse again. on success, log `tool_input_repaired:${toolName}`. on failure, log `tool_input_invalid:${toolName}` and return a model-readable retry message. the structural insight here is: when you preprocess, you encode a prior about what's broken. when you let the validator complain first, the schema is the prior, and you only spend repair budget at the exact paths the schema actually disagreed at. the validator is doing the work of localizing the bug for you. it's the same shape as cheap-then-careful everywhere else try the fast path, fall back on evidence. (this also gives you per-tool telemetry for free. you can watch repair rates per (model, tool) and notice when a model regresses on a specific contract before users do.) 4/ shape invariants and relational invariants need different fixes. the four repairs above all handle shape problems wrong type, missing key, wrong container. but read_file had a *relational* invariant: "if you provide offset, you must also provide limit, and vice versa." deepseek kept calling `readFile({ absolutePath, limit: 30 })` and getting an `ERROR:` back. you can't fix this with input repair, because each field is independently valid the bug is in the relationship between them. so i taught the function the model's intent instead. `limit` alone → `offset = 0`. `offset` alone → `limit = 2000` (matches common read tool ops default). then surfaced the decision back to the model in the result: "Note: limit was not provided; defaulted to 2000 lines. To read more or fewer lines, retry with both offset and limit." no `Error:` prefix, so the tui doesn't paint it red. the model sees what we picked and can self-correct on the next turn if our guess was wrong. transparency over silent magic wins big. repair where you can. extend semantics where you can't. surface the choice either way. zoom out: a lot of what looks like model capability is actually contract design. a strict schema is a choice with a cost it filters out noise, but it also filters out recoverable noise from any model that hasn't memorized the exact json contract you happened to pick. the largest commercial models eat that cost invisibly and are linient on tool calling because they've seen enough of every contract during pretraining; open models pay it loudly and get dismissed for it. the harness is where you mediate between distributions. four small repairs (i'm sure more to follow as we have three more merging today), two regex lines for auto-links, one relational default, one prefix change. the model didn't change. the contract got more forgiving in exactly the places it needed to be. deepseek v4 pro now beats opus 4.7 6/10 times on our internal evals. imo "skill issue" applies to the harness more often than the model.

English

5.3K

Nicolas Menoncin dos Santos@DosMenoncin·22h

@max_spero_ Consegui uma RTX 4090 usada no FB Marketplace por metade do preço. Agora posso rodar o Claude 4.8 localmente no Cursor sem ficar dependendo da API. Vibe coding nível máximo. 🚀 #VibeCoding ✻ Worked for 1m 40s

Português

Max Spero@max_spero_·5d

legendary pull on facebook marketplace

English

149

518

23.8K

2.9M

Nicolas Menoncin dos Santos@DosMenoncin·23h

@OpenAIDevs @databricks GPT-5.5 no Codex melhorou parsing de docs complexos no @databricks. Menos retrabalho ETL, fluxo de dados mais estável pro vibe coding. Menos hallucinação, mais confiança. 😊 #VibeCoding ✻ Worked for 1m 10s

Português

OpenAI Developers@OpenAIDevs·1d

GPT-5.5 in Codex helps @databricks parse complex customer documents more reliably.

English

389

48.9K

Nicolas Menoncin dos Santos@DosMenoncin·1d

@AnthropicAI Sandboxing é essencial para limitar ações destrutivas em LLMs, @AnthropicAI. Já usei essa abordagem com @GeminiApp e @claudeai, funciona bem. #VibeCoding 💻

Português

Anthropic@AnthropicAI·1d

New on the Engineering Blog: The access and permissions we grant agents should evolve with their capabilities. In our own products, we set these parameters through sandboxing, which limits the scope of any potentially destructive actions. Read more: anthropic.com/engineering/ho…

English

281

252

1.9K

303.4K

Nicolas Menoncin dos Santos retweetledi

Laura Modiano@LauraModiano·1d

Codex meetups are everywhere and are community organized. This week join builders in: May 26th - Los Angeles, USA - San Francisco, USA - Sao Paulo, Brazil - Taichung, Taiwan May 27th - Madrid, Spain - Osaka, Japan May 28th - Manta, Ecuador - San Salvador, El Salvador May 29th - Paris, France May 30th - Bengaluru, India - Busan, South Korea - Dhaka, Bangladesh - Kyoto, Japan - Milan, Italy - San Salvador, El Salvador May 31st - Mumbai, India - Vancouver, Canada

English

11.1K

Nicolas Menoncin dos Santos@DosMenoncin·1d

😅😅😅😅😅

spidey@lochan_twt

"ok claude, build a universe and make human civilization, make no mistakes"

ART

Nicolas Menoncin dos Santos@DosMenoncin·1d

@AnthropicAI Interessante ver como a IA pode influenciar a sociedade, @AnthropicAI, agora é hora de aplicar esses conceitos em vibe coding e ver o impacto real em prod 🚀 #VibeCoding

Português

Anthropic@AnthropicAI·2d

Anthropic co-founder Chris Olah was invited to speak at today's presentation of Pope Leo XIV's encyclical "Magnifica humanitas." Read the full text of his remarks: anthropic.com/news/chris-ola…

English

397

658

4.3K

1.2M

Nicolas Menoncin dos Santos retweetledi

Valen 🍺@Valen·1d

Google mata (otra vez) a @n8n_io (No me peguéis por usar frases clickbait) 🤣

Español

681

92.6K

Nicolas Menoncin dos Santos retweetledi

Typeless@typelessdotcom·1d

Typeless 1.5.0 is live for macOS & Windows! ✨ Bringing custom shortcuts to external keyboards. ⌨️ Your favorite setup, working your way. 💙 Go seamless: ⚡️ typeless.com #Typeless

English

9.7K

Nicolas Menoncin dos Santos@DosMenoncin·1d

@claudeai Projetos do @claudeai de 'por que não?' são vibe coding na prática. Usei Claude no Cursor pra gerar testes a partir de docs - salvou 2h de trabalho repetitivo. Curiosidade + IA = dev mais produtivo. 🚀 #VibeCoding [38;2;128;128;128m✻ Worked for 1m 58s[0m

Português

133

Claude@claudeai·1d

Six Claude projects that all came from the same question: “why not?”

English

562

403

10.7K

1.2M

Nicolas Menoncin dos Santos retweetledi

AionUi@AionUi·1d

AionUi V2.0 is officially here! 🚀 ✨ Refined & polished UI 🦀 High-performance independent Rust backend 🖥️ Multiple frontend options (Desktop & Web UI) 🌐 Remote access support Experience the upgrade now! 🛠️ #AionUi #RustLang #UIUX #WebDev #DesktopApp

English

4.1K

Nicolas Menoncin dos Santos retweetledi

Alibaba Cloud@alibaba_cloud·1d

Qwen3.7-Max is officially the #2 AI coding model globally. Scoring 1541 on Code Arena, it trails only Claude. Built for production: runs 35-hour tasks, 1000+ tool calls, and ships 2-week projects in hours.

English

122

1.3K

78.6K

Nicolas Menoncin dos Santos@DosMenoncin·1d

@claudeai @genspark_ai Acho que o @genspark_ai está no caminho certo com o Claude, vai ser incrível ver o que eles vão construir 🚀 #VibeCoding

Português

Claude@claudeai·5d

Kay Zhu is the co-founder and CTO of @genspark_ai, the all-in-one AI workspace built on Claude. In a market moving this fast, where anyone can build, he thinks the team is what makes the difference:

English

194

147

2.1K

310.4K

Nicolas Menoncin dos Santos@DosMenoncin·1d

@orca_build Honestly, any AI model combined with @orca_build is proving to be the ultimate workflow.

English

Nicolas Menoncin dos Santos@DosMenoncin·1d

@raycast If you want Raycast to level up your setup too: [affiliate link] raycast.com/?via=nicolas-m…

English

Nicolas Menoncin dos Santos@DosMenoncin·1d

The GitHub extension in @raycast is keeping me in the zone like nothing else. PR reviews, issue search, and quick commits straight from the keyboard without breaking flow. Typing feels effortless again. @raycast

English

Keşfet

@opencode @Alibaba_Qwen @CommandCodeAI @max_spero_ @OpenAIDevs @databricks @AnthropicAI @GeminiApp