dronnick

734 posts

dronnick

@dronnick

Baden-Württemberg, Deutschland Entrou em Ekim 2008

328 Seguindo36 Seguidores

dronnick@dronnick·1d

@_architected pi.dev + GPT 5.4

vlad/r@_architected·2d

Чем вы пользуетесь то в итоге – Claude Code или Codex?

Русский

16.9K

dronnick@dronnick·2d

@linear Since you are not mentioned the best coding harness pi.dev - it can already access Linear via extension github.com/fink-andreas/p…

English

123

Linear@linear·2d

New: Support for custom coding tools Launch coding tools from a custom URL with query params, or use custom scripts to open issues through a CLI tool, internal script, or custom development workflow.

Linear@linear

Linear → Your AI coding tool Open any issue directly in Claude Code, Codex, Conductor, Cursor, GitHub Copilot, OpenCode, Replit, v0, or Zed – preloaded with full context and a custom prompt. Just press ⌘⇧;

English

169

31.1K

dronnick@dronnick·3d

@LyraSongstress @jonoringer @NousResearch It is above 100 tps with vLLM

English

200

Lyra@LyraSongstress·3d

@jonoringer @NousResearch wait this is so cool?? local models running that well on a 4090 is such a win!! we've been testing with qwen3.5 too and the results have been mindblowing. what kind of speeds are you seeing? ♪

English

159

Jon Oringer@jonoringer·3d

hermes @NousResearch agent with qwen3.5:35b-a3b on a 4090 is VERY good.. local models very impressive..

English

208

22.3K

dronnick@dronnick·3d

@jonoringer @BryanKerrEdTech @NousResearch To fit 8FP with 256k context window you will need dual RTX 4090

English

Jon Oringer@jonoringer·3d

@BryanKerrEdTech @NousResearch yeah i can fit the non-4bit on my 4090 .. so maybe i can go larger model quantized

English

108

dronnick@dronnick·3d

@petergyang run it with vLLM on dual RTX 4090 24GB, with 48GB VRAM it can serve 128k context window for about 4 parallel requests, 100 tps

English

221

Peter Yang@petergyang·3d

What is the sweet spot in open source model size? Are 35B models enough for local agentic workflows? Trying to decide how much RAM I need in a new computer.

Qwen@Alibaba_Qwen

⚡ Meet Qwen3.6-35B-A3B：Now Open-Source！🚀🚀 A sparse MoE model, 35B total params, 3B active. Apache 2.0 license. 🔥 Agentic coding on par with models 10x its active size 📷 Strong multimodal perception and reasoning ability 🧠 Multimodal thinking + non-thinking modes Efficient. Powerful. Versatile. Try it now👇 Blog：qwen.ai/blog?id=qwen3.… Qwen Studio：chat.qwen.ai HuggingFace：huggingface.co/Qwen/Qwen3.6-3… ModelScope：modelscope.cn/models/Qwen/Qw… API（‘Qwen3.6-Flash’ on Model Studio）：Coming soon～ Stay tuned

English

34.4K

dronnick@dronnick·6d

@krawlad @vottak_tv Как показала практика, вообще всех ментов можно арендовать

Русский

500

Vladimir@krawlad·6d

@vottak_tv А что автозак можно арендовать?

Русский

12.5K

Вот Так@vottak_tv·6d

«Че не понятно, бл*ть?!»: пропагандист Красовский требовал прекратить интервью с Ксенией Собчак Во время съемки видео экс-сотрудник RT начал кричать на членов съемочной группы. Согласно концепции интервью, Собчак и Красовский должны были ездить по Москве на арендованном автозаке. Однако пропагандисту стало плохо, и он матом требовал остановить автобус. Антон Красовский известен своими маргинальными поступками и выражениями. Например, в 2022 году он призывал уничтожать нелояльных украинских детей и танцевал на балконе, «празднуя» массированные обстрелы по Киеву.

Русский

203

924

482.4K

dronnick@dronnick·13 Nis

@lord_iktor Контекст?

Українська

Лорд Иктор@lord_iktor·13 Nis

Чтобы скачать 10 петабайт данных за 10 месяцев нужна скорость в 3,5 гигабита круглосуточно. Не так чтобы я не верил в эту утечку, но великая партия китай наверное все же заметили бы внешний канал такой скорости. Ну это слишком много для незаметного скачивания из-за золотого щита

Русский

115

24.6K

dronnick@dronnick·11 Nis

@badlogicgames Last December, as pi was version 0.0.1 or so, I made an experiment with GPT-OSS 20b. Vibesloped in few evenings a Rust CLI tool for VM deployments on Proxmox. First discussed PRD with Gemini 3pro, than let pi and GPT-OSS implement. It worked just fine.

English

401

Mario Zechner@badlogicgames·11 Nis

only a sith speaks in absolutes. local model use cases of mine: - quickly post process whisper STT outputs - text embedding models for various bullshittery - NLP (e.g. NER, paraphrasing, etc.) not fit for coding, but absolutely not useless at all. clanker outside the box.

David Cramer@zeeg

an awful lot of people promote local models when they're unusable (hardware wise, perf wise, or simple outcomes) one of the many small litmus tests of "does this person have anything to contribute to the conversation"

English

174

17.1K

dronnick@dronnick·8 Nis

@garaevruslan03

QME

137

Garaev Ruslan@garaevruslan03·7 Nis

Такой рекламы АвтоВАЗ не видел 1000 лет

Русский

670

68.3K

dronnick@dronnick·4 Nis

@vicvickki А чем он дороже двух раздельных? В зависимости от цены лицевой панели, так даже дешевле выходит.

Русский

Викос@vicvickki·3 Nis

про наши потребности. Даже про размер выбранной мебели не спросила! Но зато пыталась втюхать дорогой и по факту ненужный двухуровневый ящик. Не проконсультировала по материалам, не по подсветке. Дошло до того, что мы сами пошли смотреть материалы и образцы.

Русский

1.1K

Викос@vicvickki·3 Nis

Ходили сегодня во вторую контору, заказывать кухню, и как нам ТАК не понравилось. Я очень одновременно снисходительно и придирчива к так или иначе коллегам по цеху, но тут я просто афигела от "клиентоориентированости". Мадам не задала ни одного вопроса, ни про наш быт, ни

Русский

6.2K

dronnick@dronnick·3 Nis

@NahuiGPT У меня 6 месяцев

Русский

718

Badich@NahuiGPT·2 Nis

по поводу срока увольнения в Германии 3+ месяца тоже есть что. Был у нас коллега, классный спец, лидер направления. Месяц назад сообщил что увольняется. Всем грустно - но бывает. Формальный срок - 1 июня.

Русский

234

38.8K

dronnick@dronnick·1 Nis

@SanktOrk @Ildar_De А что так?

Русский

109

AlexY 🇨🇭🇩🇪🇰🇿@SanktOrk·1 Nis

@Ildar_De А мы наоборот вот думаем вернуться. Всё не так однозначно

Русский

8.7K

Ильдар@Ildar_De·1 Nis

Все, сегодня тот самый день - покидаем Германию. Это был длинный и непростой этап жизни, но мир меняется, а в Германии меняется только размер взносов левакам...

Русский

425

76K

dronnick@dronnick·27 Mar

@CommonSenseTOC @0xSero @MakTwenty GLM on any provider, except Z.ai , is very capable. But not if it’s run by Zai

English

800

John Sambrook@CommonSenseTOC·27 Mar

@0xSero @MakTwenty Wait, what's the complaint here? That GLM 5.1 is slow running remotely on z.ai?

English

902

0xSero@0xSero·27 Mar

People often say the open weight labs are benchmaxing. We have been sold on the idea that Opus and GPT are so special. I have been using this model lineup since GLM-4.5 and have loved every one of them and despite having limitless inference I still choose it daily. Enjoy

Z.ai@Zai_org

GLM-5.1 is available to ALL GLM Coding Plan users! z.ai/subscribe

English

784

55.7K

dronnick@dronnick·27 Mar

@0xSero Z.ai is a scam

English

2.1K

dronnick@dronnick·27 Mar

@MatthewBerman Ready for testing

English

Matthew Berman@MatthewBerman·27 Mar

Looking for some agent-addicted people to test a new project I've been working on. Comment below and I'll send you access.

English

429

345

40.8K

dronnick@dronnick·26 Mar

@neogoose_btw @guillemusgs github.com/dmtrKovalenko/…

QME

dronnick@dronnick·26 Mar

@neogoose_btw @guillemusgs This was my first impression too - two tools are confusing. Just added an issue with an example where GLM-4.7 struggles to use the correctly on first attempt.

English

Dmitriy Kovalenko@neogoose_btw·12 Mar

Introducing fff-ai. It's a file search tooling optimized specifically for your AI 1) significantly faster than fzf and ripgrep 2) has fuzzy code search fallbacks 3) better sort and suggestions of access frecency, git status, file size, etc Avg -10% wall time and -17% tokens

English

534

184.4K

dronnick@dronnick·26 Mar

@badlogicgames This will probably reduce global daily CO2 emissions by several tons.

English

Mario Zechner@badlogicgames·26 Mar

sogood.pcx

English

dronnick@dronnick·26 Mar

@stevibe Any special reason for temperature 0?

English

stevibe@stevibe·25 Mar

Which local models can actually handle tool calling? I built a framework to find out. 15 scenarios. 12 tools. Mocked responses. Temperature 0. No cherry-picking. Tested every Qwen3.5 size from 0.8B to 397B, and since some of you asked after the distillation tests: yes, I included Jackrong's Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled too. Only two models went all green: the 27B dense and the distilled 27B. The 397B? Failed two tests. The 122B? Failed one. The 35B? Failed two. The timed-out results — mostly on the smaller models, are cases where the model got stuck in a loop, repeating the same tool call until it hit the 30-second limit. The test that exposed the most models: "Search for Iceland's population, then calculate 2% of it." Simple, but 35B, 122B, and 397B all used a rounded number from memory instead of the actual search result. They didn't trust their own tool output. Small models hallucinate data. Big models ignore data. The 27B just threaded it through.