Victor Kiselev

630 posts

Victor Kiselev

@Gaploid

Developing https://t.co/EIST1EFqAP, ex @Microsoft, @AWS, @yandexcom

Munich, Germany Katılım Mart 2008

206 Takip Edilen124 Takipçiler

Victor Kiselev@Gaploid·14h

@Miss_Lilindra Это кажись самый дешевый/нормальный электрический вариант от bmw. Остальное там сразу x2

Русский

317

Dog. Man. Bavarian village.@Miss_Lilindra·15h

Кто нибудь может мне объяснить феномен BMWi3 которая через 4 года после прекращения производства все еще стоит дохера и все ее хотят? :)

Русский

4.4K

Victor Kiselev@Gaploid·15h

@likeamothtoaf кажется это то во что превратся junior developer

Русский

808

Nothing mew@likeamothtoaf·19h

сегодня был собес на позицию senior AI developer первый вопрос: ЗНАЕТЕ ЛИ ВЫ ЧТО ТАКОЕ ВАЙБ-КОДИНГ, ГИТХАБ КОПАЙЛОТ, КЛАД КОД БЛЯЯЯЯЯЯЯ

Русский

405

40.3K

Victor Kiselev@Gaploid·2d

@KYKYPY3A_B Там что-то явно не связанное со старшипом - Слишком быстро и траектория из вне.

Русский

Кукуруза как у томакруза 🌽@KYKYPY3A_B·2d

@Gaploid Скорее всего что-то из грузового отсека, либо один из макетов. Но корбаль вроде не менял ориентацию, чтобы они за ним оказались

Русский

Кукуруза как у томакруза 🌽@KYKYPY3A_B·4d

Быстрые итоги 12 тестового запуска Starship из нового поколения V3 — лучше дебюта V2 в прошлом году, но хуже идеального сценария, который был нужен этому пуску. Ключевые этапы Flight 12: 🟢 Запуск 33 двигателей Raptor 3 (1 отказал позже); 🟢 Старт с Pad 2 и прохождение зоны Max Q через 45 секунд; 🟢 Горячее разделение Starship S39 и Super Heavy B19; 🟡 Запуск 6 двигателей на S39 и перезапуск 8 двигателей (5 будут работать) B19 для первого тормозного манёвра; 🟢 Вход Super Heavy B19 в атмосферу с прохождением зоны максимальных нагрузок; 🔴 Перезапуск 13 двигателей на Super Heavy B19 для второго тормозного манёвра; 🔴 Симуляция посадки B19 на виртуальную башню на воду с последующей утилизацией; 🟢 Выход корабля Starship S39 на плановую незамкнутую орбиту; 🟢 Открытие шлюза грузового отсека; 🟢 Демонстрация выгрузки 22 макетов спутников Starlink V3; 🟢 Трансляция с камер на двух Starlink V3 для анализа теплозащиты Starship S39; 🔴 Тест перезапуска 1 двигателя Raptor 3 на орбите; 🟢 Вход Starship S39 в атмосферу, и прохождение зоны максимального нагрева и нагрузок; 🟢 Теплозащита Starship S39 выдерживает вход в атмосферу, плавники и функциональный механизм посадки не расплавляются; 🟢 Перезапуск 2 двигателей на S39 с симуляцией мягкой посадки в океан в нужной точке.

Русский

328

11.8K

Victor Kiselev@Gaploid·2d

here is a link cloudprice.net/models and api details cloudprice.net/models/api

English

Victor Kiselev@Gaploid·2d

We’ve added pricing and specification data for 2,500+ LLM models from ~100 providers, so you can quickly see which AI provider offers the lowest price for a given model, compare models side by side, and identify which models offer the best intelligence-to-price ratio. We’ve also added a simple API with no registration required: just send the model name and number of tokens to get the actual cost.

English

Victor Kiselev@Gaploid·3d

@mathis_aa @vgermaniu - Stripe для снятия денег, счета клиентам, подписки, все там автоматизировано. - getsorted.de бесплатный сервис для учета расходов, храню там счета расходные -taxfix для подачи налогов

Русский

200

матхис@mathis_aa·3d

Фрилансеры немецкие Расскажите, пожалуйста, как вы ведете свою бухгалтерию. С бухгалтером или без, и используете какой либо сервис для контроля своих счетов и финансов? Знакомая создает счета в pages и сама все делает, но мне хотелось бы более удобный способ найти

Русский

Victor Kiselev@Gaploid·19 May

@zeeg @mikejulian Thats why we've created that free API with up to date pricing for ~100 providers cloudprice.net/models/api no proxy needed

English

123

David Cramer@zeeg·18 May

is there a rational argument for why LLM vendors arent returning token costs as HTTP headers this is a plague

English

1.1K

144.3K

Victor Kiselev@Gaploid·13 May

@QuinnyPig Seems like an opportunity for duckbill service here

English

468

Corey Quinn@QuinnyPig·13 May

Anthropic’s pricing is super clear here. You just need to be following the right Twitter accounts, be comfortable with “step 7, master the wolf” pricing complexity, and can absorb a 5-figure oopsie as part of the learning process.

ClaudeDevs@ClaudeDevs

Starting June 15, paid Claude plans can claim a dedicated monthly credit for programmatic usage. The credit covers usage of: - Claude Agent SDK - claude -p - Claude Code GitHub Actions - Third-party apps built on the Agent SDK

English

632

51.5K

Victor Kiselev@Gaploid·11 May

@ArtificialAnlys Junie CLI from Jetbrains would be great to see. Based on other comparisons its on top of charts

English

427

Artificial Analysis@ArtificialAnlys·11 May

Announcing the Artificial Analysis Coding Agent Index! Our new coding agent benchmarks measure how combinations of agent harnesses and models perform on 3 leading benchmarks, token usage, cost and more When developers use AI to code they’re choosing a model, but also pairing it with a specific harness. It makes sense to benchmark that combination to understand and compare performance. The Artificial Analysis Coding Agent Index includes 3 leading benchmarks that represent a broad spectrum of coding agent use: ➤ SWE-Bench-Pro-Hard-AA, 150 realistic coding tasks that frontier models struggle with, sampled from Scale AI’s SWE-Bench Pro ➤ Terminal-Bench v2, 84 agentic terminal tasks from the Laude Institute and that range from system administration and cryptography to machine learning. 5 tasks were filtered due to environment incompatibility ➤ SWE-Atlas-QnA, 124 technical questions developed by Scale AI about how code behaves, root causes of issues, and more, requiring agents to explore codebases and give text answers Analysis of results: ➤ Opus 4.7 and GPT-5.5 lead the Index: Opus 4.7 in Cursor CLI scores 61, followed closely by GPT-5.5 in Codex and Opus 4.7 in Claude Code at 60. GPT-5.5 in Cursor CLI follows at 58. ➤ Open weights models are competitive, but still trail the leaders: GLM-5.1 in Claude Code is the top open-weight result at 53, followed by Kimi K2.6 and DeepSeek V4 Pro in Claude Code at 50. These are strong results, but still meaningfully behind the top proprietary models. ➤ Gemini 3.1 Pro in Gemini CLI underperforms: Gemini 3.1 Pro in Gemini CLI scores 43, well below where Gemini 3.1 Pro sits on our Intelligence Index, highlighting that Gemini’s performance in Gemini CLI remains a relative weak spot for Google’s offering. ➤ Cost per task (API token pricing) varies >30x: Composer 2 in Cursor CLI is cheapest at $0.07/task, followed by DeepSeek V4 Pro in Claude Code at $0.35/task and Kimi K2.6 in Claude Code at $0.76/task. At the high end, GPT-5.5 in Codex costs $2.21/task, while GLM-5.1 in Claude Code costs $2.26/task. For both models this was contributed to by high token usage, and in GPT-5.5’s case by a relatively higher per token cost. ➤ Token usage varies >3x: GLM-5.1 in Claude Code uses the most tokens at 4.8M/task, followed by Kimi K2.6 at 3.7M/task and DeepSeek V4 Pro at 3.5M/task. GPT-5.5 in Codex uses 2.8M tokens/task, substantially more than Opus 4.7 in Claude Code at 1.7M/task. In GLM-5.1’s case, higher token usage, cost and execution time were partly driven by the model entering loops on some tasks. ➤ Cache hit rates remain high but vary materially: Cache hit rates range from 80% to 96% across combinations. Provider routing, harness prompt structure and cache behavior can materially change the economics of running the same model given cached inputs are typically <50% the API price of regular input tokens. ➤ Time per task varies >7x: Opus 4.7 in Claude Code is fastest at ~6 minutes/task, while Kimi K2.6 in Claude Code is slowest at ~40 minutes/task. This is contributed to by differences in average turns per task, token usage and API serving speed. Opus 4.7 had materially lower amount of turns to complete a task than all other models while Kimi K2.6 had the most. ➤ Cursor made real progress with Composer 2: Composer 2 in Cursor CLI scores 48, near the leading open-weight model results, while being the cheapest combination measured at $0.07/task. Cursor has stated Composer 2 is built from Kimi K2.5, showcasing they have made substantial post-training gains. This is just the start. We are planning to add additional agents (both harnesses and models). Let us know what you would like to see added next.

English

126

170

1.5K

Victor Kiselev@Gaploid·23 Nis

@belidor А кто-то видел бенчмарки задач с generic скилами и без? У меня есть гипотеза что скилы больше всего влияют на спокойствие и ощущение контроля человека чем на качество выходного результата.

Русский

811

Artem@belidor·22 Nis

А накидайте мне скиллов для claude code которыми вы регулярно пользуетесь

Русский

310

85.6K

Victor Kiselev@Gaploid·18 Nis

@bodryachog @dutch_dispatch air же это codex app или claude code desktop. Lovable и replit это совем другая история.

Denis Esakov@bodryachog·18 Nis

О это мой срач был! Я там писал попробуй к ним в поддержку обратиться, сам дурак нам лучше знать. У меня в продукте в энтерпрайзе когда мне навязывают фичу мы снимаем метрики использования, я бы вот с удовольствием посмотрел бы на метрики всего того что туда затащили за последние пять лет. А эйр решил поконкурировать с реплитом и лаваблом которые на венчурном бабле. Есть гдето в интернете история про то как асана убила похожий джетбрейнс в тикет менежменте. Там причем фаундеры встречались и формер фб гай сказал мы убьем твой бизнес. Чел потом написал эссе что конкурировать с венчурными деньгами не реально. Необучаемые -)

Русский

3.2K

Victor Kiselev@Gaploid·17 Nis

@ZloyDevOps Офигеть, у меня ровно так же поцарапали и в этом месте. Два года назад, 1000 евро на все было в мюнхене

Русский

424

Zloy DevOps@ZloyDevOps·17 Nis

Для всех интересующихся, По результатам экспертизы: Оценка затрат на ремонт: €6.085,02 Компенсация потери стоимости: €900,00 Экспертиза: €1.221,18 Юридическое сопровождение: €756,30 Компенсация за время ремонта: €316.00 Тотал: €9.278,50

Zloy DevOps@ZloyDevOps

Имеются следующие повреждения. Сколько по вашему насчитал страховой оценщик для устранения повреждений и сколько стоит экспертиза в 🇩🇪

Русский

210

95.7K

Victor Kiselev@Gaploid·3 Mar

@bodryachog @moishe_ee Я послушал последнее(2дня назад) где он про конфликт с военными рассказывал но не услышал этого стейтмента

Русский

Denis Esakov@bodryachog·3 Mar

@Gaploid @moishe_ee Нет не дает. Максимум в клаудах под себя. Его интервью последнее, мысль про открытая модель закрытая модель и что модель это не веса, а все что вокруг.

Русский

Mooiše@moishe_ee·2 Mar

Запустил на днищебродском М3 Pro/16Gb через lm studio. Первый промт читает довольно долго, потом раздупляется и выдает где-то 20tps на окне в 24К токенов. В целом с простыми вещами справляется легко, с тулзами работает. На большом контекстном окне тупит правда

Ahmad@TheAhmadOsman

not only does the Qwen 3.5 9B beat the GPT OSS 20B it BEATS the 120B INCREDIBLE stuff

Русский

8.9K

Victor Kiselev@Gaploid·3 Mar

@moishe_ee У мелких моделей обычно плохо с структурированными ответами которые важны для тулов.

Русский

Victor Kiselev@Gaploid·3 Mar

@bodryachog @moishe_ee А можно ссылку где он такое говорил? Антропик дает возможность развернуть модели у себя на железе?

Русский

Denis Esakov@bodryachog·2 Mar

@moishe_ee Я кстати тут чето смотрел разбор интервью амодея. Он там правильно сказал. Не важна опенсорс не опенсорс, важно дает ли провайдер или локальный сетап инфраструктуру для использования в бизнесе.

Русский

682

Victor Kiselev@Gaploid·3 Mar

@Bensign Hah, something what we've started to do as well here: track pricing, benchmarks for llm models cloudprice.net/models

English

Ben Schaechter@Bensign·2 Mar

Pretty interesting.

Miles@miles_matthias

Today, we're expanding access to billing for LLM tokens. Pick your models, set your markup, and route calls through @stripe’s LLM proxy (or supported partners). We sync popular model prices, configure advanced usage-based billing for your margin, and record usage automatically.

English

908

Victor Kiselev@Gaploid·15 Şub

@softwarevlogger Isengard для автоматизации создания/закрытия многих экаунтов и гардрейлов для них - это не много не то. Для того что говоришь это вот это больше подходит aws.amazon.com/controltower/ но этотне решает всю эту канифоль с сетевой связанностью

Русский

453

Dima@softwarevlogger·15 Şub

Использовать кучу аккаунтов если ты не AWS очень затратно и крайне неудобно. Сразу обрастаешь VPC туннелями и начинаешь платить за кучу инфраструктуры, которая тебе вообще говоря не нужна. Уж не говоря о том, что AWS давно стоило бы выпустить наружу внутренний тул для управления аккаунтами Isengard. Сейчас хоть что-то появилось, а раньше такой сетап тут же рендерил тебя в ССЗБ.

Evgeny Kot@bunopus

Самое стрёмное конечно в инциденте от supabase это то, что они крутили всё на одном AWS аккаунте 🤯 supabase.com/blog/supabase-…

Русский

13.2K

Victor Kiselev@Gaploid·15 Şub

@bunopus Вроде регион один. Под одним экаунтом много регионов может быть

Русский

987

Evgeny Kot@bunopus·14 Şub

Русский

24.8K

Victor Kiselev@Gaploid·13 Şub

@vgermaniu - Налог на не реализованную жизнь. Продолбал жизнь и средний доход меньше то плати за не до реализацию, ведь мог же. - Не до реализуемый доход от недвидимости, - не до реализуемый ...

Русский

461

В Германию@vgermaniu·13 Şub

Предлагаю голландцам к налогу на нереализованную прибыль добавить также налог на неосуществлённые инвестиции ☺️

Русский

258

10K

Keşfet

@Miss_Lilindra @likeamothtoaf @KYKYPY3A_B @mathis_aa @vgermaniu @zeeg @mikejulian @QuinnyPig