Andrey Kolesnikov

251 posts

Andrey Kolesnikov

Andrey Kolesnikov

@minviable_org

Dad, husband, immigrant, nerd. Built, bought and sold companies. Default to code, law degree is a bonus.

San Francisco, CA Bergabung Mayıs 2026
95 Mengikuti57 Pengikut
Serenity
Serenity@aleabitoreddit·
At $NVDA GTC/Computex in Taipei: I think we’ll hear about the next AI bottleneck. That’s owned by a .6 P/B potato farming company in Japan, with a 180 year history. Their owner cooks those potatoes in night markets for 160 yen a piece. But that same potato farming equipment used to grow potatoes with optimal sunlight. Is now required for optical alignment requirements for CPO. And their unique cooking technique is mandatory to address thermal requirements for Rubin. Can anyone guess?
English
409
34
996
249.3K
Andrey Kolesnikov
Andrey Kolesnikov@minviable_org·
@asaio87 I’m one of those idiots I guess. 1M MAU app. Most of our growth and feature innovation came after AI usage spread throughout the company. Exited too, made investors happy. All AI.
English
0
0
0
34
andrei saioc
andrei saioc@asaio87·
Only idiots think AI has changed everything when it comes to building successful apps.
English
43
1
51
3.9K
Andrey Kolesnikov
Andrey Kolesnikov@minviable_org·
@DanielSmidstrup They are capacity constrained. The rest of the world is fine using Nvidia, for Google it’s declaring an L in GPU race. Hence the focus on TPU8
English
2
1
3
255
Daniel Smidstrup
Daniel Smidstrup@DanielSmidstrup·
Seriously, how can Google still not be leading the AI race with all it's resources
English
202
8
175
16.4K
Chris Bakke
Chris Bakke@ChrisJBakke·
For the last 6 years I’ve been buying well-run small businesses for 5x earnings. In the first 30 days, I take the websites offline, move the companies to sad office parks with drop ceilings, install fax machines at the front desk, and bringing in 75 year old actors to pose as the CEO. I then sell the companies to people with MBAs for 10x revenue so that they can feel useful “turning the company around”
English
47
112
4.5K
247K
Andrey Kolesnikov
Andrey Kolesnikov@minviable_org·
@TheChiefNerd BG is spot on with his Frankenstein take. Loudest case for local AI - freedom of intelligence.
English
0
0
1
1.2K
Chief Nerd
Chief Nerd@TheChiefNerd·
🚨 BILL GURLEY: “I would encourage people to read as much as they can about Anthropic … I don't think they think they're writing software. I think they're midwifing a deity.” JASON: “I know some of these folks … They believe they're so powerful, that they can create God.”
English
369
1.2K
8.4K
1M
Andrey Kolesnikov
Andrey Kolesnikov@minviable_org·
@antirez How can you forget when reminders are everywhere. I’m keep cancelling subscriptions, banks and other services that haven’t evolved. Some are still running JSX, ASP and other ancient frameworks.
English
0
0
6
1.3K
antirez
antirez@antirez·
Many of you forgot too fast the insane amount of shitty software we had to see and suffer in the pre-AI era.
English
122
97
1.7K
83.3K
Olivia Moore
Olivia Moore@omooretweets·
Self-driving cars are fun because you never see competing SaaS products having a literal standoff in the street
English
282
820
13.2K
1M
Andrey Kolesnikov
Andrey Kolesnikov@minviable_org·
@witcheer @NousResearch 27b is not really designed for multi-step and context recall. Something bigger needs to feed it isolated chunks of bound context, it rips.
English
0
0
2
31
witcheer
witcheer@witcheer·
Which local LLM best drives an agent? I built a benchmark for pairing models with Hermes Agent (@NousResearch) - a CodeAct agent that writes Python to call its tools, not JSON function calls. 4 models, RTX 5090, tested under Hermes's real system prompt. ~~ here is the final leaderboard: 🥇 Qwopus-18B — 92.7 🥈 Qwen3.6-27B — 92.4 🥉 Nemotron-Cascade-2-30B — 90.5 4️⃣ Hermes-4.3-36B — 84.3 ~~ no model wins all four axes: - Qwen 27B = perfect multi-step loops + instruction-following, but weakest long-context recall (~70%) - Nemotron + Qwopus = flawless long-context (100%) but worst at multi-step (50%) - Hermes 36B = solid, but OOMs at 64K context on 32GB → that 0 tanks its score the "best agent model" genuinely depends on your workload. ~~ methodology most "function-calling" benchmarks score JSON tool calls. Hermes is code-as-action, which means that the model writes Python. I tested that, under the real ~3.5K-token agent prompt.
English
19
5
59
4.9K
Lotto
Lotto@LottoLabs·
Anthropic is right in the verge of getting lost in the sauce
English
6
1
58
2.7K
Andrey Kolesnikov
Andrey Kolesnikov@minviable_org·
@LottoLabs 27b-written code will power the software innovation of the next few years. Talk about a model punching way above its weight. Pun intended.
English
1
0
6
1.2K
Lotto
Lotto@LottoLabs·
The biggest dark horse in all of ai right now Is qwen 27b on a 3090 going 70+ TPS Literally 6 year old tech that can think
English
31
10
564
34.1K
Andrey Kolesnikov
Andrey Kolesnikov@minviable_org·
@dakshgup cpu compute that generates training data for subsequent gpu evolution
English
0
0
3
379
Daksh Gupta
Daksh Gupta@dakshgup·
all coding is just turning gpu compute into cpu compute
English
9
4
131
10.1K
Hikari∣LocalLLM⚡
Hikari∣LocalLLM⚡@Hikari_07_jp·
To maximize throughput, it's necessary to run LLMs in parallel. However, when n increases to around 30, the harness can't handle it. I want to solve this problem on my own, so I'm planning to fork Hermes.
English
4
0
18
2.1K
Andrey Kolesnikov
Andrey Kolesnikov@minviable_org·
I have 64Gb low cas non-ECC and it’s fine. Color me uneducated, I honestly don’t know why have more RAM (unless it’s Mac). CPU with large cache is much more consequential, X3D edges noticeably on Ryzen builds. Choke is cross-card tensor parallelism over PCIe and not having NVLink. If 6ks had NVLink it would cannibalize a lot of their DC market.
English
2
0
3
117
Hikari∣LocalLLM⚡
Hikari∣LocalLLM⚡@Hikari_07_jp·
My setup has two RTX RPO 6000 cards. In this case, what RAM do you think would be ideal? I'm planning to upgrade in the near future and I'm torn between 512GB and 256GB. Please share your opinions.
English
13
0
12
1.9K
Andrey Kolesnikov
Andrey Kolesnikov@minviable_org·
Catching up on latest @theallinpod. It seems like local AI is becoming mainstream. I feel seen.
English
0
0
1
160
Yash
Yash@YashHustle_22·
Can you call yourself a founder if your entire product was built by Codex?
English
82
2
33
4.7K
Andrey Kolesnikov
Andrey Kolesnikov@minviable_org·
@rohitdotmittal What do you mean by AI? Camera roll, noise cancelling and sms code from messages are AI and we cant function without those.
English
1
0
2
462
Rohit Mittal
Rohit Mittal@rohitdotmittal·
It’s insane to think that Apple has zero AI execution. Absolutely zero. Nothing they did worked, and they are not even trying. Still, it’s a $4.6 trillion company.
English
204
16
1.3K
82.3K
Roy
Roy@usr_bin_roygbiv·
Getting a job in SF: > recruiter phone call - 15m > hiring manager - 30m > talk to owner/founder - 1h > offer Getting a job in NY: > fill out our workday application > on teams > 3 HR screens > hiring manager shows up hungover > cheat lc > you look like swes ex, don't get job
English
6
0
101
11.4K
Ion
Ion@ionthedev·
The best part of the day as a founding engineer
Ion tweet media
English
2
0
27
1.2K