Andrey Kolesnikov

251 posts

Andrey Kolesnikov

@minviable_org

Dad, husband, immigrant, nerd. Built, bought and sold companies. Default to code, law degree is a bonus.

San Francisco, CA انضم Mayıs 2026

95 يتبع57 المتابعون

Andrey Kolesnikov@minviable_org·3h

@aleabitoreddit ajinomoto

10K

Serenity@aleabitoreddit·4h

At $NVDA GTC/Computex in Taipei: I think we’ll hear about the next AI bottleneck. That’s owned by a .6 P/B potato farming company in Japan, with a 180 year history. Their owner cooks those potatoes in night markets for 160 yen a piece. But that same potato farming equipment used to grow potatoes with optimal sunlight. Is now required for optical alignment requirements for CPO. And their unique cooking technique is mandatory to address thermal requirements for Rubin. Can anyone guess?

English

385

870

201.3K

Andrey Kolesnikov@minviable_org·15h

@asaio87 I’m one of those idiots I guess. 1M MAU app. Most of our growth and feature innovation came after AI usage spread throughout the company. Exited too, made investors happy. All AI.

English

andrei saioc@asaio87·19h

Only idiots think AI has changed everything when it comes to building successful apps.

English

3.8K

Andrey Kolesnikov@minviable_org·15h

@DanielSmidstrup They are capacity constrained. The rest of the world is fine using Nvidia, for Google it’s declaring an L in GPU race. Hence the focus on TPU8

English

254

Daniel Smidstrup@DanielSmidstrup·1d

Seriously, how can Google still not be leading the AI race with all it's resources

English

201

175

16.3K

Andrey Kolesnikov@minviable_org·16h

@steipete You’ll like the weather.

English

Peter Steinberger 🦞@steipete·19h

Finally got my visa sorted out and moving to San Francisco, just in time for MS Build and OpenClaw’s after hours! luma.com/OpenClaw-GitHub

English

121

2.2K

96.8K

Andrey Kolesnikov@minviable_org·16h

@ChrisJBakke 5x rev is very rich for SMB.

English

327

Chris Bakke@ChrisJBakke·1d

For the last 6 years I’ve been buying well-run small businesses for 5x earnings. In the first 30 days, I take the websites offline, move the companies to sad office parks with drop ceilings, install fax machines at the front desk, and bringing in 75 year old actors to pose as the CEO. I then sell the companies to people with MBAs for 10x revenue so that they can feel useful “turning the company around”

English

111

4.4K

244.2K

Andrey Kolesnikov@minviable_org·16h

@TheChiefNerd BG is spot on with his Frankenstein take. Loudest case for local AI - freedom of intelligence.

English

1.2K

Chief Nerd@TheChiefNerd·1d

🚨 BILL GURLEY: “I would encourage people to read as much as they can about Anthropic … I don't think they think they're writing software. I think they're midwifing a deity.” JASON: “I know some of these folks … They believe they're so powerful, that they can create God.”

English

369

1.2K

8.3K

Andrey Kolesnikov@minviable_org·16h

@antirez How can you forget when reminders are everywhere. I’m keep cancelling subscriptions, banks and other services that haven’t evolved. Some are still running JSX, ASP and other ancient frameworks.

English

1.3K

antirez@antirez·20h

Many of you forgot too fast the insane amount of shitty software we had to see and suffer in the pre-AI era.

English

121

1.7K

81.4K

Andrey Kolesnikov@minviable_org·1d

@omooretweets Just like Sam and Dario in India.

English

1.8K

Olivia Moore@omooretweets·1d

Self-driving cars are fun because you never see competing SaaS products having a literal standoff in the street

English

281

815

13K

998.1K

Andrey Kolesnikov@minviable_org·1d

@witcheer @NousResearch 27b is not really designed for multi-step and context recall. Something bigger needs to feed it isolated chunks of bound context, it rips.

English

witcheer@witcheer·1d

Which local LLM best drives an agent? I built a benchmark for pairing models with Hermes Agent (@NousResearch) - a CodeAct agent that writes Python to call its tools, not JSON function calls. 4 models, RTX 5090, tested under Hermes's real system prompt. ~~ here is the final leaderboard: 🥇 Qwopus-18B — 92.7 🥈 Qwen3.6-27B — 92.4 🥉 Nemotron-Cascade-2-30B — 90.5 4️⃣ Hermes-4.3-36B — 84.3 ~~ no model wins all four axes: - Qwen 27B = perfect multi-step loops + instruction-following, but weakest long-context recall (~70%) - Nemotron + Qwopus = flawless long-context (100%) but worst at multi-step (50%) - Hermes 36B = solid, but OOMs at 64K context on 32GB → that 0 tanks its score the "best agent model" genuinely depends on your workload. ~~ methodology most "function-calling" benchmarks score JSON tool calls. Hermes is code-as-action, which means that the model writes Python. I tested that, under the real ~3.5K-token agent prompt.

English

4.9K

Andrey Kolesnikov@minviable_org·1d

@TheAhmadOsman traffic-driven quant downcast. PoS as the inverse of QoS.

English

Ahmad@TheAhmadOsman·3d

Opus 4.8 could be the same nerfed opus 4.6 in 4bit rather than 1.58bit 🤡 I don't trust those clowns Don't waste your money on a Claude Max subscription, they will keep rugpulling you

Ahmad@TheAhmadOsman

Claude Code is so good at night/early morning before they start serving it quantized at 1.58-bit for the masses 🤡

English

104

9.9K

Andrey Kolesnikov@minviable_org·1d

@LottoLabs @ntbrown01 It is wicked fast, but they need to polish edges around reliability.

English

Lotto@LottoLabs·1d

@ntbrown01 Interesting

English

255

Lotto@LottoLabs·1d

Anthropic is right in the verge of getting lost in the sauce

English

2.7K

Andrey Kolesnikov@minviable_org·1d

@LottoLabs 27b-written code will power the software innovation of the next few years. Talk about a model punching way above its weight. Pun intended.

English

1.2K

Lotto@LottoLabs·1d

The biggest dark horse in all of ai right now Is qwen 27b on a 3090 going 70+ TPS Literally 6 year old tech that can think

English

564

34K

Andrey Kolesnikov@minviable_org·1d

@dakshgup cpu compute that generates training data for subsequent gpu evolution

English

378

Daksh Gupta@dakshgup·1d

all coding is just turning gpu compute into cpu compute

English

129

10K

Andrey Kolesnikov@minviable_org·1d

@Hikari_07_jp Try @papercliping , helped me regain my sanity. It supports Hermes, which I only use directly for esoteric hand surgeries now.

English

Hikari∣LocalLLM⚡@Hikari_07_jp·1d

To maximize throughput, it's necessary to run LLMs in parallel. However, when n increases to around 30, the harness can't handle it. I want to solve this problem on my own, so I'm planning to fork Hermes.

English

2.1K

Andrey Kolesnikov@minviable_org·1d

I have 64Gb low cas non-ECC and it’s fine. Color me uneducated, I honestly don’t know why have more RAM (unless it’s Mac). CPU with large cache is much more consequential, X3D edges noticeably on Ryzen builds. Choke is cross-card tensor parallelism over PCIe and not having NVLink. If 6ks had NVLink it would cannibalize a lot of their DC market.

English

117

Hikari∣LocalLLM⚡@Hikari_07_jp·1d

My setup has two RTX RPO 6000 cards. In this case, what RAM do you think would be ideal? I'm planning to upgrade in the near future and I'm torn between 512GB and 256GB. Please share your opinions.

English

1.8K

Andrey Kolesnikov@minviable_org·1d

Catching up on latest @theallinpod. It seems like local AI is becoming mainstream. I feel seen.

English

159

Andrey Kolesnikov@minviable_org·1d

@YashHustle_22 Codex doesn’t get a TIN nor files 83b.

English

Yash@YashHustle_22·2d

Can you call yourself a founder if your entire product was built by Codex?

English

4.7K

Andrey Kolesnikov@minviable_org·1d

@rohitdotmittal What do you mean by AI? Camera roll, noise cancelling and sms code from messages are AI and we cant function without those.

English

462

Rohit Mittal@rohitdotmittal·2d

It’s insane to think that Apple has zero AI execution. Absolutely zero. Nothing they did worked, and they are not even trying. Still, it’s a $4.6 trillion company.

English

204

1.3K

82.3K

Andrey Kolesnikov@minviable_org·1d

@usr_bin_roygbiv Always run if HM interview is 30 min. This is exactly how much time they’d invest in you.

English

244

Roy@usr_bin_roygbiv·2d

Getting a job in SF: > recruiter phone call - 15m > hiring manager - 30m > talk to owner/founder - 1h > offer Getting a job in NY: > fill out our workday application > on teams > 3 HR screens > hiring manager shows up hungover > cheat lc > you look like swes ex, don't get job

English

101

11.4K

Andrey Kolesnikov@minviable_org·1d

@ionthedev The best. Answers all the Whys instantly.

English

Ion@ionthedev·2d

The best part of the day as a founding engineer

English

1.2K

اكتشف

@aleabitoreddit @asaio87 @DanielSmidstrup @steipete @ChrisJBakke @TheChiefNerd @antirez @omooretweets