Nilay

317 posts

Nilay

@localm_tuts

AI Researcher | 2x Native AI Startups | Follow for tutorials, podcasts, and hacks to keep your skills sharp. For Hindi follow @localm_hi

London | San Jose Katılım Aralık 2025

25 Takip Edilen15 Takipçiler

Nilay@localm_tuts·22h

@HemantDotDev vscode - never support whol lock the door.

English

Hemant@HemantDotDev·1d

Be honest, which IDE is the best in this AI era ?

English

2.1K

Nilay@localm_tuts·22h

@mr_r0b0t @xyster @MemoryReboot_ @intel @NVIDIAAI tobe honest, tokenmaxxing has no meaning unlese prefixed with `local` - localtokenmaxx I am at 260M tokens, and only $8 (premium - external) ;) 3xGB10 (one cluster) – various models + 1xGB10 (Qwen 27B or 35B A3B)

English

mr-r0b0t@mr_r0b0t·1d

We made it to my "Today's News" @xyster @MemoryReboot_ 🤣🤣 Looks like we need to do more testing!

English

794

Nilay@localm_tuts·22h

@mr_r0b0t @Tono_Ken3 Yes, I assume for the size of minimax and supporting reasonable context for multiturn (with a 20% token increase turn by turn to support up to 4-6 turns with 64K context), you are looking at 396GB VRAM for 4-6 parallel streams.

English

mr-r0b0t@mr_r0b0t·1d

@Tono_Ken3 You need 256GB for this, I was nearly OOM 😭

English

775

TonoKen3🌏文明航海士©｜とのけん3@Tono_Ken3·1d

Imagine this 8x RTX Pro 2000s with 128GB of VRAM have aggregate throughput greater than 2x GB10s... MiniMax 3.0 is about to be released.

mr-r0b0t@mr_r0b0t

16 local AI agents streaming at once! MiniMax M2.7 NVFP4 — 2x GB10, no cloud APIs.

English

9.2K

Nilay@localm_tuts·2d

@mr_r0b0t @NVIDIAAI nice one ;) lets get the ball rolling.

English

mr-r0b0t@mr_r0b0t·2d

Are you considering a @NVIDIAAI DGX Spark or GB10? Looking for tips and best practices for the one you have? Want to show off your new projects? Join the newly formed DGX (GB10) User Group! I'll be there and happy to help as best I'm able ♥️ x.com/i/chat/group_j…

English

7.7K

Nilay@localm_tuts·2d

@SpaceTimeViking @NVIDIAAI I think next round of NEMOs on way ;) that's my best hunch.

English

ÆON FORGE ✨@SpaceTimeViking·2d

@NVIDIAAI is cooking up something “Ultra” and this could be their big break. The post training model has so much potential distilling the weights and data down to the purest form possible. Isolating the signal and removing the noise. Scaling that up could be a big deal.

NVIDIA AI@NVIDIAAI

@TheAhmadOsman 👀 "Ultra" ⏳️

English

440

Nilay@localm_tuts·2d

@thegenioo if you have one - great, if you haven't that's not blockers. key is multiturn accuracy. one can have image agent.

English

Hamza@thegenioo·3d

@localm_tuts why not??

English

275

Hamza@thegenioo·3d

The moment DeepSeek gets vision You absolutely don’t need any other chinese model if you need reliability, performance, cost effectiveness and speed

DeepSeek@deepseek_ai

We are making our discount permanent! 🎉 Enjoy building with DeepSeek-V4-Pro and bring your innovative ideas to life! 🚀

English

208

17.4K

Nilay@localm_tuts·2d

FYI. this is experimental for dgx sparks only - optimised with modelopt from BF16 image, and post quant-optimised. I haven't benchmark to validate loss (but my hunch is it is lossless) - as NVFP4 was < 0.02 it use llamacpp's nvfp4 underlaying... huggingface.co/nilayparikh/Qw…

English

Nilay@localm_tuts·2d

I am so impressed with llamacpp and especially when combined mtp and ngram-mod - qwen 3.6 27b - fly up to 30 toks/s @ 192K context size, on single node DGX spark. With far better accuracy. FYI. I am running my own experimental model - details in thread

English

Nilay@localm_tuts·2d

@KaiXCreator everyother. only nonserious actors use that

English

Kaito@KaiXCreator·2d

Can you name a programming language better than JavaScript ?

English

159

107

14K

Nilay@localm_tuts·2d

@alexabelonix thanks

English

Alexa Web3 (e/acc)@alexabelonix·2d

@localm_tuts really good build.

English

Nilay@localm_tuts·2d

I get this question often: Why are you capturing all your in/out via proxy even when you are a lone person using this? Lone or not lone, it doesn't matter; an agentic conversation with an LLM, along with user inputs, is a gold mine. Next 2 years are all about RLHF.

English

Nilay@localm_tuts·3d

@ash_twtz They have no value, instead they need to keep mastering interface. Yes they should build and invest into mid and small models, as it will increase demand for their prosumer based.

English

Mr Ash@ash_twtz·3d

What’s stopping NVIDIA from creating the world’s best AI model?

English

154

165

11.4K

Nilay@localm_tuts·3d

@gregpr07 Can it meet my acceptance criteria? Yes than rest is noise as legally can buy a service.

English

Gregor Zunic@gregpr07·3d

I think I know why deepseek is so good

English

272

153

5.9K

441.5K

Nilay@localm_tuts·3d

@ylecun Because there will be midterms in few months !

English

Yann LeCun@ylecun·3d

Why?

The Wall Street Journal@WSJ

Most green-card applicants will need to go abroad to apply for permanent residency at an American consulate, rather than filing from within the U.S. as they do now, the Trump administration announced Friday. on.wsj.com/4v2Fqkr

QST

401

152

3.8K

783.4K

Nilay@localm_tuts·3d

Most beautiful image, when comes to agentic solutions :D

English

Nilay@localm_tuts·3d

I am token rich! I have ratio of 700:1 (input to output) ratio for input to output.

DeepSeek@deepseek_ai

We are making our discount permanent! 🎉 Enjoy building with DeepSeek-V4-Pro and bring your innovative ideas to life! 🚀

English

Nilay@localm_tuts·3d

@wieslawsoltes do you know `kill switch` - gh copilot has turned on

English

Wiesław Šoltés@wieslawsoltes·3d

Gemini 3.5 Flash in GitHub Copilot has 14x multiplier basically on par with Claude Opus 4.7 🤡

English

1.2K

Nilay@localm_tuts·3d

When coding agents do retrospective at end of week, what went well, what was off. Now updating agents.md and agent definition. that's my claw moment, when they are in discussion what skill will improve.

English

Nilay@localm_tuts·3d

I am without OPUS since 29 days I am without GPT 5.5 since 29 days I am without GPT 5.4 since a week And I survived - so can you! API (NVIDIA++) + Local 👏

DeepSeek@deepseek_ai

We are making our discount permanent! 🎉 Enjoy building with DeepSeek-V4-Pro and bring your innovative ideas to life! 🚀

English

Keşfet

@HemantDotDev @mr_r0b0t @xyster @MemoryReboot_ @intel @NVIDIAAI @Tono_Ken3 @SpaceTimeViking