Rahul

1.4K posts

Rahul

@guddur

Video games, East Bengal, Chelsea, Machine Learning @ a fmcg

Bengaluru Katılım Haziran 2009

198 Takip Edilen59 Takipçiler

Rahul@guddur·3d

Strait of chingrighata finally open.

Sreyashi Dey@SreyashiDey

First in my bloodline to see #ChingrihataMetro construction work 😍

English

193

Rahul retweetledi

le.hl@0xleegenz·3 Nis

Walking alone through a foreign city at night and realizing how far you’ve come has to be a top 3 peak moment of all time

English

938

22.6K

110.2K

19M

Rahul retweetledi

Fabrizio Romano@FabrizioRomano·1 Nis

🚨 BREAKING: Italy are OUT of the 2026 World Cup. Third World Cup missed in a row. ❌🇮🇹

English

7.3K

16.7K

177.9K

8.8M

Rahul@guddur·12 Mar

@naveen_venk Let's gooo! : )

English

Naveen Venkat@naveen_venk·11 Mar

Launching something we’ve been working for a while - AI agents to build chips faster. Applied to YC. Let’s see where this goes 🚀 archgen.tech

hari_haran@HariAyapps

@naveen_venk and I submitted our @ycombinator to build the next generation of AI chips with agents. Do you think we'll hear back? @garrytan @snowmaker @t_blom

English

2.8K

Rahul@guddur·27 Şub

@shantanugoel I will prefer reranking + semantics any day but actual data we deal with is shit. And don't have option to train our own embedding, which would solve 90% of issue. Long run much less token costs and better retrieval. But heh!

English

Rahul@guddur·27 Şub

@shantanugoel Yes. But only if your doc are clean. I tried this a memory for a agent I am working on. Failed horribly because the it is getting bs from vector search as it contains lingo that make no sense to the model (bad score).so moved back to this. I guess it work great if self train emb

English

Shantanu Goel@shantanugoel·27 Şub

Also trying an experiment with oxydra which renders "SKILLS" obsolete as mere context bloat. I'm trying a mode so when the model tries something new and makes errors and finally stumbles upon the right way, it would remember in long term memory what fails and what works, thus upskilling it self. With the awesome vector search baked into libsql, it can easily recall its past skills without bloating big skill sets into its context unnecessarily.

Shantanu Goel@shantanugoel

Updated oxydra last night so that it adapts to model capabilities. - If the model has ability to parse audio, it will send it to model directly otherwise the model will try to convert it to text (not baking in special tools by myself) - same for docs/images/videos etc

English

755

Rahul@guddur·27 Şub

@shantanugoel Another thing I found out building and storing a function by agent itself and write to file. And later recall that function from a function registery. Saves a lot of token on regular tasks. Where you don't have stored procedure/function.

English

Rahul@guddur·27 Şub

@shantanugoel You ideally can have subagent that gets triggered on failure with a job of searching skill and feed tha back to main. Generally very little context bloat here. But again depending on how you are extracting and storing skills. My vector search failed horribly because shit data stc

English

Rahul@guddur·23 Şub

Man it took a lot of effort write those books, codes for training 😌

Anthropic@AnthropicAI

We’ve identified industrial-scale distillation attacks on our models by DeepSeek, Moonshot AI, and MiniMax. These labs created over 24,000 fraudulent accounts and generated over 16 million exchanges with Claude, extracting its capabilities to train and improve their own models.

English

Rahul@guddur·21 Şub

@shantanugoel As a side effect of llm being markdown maniacs the documentation became standardized. Took a few years 😌

English

164

Shantanu Goel@shantanugoel·21 Şub

Remember when having these md files in your github repo was a taboo, and people used to gitignore *.md 😁

English

102

7.4K

Rahul@guddur·18 Şub

this is excellent. Training from zero token for indic langauge is huge huge job, cant wait to play with these @SarvamAI Also need a blog on that tokenizer please 🥺

English

Rahul retweetledi

Unsloth AI@UnslothAI·10 Şub

You can now train MoE models 12× faster with 35% less VRAM via our new Triton kernels (no accuracy loss). Train gpt-oss locally on 12.8GB VRAM. In collab with @HuggingFace, Unsloth trains DeepSeek, Qwen3, GLM faster. Repo: github.com/unslothai/unsl… Blog: unsloth.ai/docs/new/faste…

English

196

1.5K

212.3K

Rahul retweetledi

Neil Zeghidour@neilzegh·20 Oca

Me defending my O(n^3) solution to the coding interviewer.

English

416

49.3K

Rahul@guddur·9 Ara

@IndiGo6E wasn't the flight fare was capped by going. Or you got an extention that @DGCAIndia

English

Rahul@guddur·9 Ara

@MOO_THE_COW_MOO @Battlefield A copy of the game!

English

Vendetta@MOO_THE_COW_MOO·9 Ara

@guddur @Battlefield @guddur what did you get

English

Battlefield@Battlefield·9 Ara

Frozen solid out there 🥶 Squad up in the cold-weather fits. Winter Offensive arrives tomorrow in #Battlefield6 and #REDSEC. Don’t miss the comments 👀 there’s something buried in the snow ❄️

English

306

4.1K

351.2K

Rahul retweetledi

Anthropic@AnthropicAI·13 Kas

We disrupted a highly sophisticated AI-led espionage campaign. The attack targeted large tech companies, financial institutions, chemical manufacturing companies, and government agencies. We assess with high confidence that the threat actor was a Chinese state-sponsored group.

English

3.3K

21.2K

7.5M

Rahul retweetledi

elie@eliebakouch·30 Eki

Training LLMs end to end is hard. Very excited to share our new blog (book?) that cover the full pipeline: pre-training, post-training and infra. 200+ pages of what worked, what didn’t, and how to make it run reliably huggingface.co/spaces/Hugging…

English

126

6.8K

1.9M

Rahul@guddur·30 Eki

@ICICIBank_Care how can suddenly the account type and status can be changed, without informing the customer. How do you even manage this

English

Keşfet

@naveen_venk @shantanugoel @SarvamAI @HuggingFace @IndiGo6E @DGCAIndia @MOO_THE_COW_MOO @Battlefield