Rahul

1.4K posts

Rahul

Rahul

@guddur

Video games, East Bengal, Chelsea, Machine Learning @ a fmcg

Bengaluru Katılım Haziran 2009
198 Takip Edilen59 Takipçiler
Rahul retweetledi
le.hl
le.hl@0xleegenz·
Walking alone through a foreign city at night and realizing how far you’ve come has to be a top 3 peak moment of all time
English
938
22.6K
110.2K
19M
Rahul retweetledi
Fabrizio Romano
Fabrizio Romano@FabrizioRomano·
🚨 BREAKING: Italy are OUT of the 2026 World Cup. Third World Cup missed in a row. ❌🇮🇹
Fabrizio Romano tweet media
English
7.3K
16.7K
177.9K
8.8M
Rahul
Rahul@guddur·
@shantanugoel I will prefer reranking + semantics any day but actual data we deal with is shit. And don't have option to train our own embedding, which would solve 90% of issue. Long run much less token costs and better retrieval. But heh!
English
0
0
1
11
Rahul
Rahul@guddur·
@shantanugoel Yes. But only if your doc are clean. I tried this a memory for a agent I am working on. Failed horribly because the it is getting bs from vector search as it contains lingo that make no sense to the model (bad score).so moved back to this. I guess it work great if self train emb
English
1
0
1
10
Shantanu Goel
Shantanu Goel@shantanugoel·
Also trying an experiment with oxydra which renders "SKILLS" obsolete as mere context bloat. I'm trying a mode so when the model tries something new and makes errors and finally stumbles upon the right way, it would remember in long term memory what fails and what works, thus upskilling it self. With the awesome vector search baked into libsql, it can easily recall its past skills without bloating big skill sets into its context unnecessarily.
Shantanu Goel@shantanugoel

Updated oxydra last night so that it adapts to model capabilities. - If the model has ability to parse audio, it will send it to model directly otherwise the model will try to convert it to text (not baking in special tools by myself) - same for docs/images/videos etc

English
2
0
6
755
Rahul
Rahul@guddur·
@shantanugoel Another thing I found out building and storing a function by agent itself and write to file. And later recall that function from a function registery. Saves a lot of token on regular tasks. Where you don't have stored procedure/function.
English
0
0
1
13
Rahul
Rahul@guddur·
@shantanugoel You ideally can have subagent that gets triggered on failure with a job of searching skill and feed tha back to main. Generally very little context bloat here. But again depending on how you are extracting and storing skills. My vector search failed horribly because shit data stc
English
1
0
1
26
Rahul
Rahul@guddur·
@shantanugoel As a side effect of llm being markdown maniacs the documentation became standardized. Took a few years 😌
English
0
0
1
164
Shantanu Goel
Shantanu Goel@shantanugoel·
Remember when having these md files in your github repo was a taboo, and people used to gitignore *.md 😁
Shantanu Goel tweet media
English
12
0
102
7.4K
Rahul
Rahul@guddur·
this is excellent. Training from zero token for indic langauge is huge huge job, cant wait to play with these @SarvamAI Also need a blog on that tokenizer please 🥺
English
0
0
0
26
Rahul retweetledi
Neil Zeghidour
Neil Zeghidour@neilzegh·
Me defending my O(n^3) solution to the coding interviewer.
English
416
5K
49.3K
4M
Rahul
Rahul@guddur·
@IndiGo6E wasn't the flight fare was capped by going. Or you got an extention that @DGCAIndia
Rahul tweet media
English
0
0
0
7
Battlefield
Battlefield@Battlefield·
Frozen solid out there 🥶 Squad up in the cold-weather fits. Winter Offensive arrives tomorrow in #Battlefield6 and #REDSEC. Don’t miss the comments 👀 there’s something buried in the snow ❄️
Battlefield tweet media
English
2K
306
4.1K
351.2K
Rahul retweetledi
Anthropic
Anthropic@AnthropicAI·
We disrupted a highly sophisticated AI-led espionage campaign. The attack targeted large tech companies, financial institutions, chemical manufacturing companies, and government agencies. We assess with high confidence that the threat actor was a Chinese state-sponsored group.
English
1K
3.3K
21.2K
7.5M
Rahul retweetledi
elie
elie@eliebakouch·
Training LLMs end to end is hard. Very excited to share our new blog (book?) that cover the full pipeline: pre-training, post-training and infra. 200+ pages of what worked, what didn’t, and how to make it run reliably huggingface.co/spaces/Hugging…
elie tweet media
English
126
1K
6.8K
1.9M
Rahul
Rahul@guddur·
@ICICIBank_Care how can suddenly the account type and status can be changed, without informing the customer. How do you even manage this
English
1
0
0
10