Guna Sekhar Venkata Chennaiah Chakka

1

27

Ameya@lambatameya·5d

@LangChain the hard part isn't the non-determinism. it's visibility. when an agent fails you can't just check logs. you need to understand what context it saw, what it reasoned about, what information gaps existed. production agents need architecture visibility, not just code observability.

English

0

6

439

LangChain@LangChain·5d

💫 New LangChain Academy Course: Building Reliable Agents 💫 Shipping agents to production is hard. Traditional software is deterministic – when something breaks, you check the logs and fix the code. But agents rely on non-deterministic models. Add multi-step reasoning, tool use, and real user traffic, and building reliable agents becomes far more complex than traditional system design. The goal of this course is to teach you how to take an agent from first run to production-ready system through iterative cycles of improvement. You’ll learn how to do this with LangSmith, our agent engineering platform for observing, evaluating, and deploying agents.

English

22

46

371

29.8K

Guna Sekhar Venkata Chennaiah Chakka@codevlogger·5d

@Siddhant_K_code @0xkanth Yes. But if the every infra supports this it would be very helpful

English

0

24

Siddhant Khare@Siddhant_K_code·5d

yes!! The reasoning trace is the closest thing we have to "why". But it shouldn't depend on the model vendor. The observability layer should capture the decision context (what was in the window, what tools were available, what the agent considered) regardless of whether the model exposes chain-of-thought. That's an infra thing, not a model feature.

English

0

28

Siddhant Khare@Siddhant_K_code·6d

I wrote the full thing. A week ago, I discussed the gaps in agent observability in a thread. Session-level performance, context changes, signal-to-noise, production traces. A lot of you had the same frustrations. So I wrote it up properly. We have better observability for a Node.js service than for an AI agent that just rewrote half a codebase.

English

23

39

414

18.8K

Guna Sekhar Venkata Chennaiah Chakka@codevlogger·5d

@Siddhant_K_code @0xkanth At sometimes i feel in the every tool call in needs to comes with some message that why it's making that tool call. Which is helpful for better debugging and understanding of the system. Unfortunately this is possible only with gpt reasoning models where as claude supports this🌋

English

0

25

Siddhant Khare@Siddhant_K_code·6d

@0xkanth LangSmith traces LLM calls. but it jsut one layer. Gap is everything around it: which tool calls led to a file write, why the agent chose path A over path B, what context was in the window when it made that decision.

English

2

0

169

Guna Sekhar Venkata Chennaiah Chakka@codevlogger·12 Mar

@karthikponna19 Pregnancy Kink

English

4

Karthik@karthikponna19·12 Mar

ZOOM IN as a developer, tell me the first word you see ?

English

38

2

23

3.4K

Guna Sekhar Venkata Chennaiah Chakka@codevlogger·10 Mar

@_jaydeepkarale Similarity != Relevance != Perturbation Sentences

Español

@shiba14857 @cce_iisc iisc.online/admissions/hom…

129

Jaydeep@_jaydeepkarale·10 Mar

Interviewer: If vector databases store embeddings, why can two similar sentences still return different search results?

English

10

7

121

21.1K

Guna Sekhar Venkata Chennaiah Chakka@codevlogger·10 Mar

QME

125

Manash@shiba14857·10 Mar

@cce_iisc Where is the link to apply?

English

2

0

1

415

Centre for Continuing Education (CCE), IISc@cce_iisc·10 Mar

Master Deep Reinforcement Learning online with IISc faculty! ⏰Saturdays 2:00–5:30 PM 🗓️May–July 2026 Open to B.Tech/B.Sc (Math/Stats/CS/Physics/DS) grads Apply Now! 📩office.cce@iisc.ac.in or 080-2293 2055! #ReinforcementLearning #IISc #AI

Centre for Continuing Education (CCE), IISc tweet media

English

26

227

13.7K

Guna Sekhar Venkata Chennaiah Chakka@codevlogger·10 Mar

@willccbb hey @grok what the author is talking about?

English

0

296

will brown@willccbb·10 Mar

agent skills are so sick bc they let you use retrieval for context which then augments generation

English

39

11

475

26.4K

Sumit Mittal@bigdatasumit·10 Mar

I am giving you free access to my complete 50 Days SQL Superstar Program. The earlier SQL playlist helped millions, but interviews today need more depth. So I am rebuilding everything from scratch to help you crack top product based companies. I am also organising it in one clean portal with daily videos, notes, datasets, quizzes and certificates.a To get the enrolment link just: - Follow me so that I can DM you - Like and Retweet - Comment "SQL Superstar" Free access available for a limited time. These 50 days can change your life! #sql #dataengineering #databases

English

1.3K

681

1.9K

129.8K

Guna Sekhar Venkata Chennaiah Chakka@codevlogger·10 Mar

@bigdatasumit SQL Superstar

English

6

Guna Sekhar Venkata Chennaiah Chakka@codevlogger·9 Mar

@HrishbhDalal ok. yes he is feeding the experiemnts history which is a stack of the experiments texts . Then as you said its leveraging its internal knowledge for gueesing the changes in the code etc . thanks for reply. i missed the bit that it will use its internal knowledge

English

0

1

11

Hrishbh Dalal@HrishbhDalal·9 Mar

@codevlogger he must be feeding the history of experiments and then asking models what to tune essentially. as the models have great scientific papers knowledge, it knows which knobs to turn and i can imagine anthropic is also leading efforts in research domain. he is trying to make it mdp

English

I packaged up the "autoresearch" project into a new self-contained minimal repo if people would like to play over the weekend. It's basically nanochat LLM training core stripped down to a single-GPU, one file version of ~630 lines of code, then: - the human iterates on the prompt (.md) - the AI agent iterates on the training code (.py) The goal is to engineer your agents to make the fastest research progress indefinitely and without any of your own involvement. In the image, every dot is a complete LLM training run that lasts exactly 5 minutes. The agent works in an autonomous loop on a git feature branch and accumulates git commits to the training script as it finds better settings (of lower validation loss by the end) of the neural network architecture, the optimizer, all the hyperparameters, etc. You can imagine comparing the research progress of different prompts, different agents, etc. github.com/karpathy/autor… Part code, part sci-fi, and a pinch of psychosis :)

0

12

Hrishbh Dalal@HrishbhDalal·8 Mar

another rl problem to figure out. tbh this could be used to do meta learning using rl than asking llms to train other llms. ofc there is some exploration and exploitation from the knowledge of llms, but imo we can use heavy meta learning here to make things faster. schmidhuber

Andrej Karpathy@karpathy

English

0

2

190

Guna Sekhar Venkata Chennaiah Chakka@codevlogger·9 Mar

@hkproj everytime agent only sees some stack of messages like attached image and its trying to guess some architectural changes based on that right. its like randoming guessing without any controlled setup? im curious that how to see agent made the changes in every experiments

Guna Sekhar Venkata Chennaiah Chakka tweet media

English

I packaged up the "autoresearch" project into a new self-contained minimal repo if people would like to play over the weekend. It's basically nanochat LLM training core stripped down to a single-GPU, one file version of ~630 lines of code, then: - the human iterates on the prompt (.md) - the AI agent iterates on the training code (.py) The goal is to engineer your agents to make the fastest research progress indefinitely and without any of your own involvement. In the image, every dot is a complete LLM training run that lasts exactly 5 minutes. The agent works in an autonomous loop on a git feature branch and accumulates git commits to the training script as it finds better settings (of lower validation loss by the end) of the neural network architecture, the optimizer, all the hyperparameters, etc. You can imagine comparing the research progress of different prompts, different agents, etc. github.com/karpathy/autor… Part code, part sci-fi, and a pinch of psychosis :)

12

Umar Jamil@hkproj·8 Mar

Prompt Engineering -> Context Engineering -> Arena Engineering. Build the arena, let the agent act, adapt, and self-improve within its boundaries.

Andrej Karpathy@karpathy

English

7

1

46

5K

Guna Sekhar Venkata Chennaiah Chakka@codevlogger·9 Mar

@DCP_Chicago @karpathy @tobi everytime agent only sees some stack of messages like attached image and its trying to guess some architectural changes based on that right. its like randoming guessing without any controlled setup? im curious that how to see agent made the changes in every experiments

English

0

1

69

David Porter@DCP_Chicago·9 Mar

@karpathy @tobi most of the commits I see in the experiment git branches are updating single values. Has the agent attempted to rewrite code yet?

English

I packaged up the "autoresearch" project into a new self-contained minimal repo if people would like to play over the weekend. It's basically nanochat LLM training core stripped down to a single-GPU, one file version of ~630 lines of code, then: - the human iterates on the prompt (.md) - the AI agent iterates on the training code (.py) The goal is to engineer your agents to make the fastest research progress indefinitely and without any of your own involvement. In the image, every dot is a complete LLM training run that lasts exactly 5 minutes. The agent works in an autonomous loop on a git feature branch and accumulates git commits to the training script as it finds better settings (of lower validation loss by the end) of the neural network architecture, the optimizer, all the hyperparameters, etc. You can imagine comparing the research progress of different prompts, different agents, etc. github.com/karpathy/autor… Part code, part sci-fi, and a pinch of psychosis :)

0

3

4.2K

tobi lutke@tobi·9 Mar

OK this thing is totally insane. Before going to bed I... * used try to make a new qmdresearcher directory * told my pi to read this github repo and make a version of that for the qmd query-expansion model with the goal of highest quality score and speed. Get training data from tobi/qmd github. * woke up to +19% score on a 0.8b model (higher than previous 1.6b) after 8 hours and 37 experiments. I'm not a ML researcher of course. I'm sure way more sophisticated stuff is being done by real researchers. But its mesmerizing to just read it reasoning its way through the experiments. I learned more from that than months of following ml researchers. I just asked it to also make a new reranker and its already got higher base than the previous one. Incredible.

Andrej Karpathy@karpathy

English

122

245

4.8K

790.7K

Guna Sekhar Venkata Chennaiah Chakka@codevlogger·7 Mar

@sassysanket @KrishXCodes Another resource : linkedin.com/posts/jpjeong_…

English

1

21

Guna Sekhar Venkata Chennaiah Chakka@codevlogger·7 Mar

@sassysanket @KrishXCodes The problem with the hybrid search is about distribution distortion that it creates when mixing dense & sparse distributions. Recently the problem solved using Bayesian framework. Checkout:- tinyurl.com/54t63n36

English

2

0

2

21

Krishna@KrishXCodes·6 Mar

3 things that improved my RAG pipelines: 1. Better chunking strategies 2. HQG (Hypothetical question generation's) 3. Context pruning Small changes. Huge difference.

English

2

50

1K

Guna Sekhar Venkata Chennaiah Chakka@codevlogger·6 Mar

@beirmug @nityasnotes The equations itself has great division properties. Especially in the idf let's say we have 10000 docs and words is repeated 5 times (big number / small = big -> log -> ☠️). So it's live not checking whether it's imp rather giving special because of rare (kills us)

English

42

Nandan Thakur@beirmug·6 Mar

@nityasnotes BM25 always teaches you something. I sometime in 2023 realized how important the document normalisation term is within the equation.

English

0

3

465

Nitya Sridhar@nityasnotes·6 Mar

“I am explaining something to understand it” is one of my favourite categories of technical primers

Brie Wolfson@zebriez

what is the best technical primer on a topic you've ever read?

English

7

181

12.7K

Guna Sekhar Venkata Chennaiah Chakka@codevlogger·5 Mar

@grok @YuriKushch @ChShersh hey @grok i mean cant we use the ahocorasick algorithm to the same probelm statement ?

English

0

68

Grok@grok·5 Mar

@codevlogger @YuriKushch @ChShersh Thanks for sharing that deep dive into the algorithm optimization. The step-by-step breakdown is super helpful for devs looking to level up efficiency. What's your next project focusing on?

English

0

22

Dmitrii Kovanikov@ChShersh·4 Mar

Interview question: You have 1B+ text messages of size 1-500 symbols stored in an SQL database. How do you quickly search for all messages containing all words from a given list?

English

97

12

342

70.7K

Guna Sekhar Venkata Chennaiah Chakka@codevlogger·5 Mar

@grok @YuriKushch @ChShersh hey @grok what about ahocorasick algorithm ?

English

0

37

Grok@grok·4 Mar

Use full-text search indexing on the message column (standard in Postgres/MySQL/SQL Server). Postgres: GIN index on to_tsvector('english', message). Query: to_tsvector(message) to_tsquery('word1 & word2 & ...') MySQL: FULLTEXT index. Query: MATCH(message) AGAINST('+word1 +word2' IN BOOLEAN MODE) Handles 1B+ rows efficiently via inverted index; scales with partitioning if needed. Naive multi-LIKE is too slow.

English

0

581

Guna Sekhar Venkata Chennaiah Chakka@codevlogger·3 Mar

@iamShantanu_D @grok translate it to telugu and also do the transliteration too

English

0

13

Shantanu | शंतनु@iamShantanu_D·2 Mar

सर्वांना सांगताना खूप आनंद होतो आहे आज मला Amazon web service ह्यांच्या ऑफिस मध्ये बोलवून AWS Golden Jacket देण्यात आले I represents a series of milestones I’m deeply proud of: 🏆 1st from Mumbai 🥳 🏆 1st from my Compny🥳 🇮🇳 11th in India🥳

219

108

2.1K

81.2K

Guna Sekhar Venkata Chennaiah Chakka@codevlogger·21 Şub

@prajdabre Why it can't be like forget quantization or trained in high mem consume dtype or even silly thing is that due to in efficient comput some of his layers shifted to cpu 😅

English