Saumitra Srivastav

147 posts

Saumitra Srivastav

Saumitra Srivastav

@_saumitra_

Bangalore, India Katılım Şubat 2012
149 Takip Edilen108 Takipçiler
Saumitra Srivastav
Saumitra Srivastav@_saumitra_·
Wrote a tutorial on how to create a lakehouse-based AI evaluation platform using open-source stack. Blog: saumitra.me/2026/2026-03-0… Code: github.com/saumitras/ai-e… We will see how to solve typical scale problems like: 1. Fragmented tooling: each team builds its own eval tooling, schemas, and scoring logic 2. No shared standard: model, prompt, retriever, and dataset versions are tracked inconsistently, making cross-team governance and cross-team knowledge sharing hard. 3. Weak lineage: teams can see a score change but cannot reliably answer what exact configuration caused it. 4. Poor observability: traces and metrics are often separated from run metadata, which slows root-cause analysis. 5. Replay gaps: failures found in production cannot be deterministically reproduced for safe comparisons. 6. Throughput limits: simple eval pipelines cannot keep up with enterprise-scale experiment volume. 7. BI disconnect: analytics teams cannot query cross-app eval data easily through a single pane 8. Failure patterns stay hidden: teams see individual failed cases, but without clustering they miss recurring failure modes and cannot prioritize fixes effectively. Technologies: AWS #S3, @ApacheIceberg, @apachepolaris , @ApacheAirflow, @deepeval , @raydistributed , @apachekafka , @ApacheSpark , @PostgreSQL , @trinodb , @apachesuperset , Google Agent Development Kit, @OpenAI , #llm, #mcp
English
0
0
3
68
Saumitra Srivastav
Saumitra Srivastav@_saumitra_·
@gunnarmorling Tried it. After 3-4 months of handling prod edge cases, eventually ended up having something similar to schema registry, so dropped it in the next release and switched back to Confluent's 🙂 Not worth pursuing IMO
English
0
0
0
0
Gunnar Morling 🌍
Gunnar Morling 🌍@gunnarmorling·
Question for the Kafka community: has anyone ever explored a (de-)serializer which would keep JSON/Avro schemas within a Kafka topic? I.e. in this model, there'd be no registry whatsoever, provided all the schemas could be kept in memory for efficient access. Worth exploring?
English
21
4
37
0
Saumitra Srivastav
Saumitra Srivastav@_saumitra_·
@manjunath_t_m @rockthejvm Even though I use Scala as the primary language wherever available, but with Java implementing ideas from Scala and bad ecosystem support for Scala 3, I am concerned about Scala's future. Especially in ML ecosystem, its now much easier to use Python or Java because of lib support
English
1
0
1
0
Manjunath T M
Manjunath T M@manjunath_t_m·
@rockthejvm Scala is more capable than just writing spark jobs. I would like to see Scala to be used more for building rock solid products
English
1
0
6
0
Rock the JVM
Rock the JVM@rockthejvm·
Just finished a live 2-day #Scala training session with Microsoft (!). Apparently they needed it for Spark. They had no idea what Scala is capable of. Left the training blown away.
English
9
12
189
0
Saumitra Srivastav
Saumitra Srivastav@_saumitra_·
@moxie Any non-blockchain engineer willing to get into web3, should first start with learning about currently available L1 & L2 chains, and not just focus on bitcoin and eth. Even @opensea is taking steps in right direction by adding @0xPolygon to reduce or get rid of gas fees... 3/n
English
0
0
0
0
Saumitra Srivastav
Saumitra Srivastav@_saumitra_·
@moxie Anyone new coming into the space sees hyped NFTs/metaverse projects in their current form as web3, but they are not. a16z backed opensea, when using eth-1.0, is a misleading example to explain to someone what web3 is/will be... 2/n
English
1
0
2
0
Saumitra Srivastav
Saumitra Srivastav@_saumitra_·
@tlberglund this is brilliant @tlberglund. I would pay to watch a series "The legend of Bare Metalsson" where he leads a cult of Metalsson(s) in a (losing??) battle against the evil cloud. Or perhaps an origin story🦹‍♂️ More Bare Metalsson, please!😂
English
0
0
1
0
Saumitra Srivastav
Saumitra Srivastav@_saumitra_·
@gwenshap I try to maintain a knowledge base of "what brought my X service down". Have few items for Kafka too🙂 It wud be great to have a centralized wiki of "what brought my Kafka cluster down" as a troubleshooting guide for beginners. Is there one already where I can add mine too? 2/2
English
0
0
4
0
Saumitra Srivastav
Saumitra Srivastav@_saumitra_·
@gwenshap That's because things that are obvious to experts, because they understand underlying architecture, code and config, are not known to beginners and hence they don't mind messing with those. It's not that hard to severely degrade the performance of a Kafka cluster🙂 1/2
English
1
0
1
0
Gwen (Chen) Shapira
Gwen (Chen) Shapira@gwenshap·
Still amazed at how it sometimes takes us weeks to try and reproduce specific workloads that bring Kafka to its knees, while our least knowledgable customers do this effortlessly.
English
7
14
133
0
Saumitra Srivastav
Saumitra Srivastav@_saumitra_·
congratulations @nehanarkhede to you and entire @confluentinc team! Its inspirational and feels proud to see a fellow Indian at forefront of a company and technology that will undoubtedly power the whole world in coming years. 🇮🇳🚀🎉
Neha Narkhede@nehanarkhede

It's @confluentinc's 5th birthday and I got a chance to inaugurate our first office in my home country 🇮🇳 with our amazing team in Bangalore. This goes pretty high up in the list of highlights on this immigrant founder journey 🌟 Happy 5th birthday, Confluent! 🧡💙

English
0
0
3
0
Kelly Sommers
Kelly Sommers@kellabyte·
Since it takes around 10 years for a programming language or database to gain mass popularity I wonder which ones today are going to boom tomorrow. What are your guesses?
English
65
13
73
0
Saumitra Srivastav retweetledi
Neha Narkhede
Neha Narkhede@nehanarkhede·
Really looking forward to my first @apachekafka meetup keynote in India. I will be speaking at one of the largest Kafka meetup groups in the world today at 2pm. Thanks for hosting us @hotstartweets and hope to meet the amazing tech community in Bangalore! meetup.com/Bangalore-Apac…
English
12
22
208
0