Gaurav Kumar

1.2K posts

Gaurav Kumar banner
Gaurav Kumar

Gaurav Kumar

@cereal_learner

Data, AI, Rust, Linux | Architect in @JioHotstar

Katılım Mart 2010
1K Takip Edilen244 Takipçiler
Sabitlenmiş Tweet
Gaurav Kumar
Gaurav Kumar@cereal_learner·
Excellent Opportunity to learn how @JioHotstar scales up 📈 in this 1st edition of ScaleUP, I will be presenting how we do Data@WC scale in 8-bit style 🕹️
Sunny R Gupta 🐰@sunnykgupta

🚀 ScaleUp by JioHotstar | Deep Tech Meetup Hey everyone! I’m super excited to share that we’re hosting ScaleUp by JioHotstar, an in-office tech meetup where our engineers (including @PracSharma, @jiten, @cereal_learner) will talk about how we build and operate systems that power millions of concurrent users across the globe. This is where we’ll go behind the scenes of video streaming at scale… the systems, the decisions, the trade-offs, and a few battle stories from production 😄 📍 BLR | JioHotstar – PTP Office 📆 16th January 2026, 5:30 pm 🎟️ Limited seats | No walk-ins - Registrations are open for engineers, builders, and tech enthusiasts who love deep tech conversations. - Profiles will be screened before we send out the final invites. 💡 Why this matters: If you’ve ever been curious about how large-scale streaming, personalized recommendations, or dynamic ad systems work under the hood… this is your chance to learn, connect, and discuss with the team that builds it! 👉 Register to attend: luma.com/y7g1oxf6 Let’s meet, learn, and geek out together — this one’s going to be special! 🙌

English
0
1
4
434
Asfi
Asfi@AsfiShaheen·
I get it. Here’s what I did with tool calls 1. After writing my gold set I treat tool definitions similar to schema ie fact. So there’s a tool in one of my signatures for exploring nearest neighbor of a target page. The description I take as fact 2. What gets optimized is the second signature which has access to this tool. My gold set has already specified situations on when to call this tool. All GEPA now does is morph the instruction text of the signature which decides when to call this tool that’s why I find the fact vs instruction difference helpful. Worth mentioning tool definitions are a sort of hybrid. I have actually played with running GEPA on tool definitions themselves but wasn’t as useful.
English
1
0
1
51
Asfi
Asfi@AsfiShaheen·
GEPA learnings: optimize instructions, freeze facts A few months ago I bumped into GEPA. Was overjoyed because it helped me tag close to a million PDF pages accurately with a low cost LLM. So I pushed data set on psxGPT, a version worked, got to a 1,000 users. But then my serious users started asking for structured data and ability to model more easily. I added those, hand crafted some rules to handle routing complexity, worked for a while but reliability sucked so I reverted to older commit. As I started debugging and re creating my backend in a single Jupyter notebook and re running evals, gaps became clearer. Signature boundaries weren’t tightly defined. As I closed those one question remained: What should GEPA optimize and what should it leave alone? Where i ended: freeze facts, optimize all else. Previously I never touched Signature docstring. But I notice letting GEPA change docstring is super effective (DSPy Signatures have a little docstring instruction in addition to input and output). For me what vs how distinction isn’t helpful. Fact vs instruction is clearer. So schema is fact, column name fact, ticker set fact, list of canonical fact. Everything else: instruction and therefore should be optimized. This has also led to me designing my context folder by Signature and separated by schema.md (which I don’t optimize) and a bunch of others which I do. This process has raised another important question When is the right time to run GEPA? Only when signature boundaries are super tight and you’re very sure no obvious instruction is left out. Meaning whatever error is arising is just model drift and not a function of your own instruction sloppiness. So for instance: if a stage requires inserting canonical names of a bank’s balance sheet and you didn’t insert those only to see model getting lost, fix that first. Writing the spec, tests, defining signature boundaries, manually checking code blocks in Jupyter notebook is tedious yes but essential. I like to rebuild until cognitive load vanishes. That’s when I know that I actually “know”. Otherwise I’m winging it and that feels unsettling. GEPA now has an API which is very easy to use. I prefer it over using the DSPy library. There are some similar optimizers out there too. I don’t understand the underlying genetic Pareto stuff well enough but this API works very well for me. Swapping DeepSeek V3.2 for Gemini 3.1 flash lite was a breeze.
English
3
7
112
5.4K
Gaurav Kumar
Gaurav Kumar@cereal_learner·
@AsfiShaheen I have good experience with DSPy and some with GEPA but felt that the current way of improving tool call usages/descriptions is bit hand wavy.
English
1
0
0
62
Asfi
Asfi@AsfiShaheen·
No blog or code to share sir, it’s deeply embedded in a private repo. In any case all you need for every signature (including tool calls) is ~25 gold set examples of what correct looks like. After that it’s plug and play. Give your coding program their api docs and your gold set examples, rest will be very straight forward.
English
1
0
2
190
Gaurav Kumar
Gaurav Kumar@cereal_learner·
Hyrum's Law: with a sufficient number of users of an API, it does not matter what you promise in the contract: all observable behaviors of your system will be depended on by somebody. Making Postgres connections faster broke implicit rate limiters!
Clerk@clerk

We are incredibly sorry for our downtime this week. The incident is now fully resolved and we have published a postmortem, including root cause and remediations. clerk.com/blog/2025-09-1…

English
1
0
2
402
Neelesh Salian 💻
Neelesh Salian 💻@nssalian·
Confluent CFLT Stock is way down. Flink might not be working out.
Neelesh Salian 💻 tweet media
English
1
0
5
753
DHH
DHH@dhh·
Not a day has gone by recently where I haven't woken up to a cool new Omarchy theme created by someone in the community. This Gold Rush look is just right for the morning sun. Thanks @tahayvr! github.com/tahayvr/omarch…
DHH tweet media
English
30
18
614
48K
Gaurav Kumar retweetledi
Glauber Costa
Glauber Costa@glcst·
We at @tursodatabase have just done something unheard of in the startup world: we hired a person in prison. Not an ex-con: he is actually serving time right now. If you follow us you may remember when we announced our full rewrite of SQLite in Rust, Project Limbo. It got a tremendous amount of attention, but by far, my favorite story was that of @PThorpe92: Preston is currently incarcerated for things he regret doing a decade ago. A model reformed inmate, Preston was given the beautiful opportunity by the Maine Department of Corrections to access the internet. He then found our project, and since it's not like there are many entertainment options competing for his attention in prison, he decided to pour his heart and soul into helping us rewrite SQLite. I immediately fell in love with his story. Reading his old blogs, his Github profile. I prayed we would be able to hire him as soon as he was out, and make him a part of our team. But thankfully, things have happened in such way, that we were able to do it *even before* he was released. He's now given an opportunity to make a dent in the world, from prison. Preston claims he is immensely thankful for this opportunity. It is certainly not an opportunity that comes often, and I understand him. But when I read his story in his personal blog, where he mentioned how he clearly saw his life going downhill, had this sudden epiphany, saw clearly that he could do differently, and then started being showered with blessings, I saw the unmistakable markings of how our Lord Jesus Christ, the Lamb of God, operates in the world. I was an obnoxious Richard Dawkins-style atheist for more than two decades. I am not an experiential person, and I came to a change through the rational evaluation of the historical context of His resurrection. I am not the kind of person that feels the hand of God in my life, but over the years I learned to recognize his patchwork through those sudden, unexpected changes, where everything suddenly "just works". And because of that, I am the one who's thankful. The Lord decided to operate such a drastic change in Preston's life. Clearly He has a great plan for him. I am immensely thankful for the opportunity to play a small part in that story. God, in his infinite power, doesn't need me to do any of that, and yet he gave me the opportunity to act. I feel thankful and blessed. I am also very thankful to @LabsUnlocked , and entity helping people like Preston find a better life once they're out. If you feel inspired by their mission, you should reach out and help. And if you want to hear from Preston in his own words, read the article he just published on the Turso blog: turso.tech/blog/working-o…
English
39
74
894
84.2K
Xuanwo
Xuanwo@OnlyXuanwo·
Talk is cheap. Show me the prompt.
English
3
2
12
1.4K
Gaurav Kumar
Gaurav Kumar@cereal_learner·
@d4m1n Deepseek had a lot of thinking in Chinese actually. When doing deep thinking, it would turn to think in Chinese and then finally comes out to give the conclusion in English again. Likely because it was trained on such a training set.
English
0
0
2
49
Dan ⚡️
Dan ⚡️@d4m1n·
galaxy brain question 🧠 is using ChatGPT in Chinese cheaper because it's using fewer tokens?
English
22
0
33
6.5K
Gaurav Kumar
Gaurav Kumar@cereal_learner·
@jasonfried Let's move this to backlog and we'll prioritize it later ... and other lies we tell ourselves
English
1
0
3
290
Jason Fried
Jason Fried@jasonfried·
There's no such thing as low priority. It's either high priority, or no priority ("it just is"). Multi-level priority systems instituted in software is software doing what it does worst.
English
60
72
825
58.3K
Gaurav Kumar
Gaurav Kumar@cereal_learner·
Can someone recommend a good UUID? I've searched a lot and a8098c1a-f86e-11da-bd1a-00112444be1e seems pretty random. Want to make sure no one else has used it before!
English
0
3
3
203
Gaurav Kumar
Gaurav Kumar@cereal_learner·
@shuttle_dev End to End it takes a lot of time. I mean, deploying is easy but first we have to rewrite it in Rust
English
0
0
2
27
Shuttle
Shuttle@shuttle_dev·
What’s the biggest pain point you’ve faced trying to deploy a backend application?
English
1
0
3
556