Mayank Pant

278 posts

Mayank Pant banner
Mayank Pant

Mayank Pant

@soyank_76

Tech 💻@getonecardin Who the heck decides what your limit is anyway

Pune🔁Noida Katılım Temmuz 2017
574 Takip Edilen51 Takipçiler
Mayank Pant
Mayank Pant@soyank_76·
@cortisoul_ Are you guys going to add word puzzles, games crosswords as well or just sticking to maths ?
English
0
0
0
17
Sudhanshu | Matiks
Sudhanshu | Matiks@cortisoul_·
International chess players use Matiks but Zack from reddit messages daily that your app will fail what should be the ideal reply to this guy ?
English
9
1
34
3.6K
Mayank Pant retweetledi
Andrej Karpathy
Andrej Karpathy@karpathy·
A few random notes from claude coding quite a bit last few weeks. Coding workflow. Given the latest lift in LLM coding capability, like many others I rapidly went from about 80% manual+autocomplete coding and 20% agents in November to 80% agent coding and 20% edits+touchups in December. i.e. I really am mostly programming in English now, a bit sheepishly telling the LLM what code to write... in words. It hurts the ego a bit but the power to operate over software in large "code actions" is just too net useful, especially once you adapt to it, configure it, learn to use it, and wrap your head around what it can and cannot do. This is easily the biggest change to my basic coding workflow in ~2 decades of programming and it happened over the course of a few weeks. I'd expect something similar to be happening to well into double digit percent of engineers out there, while the awareness of it in the general population feels well into low single digit percent. IDEs/agent swarms/fallability. Both the "no need for IDE anymore" hype and the "agent swarm" hype is imo too much for right now. The models definitely still make mistakes and if you have any code you actually care about I would watch them like a hawk, in a nice large IDE on the side. The mistakes have changed a lot - they are not simple syntax errors anymore, they are subtle conceptual errors that a slightly sloppy, hasty junior dev might do. The most common category is that the models make wrong assumptions on your behalf and just run along with them without checking. They also don't manage their confusion, they don't seek clarifications, they don't surface inconsistencies, they don't present tradeoffs, they don't push back when they should, and they are still a little too sycophantic. Things get better in plan mode, but there is some need for a lightweight inline plan mode. They also really like to overcomplicate code and APIs, they bloat abstractions, they don't clean up dead code after themselves, etc. They will implement an inefficient, bloated, brittle construction over 1000 lines of code and it's up to you to be like "umm couldn't you just do this instead?" and they will be like "of course!" and immediately cut it down to 100 lines. They still sometimes change/remove comments and code they don't like or don't sufficiently understand as side effects, even if it is orthogonal to the task at hand. All of this happens despite a few simple attempts to fix it via instructions in CLAUDE . md. Despite all these issues, it is still a net huge improvement and it's very difficult to imagine going back to manual coding. TLDR everyone has their developing flow, my current is a small few CC sessions on the left in ghostty windows/tabs and an IDE on the right for viewing the code + manual edits. Tenacity. It's so interesting to watch an agent relentlessly work at something. They never get tired, they never get demoralized, they just keep going and trying things where a person would have given up long ago to fight another day. It's a "feel the AGI" moment to watch it struggle with something for a long time just to come out victorious 30 minutes later. You realize that stamina is a core bottleneck to work and that with LLMs in hand it has been dramatically increased. Speedups. It's not clear how to measure the "speedup" of LLM assistance. Certainly I feel net way faster at what I was going to do, but the main effect is that I do a lot more than I was going to do because 1) I can code up all kinds of things that just wouldn't have been worth coding before and 2) I can approach code that I couldn't work on before because of knowledge/skill issue. So certainly it's speedup, but it's possibly a lot more an expansion. Leverage. LLMs are exceptionally good at looping until they meet specific goals and this is where most of the "feel the AGI" magic is to be found. Don't tell it what to do, give it success criteria and watch it go. Get it to write tests first and then pass them. Put it in the loop with a browser MCP. Write the naive algorithm that is very likely correct first, then ask it to optimize it while preserving correctness. Change your approach from imperative to declarative to get the agents looping longer and gain leverage. Fun. I didn't anticipate that with agents programming feels *more* fun because a lot of the fill in the blanks drudgery is removed and what remains is the creative part. I also feel less blocked/stuck (which is not fun) and I experience a lot more courage because there's almost always a way to work hand in hand with it to make some positive progress. I have seen the opposite sentiment from other people too; LLM coding will split up engineers based on those who primarily liked coding and those who primarily liked building. Atrophy. I've already noticed that I am slowly starting to atrophy my ability to write code manually. Generation (writing code) and discrimination (reading code) are different capabilities in the brain. Largely due to all the little mostly syntactic details involved in programming, you can review code just fine even if you struggle to write it. Slopacolypse. I am bracing for 2026 as the year of the slopacolypse across all of github, substack, arxiv, X/instagram, and generally all digital media. We're also going to see a lot more AI hype productivity theater (is that even possible?), on the side of actual, real improvements. Questions. A few of the questions on my mind: - What happens to the "10X engineer" - the ratio of productivity between the mean and the max engineer? It's quite possible that this grows *a lot*. - Armed with LLMs, do generalists increasingly outperform specialists? LLMs are a lot better at fill in the blanks (the micro) than grand strategy (the macro). - What does LLM coding feel like in the future? Is it like playing StarCraft? Playing Factorio? Playing music? - How much of society is bottlenecked by digital knowledge work? TLDR Where does this leave us? LLM agent capabilities (Claude & Codex especially) have crossed some kind of threshold of coherence around December 2025 and caused a phase shift in software engineering and closely related. The intelligence part suddenly feels quite a bit ahead of all the rest of it - integrations (tools, knowledge), the necessity for new organizational workflows, processes, diffusion more generally. 2026 is going to be a high energy year as the industry metabolizes the new capability.
English
1.6K
5.6K
40.5K
7.8M
Mayank Pant
Mayank Pant@soyank_76·
@manthanguptaa But is this context management applies to their pro models as well. Summary does not give the whole context. Reasoning with summary will definetly produce inferior inference compared to giving whole context.
English
0
0
0
94
Manthan Gupta
Manthan Gupta@manthanguptaa·
randomly got curious about context management by cursor, and now I am reverse engineering it
English
22
1
194
16.8K
Mayank Pant retweetledi
Rishit Jhunjhunwala
Rishit Jhunjhunwala@rishj·
Every family has a tech-savvy person, the family "CTO". @Truecaller will now empower that person to protect elders and other less tech-savvy members in the family. Truecaller Family Protect is live in select markets with more countries coming soon! * Get alerted when a family member is about to get scammed and intervene rightaway * Share blocked numbers with the entire family * Keep on eye on Truecaller settings on your family’s phones and ensure it's setup properly for maximum protection Read more here: corporate.truecaller.com/newsroom/press…
Rishit Jhunjhunwala tweet media
English
13
46
336
92.1K
Mayank Pant
Mayank Pant@soyank_76·
@abhi9u Hmm, what could be the use case of concurrent interpreters ?
English
0
0
0
53
Abhinav Upadhyay
Abhinav Upadhyay@abhi9u·
Python 3.14 came out a few days ago. I wrote about my top 5 favorite features. Here's a quick summary.
English
5
9
112
11.7K
Mayank Pant
Mayank Pant@soyank_76·
@TrueIndology @grok Why did IMF approve a loan to Pakistan despite clear evidence it harbors terrorists? Can India challenge this in international court and hold Pakistan accountable? How can India push for its inclusion on the FATF blacklist?
English
2
0
1
330
True Indology
True Indology@TrueIndology·
During Kargil war, West and its proxies strongly pressurized India to De-escalate. At the same time, they provided a huge $ 1.56 billion loan that helped Pakistan survive. On July 29 1999, just a few weeks after Kargil war, IMF provided this loan to Pakistan. Pakistan was economically ruined after Kargil & would have broken up if not for western loans. Purpose of De-escalation is not peace but protecting Pakistan and keeping India in check. Pakistan is a dagger west maintains to stab India in back.
English
206
5.9K
20K
470.7K
Mayank Pant retweetledi
Prasanna S
Prasanna S@myprasanna·
My name is Prasanna, who previously founded Rippling (worth $10B); I'm going through a divorce. I'm now on the run from the Chennai police hiding outside of Tamil Nadu. This is my story.
English
3.3K
16.6K
74.4K
22.5M
Mayank Pant retweetledi
sneha
sneha@itspsneha·
alright, let me tell y'all why i'm literally feeling 💔 after visiting court today. this was supposed to be a quick little side quest: meet @_adi18_ , chat with a few lawyers, check out what nyayanidhi is building, get my full-time job work done, and head home. you know, just another day. but nope, today wasn’t normal at all. i deeply understood the cost of justice ⚖️ x.com/itspsneha/stat…
English
25
74
796
217.5K
Mayank Pant retweetledi
Ishan
Ishan@ishanagarwal24·
There’s never a day that Indian companies won’t embarrass you 🤦‍♂️ They removed a whole country larger than India from the chart to make OLA the “4th” largest EV company 😭
Ishan tweet media
English
246
529
7.7K
1.1M
Mayank Pant
Mayank Pant@soyank_76·
@batmanyata I arrange text on screens and try to make sense of it also build legos in my free time. Full stack dedicated to building. Hire me!!
English
0
0
0
73
rishabh
rishabh@rishxbhh·
Invites for CRED Money up for grabs✌️
rishabh tweet media
English
8
1
35
8.8K
Mayank Pant
Mayank Pant@soyank_76·
@arpit_bhayani Will you blame zomato if you find a cockroach in your food? I would say yes. Windows being a platform, and provider of its kernel apis should be held accountable too.
English
0
0
0
37
Arpit Bhayani
Arpit Bhayani@arpit_bhayani·
I saw many engineers blaming the outage on Microsoft 🤦‍♂️ SWEs blaming without knowing the root cause is concerning. It is not Microsoft, it is Crowdstrike who released an update for Windows that had a bug. The patch runs in Kernel mode to monitor system activity at a low level. Because it was running in Kernel mode, the buggy code was trying to access an invalid memory location that triggered a panic and which showed Blue Screen of Death. The name of the driver file that had the buggy update is "C-00000291.sys", deleting it fixes the issue and unfortunately this needs to be done manually. Microsoft has nothing to do with it.
English
269
372
3.5K
489.1K
Rushab Jain
Rushab Jain@rushabtated4·
Since last 2 years I've seen 10+ founders identifying an arbitrage and scaling to millions very fast (few months) with a very small team (1-3) and 0 funding I am thinking to make a doc with these case studies explaining arbitrage and their primary marketing channels and some upcoming arbitrage (according to me) you can build around would you be interested to read something like this? Comment "👋" and I will send it to you as soon as it is ready
English
1.4K
45
1.7K
288.9K
Mayank Pant
Mayank Pant@soyank_76·
@shantanugoel Order ID duplication isn't the only reason. They casually ignored the schema choices that led to this issue. There's no point in showing the order and folio if the underlying transaction failed.
English
0
0
0
103
Shantanu Goel
Shantanu Goel@shantanugoel·
Groww reached out with this RCA of the current issue. Seems like a reasonable explanation.
Shantanu Goel tweet media
English
42
31
504
86.9K
Mayank Pant
Mayank Pant@soyank_76·
Does chatGPT 4o has hey google like trigger or can be done. I have gemini advanced and it is a bliss to just trigger it with hey google, talk, ask and micromanage your stuff @ChatGPTapp
English
0
0
1
125