Ajeya

1.1K posts

Ajeya

@ajeyamk

ML engineer | curious

Caifornia Katılım Ekim 2015

369 Takip Edilen36 Takipçiler

Ajeya retweetledi

Kimi.ai@Kimi_Moonshot·18 Nis

We push Prefill/Decode disaggregation beyond a single cluster: cross-datacenter + heterogeneous hardware, unlocking the potential for significantly lower cost per token. This was previously blocked by KV cache transfer overhead. The key enabler is our hybrid model (Kimi Linear), which reduces KV cache size and makes cross-DC PD practical. Validated on a 20x scaled-up Kimi Linear model: ✅ 1.54× throughput ✅ 64% ↓ P90 TTFT → Directly translating into lower token cost. More in Prefill-as-a-Service: arxiv.org/html/2604.1503…

English

345

2.9K

682.6K

Ajeya retweetledi

alphaXiv@askalphaxiv·10 Mar

If you're using Claude Code for research: stop making it read directly from PDFs We've introduced a SKILL.md that fetches structured, AI-friendly paper overviews from alphaXiv 👀

English

377

3.3K

244.9K

Ajeya retweetledi

sysls@systematicls·27 Ara

x.com/i/article/2004…

ZXX

4.5K

26K

16.3M

Ajeya retweetledi

Jeffrey Wang@jeffzwang·26 Eki

Vibe coding productivity boosts are not evenly distributed. My estimates for Exa are: - Full stack product engineering: 1.5x-2.5x - Frontend, internal tooling: 5x-10x - Hard, low levels systems programming: 1.2x-1.5x - Reliability and infrastructure: 0.5x (mistakes here cost a lot lol) Overall, the cost of different types of engineering is now different, which rejiggers cost-benefit analyses of different types of work and everyone has to adjust accordingly. A crazy example is that once when we were managing an incident, one of our engineers spent the first 5 minutes of the incident vibe coding a full custom incident dashboard using Streamlit. This is the kind of thing you'd never think to do before AI, but is now the right thing to do.

English

103

1.3K

243.8K

Ajeya@ajeyamk·18 Eki

Telling a major employer to ‘fix it yourself’ when raising infrastructure concerns is governance failure. Creating an enabling environment is the government’s job.

ANI@ANI

Bengaluru | On Biocon Chairperson Kiran Mazumdar-Shaw slamming Bengaluru infra, Karnataka Dy CM DK Shivakumar says, "If she wants to develop them, let her do it. If she comes and asks, we will give her the roads.”

English

Ajeya@ajeyamk·27 Ağu

@BJP4India Telling people "don't buy foreign" without improving domestic quality/innovation is backwards economics. Government should focus on building competitive advantages through infrastructure & R&D investment, not consumer guilt.

English

BJP@BJP4India·25 Ağu

I appeal to the citizens of our country to prioritize purchasing goods that are Made in India. Whether it's decorative items or gifts, let us choose products manufactured within our own nation. I also encourage businesses to refrain from selling items sourced from other countries. These small yet impactful steps can play a significant role in driving our nation's progress and prosperity. -PM @narendramodi

English

3.8K

4.1K

26.1K

3.1M

Ajeya retweetledi

Omar Khattab@lateinteraction·8 Haz

Some people say LLMs exhibit "human-level intelligence", others say they don't. But the funny thing is that most people are actually discussing whether LLMs adhere to people's mental model of, uh, COMPUTER-level intelligence. Let me explain. It's clear that people *really* want AI systems to be reliable. That's what they've come to expect from computers and programs in general. But the thing is, humans are super unreliable. In every way possible. So much so that we've never met a reliable intelligence in the first place. But then the keyword "intelligence" confuses experts and laypeople alike and they enter long debates that have the form "well, if LLMs were intelligent, how come they can't execute this simple process without hallucinating?" and then the other side replying "but humans fail at this too!" But really, it's clear that we all have a pretty similar set of criteria from what we want from AI systems. That criteria has very little to do with humans or human-level anything. It's just about: Can we (i) "automate tasks" that are (ii) "hard to specify procedurally at a mechanical (or mathematical) level of abstraction", with a (iii) "similar degree of reliability to the average non-AI computer program"? That's the question. Leave humans and AGI and all that stuff out of it, please. They're just distracting both sides.

English

252

33.5K

Ajeya retweetledi

ben hylak@benhylak·7 Ağu

GPT-5: Welcome to the Stone Age We wrote up exactly how you should think about and use GPT-5: whether that's in Cursor, or building your own agent. (link below)

English

248

43.5K

Ajeya retweetledi

Leonie@helloiamleonie·3 Tem

Today I'm celebrating 2 years at @weaviate_io! 🎉 Here are 37 things I learned about retrieval so far. 🔗leoniemonigatti.com/blog/what_i_le…

English

702

53.9K

Ajeya retweetledi

Alex Vacca@itsalexvacca·18 Haz

BREAKING: MIT just completed the first brain scan study of ChatGPT users & the results are terrifying. Turns out, AI isn't making us more productive. It's making us cognitively bankrupt. Here's what 4 months of data revealed: (hint: we've been measuring productivity all wrong)

English

37.9K

167.8K

23.7M

Ajeya retweetledi

Wade Foster@wadefoster·5 Haz

How do we measure AI fluency at Zapier? Here are some role-by-role examples 🧵

Wade Foster@wadefoster

We’re setting a new standard at Zapier. 100% of new hires must be fluent in AI.

English

253

2.3K

721.1K

Ajeya@ajeyamk·5 Haz

"You're absolutely right!" - Please stop 🥴

English

Ajeya retweetledi

Jo Kristian Bergum@jobergum·30 May

Yes, you can call it RAG even if you don’t use embeddings in the retriveal phase

English

241

19.8K

Ajeya retweetledi

Rohan Paul@rohanpaul_ai·28 May

Anthropic’s Claude 4 system prompt (leaked in full, ~10,000 words) shows an LLM orchestrated through strict internal scaffolding. It’s not just “a prompt”—it’s a control program. → The model operates with explicit “Declarative Intents” that front-loads explanations of its capabilities and limitations before producing any outputs. This acts as a soft interlock to shape behavior expectations. → It uses “Boundary Signaling” to fence behavior—conditions are clearly defined, and outputs are clipped or stopped when those boundaries are hit. No vague fallback. It's conditional logic that enforces refusal. → Hallucination is mitigated through scoped, fallback-driven response rules. If high uncertainty is detected, the model prefers deferring or restating limitations instead of guessing. → Tools like web search and APIs are invoked using strict XML-like tags. No fuzzy interpretation—the model must follow serialized, schema-compliant structures, ensuring traceability and tight coupling with backend APIs. → “Positional Reinforcement” is applied throughout the system prompt. Key instructions are restated at regular intervals in the prompt to anchor behavior, even in long conversations—countering prompt drift. → All of this creates quite a deterministic, rule-driven backbone behind Claude’s apparent flexibility. The structure resembles a rules engine with an LLM executor.

English

616

59.2K

Ajeya retweetledi

Simon Taylor@sytaylor·26 May

Visa wants to give AI Agents "tokens" so they can pay without you ever seeing a checkout page. Visa's CEO told investors this is their #1 priority. Here's how it will work 👇

English

164

1.4K

281.4K

Ajeya@ajeyamk·22 May

It’s so hard to get credible and quality information on social media. It’s gets worse with incentives from engagement farming. Would love to see @X having something similar to ‘Proof of stake’ (from Blockchain). Reward the right(especially- news), and penalize the wrong

English

Ajeya retweetledi

Deedy@deedydas·21 May

🚨 This was the BEST Google I/O that I can remember. Google launched over 12 different insane things. Here is every single one of the launches and the best tweets about them: 1/12

English

808

10K

2.1M

Ajeya retweetledi

Engineer Girlfriend@enggirlfriend·15 May

hot take: working at a late stage startup in your 20s is a scam i know so many smart, hardworking people who left high paying FAANG job in their early 20s to pursue “the startup life” by joining a buzzy late stage startup now, theyre in their late 20s faced with the reality that their equity is likely worth zero, leadership is chaos, and the promised “career growth” was not there sticking out your FAANG job for your 20s can unlock low millions of networth. instead these people have made <50% of their expected comp there’s obviously exceptions but you should expect these companies to be quite rare i see more successful paths created for people who either: 1. took on full risk early and started their own biz/joined very early stage 2. built up a seniority at a public company than joined startup world in their 30s

English

166

3.3K

570.5K

Ajeya retweetledi

Tivadar Danka@TivadarDanka·7 May

Behold one of the mightiest tools in mathematics: the camel principle. I am dead serious. Deep down, this tiny rule is the cog in many methods. Ones that you use every day. Here is what it is, how it works, and why it is essential.

English

260

1.9K

246.6K

Ajeya retweetledi

Aadit Sheth@aaditsh·2 May

Jeff Bezos literally dropped the best advice on how to build a trillion dollar company.

English

289

2.6K

314.8K

Keşfet

@BJP4India @narendramodi @weaviate_io @X @elonmusk @BarackObama @taylorswift13 @cristiano