Akshay Ram

437 posts

Akshay Ram

@akshay5995

Pet parent | Tech Lead

Bangkok, Thailand Katılım Şubat 2010

1.9K Takip Edilen243 Takipçiler

Akshay Ram retweetledi

Mario Zechner@badlogicgames·1d

Due to popular demand, you can now browse all my recommended reading/viewing suggestions here: mariozechner.at/recommended-re… Sadly, Space Karen's hard core engineered Twitter API is absolutey garbage, and I can only get tweets back to April. Will backfile with an export. Enjoy.

English

933

30.4K

Akshay Ram retweetledi

Ben Holmes@BHolmesDev·5d

You should use Codex differently than how you use Claude Code. I've found success with no plans and more involvement as an engineer. Here's a 1 min rundown of my process:

English

288

24.8K

Akshay Ram retweetledi

Yohei@yoheinakajima·11 May

great article, mostly focused on coding agents but applies elsewhere impo. aligns w a lot of my prior thoughts: - agents need scaffolding, not just smarter models - failures should become durable improvements - tools/functions should be small, reusable, and logged - execution environments matter - tests and deterministic checks are essential - memory should live outside the model - agent systems should improve through use

Addy Osmani@addyosmani

x.com/i/article/2050…

English

172

30.7K

Akshay Ram retweetledi

Harrison Chase@hwchase17·5 May

x.com/i/article/2051…

ZXX

211

58.1K

Akshay Ram retweetledi

Himanshu@himanshutwtxs·4 May

x.com/i/article/2035…

ZXX

131

12.5K

Akshay Ram retweetledi

Mario Zechner@badlogicgames·1 May

felt cute, did some @karpathy style cozy coding. now i can run GPT 2 124M in pure TypeScript at 7 tps. played with implementing the GEMV via C/WASM, but that only got me a 1.7x speed up.

English

182

29.2K

Akshay Ram retweetledi

Andrej Karpathy@karpathy·30 Nis

This is the the quote I've been citing a lot recently.

kache@yacineMTB

you can outsource your thinking but you cannot outsource your understanding

English

819

4.3K

45.9K

2.4M

Akshay Ram retweetledi

sunil pai@threepointone·29 Nis

didn't expect to get here, but 5.5 is just better than 4.7 now for me. faster, smarter, and doesn't try to do too much. I don't need xhigh or whatever, I prefer the shorter hops and letting me participate.

English

785

44.6K

Akshay Ram retweetledi

Matt Pocock@mattpocockuk·23 Nis

A talk I gave a few weeks ago. Software fundamentals matter more than ever. Here's why: youtube.com/watch?v=v4F1gF…

YouTube

English

144

1.3K

314.4K

Akshay Ram retweetledi

Zain Shah@zan2434·22 Nis

Imagine every pixel on your screen, streamed live directly from a model. No HTML, no layout engine, no code. Just exactly what you want to see. @eddiejiao_obj, @drewocarr and I built a prototype to see how this could actually work, and set out to make it real. We're calling it Flipbook. (1/5)

English

1.1K

3.7K

28.4K

5.9M

Akshay Ram retweetledi

Thariq@trq212·16 Nis

x.com/i/article/2044…

ZXX

287

8.5K

2.4M

Akshay Ram retweetledi

Matt Pocock@mattpocockuk·14 Nis

I just ran an AI coding course for ~2,000 people One massive piece of feedback was how dissatisfied people are with frameworks like BMAD, GSD, Spec-Kit Turns out that giving away control of context to a framework makes things a lot harder to debug My advice: own the process

English

659

49.4K

Akshay Ram@akshay5995·14 Nis

The only good coding agents on mobile experience I’ve had is with @cursor_ai.

English

Akshay Ram retweetledi

Viv@Vtrivedy10·14 Nis

Harness Engineering Derived from what Models can’t do alone: It feels like a good time to step back and reshare some basic mental models for why harnesses exist in the first place - working backwards from The Model Why the Harness Exists: Harnesses exist to augment and shape models to do useful work for us. Models are fundamentally token input and output machines They cannot do useful things in the world without a harness. This is an important but under appreciated truth about the agentic systems we’re building The whole point of a harness is to extend the abilities of models to do useful work on our behalf Primitives the World Finds Useful Today: The diagrams below describe what capability we want and we work backwards from that capability to design a component of the harness. Filesystems, bash, compaction and Ralph Loops are all derived by working backwards from a goal to what a model can’t do today. We add those to the harness because they’re useful and help our agents more reliably complete work Model-Harness Training Loop: As we find incorporate a useful harness primitive into the model, model labs often directly train on that capability so the next version of the model is better at leveraging that tooling. This isn’t the end all be all, models are good at generalizing to other tools via in-context-learning. But this is a tried and true recipe we’ve seen repeat over the last 3 years and likely into the future. Different labs have different priors on useful harness capabilities. This is the entire reason they share prompting guides that differ dramatically down to their tool design. There’s harness mania happening right now which is cool to see as we’ve been investing in this for a while! :) It helps to step back and think about “why does this thing exist in the first place?”

English

339

19.6K

Akshay Ram retweetledi

Harrison Chase@hwchase17·9 Nis

most of the agents we see being built do this the main cases where we see people using "agent in a sandbox" is when they are using claude agent sdk (which is poorly designed for "harness outside sandbox")

Nasir Shadravan@n4Cr

@hwchase17 Do you also move the harness it outside the sandbox like managed agents?

English

8.6K

Akshay Ram@akshay5995·9 Nis

looks like they discovered responses api ? openai.com/index/equip-re…

Anthropic@AnthropicAI

New on the Engineering Blog: Building Managed Agents—our hosted service for long-running agents—meant solving an old problem in computing: how to design a system for “programs as yet unthought of.” Read more: anthropic.com/engineering/ma…

English

Akshay Ram retweetledi

Akshay 🚀@akshay_pachaar·6 Nis

x.com/i/article/2040…

ZXX

424

2.7K

1.3M

Akshay Ram@akshay5995·5 Nis

README.md CLAUDE.md AGENTS.md SKILLS.md … Can’t wait for IDEA.md standard

Andrej Karpathy@karpathy

Wow, this tweet went very viral! I wanted share a possibly slightly improved version of the tweet in an "idea file". The idea of the idea file is that in this era of LLM agents, there is less of a point/need of sharing the specific code/app, you just share the idea, then the other person's agent customizes & builds it for your specific needs. So here's the idea in a gist format: gist.github.com/karpathy/442a6… You can give this to your agent and it can build you your own LLM wiki and guide you on how to use it etc. It's intentionally kept a little bit abstract/vague because there are so many directions to take this in. And ofc, people can adjust the idea or contribute their own in the Discussion which is cool.

English

215

Akshay Ram retweetledi

Viv@Vtrivedy10·4 Nis

ok hot take, who (dis)agrees? The general purpose agent/harness doesn’t exist the best harnesses are deeply Task specific and when we use a “default harness” out-of-the-box, we’re just making a tradeoff between - acceptable task performance - time+money spent designing around our task(s) that’s a totally fair tradeoff to make, maybe we’re happy with the out of box perf! what we call a “general purpose” harness is just one that’s reasonably good at a relatively large portion of tasks but there’s a reason why teams that want top 1% agent performance obsessively tweak the harness per Task+Model it’s because you can squeeze out a lot by building bespoke harness tooling for a Task. For a high value task, it’s totally worth the investment Your entire company might be predicated on that investment this effect is pretty clear when we try to swap models “models are non-fungible in their harness” - so the suck if we just drop in codex into the Claude Code harness but if you use the models together in a joint harness and design around the specific problem, you can get great perf I’ve mentioned before but i think the most exciting future is just-in-time harness creation per task idk if that’s a very popular take vs “one model will do everything” but it’s a current mental model and exciting thing i’m messing around with

English

278

70.3K

Akshay Ram retweetledi

Armin Ronacher ⇌@mitsuhiko·26 Mar

This is the talk I gave in San Francisco at PyAI on how you can figure out what present and future models are good at when it comes to writing your own agents.

Pydantic@pydantic

"How do agentic coding tools work? I have no idea. And you probably also don't." That was @mitsuhiko opening his talk at PyAI. What follows is one of the most useful 20 minutes on working with agents we've ever seen. youtu.be/8RHYyRUxVrA

English

222

37.9K

Keşfet

@karpathy @eddiejiao_obj @drewocarr @cursor_ai @elonmusk @BarackObama @taylorswift13 @cristiano