A.J. Feather

764 posts

A.J. Feather

@AJFeather

Did journalism-ish things in a previous life. Took the X mob’s advice and learned to code. Subscribe to my newsletter 🧑‍💻: https://t.co/uOUpVeciZC

New York, NY Katılım Ocak 2008

1.4K Takip Edilen1.3K Takipçiler

A.J. Feather@AJFeather·23 Mar

@billmurphy Huh, alright. Might have to finally give this a shot. I was worried I was just going to be running up massive token bills. This would be better.

English

Bill Murphy@billmurphy·23 Mar

@AJFeather I have a 2024 M4 24gb. Llama works for super basic things. I even have it as a member of a R&D council where all models debate high impact ideas. It is the challenger -- seems to be working well.

English

Bill Murphy@billmurphy·23 Mar

I let one Opus session swell past 300,000 tokens doing everything in OpenClaw. Strategy, writing, research, ops, monitoring. That was the mistake. I had built an expensive employee, not an AI team.

English

131

A.J. Feather@AJFeather·19 Mar

I don’t understand how this is possible when I - and hundreds of thousands like me - pay NY state income tax and LIVE and spend 90% of our time in other states. Even when I was 100% remote at my last job, I paid New York State income tax from my basement in New Jersey!

Gregory Kennedy@gregorykennedy

I didn’t believe it until I saw the video. The governor of NY is begging rich people to move back to fund her social programs. So, like, I guess there are consequences to over taxation after all?

English

A.J. Feather@AJFeather·10 Mar

Not as impressive as @karpathy but I had to write a fuzzy search function the other day. I told Copilot “write a function that finds the correct rows in this csv most of the time. Here’s some input and the expected output” Took it 10 minutes.

Andrej Karpathy@karpathy

Three days ago I left autoresearch tuning nanochat for ~2 days on depth=12 model. It found ~20 changes that improved the validation loss. I tested these changes yesterday and all of them were additive and transferred to larger (depth=24) models. Stacking up all of these changes, today I measured that the leaderboard's "Time to GPT-2" drops from 2.02 hours to 1.80 hours (~11% improvement), this will be the new leaderboard entry. So yes, these are real improvements and they make an actual difference. I am mildly surprised that my very first naive attempt already worked this well on top of what I thought was already a fairly manually well-tuned project. This is a first for me because I am very used to doing the iterative optimization of neural network training manually. You come up with ideas, you implement them, you check if they work (better validation loss), you come up with new ideas based on that, you read some papers for inspiration, etc etc. This is the bread and butter of what I do daily for 2 decades. Seeing the agent do this entire workflow end-to-end and all by itself as it worked through approx. 700 changes autonomously is wild. It really looked at the sequence of results of experiments and used that to plan the next ones. It's not novel, ground-breaking "research" (yet), but all the adjustments are "real", I didn't find them manually previously, and they stack up and actually improved nanochat. Among the bigger things e.g.: - It noticed an oversight that my parameterless QKnorm didn't have a scaler multiplier attached, so my attention was too diffuse. The agent found multipliers to sharpen it, pointing to future work. - It found that the Value Embeddings really like regularization and I wasn't applying any (oops). - It found that my banded attention was too conservative (i forgot to tune it). - It found that AdamW betas were all messed up. - It tuned the weight decay schedule. - It tuned the network initialization. This is on top of all the tuning I've already done over a good amount of time. The exact commit is here, from this "round 1" of autoresearch. I am going to kick off "round 2", and in parallel I am looking at how multiple agents can collaborate to unlock parallelism. github.com/karpathy/nanoc… All LLM frontier labs will do this. It's the final boss battle. It's a lot more complex at scale of course - you don't just have a single train. py file to tune. But doing it is "just engineering" and it's going to work. You spin up a swarm of agents, you have them collaborate to tune smaller models, you promote the most promising ideas to increasingly larger scales, and humans (optionally) contribute on the edges. And more generally, *any* metric you care about that is reasonably efficient to evaluate (or that has more efficient proxy metrics such as training a smaller network) can be autoresearched by an agent swarm. It's worth thinking about whether your problem falls into this bucket too.

English

A.J. Feather@AJFeather·27 Şub

@Austen It helps, for sure. I finish PRs much quicker. I pick up more. Could I do everyone else’s tickets too? No! I’d be launching more agents than I’d be able to review, and the bugs that slip through would slowly consume all of my time.

English

Austen Allred@Austen·27 Şub

I do think it’s at least partially accurate that most giant companies could have been operating with far fewer people and AI is exposing that

jack@jack

we're making @blocks smaller today. here's my note to the company. #### today we're making one of the hardest decisions in the history of our company: we're reducing our organization by nearly half, from over 10,000 people to just under 6,000. that means over 4,000 of you are being asked to leave or entering into consultation. i'll be straight about what's happening, why, and what it means for everyone. first off, if you're one of the people affected, you'll receive your salary for 20 weeks + 1 week per year of tenure, equity vested through the end of may, 6 months of health care, your corporate devices, and $5,000 to put toward whatever you need to help you in this transition (if you’re outside the U.S. you’ll receive similar support but exact details are going to vary based on local requirements). i want you to know that before anything else. everyone will be notified today, whether you're being asked to leave, entering consultation, or asked to stay. we're not making this decision because we're in trouble. our business is strong. gross profit continues to grow, we continue to serve more and more customers, and profitability is improving. but something has changed. we're already seeing that the intelligence tools we’re creating and using, paired with smaller and flatter teams, are enabling a new way of working which fundamentally changes what it means to build and run a company. and that's accelerating rapidly. i had two options: cut gradually over months or years as this shift plays out, or be honest about where we are and act on it now. i chose the latter. repeated rounds of cuts are destructive to morale, to focus, and to the trust that customers and shareholders place in our ability to lead. i'd rather take a hard, clear action now and build from a position we believe in than manage a slow reduction of people toward the same outcome. a smaller company also gives us the space to grow our business the right way, on our own terms, instead of constantly reacting to market pressures. a decision at this scale carries risk. but so does standing still. we've done a full review to determine the roles and people we require to reliably grow the business from here, and we've pressure-tested those decisions from multiple angles. i accept that we may have gotten some of them wrong, and we've built in flexibility to account for that, and do the right thing for our customers. we're not going to just disappear people from slack and email and pretend they were never here. communication channels will stay open through thursday evening (pacific) so everyone can say goodbye properly, and share whatever you wish. i'll also be hosting a live video session to thank everyone at 3:35pm pacific. i know doing it this way might feel awkward. i'd rather it feel awkward and human than efficient and cold. to those of you leaving…i’m grateful for you, and i’m sorry to put you through this. you built what this company is today. that's a fact that i'll honor forever. this decision is not a reflection of what you contributed. you will be a great contributor to any organization going forward. to those staying…i made this decision, and i'll own it. what i'm asking of you is to build with me. we're going to build this company with intelligence at the core of everything we do. how we work, how we create, how we serve our customers. our customers will feel this shift too, and we're going to help them navigate it: towards a future where they can build their own features directly, composed of our capabilities and served through our interfaces. that's what i'm focused on now. expect a note from me tomorrow. jack

English

176

14.2K

A.J. Feather@AJFeather·27 Ara

@nikitabier Turns out the coding and video editing weren’t the bottleneck.

English

Nikita Bier@nikitabier·27 Ara

The great surprise of the technical and financial requirements being removed from coding and video creation is that all the same people are doing it—and that there hasn’t been an explosion of new software builders and filmmakers. After a decade of the media telling us that the most glamorous life is entrepreneur, filmmaker, or short-form video influencer: no one new jumps at the opportunity when the primary obstacles are removed.

English

733

250

4.9K

696.2K

A.J. Feather@AJFeather·27 Ara

@bcherny @palashkaria @karpathy Having a working integration test suite with existing examples has been a massive help for me. “Test the endpoint you just built by calling it after x but before y in this test.” Works 80% of the time, and works 100% of the time on the second try 😆

English

543

Boris Cherny@bcherny·27 Ara

@palashkaria @karpathy 1. Almost always use Plan mode 2. Give Claude a way to verify its output with unit tests, the Claude Chrome extension, or an iOS/Android sim 3. Hold the same bar for human and Claude code. Use /code-review to automate most of code review

English

1.3K

169.7K

Andrej Karpathy@karpathy·26 Ara

I've never felt this much behind as a programmer. The profession is being dramatically refactored as the bits contributed by the programmer are increasingly sparse and between. I have a sense that I could be 10X more powerful if I just properly string together what has become available over the last ~year and a failure to claim the boost feels decidedly like skill issue. There's a new programmable layer of abstraction to master (in addition to the usual layers below) involving agents, subagents, their prompts, contexts, memory, modes, permissions, tools, plugins, skills, hooks, MCP, LSP, slash commands, workflows, IDE integrations, and a need to build an all-encompassing mental model for strengths and pitfalls of fundamentally stochastic, fallible, unintelligible and changing entities suddenly intermingled with what used to be good old fashioned engineering. Clearly some powerful alien tool was handed around except it comes with no manual and everyone has to figure out how to hold it and operate it, while the resulting magnitude 9 earthquake is rocking the profession. Roll up your sleeves to not fall behind.

English

2.6K

7.5K

55.9K

16.8M

A.J. Feather retweetledi

Thomas Chatterton Williams@thomaschattwill·11 Eyl

Speech is not violence. Violence is violence.

English

132

8.1K

406.2K

A.J. Feather retweetledi

Ezra Klein@ezraklein·11 Eyl

In the last few years we've seen: - The plot to kidnap Gretchen Whitmer - The Storming of the Capitol and pipe bombs left at the RNC and DNC - The break-in to kidnap Nancy Pelosi and the brutal on Paul Pelosi - Multiple assassination attempts against Trump - The assassination of Minnesota House Speaker Melissa Hortman and her husband and the shooting of on State Senator John Hoffman and his wife - Luigi Mangione's assassination of Brian Thompson - The assassination of Charlie Kirk Political violence is contagious. It is spreading. It is not confined to one side or belief system. It should terrify us all. The foundation of a free society is the ability to participate in it without fear of violence. Political violence is always an attack against us all. You have to be so blind not to see that.

English

5.4K

16.8K

80K

9.3M

A.J. Feather retweetledi

U.S. Senator John Fetterman@SenFettermanPA·10 Eyl

I condemn this in the strongest terms. There is ZERO place in our great country for these horrendous acts of political violence. We must find a better way forward. May Charlie Kirk have a full and quick recovery.

English

5.6K

6.7K

87.5K

3.7M

A.J. Feather@AJFeather·21 Ağu

The unintended consequences being housing that is affordable

Mayor Karen Bass@MayorOfLA

Today I signed a City Council resolution opposing SB79 unless it is amended to exempt cities with a state-approved and compliant Housing Element. While I support the intent to accelerate housing development statewide, as written, this bill risks unintended consequences for LA.

English

374

A.J. Feather@AJFeather·24 Haz

@JamesCCortes @OneJerseySchorr It’s currently a 1% fee on all properties greater than $1 million which is also a lot of single family homes particularly in Northern Jersey. The proposal doubles it to 2%

English

Shlomo Schorr@OneJerseySchorr·24 Haz

With just one week before the July 1 deadline, New Jersey Governor Phil Murphy and legislative leaders are close to finalizing a deal on a new state budget that would hike taxes on cigarettes, luxury homes, and online gaming in New Jersey — but not on bowling, batting cages, and laser tag, as originally proposed, @johnsb01 is reporting.

English

5.2K

A.J. Feather@AJFeather·22 Haz

@craigarnzen @willscharf Well hopefully the dorms are nicer. Lol

English

Craig Arnzen@craigarnzen·22 Haz

@AJFeather @willscharf They’ve moved Boys State to Lindenwood. This is even more reason it’s a travesty!

English

A.J. Feather@AJFeather·5 Haz

@elonmusk The rule of plurality in the US makes this very difficult if not impossible. Taking over an existing party by funding targeted primaries in purple areas might be an easier strategy.

English

Elon Musk@elonmusk·5 Haz

Is it time to create a new political party in America that actually represents the 80% in the middle?

English

115.5K

37.5K

264.2K

164.6M

A.J. Feather@AJFeather·5 Haz

No one in Washington has any interest in cutting, and the states are now completely dependent on funding from the Federal government for infrastructure, Medicaid, etc.

English

A.J. Feather@AJFeather·5 Haz

We had an opportunity to fix our country’s debt problem in 2012 by electing the only two people who have ever actually cared about the debt and deficit, @MittRomney & Paul Ryan. We are now in an intractable situation.

English

A.J. Feather@AJFeather·27 Nis

@ZaidJilani Not trying to argue your overall point, but I have not experienced phantom breaking since the highway stack switched over to FSD from autopilot. The original video seems plausible but unlikely given the way lane detection works in vision by converting to grayscale

English

A.J. Feather retweetledi

Jonah Goldberg@JonahDispatch·18 Nis

Wilkinson is 100% right. Awaiting the “he’s a rogue Marxist judge” discourse from the usual corners. nationalreview.com/bench-memos/fo…

English

125

938

319.4K

A.J. Feather@AJFeather·11 Ara

@OneJerseySchorr Is this published somewhere?

English

100

Shlomo Schorr@OneJerseySchorr·11 Ara

BREAKING: A spokesperson for the U.S. Department of Defense says their initial assessment is that the drones spotted over New Jersey are not coming from "a foreign entity or adversary," and says there is no truth to claims that they are coming from an "Iranian mothership."

Shlomo Schorr@OneJerseySchorr

Speaking on Fox News, New Jersey Rep. Jeff Van Drew, citing "very responsible sources," says Iran recently "launched a mothership probably about a month ago" which contains the drones seen in recent weeks over the skies all across New Jersey.

English

Keşfet

@billmurphy @karpathy @Austen @nikitabier @bcherny @palashkaria @OneJerseySchorr @johnsb01