devscourtAI retweeted
devscourtAI
117 posts

devscourtAI
@devscourtAI
AI Engineering community empowering software engineers, ML engineers, Data professional to explore, learn & apply AI in their careers.
Worldwide Joined Nisan 2023
92 Following14 Followers
devscourtAI retweeted

I packaged up the "autoresearch" project into a new self-contained minimal repo if people would like to play over the weekend. It's basically nanochat LLM training core stripped down to a single-GPU, one file version of ~630 lines of code, then:
- the human iterates on the prompt (.md)
- the AI agent iterates on the training code (.py)
The goal is to engineer your agents to make the fastest research progress indefinitely and without any of your own involvement. In the image, every dot is a complete LLM training run that lasts exactly 5 minutes. The agent works in an autonomous loop on a git feature branch and accumulates git commits to the training script as it finds better settings (of lower validation loss by the end) of the neural network architecture, the optimizer, all the hyperparameters, etc. You can imagine comparing the research progress of different prompts, different agents, etc.
github.com/karpathy/autor…
Part code, part sci-fi, and a pinch of psychosis :)

English
devscourtAI retweeted

We're launching Claude Community Ambassadors. Lead local meetups, bring builders together, and partner with our team.
Open to any background, anywhere in the world.
Apply: claude.com/community/amba…

English
devscourtAI retweeted
devscourtAI retweeted

We rebuilt Next.js in a week. No, really.
The team ported the framework to run natively on Workers to prove what’s possible with edge-first architecture. Dive into the technical hurdles we solved to eliminate Node.js dependencies.
cfl.re/4ciNc3L
English
devscourtAI retweeted
devscourtAI retweeted
devscourtAI retweeted

Additionally, code execution, web fetch, memory, programmatic tool calling, tool search, and tool use examples are now generally available.
Read more: claude.com/blog/improved-…
English
devscourtAI retweeted

Today, we’re releasing a significant upgrade to our specialized reasoning mode, Gemini 3 Deep Think.
Deep Think is built to drive practical applications, enabling researchers to interpret complex data and engineers to model physical systems through code.
With the updated Deep Think, you can turn a sketch into a 3D-printable reality. Deep Think analyzes the drawing, builds the complex shape, and generates a file so you can create the physical object with 3D printing.
This is rolling out now to Google AI Ultra subscribers. Select the "Deep Think" option in the tools menu to get started.
Learn more here: goo.gle/3MoiifF
English
devscourtAI retweeted

You told us you’re running multiple AI agents and wanted a better UX. We listened and shipped it!
Here’s what’s new in the latest @code release:
🗂️ Unified agent sessions workspace for local, background, and cloud agents
💻 Claude and Codex support for local and cloud agents
🔀 Parallel subagents
🌐 Integrated browser
And more...
English

We just raised $500M at an 11B valuation 🎉
To celebrate, we’re giving away 1,000 free credits so you can test our platform.
For the next 6 hours, comment “11B” below and we’ll DM you the credits (must follow) 👇
ElevenLabs@ElevenLabs
We raised $500M at an $11B valuation to transform how people interact with technology.
English
devscourtAI retweeted
devscourtAI retweeted
devscourtAI retweeted

Wild.
By far the most complete Claude Skills repo yet 🤯
@Composio’s Awesome-Claude-Skills packs 100`s of ready-to-use workflows:
↳ PDF tools, changelog generation
↳ Playwright automation
↳ AWS/CDK tools, MCP builders
... and much more!
Free and open-source.
Repo in 🧵↓

English
devscourtAI retweeted
devscourtAI retweeted

Introducing A2UI: Agent-to-User Interface
🛜Protocol for agent-driven interfaces
🤖Enables agents to generate interactive user interfaces
🐙Open source
github.com/google/A2UI/

English
devscourtAI retweeted

Prompting GPT-5.2 Codex for long-running tasks:
Numman Ali@nummanali
Prompting GPT 5.2 Codex for Continuity It excels at long running tasks but without explicit guidance can lose track of outcomes Put this at the top of your AGENTS .md file, it will let Codex work on even larger scale tasks It's how I let it run for 3 hours coherently
English
devscourtAI retweeted
devscourtAI retweeted
devscourtAI retweeted

I love the expression “food for thought” as a concrete, mysterious cognitive capability humans experience but LLMs have no equivalent for.
Definition: “something worth thinking about or considering, like a mental meal that nourishes your mind with ideas, insights, or issues that require deeper reflection. It's used for topics that challenge your perspective, offer new understanding, or make you ponder important questions, acting as intellectual stimulation.”
So in LLM speak it’s a sequence of tokens such that when used as prompt for chain of thought, the samples are rewarding to attend over, via some yet undiscovered intrinsic reward function. Obsessed with what form it takes. Food for thought.
English









