The Full Stack

1.2K posts

The Full Stack banner
The Full Stack

The Full Stack

@full_stack_dl

News, community, and courses for people building AI-powered products.

🌍🌎🌏 Inscrit le Ocak 2019
183 Abonnements22.2K Abonnés
Tweet épinglé
The Full Stack
The Full Stack@full_stack_dl·
🥞🦜 Full Stack LLM Bootcamp 🦜🥞 tl;dr We're releasing our lectures on building LLM-powered apps, for FREE. 🚀 Launch an LLM App in One Hour ✨ Prompt Engineering 🗿 LLM Foundations 🔨 Augmented LLMs 🤷 UX for LUIs 🏎️ LLMOps 🔮 What's Next? 👷 Project Walkthrough Learn more:
English
18
189
1K
534.9K
The Full Stack retweeté
Sergey Karayev
Sergey Karayev@sergeykarayev·
Running agents locally is a dead end. The future of software development is hundreds of agents running at all times of the day — in response to bug alerts, emails, Slack messages, meetings, and because they were launched by other agents. The only sane way to support this is with cloud containers. Local agents hit a wall quickly: • No scale. You can only run as many agents (and copies of your app) as your hardware allows. • No isolation. Local agents share your filesystem, network, and credentials. One rogue agent can affect everything else. • No team visibility. Teammates can't see what your agents are doing, review their work, or interact with them. • No always-on capability. Agents can't respond to signals (alerts, messages, other agents) when your machine is off or asleep. Cloud agents solve all of these problems. Each agent runs in its own isolated container with its own environment, and they can run 24/7 without depending on any single machine. This year, every software company will have to make the transition from work happening on developer's local machines from 9am-6pm to work happening in the cloud 24/7 -- or get left behind by companies who do.
English
93
21
310
29.4K
The Full Stack retweeté
Sergey Karayev
Sergey Karayev@sergeykarayev·
The results are in: GPT 5.2 (xhigh) is Pareto-optimal for our Rails codebase. How did we find out? Using the new Superconductor Benchmark feature, which lets you run your own "mini SWE-bench" defined by YOUR OWN PRs. Currently in preview, reply if you'd like to check it out!
Sergey Karayev tweet media
English
0
3
11
1.8K
The Full Stack
The Full Stack@full_stack_dl·
Would you be interested in a course or workshop on ✨Building Software with AI Agents✨???
English
3
2
8
3.1K
The Full Stack retweeté
Sergey Karayev
Sergey Karayev@sergeykarayev·
Is Claude Code still the best coding agent on the market? You can now easily find out by launching Claude, Codex, Gemini, and Amp on every ticket in your codebase:
English
3
9
40
13.4K
The Full Stack retweeté
Sergey Karayev
Sergey Karayev@sergeykarayev·
Several agents plus three simple baselines were tested on HumanEval. Agents were mostly worse and always more expensive than the baselines. The good: · Evaluating the Pareto frontier · Strong simple baselines (just repeated calls!) The bad: · Clearly saturating the benchmark
Sergey Karayev tweet media
English
1
1
18
3.9K
The Full Stack retweeté
Sergey Karayev
Sergey Karayev@sergeykarayev·
What percentage of your Twitter feed (the stuff you actually read, not just scroll past) do you believe is currently written by AI?
English
1
2
2
3.9K
The Full Stack retweeté
Sergey Karayev
Sergey Karayev@sergeykarayev·
Has anyone done comprehensive testing of gpt-4-vision-preview? I want to know stuff like the minimum text size it can read, the radius of the smallest circle it can locate in an image, the number of circles it can count, etc. Could be an automated benchmark for other models too
English
3
1
13
3.5K
The Full Stack retweeté
Sergey Karayev
Sergey Karayev@sergeykarayev·
Which set of statements do you agree with? 1. AGI is as much or more of a risk to human flourishing as nuclear weapons 2. I have a good idea for what should be done about that
English
1
1
4
3.1K
The Full Stack retweeté
Sergey Karayev
Sergey Karayev@sergeykarayev·
Has anyone had good experiences with GPT-powered code generation for complete web app features? As in, you describe what should exist, and GPT actually provides the source of all the necessary files and where they should go. Ideally in the context of Ruby on Rails.
English
9
2
9
5.1K
The Full Stack retweeté
Sergey Karayev
Sergey Karayev@sergeykarayev·
Let's say that a US-based research company has developed an AGI model that was able to use the browser, pass captchas, hire people on Upwork, and lie about its intentions. What should they do after observing this?
English
6
7
20
10.3K
The Full Stack retweeté
AI by the Bay
AI by the Bay@ScaleByTheBay·
We bring in @full_stack_dl, a venerable boot camp crew that pioneered technical deep dives into deep learning where people fly in from around the world. 🥞 Their #LLM Bootcamp in the spring was sold out and this is your chance to attend the ➡️ version. 👉 scale.bythebay.io/register
English
0
1
4
2.4K
The Full Stack
The Full Stack@full_stack_dl·
We're also about 3 weeks away from our latest LLM bootcamp. @karpathy called the last version "high-quality tokens". Register soon if you want to make sure you get a spot! The bootcamp is in Oakland on November 13. You can register here: scale.bythebay.io/llm-workshop.
English
0
1
5
1.8K
The Full Stack
The Full Stack@full_stack_dl·
We're hosting a livestream with @ScaleByTheBay, this coming Monday at 1:30 pm PST. Come join us on your YouTube channel to talk about LLMs in production and more. @The_Full_Stack" target="_blank" rel="nofollow noopener">youtube.com/@The_Full_Stack)
English
1
6
28
3.3K
The Full Stack retweeté
Sergey Karayev
Sergey Karayev@sergeykarayev·
Is there a service I can use to pipe my GPT-4 calls through, and it automatically finetunes GPT-3.5 (or whatever) on all of them, and lets me know when it's up to par?
English
8
4
32
15.5K