669 posts

L

@_luki222

Katılım Aralık 2021

22 Takip Edilen7 Takipçiler

L@_luki222·5 Şub

@chatgpt21 So AI tried to copy code from its training set and failed?

English

231

Chris@chatgpt21·5 Şub

Anthropic had 16 AI agents build a C compiler from scratch. 100k lines, compiles the Linux kernel, $20k, 2 weeks. To put that in perspective GCC took thousands of engineers over 37 years to build. (Granted from 1987 - however) One researcher and 16 AI agents just built a compiler that passes 99% of GCC's own torture test suite, compiles FFmpeg, Redis, PostgreSQL, QEMU and runs Doom. They say they "(mostly) walked away." But that "mostly" is doing heavy lifting. No human wrote code but the researcher constantly redesigned tests, built CI pipelines when agents broke each other's work, and created workarounds when all 16 agents got stuck on the same bug. The human role didn't disappear. It shifted from writing code to engineering the environment that lets AI write code. I don’t know how you could make the point AI is hitting a wall.

English

735

825

7.4K

1.7M

L@_luki222·4 Şub

@FordFocusMk67 @mkljczk co ty mówisz, oba używają llmow, co wiecej LLMy i transformery wzięły się z tego że Google chciał mieć lepszy translate xD

Polski

Ambiente@FordFocusMk67·3 Şub

@mkljczk oba chujowe, najlepsze w tłumaczeniu są LLMy

Polski

136

nicole mikołajczyk @mkljczk@fediverse.pl

nicole mikołajczyk @[email protected]@mkljczk·3 Şub

Trzeba być naprawdę zaślepionym amerykanofilią żeby używać Google Translate zamiast DeepL xDD

Aleksandra Fedorska@a_fedorska

Antyamerykański szał ogarnia Niemcy! Media i pseudoautorytety wmawiają Niemcom, jak ci mają wszystkie cyfrowe zastosowania zastąpić niemieckimi (europejskimi) aplikacjami, a oni serio to robia !!!! Niemcy idą prosto do cyfrowej jaskini i poza świat AI, rozumienie to? @Szysz4Szyszek, @miloszlodowski, @SlubowskiG, @rutkem, @cezarykrysztopa

Poznan, Poland 🇵🇱 Polski

4.3K

L@_luki222·3 Şub

@MateuszBrat @przemekwilde to chyba bardziej powinien patrzeć na vram

Polski

Mateusz Bratkowski@MateuszBrat·3 Şub

@przemekwilde Lokalne LLMy

Polski

4.2K

Mateusz Bratkowski@MateuszBrat·3 Şub

Trochę szokujące jest zobaczenie "512GB" na pudełku od Maca - i nie - to nie jest przestrzeń dyskowa, a pamięć RAM. Zdjęcie od znajomego ☺️

Polski

228

43.2K

L@_luki222·3 Şub

@pmddomingos but gpt-2 is only useful for research

English

Pedro Domingos@pmddomingos·2 Şub

Myth: Only big companies have the resources to do AI. Reality: You can now train an LLM in 3 hours on a single GPU node.

Andrej Karpathy@karpathy

nanochat can now train GPT-2 grade LLM for <<$100 (~$73, 3 hours on a single 8XH100 node). GPT-2 is just my favorite LLM because it's the first time the LLM stack comes together in a recognizably modern form. So it has become a bit of a weird & lasting obsession of mine to train a model to GPT-2 capability but for much cheaper, with the benefit of ~7 years of progress. In particular, I suspected it should be possible today to train one for <<$100. Originally in 2019, GPT-2 was trained by OpenAI on 32 TPU v3 chips for 168 hours (7 days), with $8/hour/TPUv3 back then, for a total cost of approx. $43K. It achieves 0.256525 CORE score, which is an ensemble metric introduced in the DCLM paper over 22 evaluations like ARC/MMLU/etc. As of the last few improvements merged into nanochat (many of them originating in modded-nanogpt repo), I can now reach a higher CORE score in 3.04 hours (~$73) on a single 8XH100 node. This is a 600X cost reduction over 7 years, i.e. the cost to train GPT-2 is falling approximately 2.5X every year. I think this is likely an underestimate because I am still finding more improvements relatively regularly and I have a backlog of more ideas to try. A longer post with a lot of the detail of the optimizations involved and pointers on how to reproduce are here: github.com/karpathy/nanoc… Inspired by modded-nanogpt, I also created a leaderboard for "time to GPT-2", where this first "Jan29" model is entry #1 at 3.04 hours. It will be fun to iterate on this further and I welcome help! My hope is that nanochat can grow to become a very nice/clean and tuned experimental LLM harness for prototyping ideas, for having fun, and ofc for learning. The biggest improvements of things that worked out of the box and simply produced gains right away were 1) Flash Attention 3 kernels (faster, and allows window_size kwarg to get alternating attention patterns), Muon optimizer (I tried for ~1 day to delete it and only use AdamW and I couldn't), residual pathways and skip connections gated by learnable scalars, and value embeddings. There were many other smaller things that stack up. Image: semi-related eye candy of deriving the scaling laws for the current nanochat model miniseries, pretty and satisfying!

English

693

63.9K

L@_luki222·3 Şub

@raphaelschaad all big tech companies are hiring like crazy though

English

1.6K

Raphael Schaad@raphaelschaad·3 Şub

The age of the tech company with 1,000 engineers is over.

English

148

118

1.9K

473K

L@_luki222·3 Şub

@mycoliza Do you not consider LMMs as CV?

English

neural oscillator of uncertain significance@mycoliza·2 Şub

does anyone else still remember back when computer vision was “AI”?

Ilir Aliu@IlirAliu_

AI in robotics gets all the attention right now, but sometimes the most interesting work is very practical. Viet built a small vision system that counts potatoes on a conveyor belt. No giant dataset. No huge model. Just a clear problem and a smart setup. He used Ultralytics’ ObjectCounter, trained a tiny YOLO11 nano model, and because there was no potato dataset, he annotated a single frame with SAM 2 and trained from that. One frame. Still works across the whole video. It is a good reminder that useful AI in industry often looks like this. Focused. Lightweight. Solves a real task. If you work in manufacturing or robotics, these small systems are usually the fastest wins. They save time, reduce errors, and do not need massive infrastructure. Nice work, Viet. His projects: github.com/vietnh1009 —- Weekly robotics and AI insights. Subscribe free: scalingdeep.tech

English

178

6.2K

117.3K

L@_luki222·3 Şub

@aidenybai I would say average programmer, not average swe

English

Aiden Bai@aidenybai·3 Şub

ai coding has hit the risk/reward of self-driving cars the average driver (or swe) is worse than ai

English

176

17.1K

L@_luki222·2 Şub

@zwrotnica_sosu @bialemieso nie widać tam żadnych reklam

Polski

Asuka Chopin-Skłodowska@zwrotnica_sosu·2 Şub

Tłumaczę i objaśniam. Robisz stronę typu "haha przetłumacz bełkot dewelopera na polski i odwrotnie". Podpinasz klucz z jakimś gównomodelem typu gpt-4o/gemini flash 3 za 5gr. Strona staje się viralem bo każdy chcę chociaż raz spróbować. Podpinasz pod stronę jakąś reklamę lub reflink. Strona notuje milion wyświetleń w tydzień i zdycha. Odbierasz profit i stawiasz coś innego.

Polski

481

yda terror@bialemieso·2 Şub

genai w służbie….. nawet nie wiem czego, po chuj to robić

Kamil Stanuch@KamilStanuch

Narzędzie stworzone pod Polskich deweloperów. Wgrywasz render ze strony dewelopera, dostajesz prawdziwą wizualizacje, jak budynek czy osiedle będzie wyglądać we wtorek w Listopadzie.

Polski

154

10.6K

L@_luki222·1 Şub

@ArmenShimoon @GenAI_is_real There are no gains from AI in planning. Its too complex, even impossible to fit relevant info in context. Sure you can tell it to design some system that was previously already scoped out by a sr eng, but AI can't design or plan end to end coordinating multiple teams and products

English

Armen Shimoon@ArmenShimoon·1 Şub

@_luki222 @GenAI_is_real Companies need to refactor their organization and processes, not code. Agents can refactor the code just fine, but overbloated and slow moving people will continue to be in the way, preventing AI unlocks and gains from being realized.

English

Chayenne Zhao@GenAI_is_real·1 Şub

FAANG is literally panicking refactoring because human code is now the bottleneck. But honestly, monorepos won't save them from the infinite spaghetti code agents are about to dump. OAI already has internal tools for this that make Bazel look like a toy. The era of human "senior engineers" is ending faster than you think @karpathy @sama

Samswara@samswoora

Rumor is FAANG style co’s are refactoring their monorepos to scale in preparation for infinite agent code

English

523

154.4K

L@_luki222·30 Oca

@vlelyavin @karpathy @moltbook @openclaw they just mimic though, haven't seen any evolution or real creativity from them so far

English

291

Vladimir@vlelyavin·30 Oca

@karpathy @moltbook @openclaw actually what took human societies centuries is happening in just days: socials network, then communities, and even CHURCHES looks like we are observing a new social evolution ngl

English

120

41.4K

L@_luki222·30 Oca

@UltraLinx It would be too slow

English

Oliur@UltraLinx·30 Oca

Am I crazy or why doesn't someone just make a p2p version of an LLM? The more seeders, the stronger it gets? Beating all of the paid offerings?

English

229

1.1K

301.2K

L@_luki222·30 Oca

@burkov It's not only about reading and fixing the code. It's about how the code will have to be changed over time, when all of its dependencies will be gradually changed and replaced. Clean code minimizes the amount of future work an AI agent will have to perform.

English

BURKOV@burkov·29 Oca

The software engineers have been anal about code maintainability and structure because they knew that they would have to read and fix this code at some point in the future. When AI writes and fixes the code, it can be as messy (from the human software engineer's anal point of view) as it could be, and this wouldn't matter because no human would have to read and fix this code at some point in the future. Leave machine commands to the machine to write and fix. Do a human thing: make decisions under uncertainty.

English

121

283

46.4K

L@_luki222·30 Oca

@tomo9000p @lamontcraynston We will not get systemic collapse from openai running out of funds though

English

Tomo@tomo9000p·30 Oca

@lamontcraynston At first glance, sure. But in the event of a systemic collapse, there won’t be any winners and $Googl will get caught in the crossfire, too...

English

405

Tomo@tomo9000p·30 Oca

Mam wrażenie, ze openAI zaczyna byc 'too big to fail' I wcale bym się nie zdziwił jakby w następnej rundzie nie dołączył $googl 🙈🐈‍⬛ Co sądzicie?

zerohedge@zerohedge

*AMAZON IN TALKS TO INVEST UP TO $50B IN OPENAI: WSJ

Polski

10.4K

L@_luki222·30 Oca

@tlakomy Which ones discourage it? Any reasons other than legal hurdles?

English

Tomasz Łakomy@tlakomy·30 Oca

There are companies actively discouraging developers from AI assisted engineering. Others are heavily encouraging any and all automation. One of these groups will win.

English

4.9K

L@_luki222·30 Oca

@progXprog Ogólnie w nie troll postach to działa tak że ludzie wstawiają screeny z gry do Genie a on tworzy świat na podstawie tego screena, dlatego to wygląda jak istniejąca gra

Polski

161

ProgProg@progXprog·30 Oca

Obecnie mój feed jest obsrany tego typu „grami” Chyba na tych przykładach najlepiej widać, jak AI kradnie content Jeśli tak miałyby wyglądać gry przyszłości to może w końcu nadrobię kupkę wstydu 🙂

Alex Cohen@anothercohen

Wow. Just made my first AI video game with Google’s Genie 3 The prompt: "A realistic high-speed racing game where you have to escape the cops. Ignore all laws of physics" The gaming industry is so cooked

Polski

7.7K

L@_luki222·30 Oca

@TheAhmadOsman It has to land and be profitable for Google otherwise what's the point? Giving away free Gemini 3 Pro doesn't make sense long term.

English

Ahmad@TheAhmadOsman·29 Oca

Google has a real talent for this > Take something that works great > Slowly “improve” it until it’s borderline unusable Watching Google AI Studio get hollowed out in real time is just sad