Alex

777 posts

Alex

@Fax3l

it’s just tweaking weights to minimize loss

Присоединился Mart 2010

1.6K Подписки77 Подписчики

Закреплённый твит

Alex@Fax3l·13 Mar

ZXX

Alex@Fax3l·3d

@pbeyssac Je viens de le faire

Français

280

Pierre Beyssac 🏴‍☠️🇫🇷🇪🇺🇺🇦@pbeyssac·3d

@Fax3l Ok mais faut m'écrire car de votre côté c'est fermé.

Français

728

Pierre Beyssac 🏴‍☠️🇫🇷🇪🇺🇺🇦@pbeyssac·3d

Pour info, l'attaque est toujours en cours, l'attaquant se permet même de relancer. *Ne cliquez pas* Quand je vois le nombre de gens pourtant pas nés de la dernière pluie qui ont cliqué sur le lien d'attaque, je suis effaré. On peut critiquer l'ANTS, on n'est pas rendus.

Français

10.4K

Alex@Fax3l·28 Mar

@dnhkng Was the vocabulary shared/multilingual for each individual training and did it fully cover all 8 languages?

English

David@dnhkng·27 Mar

1/7 I fed the same sentences in 8 languages through 4 different LLMs and extracted the hidden states at every layer. The result: in the reasoning layers, language disappears. The models organize thought by meaning, not by language. Then I tested code and math. Same thing. 🧵

GIF

English

128

7.2K

Alex@Fax3l·25 Mar

@karpathy @KenWattana This reminds me the first time I talk to gemini and it asked me “how’s the wether in <exact city where i was> “

English

Andrej Karpathy@karpathy·25 Mar

@KenWattana yes exactly! a bit like i'm being manipulated in some creepy way. "please like me, look how much i know about you, we are good friends".

English

521

37K

Andrej Karpathy@karpathy·25 Mar

One common issue with personalization in all LLMs is how distracting memory seems to be for the models. A single question from 2 months ago about some topic can keep coming up as some kind of a deep interest of mine with undue mentions in perpetuity. Some kind of trying too hard.

English

1.8K

1.1K

21.2K

2.7M

Alex@Fax3l·21 Mar

@theo @BlazeisCoding Ahhahha microslop

Nederlands

226

Theo - t3.gg@theo·21 Mar

@BlazeisCoding It means you’re probably using windows and your codebase has file names/paths that are too long for windows

English

386

32.4K

Nikhil Rathore@BlazeisCoding·21 Mar

Hii @theo , What's his error supposed to mean?

English

111

34.3K

Alex ретвитнул

Christos Tzamos@ChristosTzamos·12 Mar

1/4 LLMs solve research grade math problems but struggle with basic calculations. We bridge this gap by turning them to computers. We built a computer INSIDE a transformer that can run programs for millions of steps in seconds solving even the hardest Sudokus with 100% accuracy

English

250

816

6.1K

1.8M

Alex@Fax3l·13 Mar

@steipete @ebukagaus Agreed. Though « submit yolo » is very fun

English

380

Peter Steinberger 🦞@steipete·13 Mar

@ebukagaus Not sure if you are joking or not, but there'a big difference between a thoughtful conversation with an agent, then it writing code, then testing it and "yo fix issue x and submit yolo"

English

1.1K

79.6K

Ebuka l Socrates🦙🐍🦜🦀@ebukagaus·13 Mar

Are you a Grifter??

Peter Steinberger 🦞@steipete

@AskPerplexity Big no for spamming our repo with a slop PR. Account banned.

English

207

141K

Alex ретвитнул

Han Xiao@hxiao·6 Mar

Seeing a lot of hype around ANE reverse engineering breakthroughs lately. Cool progress, but we benchmarked jina-v5-nano (12-layer 768d transformer) on M3 Ultra: • MLX GPU: 4ms • ANE hybrid (baked weights): 49ms • ANE hybrid (dynamic weights): 236ms ANE's value is low-power inference. It is not necessarily faster than MLX on desktop. MLX GPU does in 4ms what ANE does in 236ms on the same chip. Unified memory doesn't help when ANE must shuttle data through IOSurface kernel calls while GPU reads it directly via Metal with zero copy. People claiming ANE beats MLX for transformer inference have simply never used MLX.

English

247

20.8K

Alex ретвитнул

Beto@betomoedano·5 Mar

😂

QME

514

4.7K

499.4K

Alex ретвитнул

Steve the Beaver@beaversteever·4 Mar

incredible that we built all this RAG and vector database stuff and it turns out that grep from 1973 works better than all that

English

181

359

8.5K

508.1K

Alex ретвитнул

Lukasz Olejnik@lukOlejnik·4 Mar

Google has identified an iOS exploit kit named Coruna. 5 full exploit chains, 23 vulnerabilities, documentation in native English, modular architecture. Full professionalism. It must have cost millions of dollars. Who built it? Google doesn’t say, but the evidence points to US government tools. The kit also contains components previously used in a cyber operation that Russia attributed to the NSA. Coruna traveled. First, an anonymous “company client”, then used by a Russian cyber espionage group, which hid the code on Ukrainian websites inside a visitor-counter script, delivering it only to selected users from a specific geolocation. Later a financially motivated actor “operating from China” deployed it (infecting over 42,000 devices). The malware added to the ready-made kit was lower quality than the original suggesting the tools were acquired and modified by someone else. One US government subcontractor, Peter Williams, just received a 7-year prison sentence for selling tools to Russian broker Operation Zero. The US government spent millions on a tool that now steals cryptocurrency. A good return on investment, just not for themselves. One more detail: Coruna did not attack devices with Lockdown Mode enabled. cloud.google.com/blog/topics/th…

English

217

808

82.5K

Alex@Fax3l·2 Mar

@powl_d @oooooooorion @ropau_ai 400k token dans le contexte c est 160 milliards d elements pour que le systeme d attention choisisse ce qu il se passe. Sans « tricher » avec des flop16 c est 300 gb + de ram. Donc l attention est diluée…

Français

Powlisher@powl_d·2 Mar

Franchement c’est une facon de faire pour du dev solo sur Claude Code ou Cursor. Mais nous on build Orchestria : 5 à 10 agents (Opus, Sonnet, Gemini…) qui se challengent ( On désactive actuellement des agents et on dit a Orchestria de se limiter), se review, font de la QA visuelle avec Playwright et se corrigent entre eux. Un run complet c’est 200 à 400k tokens easy. Avec 1M de contexte tout tient en mémoire, zéro troncature, zéro résumé foireux, zéro perte d’info. L’empoisonnement c’est un problème de dev solo en mode chat. En multi-agents avec quality gates intégrés, ils se démontent entre eux. El famoso /clear c’est bien quand tu codes solo. 1M de contexte, ça change la game quand tu manages une équipe d'agents

Français

1.7K

Powlisher@powl_d·2 Mar

On a débloqué l’option 1M context sur Opus 4.6 Windows pour @ropau_ai 🚀 Objectif : aller plus vite, scaler plus fort. Pas juste regarder mon argent brûler en 4K 😅🔥

Français

80.7K

Alex@Fax3l·1 Mar

@bluetouff A croire qu’il aurait ete dresser pour surtout ne pas paraitre woke

Français

Alex@Fax3l·1 Mar

@bluetouff 😂 mais wtf Bluetooth noooon

English

232

☠ Bluetouff@bluetouff·1 Mar

J'ai vraiment trop hâte que Grok soit utilisé par le Pentagone 😂

Français

6.4K

Alex@Fax3l·1 Mar

@robertgraham state = table[state][column[c]] This so smart i ll now have to live with that knowledge and probably try to put this everywhere

English

980

Robert Graham@robertgraham·1 Mar

Here was my answer to the problem, a faster implementation of the 'wc' program that the interviewer wouldn't be able to understand how it works.

Raj Dabre@prajdabre

Technical interview question: Suppose you have 5 TB worth of text data and you want to count the total number of words, how will you do this?

English

1.1K

153.8K

Alex@Fax3l·28 Şub

@mfranz_on Python -m venv venv

Eesti

Marco Franzon@mfranz_on·27 Şub

A new toy is arrived. Sooo what should I install first?😂

English

Alex@Fax3l·26 Şub

@KieranCrown @ipla03 Dude!’ Thanks!

English

Kieran Crown@KieranCrown·25 Şub

@ipla03 That 13 mini bug is so annoying. For anyone seeing it go into Display > Screen zoom set to large and then back to normal 👌🏻

English

488

Fabrizio@ipla03·25 Şub

That moment when you spin up these three guys to check if your UI is responsive enough

English

115

15.3K

Alex@Fax3l·23 Şub

@SanhEstPasMoi “Senior Slop Janitor” 😂

Română

114

Victor Sanh@SanhEstPasMoi·22 Şub

On my timeline: - “Engineers haven’t written code in 3 months.” - “SaaS apocalypse.” - “PMs are obsolete.” - “If you’re not running 10 coding agents in parallel, you’re ngmi.” - Crab costumes In our repo: - “You are absolutely right!!! Great catch!!!” - Agent deletes half the file and rewrites it right away. - Hallucinated API endpoints. - 47 new tests. 2 are actually testing something non-superficial. - README .md README_SUMMARY .md FINAL_SUMMARY_V2_ACTUAL_FINAL .md Goal Architect. But in the meantime… Senior Slop Janitor. If you feel that tension, we’re hiring builders in NYC.

English

107

3.2K

263.1K

Alex ретвитнул

Jacob Bartlett@jacobtechtavern·21 Şub

blog.jacobstechtavern.com/p/metal-in-swi…

ZXX

844

Открыть

@pbeyssac @dnhkng @karpathy @KenWattana @theo @BlazeisCoding @steipete @ebukagaus