Sabitlenmiş Tweet
Maxime 🥛
640 posts

Maxime 🥛
@maxime0033
Rends pas fou https://t.co/b0oSJylah9
Katılım Ocak 2014
251 Takip Edilen69 Takipçiler
Maxime 🥛 retweetledi

@TwitchGauchiste On vit quand même dans un pays où une meuf dont le " travail " est de jouer à la switch et d'être vaguement sympa a le droit ( enfin avait mdr ) de décider comment mon argent est dépensé
Enième rituel d'humiliation pour Nicolas

Français
Maxime 🥛 retweetledi
Maxime 🥛 retweetledi

@Cacamann_ trop bien ce meme qd je vais l'utiliser quand quelqu'un dira un truc du genre "imagine inox danser en jouant à lol devant ava mind qui mix" et là je pourrais placer le meme je pense
Français
Maxime 🥛 retweetledi

Je l'ai perdue... Sans blague j'ai l'impression de perdre 80% des stats que je trouve, c'est effrayant.
Yoonns.@YoonnsOfficiel
J'ai une stat effrayante
Français
Maxime 🥛 retweetledi

@QuadMerad Anyme ministre de la cb12 🤣🤣🤣🤣🤣🤣🤣🤣🤣 et billy ministre du donuts sucré o sucre 😹😹😹😹😹😹😹😹😹
Français
Maxime 🥛 retweetledi
Maxime 🥛 retweetledi

She dumped me last night.
Not because I don't listen.
Not because I'm always on my phone.
Not even because I forgot our anniversary (twice).
But because,
in her exact words:
"You only pay attention to the parts of what I say that you think are important."
I stared at her for a moment and realized...
She just perfectly described the attention mechanism in transformers.
Turns out I wasn't being a bad boyfriend. I was being mathematically optimal.
See, in conversations (and transformers), you don't give equal weight to every word. Some words matter more for understanding context. Attention figures out exactly HOW important each word should be.
Here's the beautiful math:
Attention(Q, K, V) = softmax(QK^T / √d_k)V
Breaking it down:
Q (Query): "What am I looking for?"
K (Key): "What info is available?"
V (Value): "What is that info?"
d_k: Key dimension (for scaling)
Think library analogy:
You have a question (Query). Books have titles (Keys) and content (Values). Attention finds which books are most relevant.
Step-by-step with "The cat sat on the mat":
Step 1: Create Q, K, VEach word → three vectors via learned matrices W_Q, W_K, W_V
For "cat":
Query: "What should I attend to when processing 'cat'?"
Key: "I am 'cat'"
Value: "Here's cat info"
Step 2: Calculate scoresQK^T = how much each word should attend to others
Processing "sat"? High similarity with "cat" (cats sit) and "mat" (where sitting happens).
Step 3: Scale by √d_kPrevents dot products from getting too large, keeps softmax balanced.
Step 4: SoftmaxConverts scores to probabilities:
"cat": 0.4 (subject)
"sat": 0.3 (action)
"mat": 0.2 (location)
"on": 0.1 (preposition)
"the": 0.1 (article)
Step 5: Weight valuesMultiply each word's value by attention weight, sum up. Now "sat" knows it's most related to "cat" and "mat".
Multi-Head Magic:Transformers do this multiple times in parallel:
Head 1: Subject-verb relationships
Head 2: Spatial ("on", "in", "under")
Head 3: Temporal ("before", "after")
Head 4: Semantic similarity
Each head learns different relationship types.
Why This Changed Everything:
Before: RNNs = reading with flashlight (one word at a time, forget the beginning)
After: Attention = floodlights on entire sentence with dimmer switches
This is why ChatGPT can:
Remember 50 messages ago
Know "it" refers to something specific
Understand "bank" = money vs river based on context
The Kicker:Models learn these patterns from data alone. Nobody programmed grammar rules. It figured out language structure just by predicting next words.
Attention is how AI learned to read between the lines.
Just like my therapist helped me understand my focus patterns, maybe understanding transformers helps us see how we decide what matters.
Now if only I could implement multi-head attention in dating... 🤖
Still waiting for "scaled dot-product listening" to be invented.
English

@aplmoatonton Merci de remettre les pendules à l’heure et j’ajoute même que Xeu est top 10 albums de la decennie
Français

Je vous jure que ça se soigne
zaï ☆@shinigamind
un cambriolage de riche c’est juste de la redistribution de richesse dans mon langage
Français

Dire barbe à papa sans bégayer
L'Eau En Poudre💧🇨🇮@LEauEnPoudre_
C’est quoi le truc hyper basique que tout le monde sait faire, mais sauf vous ?
Français

@rueidff Il faut divorcer et la rayé de sa vie il faut pas frapper rien du tout il faut juste aller au tribunal et demander le divorce et casser la gueule au mec
Français













