Carlos Garrido ๐Ÿฟ

1.3K posts

Carlos Garrido ๐Ÿฟ

Carlos Garrido ๐Ÿฟ

@shouldomythesis

Drinking coffee. One deadline at a time. Lacking coherence

Lisbon, Portugal ๊ฐ€์ž…์ผ Aralฤฑk 2015
2.1K ํŒ”๋กœ์ž‰215 ํŒ”๋กœ์›Œ
๊ณ ์ •๋œ ํŠธ์œ—
Carlos Garrido ๐Ÿฟ
Carlos Garrido ๐Ÿฟ@shouldomythesisยท
I use error bars
Carlos Garrido ๐Ÿฟ tweet media
English
4
1
24
0
Yegor Denisov-Blanch
Yegor Denisov-Blanch@yegordbยท
I want to give you a full set of @Stanford course materials on how to measure AI. I'm on the teaching team for a new class: CS 321M - AI Measurement Science. Comment or reshare and I'll send you everything: slides, readings, textbook. Why does this class exist? Because most AI benchmarks are already broken. Saturated, partly memorized, or disconnected from real use. Yet we still use them to pick models, justify budgets, and shape policy. AI moves fast and measurement hasn't kept up. This class covers what goes wrong with benchmarks, how to design evaluations that hold up, and what valid measurement looks like. If we can't measure AI we can't improve it. I think this material matters way beyond one classroom. Help me get it out there.
Yegor Denisov-Blanch tweet media
English
8
8
18
1.1K
Alexander Doria
Alexander Doria@Dorialexanderยท
@V4ldeLund No donโ€™t really exist yet. My immediate problem right now is math problem discovery but most synth pipelines are broadly combining constraints/digging through a search space and we donโ€™t have the right language for it yet.
English
2
0
7
227
Alexander Doria
Alexander Doria@Dorialexanderยท
Starting to suspect most synthetic pipelines will be about topology long term (first with math, but not only).
English
7
1
69
4.2K
Ahmad
Ahmad@TheAhmadOsmanยท
Gemini will be the Android / iOS model ChatGPT will be the enterprise model Claude will be the specialized agents model
English
24
7
156
31.6K
Ahmad
Ahmad@TheAhmadOsmanยท
@CryptoElite007 Any, as long as you take a screenshot for proof for the form youโ€™ll receive later x.com/theahmadosman/โ€ฆ
Ahmad@TheAhmadOsman

RTX PRO 6000 (96GB VRAM, ~$15K) GIVEAWAY FAQ Q: Cost to enter? A: $0. Free. Q: Do I have to register for GTC? A: Yes, virtual attendance is COMPLETELY FREE Q: Where do I enter? A: Tap the link in my bio, thereโ€™s a clear button on the page Q: How do I increase my chances? A: Earn bonus entries: โ€ข +150 for signing up for GTC 2026 โ€ข +75 per referral when someone uses your code โ€ข Follow / subscribe on socials for extra entries Q: Is this officially sponsored? A: Yes, sponsored by NVIDIA Q: When do entries close? A: March 19 Q: What happens after I enter? A: After GTC, youโ€™ll receive a form by email Q: What do I need to submit? A: Proof of attendance: โ€ข Virtual โ†’ screenshot โ€ข In-person โ†’ selfie at GTC Q: When is the proof deadline? A: April 1 (preliminary date, may change based on response rates) Multiple reminders will be sent Q: How is the winner chosen? A: Random draw among verified entries Q: When is the winner announced? A: TBD I need time to verify all valid submissions Depends on verification volume Q: When does the GPU ship? A: TBD Q: Where will updates be posted? A: Email + my socials Q: Didnโ€™t get the verification email? A: Scroll down and hit โ€œSubmitโ€ on the Giveaway Entry page Q: Are there location restrictions? A: No, there was a bug, now fixed. Try again Q: Who can enter? A: Anyone who can attend GTC and provide valid proof Q: Is registering enough? A: No, you must attend and submit proof Q: Do I need to watch sessions or just register? A: You must attend and provide proof Q: Do I need to attend live? A: Yes, you must attend live and provide proof Replay views donโ€™t qualify Q: Is registering enough? A: No, you must attend and submit proof

English
1
0
2
324
Carlos Garrido ๐Ÿฟ ๋ฆฌํŠธ์œ—ํ•จ
Leonardo de Moura
Leonardo de Moura@Leonard41111588ยท
Prover correctness is becoming a central question as AI enters mathematics and software verification. New essay on why Lean's architecture is designed to survive AI pressure. leodemoura.github.io/blog/2026-3-16โ€ฆ
English
6
45
243
17.1K
SAIR
SAIR@SAIRfoundationยท
Terence Tao: Formal Verification Breaks the Trust Barrier in Mathematics Formal verification is transforming mathematical collaborations โ€” enabling anonymous contributions, machine-checked proofs, and radically more precise scientific discussion.
English
7
88
405
78.3K
Kevin A. Bryan
Kevin A. Bryan@Afinetheoremยท
I didn't realize this: Finland, which used to be the absolute star in the West on education, is now on math roughly at the OECD average, only a bit higher than the US (meaning way behind New England), and fell 60 pts in 20 years, worst in the world. What happened?!
Kevin A. Bryan tweet media
English
518
577
4.5K
809.8K
Carlos Garrido ๐Ÿฟ
Carlos Garrido ๐Ÿฟ@shouldomythesisยท
@Leonard41111588 Great article :) "What would a verification platform for the AI era require? A small, trusted kernel" Would love to hear your thoughts about how trust is built checking this kernel and also how trust builds around the kernel! x.com/JFPuget/statusโ€ฆ
JFPuget ๐Ÿ‡บ๐Ÿ‡ฆ๐Ÿ‡จ๐Ÿ‡ฆ๐Ÿ‡ฌ๐Ÿ‡ฑ@JFPuget

How is Lean code proved correct? Is Lean written in Lean? It all stemmed form a wild thought I had: what is there is a bug in Lean? How would it impact al the proofs created with lean? Reason for the above is I never saw a bug free software.

English
0
1
5
1.8K
๐ŸŒผ๐Ÿชป๐Ÿชด
๐ŸŒผ๐Ÿชป๐Ÿชด@moonstar24689ยท
@RepThomasMassie I donโ€™t have a screenshot but the one that says โ€œwhat a night, glad I didnโ€™t kill anyoneโ€ or something along those lines. Hopefully someone has it and will post
English
8
1
611
46.9K
Thomas Massie
Thomas Massie@RepThomasMassieยท
Tomorrow I will go to DOJ to view the unredacted Epstein files. Which docs should I view? Include EFTA link in reply I will sort the replies to this post by โ€œnumber of likes,โ€ so instead of making redundant posts, please โ€œlikeโ€ replies that contain docs you think are important.
English
7.1K
14.7K
131.9K
5.7M
Carlos Garrido ๐Ÿฟ ๋ฆฌํŠธ์œ—ํ•จ
IPMA
IPMA@ipma_ptยท
#Tempo: De 2 a 8/Fev prevemos uma semana muito chuvosa, na generalidade do paรญs, especialmente no Norte e Centro. Episรณdios de trovoada com possรญvel granizo. Esperada neve na Estrela e extremo Norte. Vento moderado a forte e agitaรงรฃo marรญtima forte๐Ÿ‘‰ tinyurl.com/3jbfes7x
IPMA tweet media
Portuguรชs
5
26
127
8.3K
Carlos Garrido ๐Ÿฟ
Carlos Garrido ๐Ÿฟ@shouldomythesisยท
@Dorialexander This might be just anecdotal evidence, but I've been somewhat successful in doing simple scrips with Gemini. I like the explanations and it's easy to follow and debug. Never tried opus, but 5.2 sometimes overcomplicates v0
English
0
0
1
41
Dmitrii Kovanikov
Dmitrii Kovanikov@ChShershยท
So, is anyone still interested in becoming a better SWE? Or is everyone obsessed with AI agent orchestrators to build and sell your next SaaS and escape poverty as soon as possible?
English
406
72
2.3K
273.5K
Aaron Stannard
Aaron Stannard@Aarononthewebยท
Not good
Aaron Stannard tweet media
English
100
152
2.8K
557.2K
Carlos Garrido ๐Ÿฟ
Carlos Garrido ๐Ÿฟ@shouldomythesisยท
@MKBHD I fell like it really depends if those 24 reviews are from a trusted reviewer or just random users. Also depends on the 24 reviews themselves, especially the worse ones. Any common failure modes? Common complaints? Etc.
English
0
0
0
27
Marques Brownlee
Marques Brownlee@MKBHDยท
Alright, even better ๐Ÿ‘€
English
131
17
637
329.8K
Marques Brownlee
Marques Brownlee@MKBHDยท
You're about to spend $100 on a product you've never tried. Which version do you pick?
English
273
42
1.6K
551.1K