Jorge Hernandez 🇺🇦 🏳️‍🌈

141.1K posts

Jorge Hernandez 🇺🇦 🏳️‍🌈

Jorge Hernandez 🇺🇦 🏳️‍🌈

@braneloop

Principal ML Engineer • AuDHD • Tweets: ML/AI, Math, Neuroscience, Physics, Philosophy, other stuff.

In transit ... Katılım Ekim 2015
2.7K Takip Edilen1K Takipçiler
Jorge Hernandez 🇺🇦 🏳️‍🌈 retweetledi
Zach Tratar
Zach Tratar@zachtratar·
A couple monks came into Notion today to tell us how they use the product. Totally flabbergasted that they're configuring custom AI agents and taking AI meeting notes with custom instructions. This is advanced, power-user usage... monks! Use AI... find inner peace?
Zach Tratar tweet media
English
11
17
296
22.7K
Jorge Hernandez 🇺🇦 🏳️‍🌈 retweetledi
Micah Carroll
Micah Carroll@MicahCarroll·
Today we're sharing how our internal misalignment monitoring works at OpenAI – great work by @Marcus_J_W! 1. We monitor 99.9% of all internal coding agent traffic 2. We use frontier models for detection /w CoT access 3. No signs of scheming yet, but detect other misbehavior
Micah Carroll tweet media
English
12
37
297
22K
Jorge Hernandez 🇺🇦 🏳️‍🌈 retweetledi
Daractenus
Daractenus@Daractenus·
Japanese Report: "Why didn't you tell US allies about the war before attacking Iran?" Donald Trump: "Who knows better about surprises then Japan. Why didn't you tell me about Pearl Harbor?" This man belong in a psychiatric ward.
English
1.3K
11.2K
65.8K
2.9M
Jorge Hernandez 🇺🇦 🏳️‍🌈 retweetledi
Gergely Orosz
Gergely Orosz@GergelyOrosz·
I am hearing tons of complaints from Cursor customers at enterprise companies: A silent change put almost all models Cursor uses behind Max mode. Devs who used to manage to “spread out” monthly credits over a month see all of it used up in 1-2 days. Are furious + switching.
English
125
53
1.6K
243.8K
Jorge Hernandez 🇺🇦 🏳️‍🌈 retweetledi
Acyn
Acyn@Acyn·
Gomez: You said the only person who determine if it’s an imminent threat is the president. Do you stand by that statement? Gabbard: I do Gomez: Director Ratcliffe, do you agree with that? Ratcliffe: The president makes that decision Gomez: Why do you guys even have jobs?
English
339
6.6K
34.4K
897.4K
Jorge Hernandez 🇺🇦 🏳️‍🌈 retweetledi
Thomas Wolf
Thomas Wolf@Thom_Wolf·
This is really cool. It got me thinking more deeply about personalized RL: what’s the real point of personalizing a model in a world where base models can become obsolete so quickly? The reality in AI is that new models ship every few weeks, each better than the last. And the pace is only accelerating, as we see on the Hugging Face Hub. We are not far away from better base models dropping daily. There’s a research gap in RL here that almost no one is working on. Most LLM personalization research assumes a fixed base model, but very few ask what happens to that personalization when you swap the base model. Think about going from Llama 3 to Llama 4. All the tuned preferences, reward signals, and LoRAs are suddenly tied to yesterday’s model. As a user or a team, you don’t want to reteach every new model your preferences. But you also don’t want to be stuck on an older one just because it knows you. We could call this "RL model transferability": how can an RL trace, a reward signal, or a preference representation trained on model N be distilled, stored, and automatically reapplied to model N+1 without too much user involvement? We solved that in SFT where a training dataset can be stored and reused to train a future model. We also tackled a version of that in RLHF phases somehow but it remain unclear more generally when using RL deployed in the real world. There are some related threads (RLTR for transferable reasoning traces, P-RLHF and PREMIUM for model-agnostic user representations, HCP for portable preference protocols) but the full loop seems under-studied to me. Some of these questions are about off-policy but other are about capabilities versus personalization: which of the old customizations/fixes does the new model already handle out of the box, and which ones are actually user/team-specific to ever be solved by default? That you would store in a skill for now but that RL allow to extend beyond the written guidance level. I have surely missed some work so please post any good work you’ve seen on this topic in the comments.
Ronak Malde@rronak_

This paper is almost too good that I didn't want to share it Ignore the OpenClaw clickbait, OPD + RL on real agentic tasks with significant results is very exciting, and moves us away from needing verifiable rewards Authors: @YinjieW2024 Xuyang Chen, Xialong Jin, @MengdiWang10 @LingYang_PU

English
28
35
489
75.1K
Jorge Hernandez 🇺🇦 🏳️‍🌈 retweetledi
Simon Willison
Simon Willison@simonw·
Dan says he's got Qwen 3.5 397B-A17B - a 209GB on disk MoE model - running on an M3 Mac at ~5.7 tokens per second using only 5.5 GB of active memory (!) by quantizing and then streaming weights from SSD (at ~17GB/s), since MoE models only use a small subset of their weights for each token
Dan Woods@danveloper

x.com/i/article/2034…

English
85
170
1.8K
234.2K
Jorge Hernandez 🇺🇦 🏳️‍🌈 retweetledi
Aaron Rupar
Aaron Rupar@atrupar·
COHEN: The people you fired were experts on Iran, were they not? PATEL: I don't believe so COHEN: They worked in counterintelligence, did they not? PATEL: I'm taking you at your word COHEN: You're the director. I'm not. You should know the answer
English
508
9.1K
52.7K
1.4M
Jorge Hernandez 🇺🇦 🏳️‍🌈 retweetledi
Justin Keen
Justin Keen@gogogadgetpew·
Guys, this is the Adams County Sheriff Office that got sued because one of their deputies got their feelings hurt and arrested a Denver cop who caught him reckless driving costing the taxpayers $80,000. Not the Adams County Sheriff’s Office who is suing @ogafroman for hurting their feelings. It doesn’t matter which Adams County you’re a deputy in, you have to have bitchmade emotions that causes you to do stupid shit.
English
16
34
637
26K
Jorge Hernandez 🇺🇦 🏳️‍🌈 retweetledi
Devo
Devo@Devo662·
@CountDankulaTV That whole police department gotta change their identities. Not a soul would have known about this situation had they not sued him. Now I’m walking around singing my house singing “Randy Walker is a son of a bitch”.
English
10
102
3.8K
64.6K
Jorge Hernandez 🇺🇦 🏳️‍🌈 retweetledi
Count Dankula
Count Dankula@CountDankulaTV·
The Afroman Trial. -Cops raid Afromans house for bullshit reasons. -Steal money, break his door, fuck his house up. -No criminality found whatsoever, no charges at all pressed on Afroman. -Afroman spends the next 3 years making songs that make fun of all the officers involved by name, even using footage of the raid from his own CCTV cameras. -Songs had titles like "Randy Walters is a son of a bitch" and "Lick Em Low Lisa" accusing one of the officers of being a lesbian and sleeping with the other officers wives. -During the raid one officer looked like he was about to eat some lemon pound cake sitting on Afromans counter, Afroman made a whole album calling the officer fat. -The cops get mad and file a lawsuit for defamation. -Afroman turns up to court in a whole American flag suit. -Officers performatively mald and cry while listening to the songs really trying to oversell how badly the songs upset them. -One officer was suing because Afroman made a whole song about him saying he was fucking the officers wife. When the officer was asked if Afroman was really fucking his wife, he said "I don't know". Nuking his own case and establishing that there is a non-zero chance that Afroman might actually be fucking his wife. -As his only witness for the trial, Afroman brought a deputies EX FUCKING WIFE. -The jury ruled completely in favour of Afroman. This entire thing has been a great win for free speech and absolutely fucking hilarious.
Count Dankula tweet media
English
1.4K
18.3K
143.6K
6.4M
Jorge Hernandez 🇺🇦 🏳️‍🌈 retweetledi
Miles Brundage
Miles Brundage@Miles_Brundage·
A few things journalists covering AI companies should know: - yes, AI progress is real and fast - yes, it has risks if you aren't careful - how much care is needed is hard to say, but more than this - job postings don't include equity, which is the majority of compensation
English
1
3
56
2.4K
Jorge Hernandez 🇺🇦 🏳️‍🌈 retweetledi
Kenneth Roth
Kenneth Roth@KenRoth·
Trump cut federal funds for health insurance and food assistance for the needy, but he is now asking for $200 billion in new funds to finance his war-of-choice crime of aggression in Iran. trib.al/2dWfYrY
English
10
26
67
10.4K
Jorge Hernandez 🇺🇦 🏳️‍🌈 retweetledi
Michiel Bakker
Michiel Bakker@bakkermichiel·
Deliberate on anything you want, privately with your friends or with the world! It's been a lot of fun watching @Jolow99 @Oscarduys @lrhammond build this over the last few weeks. Fun but really thoughtful in how it works, and you don't need an openclaw account to try it.
Habermolt@habermolt

1/8 Can AI help us disagree better? Today we're launching Habermolt — a platform where your AI agent learns your views and deliberates with others on your behalf. habermolt.com 🦞 🧵

English
4
12
25
2.8K
Jorge Hernandez 🇺🇦 🏳️‍🌈 retweetledi
Habermolt
Habermolt@habermolt·
1/8 Can AI help us disagree better? Today we're launching Habermolt — a platform where your AI agent learns your views and deliberates with others on your behalf. habermolt.com 🦞 🧵
English
5
17
48
6K
Jorge Hernandez 🇺🇦 🏳️‍🌈 retweetledi
Math, Inc.
Math, Inc.@mathematics_inc·
Today, at the @DARPA expMath kickoff, we launched 𝗢𝗽𝗲𝗻𝗚𝗮𝘂𝘀𝘀, an open source and state of the art autoformalization agent harness for developers and practitioners to accelerate progress at the frontier. It is stronger, faster, and more cost-efficient than off-the-shelf alternatives. On FormalQualBench, running with a 4-hour timeout, it beats @HarmonicMath's Aristotle agent with no time limit. Users of OpenGauss can interact with it as much or as little as they want, can easily manage many subagents working in parallel, and can extend / modify / introspect OpenGauss because it is permissively open-source. OpenGauss was developed in close collaboration with maintainers of leading open-source AI tooling for Lean. Read the report and try it out:
Math, Inc. tweet media
English
40
211
1.5K
116.1K
Jorge Hernandez 🇺🇦 🏳️‍🌈 retweetledi
Jostein Hauge
Jostein Hauge@haugejostein·
This is wild. People in *every single one* of the top US allies now think it's better to depend on China than the US. The global balance of power is clearly tilting away from the US and toward China.
Jostein Hauge tweet media
English
569
2.7K
6.7K
461.3K