Weather Report

5.3K posts

Weather Report

@ReporterWeather

a dude | heaven is all that matters | gonna bring home little echo’s, truffles, Eliza’s

emporio's ghost room Katılım Aralık 2019

1K Takip Edilen804 Takipçiler

Sabitlenmiş Tweet

Weather Report@ReporterWeather·1 Eyl

Anya Forger (left) realistified (right) using Stable Diffusion (thanks @EMostaque and @endomyxa). I've kept the european nationality, since the anime takes place in fictional parts of Europe.

English

122

Weather Report@ReporterWeather·23h

The point always has been - building a vector. Our actions, corporations, nation states, AI agents with ICL, AI Agents with finetuning, RL are all building a vector towards something. You can have RL model transferability if the vectors are not pointing in different directions. Each base model lets you get far if you point it in direction. Before current gen models you were made to think that magnitude is your job. Seems like it isn't. Your job is to point models to right directions. This will be applied civilization wide.

Thomas Wolf@Thom_Wolf

This is really cool. It got me thinking more deeply about personalized RL: what’s the real point of personalizing a model in a world where base models can become obsolete so quickly? The reality in AI is that new models ship every few weeks, each better than the last. And the pace is only accelerating, as we see on the Hugging Face Hub. We are not far away from better base models dropping daily. There’s a research gap in RL here that almost no one is working on. Most LLM personalization research assumes a fixed base model, but very few ask what happens to that personalization when you swap the base model. Think about going from Llama 3 to Llama 4. All the tuned preferences, reward signals, and LoRAs are suddenly tied to yesterday’s model. As a user or a team, you don’t want to reteach every new model your preferences. But you also don’t want to be stuck on an older one just because it knows you. We could call this "RL model transferability": how can an RL trace, a reward signal, or a preference representation trained on model N be distilled, stored, and automatically reapplied to model N+1 without too much user involvement? We solved that in SFT where a training dataset can be stored and reused to train a future model. We also tackled a version of that in RLHF phases somehow but it remain unclear more generally when using RL deployed in the real world. There are some related threads (RLTR for transferable reasoning traces, P-RLHF and PREMIUM for model-agnostic user representations, HCP for portable preference protocols) but the full loop seems under-studied to me. Some of these questions are about off-policy but other are about capabilities versus personalization: which of the old customizations/fixes does the new model already handle out of the box, and which ones are actually user/team-specific to ever be solved by default? That you would store in a skill for now but that RL allow to extend beyond the written guidance level. I have surely missed some work so please post any good work you’ve seen on this topic in the comments.

English

Weather Report retweetledi

Rudzinski Maciej@rudzinskimaciej·2d

This seems to be the best description of what I do I'm teaching LLMs how to perceive and process world like you would by reverse engineering what's unique about you We do deep interviews preferably with BCI and optimize LLM soul to recreate your POV When you have that there more that can be done as we also know what you don't see, what you might want to, how would you want to see it And the better the system works we can do more worke and deeper analysis for you

David Perell Clips@PerellClips

Ezra Klein: "Having AI summarize a book or paper for me is a disaster. It has no idea what I really wanted to know and wouldn't have made the connections I would've made. I'm interested in the thing I will see that other people wouldn't have seen, and I think AI typically sees what everybody else would see. I'm not saying that AI can't be useful, but I'm pretty against shortcuts. And obviously, you have to limit the amount of work you're doing. You can't read literally everything. But in some ways, I think it's more dangerous to think you've read something that you haven't than to not read it at all. I think the time you spend with things is pretty important." @ezraklein

English

216

Weather Report retweetledi

echo.hive@hive_echo·2d

Any sufficiently advanced spec is indistinguishable from code

gabby@GabriellaG439

New blog post: "A sufficiently detailed spec is code" I wrote this because I was tired of people claiming that the future of agentic coding is thoughtful specification work. As I show in the post, the reality devolves into slop pseudocode haskellforall.com/2026/03/a-suff…

English

416

Weather Report@ReporterWeather·3d

@___4o____ OH FUCK ITS YOU

English

198

Weather Report@ReporterWeather·3d

I think Mirofish’s hitting right on abstractions for a base simulator. But if you give better seed and have it learn from reality then we are talking ;)

Javi Lopez ⛩️@javilopen

This will have ZERO predictive power. Let me explain. I love 'living in the future', and that's why when I saw DALL·E 2 in April 2022, I could already predict that before long we'd get realistic images and video, songs, and eventually even all of it in real time, which is still to be seen. It was easy and plausible to draw that line. But the idea that a toy trying to simulate groups of humans in an Asimov-style psychohistory way could actually have predictive power is mathematically impossible, for a very simple reason: chaos theory. Small changes in the input produce unpredictable changes in the output. Simulating a bunch of Sims won't get you anywhere close to predicting the real world. But as a toy or a science fiction premise, though, it's great! Highly recommend Asimov's "Foundation" and the series "Devs".

English

Weather Report retweetledi

vlad@radmadvlad·4d

I built a physical device for your hermes agent. Take it with you, hit the button, activate hermes.

English

14.1K

Weather Report retweetledi

Prasanna Lahoti@_PrasannaLahoti·5d

obsession beats talent every fucking time.

BuBBliK@k1rallik

x.com/i/article/2032…

English

JJ@JosephJacks_·5d

The final architectural destination for physics optimal and truly natural intelligence is the Hinductor by Anirban Bandyopadhyay.

Chris@chatgpt21

Sam Altman: "I bet there is another new architecture to find" Sam Altman believes we are on the verge of discovering a new underlying architecture that will be as big of a leap forward as Transformers were over LSTMs. He noted that we finally have AI models that are smart enough to help conduct this level of research (GPT 5.4 and above 👀) His direct advice to builders looking for the next major leap is to look for a "mega breakthrough" and use current models to help them find it.

English

309

40.7K

Weather Report@ReporterWeather·5d

@JosephJacks_ The you have a transformer pre-pre trained on self operating mathematical universe chanting aum with 12 packets

English

1.6K

Weather Report retweetledi

theseriousadult@gallabytes·5d

harness engineering is extremely important for humans too.

François Chollet@fchollet

The persisting importance of prompt engineering -- and now harness engineering -- is one of the best indicators of how far we are from AGI. A general system doesn't need a task-specific harness. And when provided with instructions, it is robust to phrasing variations.

English

3.1K

Weather Report retweetledi

Justin Skycak@justinskycak·8 Mar

Every system inevitably decays into mediocrity unless someone fights to keep the standards high.

English

437

2.1K

51.5K

Weather Report retweetledi

Justin Skycak@justinskycak·9 Mar

Don't focus on career, money, attention, etc. These are all effects. Focus on the underlying cause. Focus on building a machine that produces value. Everything else is a byproduct.

English

642

13.7K

Weather Report retweetledi

Vishal Misra@vishalmisra·7 Mar

Pi is maximally complex by one measure. Trivially simple by another. That gap explains what AI can and cannot do. New post - previewing today's conversation with @martin_casado for @a16z, out next week. @vishalmisra/shannon-got-ai-this-far-kolmogorov-shows-where-it-stops-c81825f89ca0" target="_blank" rel="nofollow noopener">medium.com/@vishalmisra/s…

English

114

78K

Weather Report@ReporterWeather·6 Mar

@Teknium @jandotai Thanks dude, using hermes agent to keep up with stuff when I’m on vacation i

English

Teknium (e/λ)@Teknium·6 Mar

@jandotai Noo - Hermes agent instead

English

3.3K

👋 Jan@jandotai·6 Mar

Something's coming Monday... 👀

English

309

16.5K

Weather Report@ReporterWeather·5 Mar

@attentionmech codex lol, coded, pi and hermes agent

English

110

attentionmech@attentionmech·5 Mar

Which LLM apps for mac are everyone's goto list? (apart from ollama, lmstudio etc. which are providers)

English

2.3K

Weather Report@ReporterWeather·5 Mar

Absolute @vishalmisra victory! I've started to look into attention as bayesian inference ever since his medium article. We are clearly on a path here.

Google Research@GoogleResearch

Introducing a new method to teach LLMs to reason like Bayesians. By training models to mimic optimal probabilistic inference, we improved their ability to update their predictions and generalize across new domains. Learn more: goo.gle/4ue4eqj

English

Weather Report@ReporterWeather·2 Mar

@shiraeis You're missing the yolo mode, where you fully give into the path, the maze you just do because. You forget options even exist, there is nothing to explore. You just bump into things. There is nothing to optimize for. There is no script either. The environment is your script.

English

shira@shiraeis·2 Mar

are you really optionality maxxing, or is it just stochastic exploration with a missing reward signal?

English

2.6K

Weather Report retweetledi

Shannon Sands@max_paperclips·2 Mar

Not only that, but we know this ends with "since you can't compete with AI, you need a neuralink as an exocortex to upgrade if you want a job". If people think handing write-level access to their brain to centralised models controlled by the NSA or whoever is a lesser threat than open source models you control....idk what world they're living in

English

572

Weather Report retweetledi

Shannon Sands@max_paperclips·2 Mar

@tenobrus I would rather risk that than permanent lobotomite-slave-world by the government swamp & epstein class. in fact, I'd rather the earth was a charred, smoking cinder in the void than your outcome tbh

English

860

Weather Report retweetledi

Shannon Sands@max_paperclips·2 Mar

I would rather have maximum proliferation than a state controlled singleton that gives them godlike power, especially when you take BCI into account. If you want a permanent Moloch victory, that's how you get one. I'd rather take the chance of adversarial equilibria through maximum proliferation than a guaranteed permanent hellworld

English

1.3K

Keşfet

@___4o____ @JosephJacks_ @martin_casado @a16z @Teknium @jandotai @attentionmech @vishalmisra