Weather Report

5.3K posts

Weather Report

Weather Report

@ReporterWeather

a dude | heaven is all that matters | gonna bring home little echo’s, truffles, Eliza’s

emporio's ghost room Tham gia Aralık 2019
1K Đang theo dõi804 Người theo dõi
Tweet ghim
Weather Report
Weather Report@ReporterWeather·
Anya Forger (left) realistified (right) using Stable Diffusion (thanks @EMostaque and @endomyxa). I've kept the european nationality, since the anime takes place in fictional parts of Europe.
Weather Report tweet mediaWeather Report tweet media
English
7
20
122
0
Weather Report
Weather Report@ReporterWeather·
The trick is to not have it do thing directly somehow it goes off rails. I always have to ask three questions: > I need {thing} to be done. You’re given …. Now plan exactly how would you set out to do that thing? > Great so now don’t directly edit the files but if you were to edit the files show me the exact diffs > now go ahead and do exactly this Somehow it’s a student that writes 7x7 in rough work.
English
1
0
1
8
Rudzinski Maciej
Rudzinski Maciej@rudzinskimaciej·
Codex gpt5.4 is an idiot It's superb to implement any known thing but when one want to get a bit fancy with methods and invent something it's useless when Opus shines It depends where the novelty is if low lever interveined with rest of logic it lost case if about high level novel type of reuse of old concepts so no complex tie to other systems at this level it's ok
English
1
0
1
113
Weather Report
Weather Report@ReporterWeather·
The point always has been - building a vector. Our actions, corporations, nation states, AI agents with ICL, AI Agents with finetuning, RL are all building a vector towards something. You can have RL model transferability if the vectors are not pointing in different directions. Each base model lets you get far if you point it in direction. Before current gen models you were made to think that magnitude is your job. Seems like it isn't. Your job is to point models to right directions. This will be applied civilization wide.
Thomas Wolf@Thom_Wolf

This is really cool. It got me thinking more deeply about personalized RL: what’s the real point of personalizing a model in a world where base models can become obsolete so quickly? The reality in AI is that new models ship every few weeks, each better than the last. And the pace is only accelerating, as we see on the Hugging Face Hub. We are not far away from better base models dropping daily. There’s a research gap in RL here that almost no one is working on. Most LLM personalization research assumes a fixed base model, but very few ask what happens to that personalization when you swap the base model. Think about going from Llama 3 to Llama 4. All the tuned preferences, reward signals, and LoRAs are suddenly tied to yesterday’s model. As a user or a team, you don’t want to reteach every new model your preferences. But you also don’t want to be stuck on an older one just because it knows you. We could call this "RL model transferability": how can an RL trace, a reward signal, or a preference representation trained on model N be distilled, stored, and automatically reapplied to model N+1 without too much user involvement? We solved that in SFT where a training dataset can be stored and reused to train a future model. We also tackled a version of that in RLHF phases somehow but it remain unclear more generally when using RL deployed in the real world. There are some related threads (RLTR for transferable reasoning traces, P-RLHF and PREMIUM for model-agnostic user representations, HCP for portable preference protocols) but the full loop seems under-studied to me. Some of these questions are about off-policy but other are about capabilities versus personalization: which of the old customizations/fixes does the new model already handle out of the box, and which ones are actually user/team-specific to ever be solved by default? That you would store in a skill for now but that RL allow to extend beyond the written guidance level. I have surely missed some work so please post any good work you’ve seen on this topic in the comments.

English
0
0
0
25
Weather Report đã retweet
Rudzinski Maciej
Rudzinski Maciej@rudzinskimaciej·
This seems to be the best description of what I do I'm teaching LLMs how to perceive and process world like you would by reverse engineering what's unique about you We do deep interviews preferably with BCI and optimize LLM soul to recreate your POV When you have that there more that can be done as we also know what you don't see, what you might want to, how would you want to see it And the better the system works we can do more worke and deeper analysis for you
David Perell Clips@PerellClips

Ezra Klein: "Having AI summarize a book or paper for me is a disaster. It has no idea what I really wanted to know and wouldn't have made the connections I would've made. I'm interested in the thing I will see that other people wouldn't have seen, and I think AI typically sees what everybody else would see. I'm not saying that AI can't be useful, but I'm pretty against shortcuts. And obviously, you have to limit the amount of work you're doing. You can't read literally everything. But in some ways, I think it's more dangerous to think you've read something that you haven't than to not read it at all. I think the time you spend with things is pretty important." @ezraklein

English
1
2
4
218
Weather Report đã retweet
vlad
vlad@radmadvlad·
I built a physical device for your hermes agent. Take it with you, hit the button, activate hermes.
English
5
5
72
14.3K
Weather Report
Weather Report@ReporterWeather·
@JosephJacks_ The you have a transformer pre-pre trained on self operating mathematical universe chanting aum with 12 packets
English
0
0
1
1.6K
Weather Report đã retweet
Justin Skycak
Justin Skycak@justinskycak·
Every system inevitably decays into mediocrity unless someone fights to keep the standards high.
English
38
437
2.1K
51.5K
Weather Report đã retweet
Justin Skycak
Justin Skycak@justinskycak·
Don't focus on career, money, attention, etc. These are all effects. Focus on the underlying cause. Focus on building a machine that produces value. Everything else is a byproduct.
English
13
78
642
13.8K
Weather Report đã retweet
Vishal Misra
Vishal Misra@vishalmisra·
Pi is maximally complex by one measure. Trivially simple by another. That gap explains what AI can and cannot do. New post - previewing today's conversation with @martin_casado for @a16z, out next week. @vishalmisra/shannon-got-ai-this-far-kolmogorov-shows-where-it-stops-c81825f89ca0" target="_blank" rel="nofollow noopener">medium.com/@vishalmisra/s…
English
5
17
114
78K
👋 Jan
👋 Jan@jandotai·
Something's coming Monday... 👀
English
24
15
309
16.5K
attentionmech
attentionmech@attentionmech·
Which LLM apps for mac are everyone's goto list? (apart from ollama, lmstudio etc. which are providers)
English
4
0
11
2.3K
Weather Report
Weather Report@ReporterWeather·
@shiraeis You're missing the yolo mode, where you fully give into the path, the maze you just do because. You forget options even exist, there is nothing to explore. You just bump into things. There is nothing to optimize for. There is no script either. The environment is your script.
English
0
0
2
74
shira
shira@shiraeis·
are you really optionality maxxing, or is it just stochastic exploration with a missing reward signal?
English
10
4
57
2.6K
Weather Report đã retweet
Shannon Sands
Shannon Sands@max_paperclips·
Not only that, but we know this ends with "since you can't compete with AI, you need a neuralink as an exocortex to upgrade if you want a job". If people think handing write-level access to their brain to centralised models controlled by the NSA or whoever is a lesser threat than open source models you control....idk what world they're living in
English
1
2
21
572
Weather Report đã retweet
Shannon Sands
Shannon Sands@max_paperclips·
@tenobrus I would rather risk that than permanent lobotomite-slave-world by the government swamp & epstein class. in fact, I'd rather the earth was a charred, smoking cinder in the void than your outcome tbh
English
4
2
65
860