(☀️_☀️)

1.5K posts

(☀️_☀️) banner
(☀️_☀️)

(☀️_☀️)

@ontofractal

Only the birds fly first class 🐦

Earth Katılım Şubat 2015
1.4K Takip Edilen255 Takipçiler
(☀️_☀️) retweetledi
Peter Wildeford🇺🇸🚀
Peter Wildeford🇺🇸🚀@peterwildeford·
We focus a lot of policy on the AI models that are already released, but what about the AI models waiting internally in the wings? Our new paper explores risks from internal AI systems.
Oscar Delaney@oscar__delaney

1/ The most powerful AIs aren't public. For months, labs run highly capable internal models before release. @ashwinkacharya and I wrote a new @IAPS report, “Managing Risks from Internal AI Systems,” exploring the hidden dangers this creates. 🧵

English
0
3
34
2.2K
(☀️_☀️)
(☀️_☀️)@ontofractal·
@peterwildeford If gemini activity is saved they explicitly reserve the right to both train their models and use human workers to review chats. When gemini activity is disabled, gemini chats are ephemeral and are not saved to gemini dashboard. It's not great.
English
0
0
5
178
Peter Wildeford🇺🇸🚀
Peter Wildeford🇺🇸🚀@peterwildeford·
I'm confused about people who think we can't upload confidential data to Gemini when the data is already in Google Sheets and discussed over Google Hangout and Gmail.
English
6
7
141
5.5K
(☀️_☀️)
(☀️_☀️)@ontofractal·
@ericneyman a concept named using two or more surnames, e.g. Sapir-Whorf hypothesis, Einstein-Podolsky-Rosen paradox
English
0
0
1
13
Eric Neyman
Eric Neyman@ericneyman·
Sometimes you can tell that a Wikipedia article will be good based on its last word: "paradox", "effect", and "incident", for example. What are other examples?
English
2
0
7
459
(☀️_☀️) retweetledi
Firefly Aerospace
Firefly Aerospace@FireflySpace·
We have confirmation #BlueGhost stuck the landing! Firefly just became the first commercial company in history to achieve a fully successful Moon landing. This small step on the Moon represents a giant leap in commercial exploration. Congratulations to the entire Firefly team, our mission partners, and our @NASA customers for this incredible feat that paves the way for future missions to the Moon and Mars. Standby for the first image, expected in the next 30 minutes! #BGM1
English
730
3.1K
20.2K
1.8M
(☀️_☀️)
(☀️_☀️)@ontofractal·
some good points here
Will Bryk@WilliamBryk

Deep thoughts on Deepseek and Deep Research There was a lot of big AI news the past 2 weeks, but the actual biggest news wasn't what you'd think. The biggest news was not the trillion dollar Nvidia drop. That was a market overreaction bc of a spicy story. It was not the cheap Deepseek training run. That was impressive engineering under constraints but overblown. It was not the 500 billion dollar Stargate cluster. That was in line with predictions for big lab compute spend in the coming years. And it was not OpenAI's Deep Research. That was an impressive release but an entirely predictable combination of o3 with a traditional search engine API. So what was the biggest news? The biggest news… was that Reinforcement Learning for LLMs "just works". RL for LLMs is now easy to get working. We see RL for LLMs just working for Deepseek, given the speed they were able to replicate o1, and given the ease that other orgs had using the same RL algorithm on different training data. And we see RL for LLMs just working for OpenAI, with the speed they were able to get Deep Research working, only SOME WEEKS after o3 was trained. Something new has been discovered about reality, a statistical law of the universe. It's hard for us to grasp the power of billions of weights melding toward some reward signal. We're touching up against fundamental properties of information systems. If we ever meet superintelligent aliens out there, they'd probably tell us that they too discovered something akin to RL for LLMs long ago. All the other AI news stories the past two weeks will one day be minor details in the story. But RL for LLMs just working will power all the AI news going forward. It is this discovery that will usher in the next era of human history. So what will this next era look like if it's powered by RL + LLMs? Will it be run by startups or big tech companies? Will it be open source or closed? Will it be deeply sought or deeply researched? Yes. All of the above. I think the past two weeks suggest we're on track for a very diverse world, one where small players and big players, open and closed source, intelligent systems (deepseek) and knowledge systems (deep research), each have big roles to play. That's because the amount of value that's coming is absolutely massive (trillions of dollars) and no single player or single system will take it all. When you transition an entire economy to a new foundation built on compute, there will be opportunity everywhere for everyone. RL for LLMs just works, not just for OpenAI but for everyone. The Deepseek result complements what I've heard from people at the AI labs -- this new RL paradigm is no longer hard. It doesn't rely on some hard to replicate breakthrough like the transformer. It doesn't require some proprietary data mix like GPT-4, which took 2 years to replicate. It's an optimization function, one that requires a few thousand examples. The iteration cycles here are extremely fast. Deepseek replicated o1 in a couple months. OpenAI finetuned o3 for deep research in a couple weeks. All the big labs will have their o3-level models soon and their tool using agents soon after. And the open source versions will follow. Don't big labs have a massive compute advantage? Yes, because of the logarithmic test-time compute scaling law for RL + LLMs, you need exponentially more compute for linear gains in quality. The big companies will therefore own the frontier models. But Deepseek showed that startups and individuals will also have very good models of their own. These can be trained on proprietary data mixes to make them better than the frontier models for many tasks. There will be a powerful open source ecosystem of RL data, resources, and tools. And when the cost of serving goes down to basically the cost of the underlying GPUs, you won't need to run their o5 on their compute when you can run your personalized r5 on your own compute. Additionally we've seen that startups and individuals benefit from the race to the bottom that the big players play with their APIs, even from their frontier models. If RL + LLMs levels the playing field even more among the big labs, this probably gets more true. There is a wide distribution of tasks at all positions on the latency/intelligence/skill specialization/privacy graph, and no player will satisfy them all. You don't need a Terrence Tao o7 model to do your taxes. Trillions of dollars of new value is going to be created. There will be, and already are, a new breed of AI-first companies whose advantage come from streamlined integrations, prolific partnerships, magical product sense, shipping speed, access to unique data, connections to the physical world, viral marketing, building where big companies won't or can't. The world will overflow with models of all different types and sizes. Compute will power all pockets of the economy. This is the coolest time to be alive and to be building. Hectic and dangerous for our species for sure, but I'm optimistic. We'll get through it well if we act sensibly ( a big assumption yes). On the other side is abundance. (btw if you’re worried about lack of meaning in a world of abundance, don’t worry there will be plenty of scarcity — someone is gonna have more compute than you and you're gonna want it.) I wrote a post a couple weeks ago that predicted that at minimum by end of 2025 we’ll have phd level agents navigating the web doing complex tasks. Some called it hype. With Operator and Deep Research coming out some days later, we seem more than on track. These types of systems aren't accelerating people's work yet, but that's because they're bottlenecked on simple features that will come soon -- better integrations, longer context windows, connections to lots of data sources, and more training examples. We're only at the beginning. The past two weeks in AI were wild, and they point to many more wild weeks to come.

English
0
0
2
76
(☀️_☀️) retweetledi
François Fleuret
François Fleuret@francoisfleuret·
There is no going back. The moment information took over matter, there was no going back. Do you hear that sustained very low pitch note at the beginning of Strauss' "Also sprach Zarathustra"? DO YOU HEAR IT?
English
17
18
388
66K
(☀️_☀️)
(☀️_☀️)@ontofractal·
@ChShersh yes, both immediate and long-term yes, absolutely, though it’s actually not doing nothing
English
0
0
0
27
Dmitrii Kovanikov
Dmitrii Kovanikov@ChShersh·
Serious question to people who meditate. Did you notice any improvements? Is it really worth it to spend 15 minutes a day just sitting and doing nothing?
English
118
0
196
34.6K
(☀️_☀️)
(☀️_☀️)@ontofractal·
Shocked at the poor quality of some (most?) of the top iOS heart health apps. Bogus but scientific sounding (totally fake) metrics, capital chart crimes, GIGO dashboards, noisy alerts. Hard to believe how bad it all is.
English
0
0
0
39
(☀️_☀️) retweetledi
Chris J. Maddison
Chris J. Maddison@cjmaddison·
What if the next medical breakthrough is hidden in plain text? Causal estimates drives progress but data is limited & RCTs slow. Introducing NATURAL: a pipeline for causal estimation from text data in hours, not years. Paper: tinyurl.com/ppr29 Site: tinyurl.com/web98
Chris J. Maddison tweet media
English
23
111
582
184.4K
Thomas Millar
Thomas Millar@thmsmlr·
Any Elixir folks ETL their stripe data? If so, how do you do it? I'm looking for something that can export all stripe data to a CSV daily
English
3
1
6
4K
(☀️_☀️) retweetledi
François Chollet
François Chollet@fchollet·
My interpretation of prompt engineering is this: 1. A LLM is a repository of many (millions) of vector programs mined from human-generated data, learned implicitly as a by-product of language compression. A "vector program" is just a very non-linear function that maps part of the latent space unto itself. 2. When you're prompting, you're fetching one of these programs and running it on an input -- part of your prompt serves as a kind of "program key" (as in database key) and part serves as program argument(s). Like, in "write this paragraph in the style of Shakespeare: {my paragraph}", the part "write this paragraph in the stye of X: Y" is a program key, with arguments X=Shakespeare and Y={my paragraph}. 3. The program fetched by your key may or may not work well for the task at hand. There's no reason why it should be optimal. There are lots of related programs to choose from. 4. Prompt engineering represents a search over many keys in order a find a program that is empirically more accurate for what you're trying to do. It's no different than trying different keywords when searching for a Python library. 5. Everything else is unnecessary anthropomorphism on the part of the prompter. You're not talking to a human who understands language the way you do. Stop pretending you are.
English
116
507
2.9K
710.8K
(☀️_☀️) retweetledi
Chris McCord
Chris McCord@chris_mccord·
👀 Llama2-13b running on GPU with Elixir/Bumblee/Phoenix
English
11
45
227
51.7K
(☀️_☀️) retweetledi
Cliff Pickover
Cliff Pickover@pickover·
Infinity, fractals, geometry, the universe. Yann writes: "An early concept for the 2023 Revision Party collab with @etiennejcb" By Yann Le Gall, @Yann_LeGall, Used with permission.
English
4
75
317
35.1K
(☀️_☀️) retweetledi
Cliff Pickover
Cliff Pickover@pickover·
If there is an infinite multiverse, then there is almost surely at least one universe in which you have RETWEETED this, SMILED, and had an extremely WONDERFUL day.
Cliff Pickover tweet media
English
6
28
121
12.5K
(☀️_☀️) retweetledi
François Chollet
François Chollet@fchollet·
You maximize your learning speed by minimizing your assumptions about how much you already know.
English
18
98
603
74.6K
(☀️_☀️)
(☀️_☀️)@ontofractal·
@josevalim The incremental value you’ll create by your outreach to individual developers and Elixir community in general is massively higher than negative externalities of twitter blue. Go for it even if it feels a bit ugh.
English
0
0
3
503