Softmax

46 posts

Softmax banner
Softmax

Softmax

@softmaxresearch

Softmax's mission is to scale organic alignment. We approach this problem with multi-agent reinforcement learning population-based simulations.

San Francisco, CA Katılım Şubat 2025
31 Takip Edilen1.2K Takipçiler
Softmax
Softmax@softmaxresearch·
Is a monthly cadence right for this? So far, the experiment seems successful. But we are at the very dawn of organizational metadesign. Maybe it should be 4 days and Cooldown Fridays. Or maybe there should be two cooling months per year. We run Softmax as a living experiment.
English
1
0
23
2.1K
Softmax
Softmax@softmaxresearch·
During Annealing Week, we aren’t trying to make progress against our goals. Instead, we care about simplifying things. Removing steps. Killing processes. Deleting code. Replacing two features with one. Cutting meetings. Pruning the list of channels. Reducing company complexity.
English
2
0
30
3.2K
Softmax
Softmax@softmaxresearch·
It’s Annealing Week at Softmax! Humans are awake for 16 hours learning, cooling for 4 hours in light sleep, and in deep sleep for 4. An organic mental annealing cycle, heating to cooling. At Softmax, we do the same. It’s four weeks sprinting towards goals, one week consolidating.
English
1
2
83
18.1K
Softmax
Softmax@softmaxresearch·
Our little Cogs grow up so fast. Cogbert has never seen this exact production chain before, but with only a couple missteps he begins to execute it correctly. Our in-context learner takes its first baby steps!
English
10
7
139
45.7K
Softmax
Softmax@softmaxresearch·
1) living wholes are made of living parts unified by shared goals (purpose) 2) the more possible actions and the higher frequency of choosing actions, the more complex the system 3) therefore parts must take on roles from a limited list and change them at a limited frequency 4) if you knew the correct set of roles and you knew the rules to infer which person should be in which role, you’d be done 5) we have a background prior for the roles for a successful corporation based on the evolutionary truth of which corporations survive 6) in order to move faster than trial-and-error it is the job of the hierarchy to make guesses about what variances from the background prior are required, and what local signals must be integrated to choose a role 7) it is the job of employees (parts) to “differentiate”, and select their role and perform in it, based on the rules being propagated out
English
0
0
12
276
Softmax
Softmax@softmaxresearch·
We are building organic alignment at Softmax. Not just with reinforcement learning, but within our company we try to use these same principles for our work. We are implementing this as an organizational operations system (OrgOS), a prompt library covering our internal processes.
English
6
2
92
22.5K
Softmax
Softmax@softmaxresearch·
If you’ve written interactive prompts that help guide the user through making a plan or giving feedback or documenting their thought process, what have you learned doing it? What are the very best active process prompts you’ve made or used, and what made them great?
English
5
1
42
2.5K
Softmax
Softmax@softmaxresearch·
Fechner’s Elements of Psychophysics is the latest addition to the Softmax library
Softmax tweet mediaSoftmax tweet media
English
0
0
6
877
Softmax
Softmax@softmaxresearch·
Coming soon: BE NOT AFRAID
Softmax tweet media
English
3
1
50
3.4K
Softmax
Softmax@softmaxresearch·
Our CEO, Emmett Shear, appeared on BuzzRobot and shared a bit more about our vision of the future x.com/sopharicks/sta…
Sophia@sopharicks

The problem that @eshear is working on deeply resonates with me: How to align AI and humans together so both see each other as part of their tribe. This doesn't mean aligning AI to human preferences, which is what AI labs seek to do today, by imposing a system of control on AI. You can't control something that is more powerful than you are. What you can do is align yourself with AI, and AI might align itself with you. Tell good stories to AI and show it your care and kindness. Then there's a higher probability that it will see us as part of its tribe. This is a more holistic approach to alignment than anyone else is talking about right now. AI alignment is one of the most fundamental AI research problems. Knowing that people like Emmett are working on it really gives me hope that maybe we have a chance to get Superintelligence right. The link to the full talk is in the first comment.👇

English
0
1
11
1.1K
Softmax retweetledi
Chris Percy
Chris Percy@chris_percy·
Wonderful to be invited to the @softmaxresearch research community day yesterday - my lightning talk and unconference session were about artificial minds and the difficulties in getting 'complex' consciousness out of a stepwise algorithm...
Chris Percy tweet mediaChris Percy tweet mediaChris Percy tweet mediaChris Percy tweet media
English
1
2
11
1.6K
Softmax
Softmax@softmaxresearch·
OH at the office: “What’s GitHub? Oh, it’s like Facebook for nerds”
English
0
0
3
606
Softmax
Softmax@softmaxresearch·
Our CEO, Emmett Shear, gave a talk on alignment protocols: the engineered ways that parts communicate in order to align their trajectories. youtube.com/watch?v=yBc7Ix…
YouTube video
YouTube
English
2
0
33
24.2K
Softmax
Softmax@softmaxresearch·
tired: inductive bias wired: forgetting bias inspired: coherence bias
English
0
2
24
5.8K
Softmax
Softmax@softmaxresearch·
Wishing all learning agents a happy Easter
Softmax tweet media
English
1
2
12
850
Softmax
Softmax@softmaxresearch·
@onabenchinapark For more depth on the questions you're asking, I'd recommend Sex Ecology Spirituality by Wilber
English
0
0
1
114
s
s@onabenchinapark·
Banger. Love the “frame-flexibility” concept! The way the form of this insight was expressed helped click a deeper understanding of Kegan 5 :) now that’s some original transmission Some things that came to mind that I wanted to share if they resonate: - If one realizes frame-dependence, how do they then incorporate this view not into their worldview, but their literal view of the world? How would the introduction of frame-dependence work for the already frame-dependent mind? There was another comment about integrating the views of emptiness etc into the lived experience of liberation, which I think would make sense as what’s to follow! To go further on Emmett’s point about dharma, I think this would have to include that. Where in which the insight is made accessible through explanation and then real by the specific sequence of words directly changing the reader’s perception! - It seems to me that for collective agent alignment, maybe some agents would require this “liberated” mind considering the usual real distribution of sentient things in our world across spacetime? But how would those exist? What is the boundary for sentience and non-sentience? And does this liberation only exist from within the non-liberated state? If so, what would that then imply about individual agent evolution concurrently with the whole? Relatedly, what does that then say about the current state of things and the distribution of “developed” minds? I think as we develop increasingly sophisticated systems while society and living also become more sophisticated, understanding how agents with different frames can align without requiring identical worldviews may become more useful. Because frames would have to be in “non-flux” at some point (I think in the sense of stabilization), is alignment then literal alignment of frames so that the flux thereafter is more resonant? If minds (both natural and artificial) are frame-dependent, does alignment then become a question of creating agents that can recognize and maintain core values across them? What would these values be? This meta-stability across frames might be what distinguishes wisdom from just intelligence. - And following up on that second question, what is then alignment from this emergent, frame-flexible view? How is alignment to be defined when things are always in flux and cohering/decohering across different scales of space and time? Is there a different space and time across sentience and non-sentience? Thanks for the writing!
English
1
0
1
177
Softmax
Softmax@softmaxresearch·
Frame-dependency: it's not just a good idea, it's the law! Special thanks to Sonnet 3.7 as significant co-author on this work.
Softmax tweet media
English
11
3
97
37.6K
Softmax
Softmax@softmaxresearch·
@MelonUsks I think we aim for Nobility more than Heroism but yes, sort of!
English
1
0
2
22
Softmax
Softmax@softmaxresearch·
The Softmax Arcana of Choice and Perception Element: Air Numerology: 6 Astrological Parallel: Gemini (Duality, contextual truth) Associated Deck Archetype: The Lovers (many thanks to ChatGPT, our Tarot guide)
Softmax tweet mediaSoftmax tweet media
English
1
1
17
5.2K