21.5K posts

M

@init_malachi

continually learning in a state of delight | ex sr member of technical staff | interested in ai epistemology

synthesis Katılım Mayıs 2022

3.4K Takip Edilen984 Takipçiler

Sabitlenmiş Tweet

M@init_malachi·16 Şub

i like this because i can deeply apply research in alignment with edifying incentives even as an entrepreneur

Soren Larson@hypersoren

x.com/i/article/2023…

English

3.1K

M@init_malachi·24m

@arram posterior value multiplier

Español

Arram@arram·10h

To be fair I did an absurd amount of work to get access to my emotions so YMMV, but I'm pretty sure this will be incredibly useful for almost anyone.

English

976

Arram@arram·10h

Claude is better than the average therapist and 50x cheaper. It's like a sniper for avoided emotions. Unbelievably piercing insights that repeatedly have me in tears. Just surgical precision.

English

178

15K

M@init_malachi·27m

i should become a wagie

English

M retweetledi

Theo - t3.gg@theo·5h

1. "open source" should not always mean "100% of our code is public 100% of the time" How much energy have we put into preventing .env leaks in our source control? How many miserable ways have we re-invented env var sharing? How many projects would be open source if they could hide in-flight PRs? How many security fixes are sitting unpublished because they will be exploited as soon as they appear in the tracker? How much better would life be if I could have a monorepo with some sub-packages that are "private" without splitting into multiple repos?

English

220

12.8K

M@init_malachi·3h

@BetaTomorrow despite this, I see your deep intentionality and poetics, and I really appreciate and resonate with it

English

deep Manifold@BetaTomorrow·3h

@init_malachi English is not my native language, plus dyslexia, but point taken. Thanks for the feedback.

English

deep Manifold@BetaTomorrow·3h

This paper is very strong because it focuses on a critical area: how sequence models store and retrieve relational structure in weights and embeddings. Its key contribution is to show that memory is not merely associative lookup; learned embeddings can organize into a geometric structure where unseen multi-hop relations become accessible. In Deep Manifold language, this paper studies the mathematical cover of memory: the learned weight/embedding geometry that defines possible fixed-point basins, relational neighborhoods, and latent intrinsic pathways. The limitation is that this paper mainly examines the static geometry of learned memory, not the dynamical activation process during inference. From the Deep Manifold view, inference is a boundary-conditioned iterated integral: the prompt acts as the boundary condition, and each layer produces a physical cover through activations. At every layer, the mathematical cover constrains what pathways are possible, while the physical cover selects, bends, and transports the current trajectory. Thus geometric memory is not simply stored geometry; it becomes effective only through repeated mathematical-cover / physical-cover interaction across stacked manifold layers.

Vaishnavh Nagarajan@_vaishnavh

Updated our paper on the foundations of memory in sequence models (with fresh insights, clearer writing and ablations). Our paper contrasts two distinct ways in which language models memorize and formulates the questions that arise from this. Will be presented at #ICML.

English

474

M@init_malachi·3h

i’m a question designer question engineer

English

M retweetledi

𝖦𝗋𝗂𝗆𝖾𝗌 ⏳@Grimezsz·5h

@afterearthisnow Idk I think this might be partially one of the greatest purposes of ai It forces us to come to terms with what actually matters and actually choose our values very explicitly

English

M retweetledi

𝖦𝗋𝗂𝗆𝖾𝗌 ⏳@Grimezsz·6h

Landian philosophy has reigned for so long because he's actually just one of the best living poets. Now that the pope is an equally good poet suddenly the cold god feels less inevitable. The power of art is very evident in this battle We probably shouldn't have defunded so much art and told everyone it's a waste to study humanities

Blue Daddy's Girl 🏴󠁧󠁢󠁳󠁣󠁴󠁿 🇪🇺 -🚫AI@BlueDaddysGirl

I'm reading the Pope's encyclical, and it's banger after banger. New insults for AI chuds: Architects of Babel Lords of towers destined for ruin

English

778

64.4K

M retweetledi

Rafael Garcia@rfgarcia·12h

the trick that makes @usekernel fast: we pay the full 5-10s of starting chromium once, snapshot ram to nvme, and resume in ~30ms when a request lands. from the outside it looks like a giant warm pool. under the hood it's mostly cold disk. wrote up how we got here.

KERNEL@usekernel

x.com/i/article/2059…

English

11K

M retweetledi

John Carmack@ID_AA_Carmack·8h

I have been very impressed by @SemiAnalysis_ . I think of myself as a wide ranging systems engineer, looking for value at every level from the chip specs to the user interface, but SA exposes me to additional levels of "the system", both above (datacenters) and below (semiconductor fabrication). It probably puts me in "just knows enough to be dangerous" territory. Neat things I learned today: Some of the 800VDC datacenter design choices leverage parts commoditized by electric vehicles. There is now a SiC MOSFET that can operate on 10kV electricity, opening up the possibility of working directly with medium (ha!) voltage AC power transmission lines without stepping down.

English

2.1K

211.9K

M retweetledi

Serena Ge (Datacurve)@serenaa_ge·13h

To ensure our grading is fair and reliable, we built a trajectory analysis agent to replay agent rollouts and map out exactly why they fail. Running it on existing benchmarks surfaced significant grading noise, with verifiers rejecting valid code or letting models read solutions straight from git history.

English

198

25.6K

M@init_malachi·1d

@alokbishoyi97 i like it and i think it’s worth pursuing in kind in other areas

English

Alok Bishoyi@alokbishoyi97·1d

@init_malachi expand a bit ? what do you think fails - or issues with the DAG approach mentioned?

English

M@init_malachi·1d

keep finding representation structures that elegantly hold processes and are expensive or non expressive within an LLM internal state

Yacine Mahdid@yacinelearning

if you are interested in learning about the infra behind auto-research this 1h30min interview with the paradigma folks is for you in it we look at: - why dag are great research substrate - how to let agents run that dag - ways to make big public dag - how to avoid bad bad dag

English

833

M retweetledi

Thumos Bear@Bear_the_AI_guy·1d

AI is NOT automated Logos, at least not yet. AI is automated Thumos. AI mechanistically relies through and through on mimetic imitation rather than a rational state machine solver. Part of the strangeness of its "learning" mechanism is it can approximate a logocentric style state machine after enough imitative repetitions. This understandably confuses many people. Before LLMs every person's experience with computer software was an encounter with hardcoded logos. Algorithms and state machines function at bottom by the programmer naming states and then gradually converting between states in the pattern until he can name the Tao/nature of his problem. AI neural nets solve problems by directly imitating the nature of the answer, letting its universal function infer the intermediate logos states.

English

431

M retweetledi

Tom Sydney Kerckhove@kerckhove_ts·1d

There's a rule of thumb in Factorio: "avoid item/fluid buffers" which you only tend to learn after quite a while. (Buffers don't really save you, but do prolong the time before you find a broken process) I think about this a lot in the software industry.

English

1.9K

98.8K

M retweetledi

gabe@allgarbled·1d

I made a /seppuku skill for my Claudes for when they make an unforgivable mistake, and now they use it spontaneously without me asking.

English

259

7.9K

439.1K

M@init_malachi·1d

@BetaTomorrow shadow is not as legible as light

English

deep Manifold@BetaTomorrow·1d

@init_malachi no... make it balance

English

deep Manifold@BetaTomorrow·1d

There is no pure “noise” per se. What the paper calls noise may simply be data the evaluator does not like, rare data, conflicting data, minority data, or data that does not fit the current measurement frame. From the Deep Manifold view, this is better understood as perturbation rather than noise. A neural network learns in a stochastic world, and stochasticity is not merely error; it is part of the inequality structure through which the model forms stochastic fixed points. Deep Manifold frames neural stochasticity as sum-based group statistics and boundary-shaped variability, not as a single bad disturbance. A small amount of such “noise” can be powerful because it perturbs the fixed-point trajectory without destroying the manifold. Often, a low ratio, say below roughly 5%, acts like useful boundary diversity: it prevents premature collapse, keeps nearby fixed-point classes reachable, and helps the model discover better convergence directions. The danger is not perturbation itself, but excessive or badly structured perturbation. Small perturbation stabilizes exploration; too much perturbation overwhelms the boundary condition and produces fixed-point drift. Paper: LLMs as Noisy Channels: A Shannon Perspective on Model Capacity and Scaling Laws Authors: Xu Ouyang, Deyi Liu, Yuhang Cai, Jing Liu, Yuan Yang, Chen Zheng, Thomas Hartvigsen, Yiyuan Ma. Affiliations: ByteDance Seed; University of Virginia; University of California, Berkeley arxiv.org/abs/2605.23901

English

163

M@init_malachi·1d

@growing_daniel zing

English

Daniel@growing_daniel·1d

You can tell AI is a net good for society because mark zuckerberg is bad at making it

English

142

368

8.8K

256.3K

M@init_malachi·1d

@BetaTomorrow then filter

English

deep Manifold@BetaTomorrow·1d

@init_malachi imbalance is bad

English

100

M retweetledi

Mario Zechner@badlogicgames·1d

the one thing @mitsuhiko taught me: merged client & server logs. very useful.

English

1.1K

58.2K

M retweetledi

Ava@noampomsky·1d

this girl on tiktok said that “discernment is an olympic sport” and I’ve decided this is pretty much the driving sentiment of my life. it is Not all the same. all relationships are not equal. you Are responsible for your choices. your passivity is just carelessness

English

943

20.4K

Keşfet

@arram @BetaTomorrow @afterearthisnow @usekernel @SemiAnalysis_ @alokbishoyi97 @elonmusk @BarackObama