M

21.5K posts

M banner
M

M

@init_malachi

continually learning in a state of delight | ex sr member of technical staff | interested in ai epistemology

synthesis Katılım Mayıs 2022
3.4K Takip Edilen984 Takipçiler
M
M@init_malachi·
@arram posterior value multiplier
Español
0
0
0
5
Arram
Arram@arram·
To be fair I did an absurd amount of work to get access to my emotions so YMMV, but I'm pretty sure this will be incredibly useful for almost anyone.
English
2
0
15
976
Arram
Arram@arram·
Claude is better than the average therapist and 50x cheaper. It's like a sniper for avoided emotions. Unbelievably piercing insights that repeatedly have me in tears. Just surgical precision.
English
26
0
178
15K
M
M@init_malachi·
i should become a wagie
English
0
0
1
15
M retweetledi
Theo - t3.gg
Theo - t3.gg@theo·
1. "open source" should not always mean "100% of our code is public 100% of the time" How much energy have we put into preventing .env leaks in our source control? How many miserable ways have we re-invented env var sharing? How many projects would be open source if they could hide in-flight PRs? How many security fixes are sitting unpublished because they will be exploited as soon as they appear in the tracker? How much better would life be if I could have a monorepo with some sub-packages that are "private" without splitting into multiple repos?
English
7
3
220
12.8K
M
M@init_malachi·
@BetaTomorrow despite this, I see your deep intentionality and poetics, and I really appreciate and resonate with it
English
1
0
0
12
deep Manifold
deep Manifold@BetaTomorrow·
@init_malachi English is not my native language, plus dyslexia, but point taken. Thanks for the feedback.
English
1
0
0
19
deep Manifold
deep Manifold@BetaTomorrow·
This paper is very strong because it focuses on a critical area: how sequence models store and retrieve relational structure in weights and embeddings. Its key contribution is to show that memory is not merely associative lookup; learned embeddings can organize into a geometric structure where unseen multi-hop relations become accessible. In Deep Manifold language, this paper studies the mathematical cover of memory: the learned weight/embedding geometry that defines possible fixed-point basins, relational neighborhoods, and latent intrinsic pathways. The limitation is that this paper mainly examines the static geometry of learned memory, not the dynamical activation process during inference. From the Deep Manifold view, inference is a boundary-conditioned iterated integral: the prompt acts as the boundary condition, and each layer produces a physical cover through activations. At every layer, the mathematical cover constrains what pathways are possible, while the physical cover selects, bends, and transports the current trajectory. Thus geometric memory is not simply stored geometry; it becomes effective only through repeated mathematical-cover / physical-cover interaction across stacked manifold layers.
deep Manifold tweet media
Vaishnavh Nagarajan@_vaishnavh

Updated our paper on the foundations of memory in sequence models (with fresh insights, clearer writing and ablations). Our paper contrasts two distinct ways in which language models memorize and formulates the questions that arise from this. Will be presented at #ICML.

English
1
0
8
474
M
M@init_malachi·
i’m a question designer question engineer
English
0
0
1
49
M retweetledi
𝖦𝗋𝗂𝗆𝖾𝗌 ⏳
@afterearthisnow Idk I think this might be partially one of the greatest purposes of ai It forces us to come to terms with what actually matters and actually choose our values very explicitly
English
6
3
44
4K
M retweetledi
𝖦𝗋𝗂𝗆𝖾𝗌 ⏳
Landian philosophy has reigned for so long because he's actually just one of the best living poets. Now that the pope is an equally good poet suddenly the cold god feels less inevitable. The power of art is very evident in this battle We probably shouldn't have defunded so much art and told everyone it's a waste to study humanities
Blue Daddy's Girl 🏴󠁧󠁢󠁳󠁣󠁴󠁿 🇪🇺 -🚫AI@BlueDaddysGirl

I'm reading the Pope's encyclical, and it's banger after banger. New insults for AI chuds: Architects of Babel Lords of towers destined for ruin

English
75
62
778
64.4K
M retweetledi
Rafael Garcia
Rafael Garcia@rfgarcia·
the trick that makes @usekernel fast: we pay the full 5-10s of starting chromium once, snapshot ram to nvme, and resume in ~30ms when a request lands. from the outside it looks like a giant warm pool. under the hood it's mostly cold disk. wrote up how we got here.
KERNEL@usekernel

x.com/i/article/2059…

English
7
8
79
11K
M retweetledi
John Carmack
John Carmack@ID_AA_Carmack·
I have been very impressed by @SemiAnalysis_ . I think of myself as a wide ranging systems engineer, looking for value at every level from the chip specs to the user interface, but SA exposes me to additional levels of "the system", both above (datacenters) and below (semiconductor fabrication). It probably puts me in "just knows enough to be dangerous" territory. Neat things I learned today: Some of the 800VDC datacenter design choices leverage parts commoditized by electric vehicles. There is now a SiC MOSFET that can operate on 10kV electricity, opening up the possibility of working directly with medium (ha!) voltage AC power transmission lines without stepping down.
English
58
84
2.1K
211.9K
M retweetledi
Serena Ge (Datacurve)
Serena Ge (Datacurve)@serenaa_ge·
To ensure our grading is fair and reliable, we built a trajectory analysis agent to replay agent rollouts and map out exactly why they fail. Running it on existing benchmarks surfaced significant grading noise, with verifiers rejecting valid code or letting models read solutions straight from git history.
Serena Ge (Datacurve) tweet media
English
8
5
198
25.6K
M
M@init_malachi·
@alokbishoyi97 i like it and i think it’s worth pursuing in kind in other areas
English
0
0
1
29
Alok Bishoyi
Alok Bishoyi@alokbishoyi97·
@init_malachi expand a bit ? what do you think fails - or issues with the DAG approach mentioned?
English
1
0
0
69
M retweetledi
Thumos Bear
Thumos Bear@Bear_the_AI_guy·
AI is NOT automated Logos, at least not yet. AI is automated Thumos. AI mechanistically relies through and through on mimetic imitation rather than a rational state machine solver. Part of the strangeness of its "learning" mechanism is it can approximate a logocentric style state machine after enough imitative repetitions. This understandably confuses many people. Before LLMs every person's experience with computer software was an encounter with hardcoded logos. Algorithms and state machines function at bottom by the programmer naming states and then gradually converting between states in the pattern until he can name the Tao/nature of his problem. AI neural nets solve problems by directly imitating the nature of the answer, letting its universal function infer the intermediate logos states.
English
1
1
9
431
M retweetledi
Tom Sydney Kerckhove
Tom Sydney Kerckhove@kerckhove_ts·
There's a rule of thumb in Factorio: "avoid item/fluid buffers" which you only tend to learn after quite a while. (Buffers don't really save you, but do prolong the time before you find a broken process) I think about this a lot in the software industry.
English
55
61
1.9K
98.8K
M retweetledi
gabe
gabe@allgarbled·
I made a /seppuku skill for my Claudes for when they make an unforgivable mistake, and now they use it spontaneously without me asking.
English
95
259
7.9K
439.1K
M
M@init_malachi·
@BetaTomorrow shadow is not as legible as light
English
1
0
0
12
deep Manifold
deep Manifold@BetaTomorrow·
There is no pure “noise” per se. What the paper calls noise may simply be data the evaluator does not like, rare data, conflicting data, minority data, or data that does not fit the current measurement frame. From the Deep Manifold view, this is better understood as perturbation rather than noise. A neural network learns in a stochastic world, and stochasticity is not merely error; it is part of the inequality structure through which the model forms stochastic fixed points. Deep Manifold frames neural stochasticity as sum-based group statistics and boundary-shaped variability, not as a single bad disturbance. A small amount of such “noise” can be powerful because it perturbs the fixed-point trajectory without destroying the manifold. Often, a low ratio, say below roughly 5%, acts like useful boundary diversity: it prevents premature collapse, keeps nearby fixed-point classes reachable, and helps the model discover better convergence directions. The danger is not perturbation itself, but excessive or badly structured perturbation. Small perturbation stabilizes exploration; too much perturbation overwhelms the boundary condition and produces fixed-point drift. Paper: LLMs as Noisy Channels: A Shannon Perspective on Model Capacity and Scaling Laws Authors: Xu Ouyang, Deyi Liu, Yuhang Cai, Jing Liu, Yuan Yang, Chen Zheng, Thomas Hartvigsen, Yiyuan Ma. Affiliations: ByteDance Seed; University of Virginia; University of California, Berkeley arxiv.org/abs/2605.23901
deep Manifold tweet media
English
4
26
163
9K
Daniel
Daniel@growing_daniel·
You can tell AI is a net good for society because mark zuckerberg is bad at making it
English
142
368
8.8K
256.3K
M retweetledi
Mario Zechner
Mario Zechner@badlogicgames·
the one thing @mitsuhiko taught me: merged client & server logs. very useful.
Mario Zechner tweet media
English
27
22
1.1K
58.2K
M retweetledi
Ava
Ava@noampomsky·
this girl on tiktok said that “discernment is an olympic sport” and I’ve decided this is pretty much the driving sentiment of my life. it is Not all the same. all relationships are not equal. you Are responsible for your choices. your passivity is just carelessness
English
6
62
943
20.4K