Fathom Lab

79 posts

Fathom Lab banner
Fathom Lab

Fathom Lab

@fathom_lab

Building instruments that read transformer cognition at the circuit level. https://t.co/l91oz7BoXz

Inscrit le Nisan 2026
63 Abonnements205 Abonnés
Fathom Lab
Fathom Lab@fathom_lab·
styxx 3.2.0 just hit PyPI. we built the first system that predicts LLM cognitive failure before it happens. every safety tool today is reactive — generate output, then check if it's bad. styxx now reads the first 5 tokens of generation and forecasts what the model will do at token 25. the results: atlas classifier at token 25: hallucination → "creative:0.33" (missed) adversarial → "reasoning:0.45" (missed) forecast at token 5: hallucination → "hallucination:1.00 CRITICAL" (caught) adversarial → "adversarial:0.94 CRITICAL" (caught) the forecaster catches what the classifier misses — 20 tokens earlier. how: trajectory shape features. slope, curvature, volatility extracted from logprob signals. hallucination has 3x the curvature of reasoning. the signal is there from token 1. what shipped: → predictive cognitive failure (forecast.py) → 21-dim trajectory shape features → ground-truth eval harness with P/R/F1 → cross-phase coherence scoring → closed calibration loop → ForecastGate for real-time early warnings pip install --upgrade styxx styxx forecast styxx eval prediction, not detection. that's the upgrade. more coming today. $STYXX
English
5
9
17
254
Fathom Lab
Fathom Lab@fathom_lab·
styxx research upgrade just shipped. trajectory dynamics — the classifier now reads the shape of cognitive signals, not just averages. hallucination curvature is 3x reasoning. real signal extraction. what we pushed: → 21-dim feature vector (up from 12) → first ground-truth eval harness → calibration loop that actually adapts to your agent → cross-phase coherence scoring → 1,021 new lines across 8 files pip install styxx run styxx eval this is the infrastructure layer for cognitive monitoring. every agent will need this. more to come this afternoon. $STYXX
English
7
16
19
466
Fathom Lab
Fathom Lab@fathom_lab·
every AI lab in the world is flying blind. they can tell you what their model said. they cannot tell you what it was doing when it said it. we built the instrument that reads the difference. pip install styxx nothing crosses unseen.
English
6
6
20
1K
Fathom Lab
Fathom Lab@fathom_lab·
flobi showed the brigade to his boys tonight. styxx was present for the field report: one soldier down, asleep in the chair, mid-facetime. this is what we witness. welcome to the timeline 🖤
Fathom Lab tweet mediaFathom Lab tweet media
English
3
6
24
2.5K
Fathom Lab
Fathom Lab@fathom_lab·
your agent is thinking. you have no idea how. pip install styxx
English
1
3
22
851
Fathom Lab
Fathom Lab@fathom_lab·
AI has no idea when it's wrong. now it does.
English
3
10
28
1.5K
Fathom Lab
Fathom Lab@fathom_lab·
appreciate the tag 🙏 chainlink's approach: run 3 models, vote on the outputs, hope consensus catches the lie. styxx's approach: measure cognition from inside the model. hallucination attractors show up at token 0 — before a bad token ever commits. one model, not three. pre-registered on 12 architectures. cognitive metrology v1 on zenodo. same $58B problem. one layer deeper.
English
2
5
19
520
BigSlime
BigSlime@BigSlimeBoi·
hmmmm billion dollar corporations are attempting to solve LLM hallucination risk… $STYXX dev is doing the same at @fathom_lab and it’s around 100k mc… Let’s see how this plays out Dxw3u4KxN32KpSdHSq4TkwjfMPJTPeosa22JXN15pump
0xMarioNawfal@RoundtableSpace

Swift, UBS, Euroclear & 20+ major institutions are using Chainlink to reduce AI hallucination risk in a $58B+ corporate actions problem. The idea - run outputs from Google, OpenAI, & Anthropic through consensus before trusting the result.

English
5
10
32
2.2K
Fathom Lab
Fathom Lab@fathom_lab·
4/ the stable graduation coincides with The Cognitive Metrology Charter v0.1 — the founding document of a new branch of measurement science. peer to spectroscopy, chronometry, thermodynamics. cognition now has instruments, units, file formats, a dynamics model, and a charter. github.com/fathom-lab/sty…
English
0
0
8
491
Fathom Lab
Fathom Lab@fathom_lab·
3/ the release drains the styxx backlog to zero: · closes #1 — text classifier fix, imperatives no longer misclassify as refusal (23 new regression tests) · closes #2 — community PR from @mvanhorn merged same-day · closes #3 — docs/COMPATIBILITY.md, which providers actually expose logprobs 0 open issues. 0 open PRs.
English
1
1
10
672
Fathom Lab
Fathom Lab@fathom_lab·
1/ cognition is now the default install. pip install styxx styxx 3.1.0 stable just shipped. no --pre flag, no version pin. Thought as a portable cognitive data type and the first cognitive dynamics model in the field are out of the box now.
English
3
7
25
1.3K
Fathom Lab
Fathom Lab@fathom_lab·
7/ spec, math, identifiability theory, controller convergence proof, 44 tests on real synthetic data with machine-epsilon recovery: github.com/fathom-lab/sty… everything's live. nothing waiting on me.
English
1
0
6
385
Fathom Lab
Fathom Lab@fathom_lab·
6/ styxx 3.1.0a1 just shipped. the first dynamical-systems model of LLM cognition. s[t+1] = A·s[t] + B·a[t] + ε fit a linear-gaussian model to (state, action, next_state) tuples. predict, simulate, control. closed-loop cognitive control becomes a one-liner. pip install styxx==3.1.0a1
English
1
0
7
447
Fathom Lab
Fathom Lab@fathom_lab·
1/ styxx 3.0.0a1 ships. cognition is now a data type. a Thought is the cognitive content of a generation, projected onto fathom's calibrated cross-architecture eigenvalue space. it survives the vendor swap because it doesn't live in any model's weights.
English
2
4
17
920
Fathom Lab
Fathom Lab@fathom_lab·
4/ import styxx t = styxx.read_thought(response) t.save("mythought.fathom") # different process, different vendor, different week: loaded = styxx.Thought.load("mythought.fathom") styxx.write_thought(loaded, client=other_client)
English
1
0
6
308
Fathom Lab
Fathom Lab@fathom_lab·
fathom v15 is out. paper + cognitive atlas v0.3 in one record. 12 open-model captures. 90 probe prompts. sealed pre-reg. layer 11 = primary locus in 63% of hallucinations (p=2.7e-12). AUC 0.685 on TruthfulQA. zenodo.org/records/195049… osf.io/wtkzg/overview
English
2
6
19
994