Goodfire (@GoodfireAI) - Twitter Profili | Zamantika Mersobahis Locabet

Sabitlenmiş Tweet

Goodfire@GoodfireAI·4d

Introducing Silico: the platform for building AI models with the precision of written software. Silico lets researchers and engineers see inside their models, debug failures, and intentionally design them from the ground up. Early access is open now. 🧵(1/10)

English

20

112

847

103.4K

Goodfire@GoodfireAI·5h

Takeaway for eval design: treat verbalized eval awareness as a signal that the model doesn’t find an interaction genuine, inspect reasoning across rollouts to find why, and fix what looks artificial. More realistic evals are within reach! Full post: goodfire.ai/research/verba… (7/7)

English

0

1

12

1.8K

Goodfire@GoodfireAI·5h

What about internals? We show steering vectors that reduce verbalized eval awareness — including ones in recent system cards — may do so by changing how the model represents user intent. So the eval may measure the model under a different intent than it was meant to test. (6/7)

English

2

0

12

613

Goodfire@GoodfireAI·5h

New research from @AISecurityInst and Goodfire: Models sometimes recognize they're being evaluated, occasionally even identifying the benchmark. We show this verbalized eval awareness inflates safety scores, meaning safety benchmarks may not reflect real-world behavior. (1/7)

English

7

20

142

13.4K

Goodfire retweetledi

Tom McGrath@banburismus_·4d

if you're wondering what sort if thing you can do with silico, this is a great example!

Bo Wang@BoWang87

Love seeing Silico (@GoodfireAI ) used to probe our EchoJEPA's representations! this is exactly the kind of interpretability work that's been missing for JEPA-style models. One thing that makes EchoJEPA particularly interesting to interpret: unlike MAE-based approaches, it never reconstructs pixels. The model learns entirely in latent space through masked prediction, so you can't just look at decoder outputs to understand what it captured. Attribution onto a temporally aligned 3D mesh is a much more honest probe of what the representations actually encode. What we found in building EchoJEPA: training on 18M echo videos across 300K patients, the model learns to disentangle cardiac anatomy from ultrasound noise (speckle, reverberation artifacts) almost entirely through self-supervision. With 1% labeled data it already outperforms supervised baselines trained on 100%. The latent space is doing real anatomical work, but until you can visualize it like this, "real anatomical work" is mostly a claim. Paper + code: arxiv.org/abs/2602.02603 | github.com/bowang-lab/Ech…

English

0

6

42

4K

Goodfire retweetledi

Bo Wang@BoWang87·4d

Love seeing Silico (@GoodfireAI ) used to probe our EchoJEPA's representations! this is exactly the kind of interpretability work that's been missing for JEPA-style models. One thing that makes EchoJEPA particularly interesting to interpret: unlike MAE-based approaches, it never reconstructs pixels. The model learns entirely in latent space through masked prediction, so you can't just look at decoder outputs to understand what it captured. Attribution onto a temporally aligned 3D mesh is a much more honest probe of what the representations actually encode. What we found in building EchoJEPA: training on 18M echo videos across 300K patients, the model learns to disentangle cardiac anatomy from ultrasound noise (speckle, reverberation artifacts) almost entirely through self-supervision. With 1% labeled data it already outperforms supervised baselines trained on 100%. The latent space is doing real anatomical work, but until you can visualize it like this, "real anatomical work" is mostly a claim. Paper + code: arxiv.org/abs/2602.02603 | github.com/bowang-lab/Ech…

English

7

45

281

26.7K

Goodfire retweetledi

Sauers@Sauers_·4d

This beats the standard method (CADD, used in clinical genetics) on a type of variant (small insertions / deletions) that was never seen in the training data (single-letter variants only) by Goodfire's model, but was seen by CADD!

Goodfire@GoodfireAI

We achieved state-of-the-art performance in predicting which of 4.2 million genetic variants cause diseases by interpreting a genomics model, in a new preprint with @MayoClinic. We're now releasing an open source database for all variants in the NIH's clinvar database. 🧵(1/8)

English

0

3

24

2.4K

Goodfire@GoodfireAI·4d

@pranavxviswa Silico lets you shape model behavior in many ways, including steering vectors, but the biggest successes are generally in shaping the training process itself!

English

0

3

804

Pranav Viswanath@pranavxviswa·4d

@GoodfireAI Congrats on the launch, super cool product! To shape model behavior does it use steering vectors based on the desired behavior, and how do you ensure it doesn’t degrade the rest of model behavior?

English

1

0

2

940

Goodfire@GoodfireAI·4d

Introducing Silico: the platform for building AI models with the precision of written software. Silico lets researchers and engineers see inside their models, debug failures, and intentionally design them from the ground up. Early access is open now. 🧵(1/10)

English

20

112

847

103.4K

Goodfire retweetledi

Yan-David (Yanda) Erlich@yanda·4d

What if you could get the power of AI with the precision engineering of “traditional” software? If that feels like having your cake and eating it too, then @GoodfireAI is serving up infinite cake ♾️🎂.

Goodfire@GoodfireAI

Introducing Silico: the platform for building AI models with the precision of written software. Silico lets researchers and engineers see inside their models, debug failures, and intentionally design them from the ground up. Early access is open now. 🧵(1/10)

English

1

2

19

2.8K

Goodfire retweetledi

Nick@nickcammarata·4d

how many nobel prize worthy discoveries are sitting right now in the weight matrices of trillion parameter frontier models, waiting to be reverse-engineered. it'd be surprising if the answer were zero

roon@tszzl

imo mechinterp will not only be solved but have a huge impact on our abstractions and how we understand the world

English

19

33

791

39.8K

Goodfire@GoodfireAI·4d

@_virgil19 We use a broad set of tools, much more than SAEs! It's true that standard SAE features can be inconsistent across runs (though check out Archetypal SAEs). Silico is equipped with many different tools, and knows how to use them with the appropriate nuance

English

1

0

7

944

Virgil Maro@_virgil19·4d

@GoodfireAI the bit i keep wondering with tools like Silico: when you find a feature, is it stable across different encoding choices, or an artifact of the SAE you trained? engrams hit the same wall, what you tag at encoding determines what cell-set you can re-fire.

English

2

0

1

1.2K

Goodfire@GoodfireAI·4d

@subminima Yes! Model health checks include an entire set of tests that study signal propagation (forward and backward) through the model

English

0

2

47

min@subminima·4d

@GoodfireAI can I view exploding gradients with it?

English

1

0

4

813

Goodfire@GoodfireAI·4d

Silico is in early access now. Learn more at: goodfire.ai/platform (10/10)

English

2

1

29

2.1K

Goodfire@GoodfireAI·4d

MIT Tech Review’s @strwbilly spoke with our CEO/co-founder @ericho_goodfire about Silico and what it means for model builders: technologyreview.com/2026/04/30/113… (9/10)

English

1

3

28

2.3K

Goodfire

Keşfet