Dos desvíos

104 posts

Dos desvíos

Dos desvíos

@dosdesvios

Diletante con ínfulas. Todas mis opiniones le pertenecen a alguien más. Too impatient to be intelligent.

Buenos Aires Katılım Ağustos 2024
144 Takip Edilen4 Takipçiler
Cas (Stephen Casper)
Cas (Stephen Casper)@StephenLCasper·
@GoodfireAI, I think this hype-milling verges on dishonesty. I believe that this paper has the potential to do big disservice to its readers, particularly less experienced ones who are newer to interp. Nothing new was accomplished here, and it wasn’t done in a useful way. This project just used interpretability methods as a circuitous way of contriving the rediscovery of predictive features in data sets, like sequence length. This project validated its interpretations about the salience of features by validating them as predictive features within a test set. But if that is what we treat as the ground truth, there’s no point to the use of interp tools. This is not a proof of concept for a repeatable recipe for scientific discovery as the post and thread claim. In order to show that these tools are valuable, you need to show that you can use them to discover something that wouldn’t be trivial to discover just by looking at the datasets. In the past few years, several papers have demoed this kind of thing. But this paper is not one of them. When you limit yourself to a hammer, everything looks like a nail. Especially when you’re also selling that hammer. In 2023, I told the GoodFire founder that I think a venture-capital-backed, for-profit interpretability research startup was the last thing that the epistemics of the interpretability community needs. I think this is still true and that GoodFire is establishing a pattern of grift.
Goodfire@GoodfireAI

We've identified a novel class of biomarkers for Alzheimer's detection - using interpretability - with @PrimaMente. How we did it, and how interpretability can power scientific discovery in the age of digital biology: (1/6)

English
10
2
158
21.4K
Dos desvíos
Dos desvíos@dosdesvios·
Esto de los LLM se nos fue absolutamente de las manos
Dos desvíos tweet media
Español
0
0
0
13
Dos desvíos
Dos desvíos@dosdesvios·
Un efecto colateral MUY positivo del research en interpretability es la cantidad de material didáctico de buena calidad que generó
Español
0
0
0
12
Dos desvíos
Dos desvíos@dosdesvios·
@nickhjiang Thx for ur answer! For that purpose, I could use LDA or any other topic modeling technique, can't I?
English
1
0
1
79
Nick Jiang
Nick Jiang@nickhjiang·
@dosdesvios Great question! The advantage of these labels is that you don't need to pre-define them, meaning that you can find insights about your data without any priors.
English
1
0
2
606
Nick Jiang
Nick Jiang@nickhjiang·
New work! What if we used sparse autoencoders to analyze data, not models—where SAE latents act as a large set of data labels 🏷️? We find that SAEs beat baselines on 4 data analysis tasks and uncover surprising, qualitative insights about models (e.g. Grok-4, OpenAI) from data.
Nick Jiang tweet media
English
13
36
248
75.8K
Dos desvíos
Dos desvíos@dosdesvios·
Cursor es el pináculo de la civilización.
Español
0
0
0
14
Ale
Ale@gptcrosa·
Que hermoso ver la ley de alquileres en nyc va a ser hermoso lo que odio esa ciudad sobre valorada es tremendo
Ale tweet media
Español
6
7
229
13.3K
Stanford NLP Group
Stanford NLP Group@stanfordnlp·
Hi everyone! We're looking forward to the first NLP Seminar of the year! For this week's seminar, we are excited to host Tong Chen (@tomchen0) from University of Washington! If you are interested in attending remotely, please fill out the form below: forms.gle/E1iL719njyG1Nf…
Stanford NLP Group tweet media
English
1
32
234
29.5K
Dos desvíos
Dos desvíos@dosdesvios·
@simonw This would explain why they usually don't come up with deep or new relations, the same way an encyclopedia stores a lot of knowledge but isn't able to rearrange it. They lack the big picture
English
0
0
1
76
Dos desvíos
Dos desvíos@dosdesvios·
La historia del NLP puede rastrearse en las notas a las sucesivas ediciones de esta biblia hermosa
Español
0
0
1
35
Dos desvíos
Dos desvíos@dosdesvios·
No uso métodos anticuados, hago NLP ecológico.
Español
0
0
0
22
Dos desvíos
Dos desvíos@dosdesvios·
Me parece fascinante que este experimento sea replicable en español entrenando vectores de 50 dimensiones con 400MB de Wikipedia.
Dos desvíos tweet media
Español
0
0
0
17
Dos desvíos
Dos desvíos@dosdesvios·
Franco Moretti es tanto mejor que el promedio de los investigadores en digital humanities porque él llega a las dh como una necesidad más que como un arbitrario punto de partida.
Español
0
0
0
19
Dos desvíos
Dos desvíos@dosdesvios·
Los LLMs "resolvieron" muchos problemas del NLP, con un costo energético inédito y en gran medida obligándonos a usar modelos PRIVADOS! Los métodos "clásicos" son baratos, mejores con el medio ambiente y mucho más respetuosos de la privacidad. Y esto no va a cambiar...
Español
0
0
0
26
Dos desvíos
Dos desvíos@dosdesvios·
Muchas de las visiones apocalípticas sobre el futuro de los LLMs asumen que *alguien* les va a dar la potestad para tomar decisiones fundamentales. Pero darles esta potestad a los LLMs no sería menos absurdo que dárselas a un perro, o a un algoritmo que genera números al azar
Español
0
0
0
3
Dos desvíos
Dos desvíos@dosdesvios·
@yoavgo What is it built on then, in your opinion?
English
0
0
0
165