Pantelis Vafidis (@vafidisp) - Twitter Profili | Zamantika Mersobahis Locabet

Sabitlenmiş Tweet

Excited to share our work on disentangled/abstract representations, to appear at #ICLR2025 (@iclr_conf)! We mathematically prove and experimentally demonstrate that multi-task learning leads to disentangled representations, and propose a unifying mechanism for generalization in brains and machines: parallel processing (🧵+paper below) Our work connects to the Platonic representation hypothesis, suggests why alignment across models/organisms can occur, and shows why transformers excel at constructing world models 🤖🚀

GIF

English

1

6

27

14.4K

Pantelis Vafidis@vafidisp·18 Eki

Two of the most cracked ppl I’ve ever known building something that can truly make LLMs personalized. Congrats @ABhargava2000 and @witkowski_cam, excited to see where this goes!

Bread@ai_bread

Announcing Bread Technologies. We’re building machines that learn like humans. We raised a $5 million seed round led by Menlo Ventures and have been building in stealth for 10 months. Today, we rise 🍞

English

0

2

25

16.9K

Pantelis Vafidis@vafidisp·18 Tem

@YinChaoqun @KordingLab Agreed. But this blogpost shows that the criterion used to detect “line attractors” is extremely lax: even random dynamical systems pass it 50% of the time, and get classified as “approximate line attractors”, which is clearly wrong.

English

0

1

53

Chaoqun Yin@YinChaoqun·12 Tem

@vafidisp @KordingLab I agree that the model somehow appears to fit everything. But I think it results from the very low-dimensional neural data itself. Its simple dynamics (mostly slow ramping) limits the model's complexity.

English

1

0

44

Kording Lab 🦖@KordingLab·8 Tem

Attractors are usually not mechanisms.

English

8

21

132

14.6K

Pantelis Vafidis@vafidisp·11 Tem

@YinChaoqun @KordingLab Very problematic set of studies when it comes to quantification and modelling: aman-bhargava.com/dynamical/syst…

English

1

0

81

Chaoqun Yin@YinChaoqun·8 Tem

@KordingLab Isn't there a recent paper showing the existence of attractors in the mouse brain with causal experiments? nature.com/articles/s4158…

English

2

0

7

629

Pantelis Vafidis@vafidisp·2 May

@rudzinskimaciej Yes, I could certainly see inhibition helping with decorrelation. Thanks for pointing it out. Not sure about the frequencies part though. Any references?

English

1

0

1

13

Rudzinski Maciej@rudzinskimaciej·2 May

Columns seem to have few mechanisms that keep their representations different as a design choice both through inhibitory neurons and frequencies organization I'm pointing it as it's testable architecture choice that maps exactly to what you suggested Random initial weights wouldn't give different enough results, we know already that NN with random initiation but trained on the same data usually form similar representation

English

1

0

1

24

Pantelis Vafidis@vafidisp·2 May

Yes great point, I forgot to reference the 1000 brains theory here but we do in the paper. One main difference is that they require dense signals to map the world, while we show that it can be done with sparse signals With regards to differentiation of cortical columns, it may simply come about due to the random initial projections (similar to heads in transformers or filters in CNNs)

English

1

0

1

14

Pantelis Vafidis@vafidisp·25 Nis

If you're at #ICLR2025, and interested in how we can guarantee true out-of-distribution generalization in neural networks (extrapolation), Aman Bhargava (@ABhargava2000) and I will be presenting our work tomorrow Saturday the 26th at 3:00-5:30pm, at Hall 3 (poster number #69) We will be happy to see you there! short presentation + slides: iclr.cc/virtual/2025/p…

English

0

2

11

807

Pantelis Vafidis@vafidisp·22 Nis

Finally, huge thanks to amazing collaborator Aman Bhargava (@ABhargava2000) for recognizing the mathematical potential of this project and doing the theory part, and advisor Antonio Rangel! This project a prime example of the amplifying effect of great collaborations. Looking forward to more! Link to top:

Pantelis Vafidis@vafidisp

Excited to share our work on disentangled/abstract representations, to appear at #ICLR2025 (@iclr_conf)! We mathematically prove and experimentally demonstrate that multi-task learning leads to disentangled representations, and propose a unifying mechanism for generalization in brains and machines: parallel processing (🧵+paper below) Our work connects to the Platonic representation hypothesis, suggests why alignment across models/organisms can occur, and shows why transformers excel at constructing world models 🤖🚀

English

0

1

248

Pantelis Vafidis@vafidisp·22 Nis

Thanks for reading this far! For an in depth view of the above, I include the paper below (it’s 40 pages long!). Tldr: it worked no matter what we threw at it! And if you happen to be in Singapore for #ICLR2025, we will be presenting at poster session 6 on Saturday the 26th, 3:30-5 pm (Hall 3 + Hall 2B #69). We will be happy to see you there! arxiv.org/abs/2407.11249

English

1

0

243

Pantelis Vafidis@vafidisp·22 Nis

Excited to share our work on disentangled/abstract representations, to appear at #ICLR2025 (@iclr_conf)! We mathematically prove and experimentally demonstrate that multi-task learning leads to disentangled representations, and propose a unifying mechanism for generalization in brains and machines: parallel processing (🧵+paper below) Our work connects to the Platonic representation hypothesis, suggests why alignment across models/organisms can occur, and shows why transformers excel at constructing world models 🤖🚀

GIF

English

1

6

27

14.4K

Pantelis Vafidis

Keşfet