Yaroslav Bulatov

2.3K posts

Yaroslav Bulatov

Yaroslav Bulatov

@yaroslavvb

@southparkcommons (early OpenAI, Google Brain, Meta) https://t.co/bxo5udY3ib https://t.co/SLix8Hrt4w https://t.co/Ur3GWKpmp6

San Francisco, CA Katılım Şubat 2011
1.1K Takip Edilen10.4K Takipçiler
Sabitlenmiş Tweet
Yaroslav Bulatov
Yaroslav Bulatov@yaroslavvb·
Gradients are energy-inefficient due to long-range dependencies, yet we lack a viable alternative. What if we crowdsource the discovery of what's next? Starting a weekly in-person reading group in SF. WebGPU, online learning, Hinton. Email yaro.slavvb@gmail.com.
English
9
9
177
32K
Yaroslav Bulatov
Yaroslav Bulatov@yaroslavvb·
@FrancoisChauba1 Also I wonder if "order" is a false dichotomy. For instance take Richardson iteration. It is an "adjoint-free" method, no derivative is needed. But you could also derive it as form of gradient descent. So logically it's first order, but it has cost of 0th order
Yaroslav Bulatov tweet media
English
1
0
2
248
Tim Kanarsky
Tim Kanarsky@tkanarsky·
@yaroslavvb Could you explain what you mean by "L layers simultaneously"? isn't the whole point of chain rule that it allows gradient of an op further up thr graph to be defined only in terms of the most proximate gradient down the graph
English
1
1
0
266
Yaroslav Bulatov
Yaroslav Bulatov@yaroslavvb·
@FrancoisChauba1 I thought Hinton's Forward-Forward is a pretty cool idea. It addresses the energy issue. It doesn't work sufficiently well from learning standpoint, but I wonder if it can be fixed
English
0
0
2
31
Yaroslav Bulatov
Yaroslav Bulatov@yaroslavvb·
Spending Sunday morning reading about Recurrent Laryngeal Nerve
Yaroslav Bulatov tweet media
English
2
0
14
1.5K
Yaroslav Bulatov
Yaroslav Bulatov@yaroslavvb·
Perhaps the most important skill in the age of agents becomes management skill. Famous example from organizational theory, Boeing 777 got things unstuck by switching from functional partitioning to product partitioning
Yaroslav Bulatov tweet mediaYaroslav Bulatov tweet media
English
0
0
5
1K
Yaroslav Bulatov
Yaroslav Bulatov@yaroslavvb·
@cosminnegruseri but yeah, I think there are not enough funding strategies which allow founders to be honest, hence a lot of "fake it till you make it"
English
1
0
0
67
Yaroslav Bulatov
Yaroslav Bulatov@yaroslavvb·
Have been thinking today about the practice of starting company before you know what it's for, seems backward to me. Zuck talked about same issue in an interview in 2016 gemini.google.com/share/63600785…
English
2
0
10
1.7K
Yaroslav Bulatov
Yaroslav Bulatov@yaroslavvb·
@srush_nlp @AryaTschand I don't see much codesign happening. Alex Smola gives a tutorial on evolution of attention at icml-2019. Something like 7 steps by researchers, trying things out on CPUs and GPUs. TPUs were pretty clunky in the early days, not a good fit for research
Yaroslav Bulatov tweet media
English
0
2
4
394
Sasha Rush
Sasha Rush@srush_nlp·
@AryaTschand Oh lol. Looks like you worked on TPUs so you probably know way more about this than me.
English
1
0
1
132
Sasha Rush
Sasha Rush@srush_nlp·
Hypothetically, Tensor Cores could have predated most general purpose computing, right? Feels like there should be people creating worlds where GPT-5 exists, but MS Word is wildly impossible.
English
18
2
161
25.8K
Yaroslav Bulatov retweetledi
Vladimir Bulatov
Vladimir Bulatov@bulatov_org·
Chaotic Solitons in Gray-Scott reaction
English
1
51
402
25.2K
Yaroslav Bulatov
Yaroslav Bulatov@yaroslavvb·
Coasen Singularity - firms exist to minimize transaction costs. If AI agents reduce these transaction costs to near-zero, economic need for large firms disappears, potentially leading to a "singular" economy of individuals or "one-person unicorns."
Yaroslav Bulatov tweet media
English
9
13
75
36.8K