Vaisakh M

962 posts

Vaisakh M banner
Vaisakh M

Vaisakh M

@m__vaisakh

AI Efficiency Research | Independent Researcher

Kochi, India Katılım Mayıs 2018
1K Takip Edilen193 Takipçiler
Vaisakh M
Vaisakh M@m__vaisakh·
@vikhyatk claude's cloud sitting above claudes and clouds
English
0
0
1
34
vik
vik@vikhyatk·
spacex? the gpu neocloud company?
English
4
0
51
1.5K
Greg Yang
Greg Yang@TheGregYang·
turns out my place has high carbon monoxide! fire department brought a whole troop alarm beeped yesterday and today for a period of time I did wake up a bit tired and felt a strain all day so I didn't take any chances and called 911 ongoing situation -- fire department investigating source of CO, which doesn't seem to come from my place I'll update as we have new findings
English
19
2
272
29.4K
Vaisakh M retweetledi
signüll
signüll@signulll·
“laid back” is what high agency ppl look like from the outside when they’ve correctly identified which games are worth playing & simply declined the rest.
English
84
1.2K
11.9K
378.6K
Vaisakh M retweetledi
Natural Philosophy
Natural Philosophy@Naturalphilosy·
“Above all, do not lose your desire to walk.” — Søren Kierkegaard
Natural Philosophy tweet media
English
260
6.9K
45.8K
1.6M
Vaisakh M
Vaisakh M@m__vaisakh·
@difficultyang This is one part. The other part is corporate can do this without involving the developers/researchers.
English
0
0
0
50
difficultyang
difficultyang@difficultyang·
Look guys, just because the copyright owner released something under a particular free license, doesn't mean that can't sell it under a different license to other parties
English
3
0
6
1.1K
Vaisakh M retweetledi
The Nobel Prize
The Nobel Prize@NobelPrize·
“Timing is very important. You need to pick hard problems to solve and be ambitious with them. But you've also got to pick the right time when the world and the context that you're in is the right kind of environment for those ideas to flourish.” In his official Nobel Prize interview, Demis Hassabis discussed how his aspirations as a young gaming programmer were ahead of their time. Watch our official interview: bit.ly/41DGkXr
The Nobel Prize tweet media
English
88
459
3.5K
279.1K
sway
sway@SwayStar123·
Thinking of making a "ML intuitions bench", which will be MCQs for what happens if you make certain tweaks to tranformers or other archs. I have a bunch of findings that'll probably never make into a paper, and most of which are pretty surprising to me. If LLMs can predict these accurately then that's a pretty huge thing for autoresearch
English
1
0
6
718
Vaisakh M retweetledi
•
@WordsCocoon·
march 13, friday...
• tweet media
English
25
8.7K
60.1K
926.4K
Andrew Carr 🤸
Andrew Carr 🤸@andrew_n_carr·
it's awesome to think that nvidia released 25T training tokens. that is so fantastically hard to collect well. the proof is in their models too. I'd expect if you were interested in writing your own constitution against that data, you could train an exceedingly companionable AI
English
1
0
6
964
Vaisakh M
Vaisakh M@m__vaisakh·
HBD to us Mandate of heaven
Vaisakh M tweet media
English
0
0
0
23
Vaisakh M
Vaisakh M@m__vaisakh·
@sytelus like an indicator of (future) conflict of interest?
English
0
0
0
18
Shital Shah
Shital Shah@sytelus·
My friend who just recently raised round for his startup: “the most important thing you look in your investors is where do they have board seats”.
English
1
0
2
579
Matej Sirovatka
Matej Sirovatka@m_sirovatka·
you can just do things when you're gpu rich (full post-train GLM5 being the things)
English
3
1
86
16.5K
Vaisakh M
Vaisakh M@m__vaisakh·
bloat breeds bugs
English
0
0
0
41
Vaisakh M retweetledi
the tiny corp
the tiny corp@__tinygrad__·
“Simplicity is a great virtue, but it requires hard work to achieve and education to appreciate. And to make matters worse, complexity sells better.” — Edsger Dijkstra
English
8
81
776
21.5K
Vaisakh M
Vaisakh M@m__vaisakh·
@kuchaev @natolambert Time here is the time a human takes to complete a task iirc. This eval only takes into account how reliably a model finishes a task and not the time taken to do it.
English
0
0
1
22
Oleksii Kuchaiev
Oleksii Kuchaiev@kuchaev·
@natolambert I am not convinced that hours is a proper metric here. Anything can and will be made faster so, if hypothetically, Anthropic made claude faster (better SW, newer/more hw) that would show up as *worse* on this plot?
English
1
0
1
250
Vaisakh M retweetledi
kepano
kepano@kepano·
taste is a study of nuance, it requires slowing down
English
45
528
3.1K
80K
Vaisakh M
Vaisakh M@m__vaisakh·
@elliotarledge iirc saw a mention of the method not working (at all) in the pixel space and requires a good latent representation to work.
English
0
0
1
446