Vaisakh M

962 posts

Vaisakh M

@m__vaisakh

AI Efficiency Research | Independent Researcher

Kochi, India Katılım Mayıs 2018

1K Takip Edilen193 Takipçiler

Vaisakh M@m__vaisakh·21 May

@vikhyatk claude's cloud sitting above claudes and clouds

English

vik@vikhyatk·21 May

spacex? the gpu neocloud company?

English

1.5K

Vaisakh M@m__vaisakh·14 May

@xhluca @giffmana You mean time to release MUSE Nano Banana by Google right? muse-model.github.io

English

Xing Han Lu@xhluca·14 May

@giffmana Time to release Muse Nano Banana

English

1.6K

Lucas Beyer (bl16)@giffmana·14 May

Gemini Spark? Why does it sound familiar 🤔🤔

Fandu@mrfanduuuuu

Gemini Spark 👀

English

152

36.3K

Vaisakh M@m__vaisakh·6 May

@TheGregYang @itsclivetime kaboom risk?

Dansk

Greg Yang@TheGregYang·6 May

@itsclivetime why?

242

Greg Yang@TheGregYang·6 May

turns out my place has high carbon monoxide! fire department brought a whole troop alarm beeped yesterday and today for a period of time I did wake up a bit tired and felt a strain all day so I didn't take any chances and called 911 ongoing situation -- fire department investigating source of CO, which doesn't seem to come from my place I'll update as we have new findings

English

272

29.4K

Vaisakh M@m__vaisakh·29 Nis

@willccbb @kalomaze On Politburo Distillation

Français

will brown@willccbb·29 Nis

@kalomaze One-Child Policy Distillation

English

2.4K

kalomaze@kalomaze·29 Nis

big lab burner account humor be like chinese distillation. chinese? distillation. did you know that the chinese... distill? chinese. the chinese are distilling,

typedfemale@typedfemale

really exciting to see an LLM trained on pre-1930 data - post-2022 is already crowded with qwen, deepseek, and kimi

English

342

25.4K

Vaisakh M retweetledi

signüll@signulll·10 Nis

“laid back” is what high agency ppl look like from the outside when they’ve correctly identified which games are worth playing & simply declined the rest.

English

1.2K

11.9K

378.6K

Vaisakh M retweetledi

Natural Philosophy@Naturalphilosy·4 Nis

“Above all, do not lose your desire to walk.” — Søren Kierkegaard

English

260

6.9K

45.8K

1.6M

Vaisakh M@m__vaisakh·25 Mar

@difficultyang This is one part. The other part is corporate can do this without involving the developers/researchers.

English

difficultyang@difficultyang·25 Mar

Look guys, just because the copyright owner released something under a particular free license, doesn't mean that can't sell it under a different license to other parties

English

1.1K

Vaisakh M retweetledi

The Nobel Prize@NobelPrize·15 Mar

“Timing is very important. You need to pick hard problems to solve and be ambitious with them. But you've also got to pick the right time when the world and the context that you're in is the right kind of environment for those ideas to flourish.” In his official Nobel Prize interview, Demis Hassabis discussed how his aspirations as a young gaming programmer were ahead of their time. Watch our official interview: bit.ly/41DGkXr

English

459

3.5K

279.1K

Vaisakh M@m__vaisakh·15 Mar

@SwayStar123 share!!! Also a bench would be cool.

English

sway@SwayStar123·15 Mar

Thinking of making a "ML intuitions bench", which will be MCQs for what happens if you make certain tweaks to tranformers or other archs. I have a bunch of findings that'll probably never make into a paper, and most of which are pretty surprising to me. If LLMs can predict these accurately then that's a pretty huge thing for autoresearch

English

718

Vaisakh M retweetledi

•@WordsCocoon·13 Mar

march 13, friday...

English

8.7K

60.1K

926.4K

Vaisakh M@m__vaisakh·13 Mar

@andrew_n_carr I vaguely checked earlier and only found this (nowhere near 25T iiuc) huggingface.co/collections/nv…

English

Andrew Carr 🤸@andrew_n_carr·13 Mar

it's awesome to think that nvidia released 25T training tokens. that is so fantastically hard to collect well. the proof is in their models too. I'd expect if you were interested in writing your own constitution against that data, you could train an exceedingly companionable AI

English

964

Vaisakh M@m__vaisakh·13 Mar

HBD to us Mandate of heaven

English

Vaisakh M@m__vaisakh·11 Mar

@sytelus like an indicator of (future) conflict of interest?

English

Shital Shah@sytelus·10 Mar

My friend who just recently raised round for his startup: “the most important thing you look in your investors is where do they have board seats”.

English

579

Vaisakh M@m__vaisakh·10 Mar

@m_sirovatka But there isn't a base model (atleast not yet)

English

Matej Sirovatka@m_sirovatka·9 Mar

you can just do things when you're gpu rich (full post-train GLM5 being the things)

English

16.5K

Vaisakh M@m__vaisakh·7 Mar

bloat breeds bugs

English

Vaisakh M retweetledi

the tiny corp@__tinygrad__·4 Mar

“Simplicity is a great virtue, but it requires hard work to achieve and education to appreciate. And to make matters worse, complexity sells better.” — Edsger Dijkstra

English

776

21.5K

Vaisakh M@m__vaisakh·1 Mar

@_arohan_ x.com/i/status/20279…

QME

409

Vaisakh M@m__vaisakh·21 Şub

@kuchaev @natolambert Time here is the time a human takes to complete a task iirc. This eval only takes into account how reliably a model finishes a task and not the time taken to do it.

English

Oleksii Kuchaiev@kuchaev·21 Şub

@natolambert I am not convinced that hours is a proper metric here. Anything can and will be made faster so, if hypothetically, Anthropic made claude faster (better SW, newer/more hw) that would show up as *worse* on this plot?

English

250

Nathan Lambert@natolambert·20 Şub

This'll really solve the claude vs codex debate surely. I'm still team claude.

METR@METR_Evals

We estimate that Claude Opus 4.6 has a 50%-time-horizon of around 14.5 hours (95% CI of 6 hrs to 98 hrs) on software tasks. While this is the highest point estimate we’ve reported, this measurement is extremely noisy because our current task suite is nearly saturated.

English

164

22.5K

Vaisakh M retweetledi