Sathwik-70B

344 posts

Sathwik-70B banner
Sathwik-70B

Sathwik-70B

@VishnuSathvik1

Reasoning in continuous latent spaces @precogatiiith | ug @iiit_hyderabad | Views are strictly my own

Hyderabad, India Katılım Ocak 2023
1.3K Takip Edilen761 Takipçiler
Sathwik-70B
Sathwik-70B@VishnuSathvik1·
We all know the next Anthropic model is going to be crazy.
English
0
0
1
54
Ravi Sharma
Ravi Sharma@ravishar313·
First person in history to read a ChatGPT generated 22 page document word by word and fact check it.
English
6
0
79
4.3K
Sathwik-70B
Sathwik-70B@VishnuSathvik1·
What differentiates mediocre research with great research?
English
1
0
2
234
Sathwik-70B retweetledi
the tiny corp
the tiny corp@__tinygrad__·
Humans are 100T models with 1T active parameters that run at 10 tok/s. They train on only 10B tokens.
English
146
233
3.7K
224.3K
Benno Krojer
Benno Krojer@benno_krojer·
What are your favorite papers that can serve as excellent examples how to write great scientific paper, present results, great figures, make it engaging and easy to follow? Doesn't necessarily have to be the most cited or impactful ones
English
4
1
13
5.1K
Kabir
Kabir@kabir_j25·
I've been putting in a lot of hours, but I still gotta grow in efficiency...time & efficiency are the key
English
4
0
11
422
Sathwik-70B retweetledi
Damek
Damek@damekdavis·
Solve harder problems.
English
12
9
109
6.4K
Sathwik-70B
Sathwik-70B@VishnuSathvik1·
Today’s benchmark is tomorrow’s training data - @rao2z
English
0
1
4
227
Sathwik-70B
Sathwik-70B@VishnuSathvik1·
@srijatwt Please don't do that. Let our RSAI project live.
English
0
0
0
39
srija
srija@srijatwt·
almost feels like it had a perfect answer ready to defend all eval awareness allegations hmmmm
srija tweet media
English
1
0
1
187
JMB 🧙‍♂️
JMB 🧙‍♂️@jmbollenbacher·
nanbeige4.1 is the biggest overthinker ive ever seen. it was 1500+ tokens off "hello." and its like this every time. this was two different runs, the same thing.
JMB 🧙‍♂️ tweet mediaJMB 🧙‍♂️ tweet media
English
3
0
5
638
Marius Mosbach
Marius Mosbach@mariusmosbach·
🚨 I'm looking for emergency reviewers for ARR submissions in the interpretability and analysis track. Topics include: Analysis of CoT, Supervised Fine-tuning, and Matryoshka Representation Learning.
English
2
6
14
3.3K