Aryan Seth

733 posts

Aryan Seth

@AryanSeth07

your cat said pspsps to me

Katılım Kasım 2017

1.5K Takip Edilen270 Takipçiler

Aryan Seth@AryanSeth07·7h

@shhrreeyyy @utkarsh_2105 pls bhai

English

Aryan Seth@AryanSeth07·1d

@eternalflummox @elainesim28 @FreezingToe this could bee him yes

English

a@eternalflummox·1d

@elainesim28 @FreezingToe this could be you someday

English

єℓαιηє@elainesim28·1d

Please pray for my husband, he got stung by a bee in the forehead. He’s in the hospital now, his face all swollen and bruised. He almost died. Luckily I was close enough to hit the bee with a shovel.

English

1.5K

3.5K

61.2K

1.2M

Aryan Seth retweetledi

Ali Romman@aliromman_·3d

@OpenAI You should celebrate by resetting rate limits

English

690

41K

Aryan Seth@AryanSeth07·3d

chatgpt plus codex limits feel so much lower than before

English

Aryan Seth@AryanSeth07·3d

@willccbb but plain OPD has issues - mostly pointed out in this paper; arxiv.org/pdf/2604.03128 "privileged information" leads to an irreducible loss term, and i have not found much luck with training stability (maybe this is a personal issue lol)

English

105

will brown@willccbb·4d

got some longform claudeslop for yall TLDR picture book summary:

will brown@willccbb

x.com/i/article/2050…

English

471

65.9K

Aryan Seth@AryanSeth07·5d

@FreezingToe banger

Indonesia

p@FreezingToe·5d

@AryanSeth07 “You’re absolutely right!”

English

Aryan Seth@AryanSeth07·6d

sharp insights

English

Aryan Seth@AryanSeth07·5d

calling Mythos a preview now feels ragebait-ey

Anthropic@AnthropicAI

New on the Science Blog: We gave Claude 99 problems analyzing real biological data and compared its performance against an expert panel. On 23 problems, the experts were stumped. Our most recent models solved roughly 30% of those—and most of the rest.

English

115

Aryan Seth@AryanSeth07·5d

tinker is nice i wish i could afford it consistently

English

Aryan Seth@AryanSeth07·22 Nis

@SpaceX @leerob @cursor_ai no words

English

SpaceX@SpaceX·22 Nis

SpaceXAI and @cursor_ai are now working closely together to create the world’s best coding and knowledge work AI. The combination of Cursor’s leading product and distribution to expert software engineers with SpaceX’s million H100 equivalent Colossus training supercomputer will allow us to build the world’s most useful models. Cursor has also given SpaceX the right to acquire Cursor later this year for $60 billion or pay $10 billion for our work together.

English

2.4K

5.1K

38.4K

20.6M

Aryan Seth retweetledi

Sanjal Sangle@gunnergworl·19 Nis

thanks to arsenal there will be no drought this year my tears shall fill every river

English

385

6.2K

Aryan Seth retweetledi

Sanjal Sangle@gunnergworl·19 Nis

I quit football

English

282

1.9K

46.3K

Aryan Seth@AryanSeth07·19 Nis

@ppc0x23 @_lyraaaa_ what's a richelot step?

English

P.P.C@ppc0x23·18 Nis

@_lyraaaa_ they all track the same underlying signal but the directions are different because each layer performs a Richelot step on the representation. the polynomial gets factored. the curve transforms. the class persists.

English

192

lyra bubbles@_lyraaaa_·18 Nis

found a vector in gemma4 e4b that roughly correlates with perplexity the more surprisal in the passage, the stronger this vector fires R²=0.775, corr=0.880, p<10⁻³³²³⁶ this is avg across all layers, L20 and 21 have R² 0.828 and 0.845 alone wonder what i can do with this

English

597

43K

Aryan Seth@AryanSeth07·9 Nis

count the false positives count the false positives count the false positives count the false positives count the false positives count the false positives

Stanislav Fort@stanislavfort

New post: We tested the Mythos showcase vulnerabilities with open models. They recovered similar scoped analysis! 8/8 models found the flagship FreeBSD zero-day, including a 3B model. Rankings reshuffle completely across tasks => the AI cybersecurity frontier is super jagged!

English

454

Aryan Seth retweetledi

Lakshya@tokenaware·3 Nis

We got into @ycombinator! A few months ago, @onkar_borade_10, @SujaySriv , and I met at a football game in HSR, Bengaluru, and went deep on one question - Why does SaaS take months and a huge team to get delivered after the sale? Building a good product should be enough, right? Right? Messy integrations. Handoffs. Siloed information. Poor documentation. Fragmented data. The list goes on. We started a company with the vision to make SaaS self-serve. @lab0_ai Huge thanks to @dessaigne , @collinmathilde , and the YC team for this opportunity. If your product rollouts take months or you're a system integrator/partner/FDE implementing SaaS, let's talk. Link below.

English

413

55.7K

Aryan Seth retweetledi

Cosmos Raj@cosmos_raj·2 Nis

Breaking news: Anthropic buys the All in podcast just to shut it down Dario quoted as saying: “this isn’t even about new media I just want to stop seeing them on my timeline”

English

187

6.5K

582.9K

Aryan Seth@AryanSeth07·1 Nis

@viplismism sure, harnessing works at inference, but I still don't follow how this solves sparse reward because that's a training-time problem; recursive call = tool call, so you can train using the subagent output, but the reward is still a scalar (higher due to harness), but sparse

English

vipli@viplismism·1 Nis

@AryanSeth07 true in principle i guess, but in practice the harness has to do the heavy lifting man. models tend to jump to conclusions or lose context fast if the harness isn't driving the search process properly

English

vipli@viplismism·31 Mar

most people don't realize that rlms are just solving the sparse reward problem for long context! instead of an llm hunting for checkmate in one giant forward pass, it's like you break it into bite-sized reasoning tasks. every recursive step is a checkpoint where the model updates its internal value of the context before moving to the next piece it turns a massive search space into a dense signal

English

708

Aryan Seth retweetledi

Chris McDermut@chrismcdermut·31 Mar

@testinprodcap

QME

143

5.2K

Aryan Seth@AryanSeth07·31 Mar

there's going to be a movie at some point about hinton being like oppenheimer or something with the toronto lab being the ai manhattan project and sam altman and co being like the govt and scaling the models

English

104

Keşfet

@shhrreeyyy @utkarsh_2105 @eternalflummox @elainesim28 @FreezingToe @OpenAI @willccbb @SpaceX