ROXAS

187 posts

ROXAS

@roxasery

Katılım Nisan 2010

57 Takip Edilen7 Takipçiler

ROXAS@roxasery·24 Şub

どんな #ポケモン会えるかな？ #ポケモン30周年 #PR @poke_times

日本語

ROXAS retweetledi

AI at Meta@AIatMeta·14 Ağu

Introducing DINOv3: a state-of-the-art computer vision model trained with self-supervised learning (SSL) that produces powerful, high-resolution image features. For the first time, a single frozen vision backbone outperforms specialized solutions on multiple long-standing dense prediction tasks. Learn more about DINOv3 here: ai.meta.com/blog/dinov3-se…

English

343

754

4.4K

898.9K

ROXAS@roxasery·1 Eki

@mishalsalim Please

English

ROXAS@roxasery·1 Eki

@basedonmoon Please

English

BasedOnMoon@basedonmoon·1 Eki

I have 13 Sora 2 Invite Codes: okay let’s be fair Raffle instructions below Follow and comment below and I’ll randomise who gets one in your DM :) #Sora2 #invitecode

English

3.5K

ROXAS@roxasery·1 Eki

@Jaydenfn86 SORA

Português

Jaydenfn@Jaydenfn86·1 Eki

I got 3 invite codes left Comment “SORA” then I’ll send it to your dms

English

287

ROXAS@roxasery·1 Eki

@ozgrozer Can I have a code? Thanks!

English

Ozgur Ozer@ozgrozer·1 Eki

I just got access to Sora 2 and have some invite codes. Like and reply this post. I’ll be sharing the codes in the comments.

English

3.4K

Vincent Zhan@Vince2000_·30 Eyl

I have 100 Sora invite codes. Like, comment, and repost this post. Follow @Vince2000_ and @Martini373469 I will randomly select 100 from the comments.

English

509

137

685

102K

ROXAS@roxasery·1 Eki

@Vince2000_ @Martini373469 Can I have a code?

English

ROXAS retweetledi

Ahmad@TheAhmadOsman·7 Eyl

Comparing & Contrasting Recent LLMs Architecture > DeepSeek-V3/R1 > OLMo 2 > Gemma 3 > Mistral Small 3.1 > Llama 4 > Qwen3 (dense+MoE) > SmolLM3 > Kimi 2 > GPT-OSS Are 2025 LLMs really that different from each other? MoE, MLA, GQA, sliding window, normalization games & more.

English

158

977

85.6K

ROXAS retweetledi

Rohan Paul@rohanpaul_ai·7 Eyl

The paper shows a small model trained with reinforcement learning can outperform prompt only agents on machine learning engineering. Most agents just prompt large models and search longer, but they do not learn from experience. This work instead trains a 3B Qwen model with reinforcement learning, updating its decision rule from task feedback. Challenge 1, actions take different time to run, so naive distributed training overcounts fast but weak ideas. They fix this by weighting each update by action runtime, so slower high value runs matter. Challenge 2, rewards are sparse, a near miss and a total failure look the same to the learner. They add environment instrumentation, a separate frozen model inserts print lines into the code and awards small credit for milestones like loaded data or trained model. These steady signals steer the agent away from metric gaming and toward better modeling, like simple feature engineering or stronger classifiers. With these fixes plus a self improvement prompt, the small learner keeps improving over runs and often beats frontier models. ---- Paper – arxiv. org/abs/2509.01684 Paper Title: "Reinforcement Learning for Machine Learning Engineering Agents"

English

126

712

55K

ROXAS@roxasery·7 Ağu

research method比预想中的要难啊……加上之前在香港已经折腾地精疲力尽了，这五天过得简直就是灾难T_T事实上那些让你连参考资料都找不着的课才是真正难的课=_=

中文

ROXAS@roxasery·2 Ağu

太多时候，错过了就错过了

中文

ROXAS@roxasery·1 Ağu

@EssieZMY 自己评论自己？

中文

ROXAS@roxasery·31 Tem

@EssieZMY 我还是觉得高调点的进步比较快……不过把别人教得比自己还明白什么的……及时暴露缺点才能及时去改正啊=_=

中文

ROXAS@roxasery·13 Nis

@ElevenerT 其实只按上左下右交替着按就能打到宋朝……

中文

ROXAS@roxasery·13 Nis

“想到这里，我就悲哀得难以自禁。因为，直子连爱都没爱过我的。”

中文

ROXAS@roxasery·2 Nis

RT @roxasery: 而我现在所处的痛苦……几乎就是永恒了，我一直都是这么愚蠢

中文

ROXAS@roxasery·19 Şub

我非常好奇当初的我是怎么想到把饭否和twitter连到一起的= =

中文

ROXAS@roxasery·9 Oca

我要去见那高山。

中文

ROXAS@roxasery·8 Oca

蠢死了我……给个陷阱就往里跳=_=

中文

Keşfet

@poke_times @mishalsalim @basedonmoon @Jaydenfn86 @ozgrozer @Vince2000_ @EssieZMY @elonmusk