Reshinth

378 posts

Reshinth

@reshinth_

Research | Post Training of CodeLMs, LMs.

London, England Katılım Ağustos 2019

471 Takip Edilen276 Takipçiler

Sabitlenmiş Tweet

Reshinth@reshinth_·7 Nis

How to define Diversity in the context of CodeLMs and Programming Languages ? 1. Diversity is positively correlated with Performance in solving a problem. 2. Shortcomings of diversity in small codeLMs. 3. Code Embedding models don't capture semantics. reshinthadithyan.github.io/blog/2023/code…

English

4.5K

Reshinth retweetledi

Varun Jampani@jampani_varun·1 Eki

🎬 Introducing Stable Cinemetrics, to be presented at NeurIPS 2025. We present the first taxonomy of professional controls to systematically study and control video generative models through the lens of filmmaking. Interactive webpage with paper link: stable-cinemetrics.github.io 🧵

English

Reshinth@reshinth_·17 Tem

@tylerangert Happy Birthday 🎉

English

Tyler Angert@tylerangert·17 Tem

My forever fun fact: my birthday is the calendar emoji 📅

English

196

13.1K

Reshinth retweetledi

Xiao Liang@MasterVito0601·13 Haz

🙋‍♂️ Can RL training address model weaknesses without external distillation? 🚀 Please check our latest work on RL for LLM reasoning! 💯 TL;DR: We propose augmenting RL training with synthetic problems targeting model’s reasoning weaknesses. 📊Qwen2.5-32B: 42.9 → SwS-32B: 68.4

English

134

12.1K

Reshinth@reshinth_·12 Haz

ZXX

114

Reshinth@reshinth_·8 Haz

@evanthebouncy Congrats Evan.

English

122

evanthebouncy@evanthebouncy·8 Haz

I've recently started my job as an asst professor at NTU, Singapore. If you are ever in town come say hi :)

English

682

37.8K

Reshinth@reshinth_·7 Haz

ZXX

107

Reshinth@reshinth_·6 Haz

With CodeLMs scaling actually solved models intrinsically learning internal structural syntactical & semantic information.

English

129

Reshinth retweetledi

Josh@JoshPurtell·6 Haz

Open AI gave a talk on writing software through specs today. I thought it was my little secret, but seems like quite a few smart builders in the space have also found it's a useful approach. Now that the secrets out joshuapurtell.com/posts/spec_eng/

English

733

76.6K

Reshinth@reshinth_·31 May

ZXX

Reshinth retweetledi

Morph@morph_labs·8 May

Our Infinibranch Sandboxes power @huggingface OpenR1's code-based rewards for training LLMs with GRPO. From multi-file complex evaluation pipelines for IOI problems in C++, to Jupyter execution for Python: we evaluate in seconds, not minutes github.com/huggingface/op…

English

747

Reshinth@reshinth_·5 May

@scychan_brains Resonates a lot. I myself found stuck in such scenario late last year. Talking with collaborators and colleagues helped a lot. Thanks a lot for writing. ♥️

English

493

Stephanie Chan@scychan_brains·4 May

Some years ago, I got trapped in a Massive Trough of Imposter Syndrome. It took more than a year to dig myself out of it, but the following framework really helped me. It feels a bit vulnerable to share, but I hope it might help a few others too! A short thread 🧵🙂

English

302

38.6K

Reshinth@reshinth_·26 Nis

@ben_trevett

QME

Ben Trevett 🫟@ben_trevett·26 Nis

Does anyone else remember a year or so ago when all the AI podcasts did AGI debates? What an awful time that was, glad it’s over

English

Reshinth@reshinth_·21 Nis

@FraserGreenlee @arcprize Am I understanding it right that you're describing program synthesis ?

English

Fraser@FraserGreenlee·21 Nis

Wonder if there's room for an @arcprize game where you gradually build up an abstraction library to solve tasks of increasing difficulty. Bonus points for using less code.

English

156

Reshinth retweetledi

Augment Code@augmentcode·1 Nis

🧵We just released the #1 open-source agent on the SWE-bench Verified leaderboard by assembling the best of Claude Sonnet 3.7 and O1. Open-source repo here: github.com/augmentcode/au… Here's how we achieved 65.4% success rate on the hardest coding benchmark in the industry: 🧠👇

English

276

51.3K

Reshinth retweetledi

Morph@morph_labs·13 Mar

OpenAI's CUA plays Pokemon in the multiverse with Infinibranch by Morph Cloud choose every starter: no more pesky 'decisions'

English

Reshinth@reshinth_·23 Şub

ML Twitter lately.

Casey Muratori@cmuratori

Remember folks: if you aren't a subject matter expert, don't know the context, and have nothing valuable to add to a thread, you always have the option of not replying!

English

137

Reshinth retweetledi

Alex Havrilla@Dahoas1·5 Ara

How important is the quality, diversity, and complexity (QDC) of synthetic data for LLM performance? What effect does QDC data composition have on self-improvement? We just released a comprehensive survey discussing these questions (and many more) 🧵

English

111

16.9K

Reshinth retweetledi

Nathan Cooper@ncooper57·5 Ara

As R&D staff @answerdotai, I work a lot on boosting productivity with AI. A common theme that always comes up is the combination of human+AI. This combination proved to be powerful in our new project ShellSage, which is an AI terminal buddy that learns and teaches with you. A 🧵

English

201

69.1K

Reshinth retweetledi

Anthropic@AnthropicAI·19 Kas

New Anthropic research: Adding Error Bars to Evals. AI model evaluations don’t usually include statistics or uncertainty. We think they should. Read the blog post here: anthropic.com/research/stati…

English

299

2.1K

755.6K

Reshinth@reshinth_·8 Kas

@ncooper57 @fastdotai Congrats Nathan <3

English

Nathan Cooper@ncooper57·8 Kas

I'm so excited to be working on this new course from @fastdotai ! Education has always been a huge driving factor in my life. It is surreal that I'm getting to do this as part of my job. Really looking forward to working with students again 🤓

Jeremy Howard@jeremyphoward

Today, we're announcing that @fastdotai is joining @AnswerdotAI, marking a new phase in making AI accessible. And we're launching a new a new kind of "AI-first" educational experience, "How To Solve It With Code". answer.ai/posts/2024-11-…

English

793

Keşfet

@tylerangert @evanthebouncy @huggingface @scychan_brains @ben_trevett @FraserGreenlee @arcprize @elonmusk