Reshinth

378 posts

Reshinth banner
Reshinth

Reshinth

@reshinth_

Research | Post Training of CodeLMs, LMs.

London, England Katılım Ağustos 2019
471 Takip Edilen276 Takipçiler
Sabitlenmiş Tweet
Reshinth
Reshinth@reshinth_·
How to define Diversity in the context of CodeLMs and Programming Languages ? 1. Diversity is positively correlated with Performance in solving a problem. 2. Shortcomings of diversity in small codeLMs. 3. Code Embedding models don't capture semantics. reshinthadithyan.github.io/blog/2023/code…
Reshinth tweet mediaReshinth tweet media
English
1
9
24
4.5K
Reshinth retweetledi
Varun Jampani
Varun Jampani@jampani_varun·
🎬 Introducing Stable Cinemetrics, to be presented at NeurIPS 2025. We present the first taxonomy of professional controls to systematically study and control video generative models through the lens of filmmaking. Interactive webpage with paper link: stable-cinemetrics.github.io 🧵
English
1
3
24
4K
Tyler Angert
Tyler Angert@tylerangert·
My forever fun fact: my birthday is the calendar emoji 📅
English
55
0
196
13.1K
Reshinth retweetledi
Xiao Liang
Xiao Liang@MasterVito0601·
🙋‍♂️ Can RL training address model weaknesses without external distillation? 🚀 Please check our latest work on RL for LLM reasoning! 💯 TL;DR: We propose augmenting RL training with synthetic problems targeting model’s reasoning weaknesses. 📊Qwen2.5-32B: 42.9 → SwS-32B: 68.4
Xiao Liang tweet media
English
7
37
134
12.1K
evanthebouncy
evanthebouncy@evanthebouncy·
I've recently started my job as an asst professor at NTU, Singapore. If you are ever in town come say hi :)
evanthebouncy tweet media
English
28
11
682
37.8K
Reshinth
Reshinth@reshinth_·
With CodeLMs scaling actually solved models intrinsically learning internal structural syntactical & semantic information.
English
0
0
0
129
Reshinth retweetledi
Josh
Josh@JoshPurtell·
Open AI gave a talk on writing software through specs today. I thought it was my little secret, but seems like quite a few smart builders in the space have also found it's a useful approach. Now that the secrets out joshuapurtell.com/posts/spec_eng/
English
20
44
733
76.6K
Reshinth retweetledi
Morph
Morph@morph_labs·
Our Infinibranch Sandboxes power @huggingface OpenR1's code-based rewards for training LLMs with GRPO. From multi-file complex evaluation pipelines for IOI problems in C++, to Jupyter execution for Python: we evaluate in seconds, not minutes github.com/huggingface/op…
English
1
2
4
747
Reshinth
Reshinth@reshinth_·
@scychan_brains Resonates a lot. I myself found stuck in such scenario late last year. Talking with collaborators and colleagues helped a lot. Thanks a lot for writing. ♥️
English
1
0
1
493
Stephanie Chan
Stephanie Chan@scychan_brains·
Some years ago, I got trapped in a Massive Trough of Imposter Syndrome. It took more than a year to dig myself out of it, but the following framework really helped me. It feels a bit vulnerable to share, but I hope it might help a few others too! A short thread 🧵🙂
English
7
34
302
38.6K
Ben Trevett 🫟
Ben Trevett 🫟@ben_trevett·
Does anyone else remember a year or so ago when all the AI podcasts did AGI debates? What an awful time that was, glad it’s over
English
1
0
1
85
Fraser
Fraser@FraserGreenlee·
Wonder if there's room for an @arcprize game where you gradually build up an abstraction library to solve tasks of increasing difficulty. Bonus points for using less code.
English
1
0
2
156
Reshinth retweetledi
Augment Code
Augment Code@augmentcode·
🧵We just released the #1 open-source agent on the SWE-bench Verified leaderboard by assembling the best of Claude Sonnet 3.7 and O1. Open-source repo here: github.com/augmentcode/au… Here's how we achieved 65.4% success rate on the hardest coding benchmark in the industry: 🧠👇
Augment Code tweet media
English
7
61
276
51.3K
Reshinth retweetledi
Morph
Morph@morph_labs·
OpenAI's CUA plays Pokemon in the multiverse with Infinibranch by Morph Cloud choose every starter: no more pesky 'decisions'
English
2
12
45
6K
Reshinth retweetledi
Alex Havrilla
Alex Havrilla@Dahoas1·
How important is the quality, diversity, and complexity (QDC) of synthetic data for LLM performance? What effect does QDC data composition have on self-improvement? We just released a comprehensive survey discussing these questions (and many more) 🧵
Alex Havrilla tweet media
English
5
32
111
16.9K
Reshinth retweetledi
Nathan Cooper
Nathan Cooper@ncooper57·
As R&D staff @answerdotai, I work a lot on boosting productivity with AI. A common theme that always comes up is the combination of human+AI. This combination proved to be powerful in our new project ShellSage, which is an AI terminal buddy that learns and teaches with you. A 🧵
Nathan Cooper tweet media
English
5
39
201
69.1K
Reshinth retweetledi
Anthropic
Anthropic@AnthropicAI·
New Anthropic research: Adding Error Bars to Evals. AI model evaluations don’t usually include statistics or uncertainty. We think they should. Read the blog post here: anthropic.com/research/stati…
English
49
299
2.1K
755.6K
Nathan Cooper
Nathan Cooper@ncooper57·
I'm so excited to be working on this new course from @fastdotai ! Education has always been a huge driving factor in my life. It is surreal that I'm getting to do this as part of my job. Really looking forward to working with students again 🤓
Jeremy Howard@jeremyphoward

Today, we're announcing that @fastdotai is joining @AnswerdotAI, marking a new phase in making AI accessible. And we're launching a new a new kind of "AI-first" educational experience, "How To Solve It With Code". answer.ai/posts/2024-11-…

English
2
1
18
793