Anil Thomas

19 posts

Anil Thomas

@anlthms

Time to build...

Katılım Mayıs 2013

29 Takip Edilen75 Takipçiler

Anil Thomas retweetledi

Ashish Vaswani@ashVaswani·7 Mar

Rnj-1 aces telco benchmarks. Congratulations to Farbod Tavakkoli, @GregoryDiamos , @tensorwave , @AMD , and the @essential_ai team!

Essential AI@essential_ai

Rnj-1’s performance is especially good in correctness and abstention in its weight class, which are the two most important metrics for this work.

English

5.1K

Anil Thomas@anlthms·12 Ara

@omkizzy we used <suf_fim>{suffix}<pre_fim>{prefix}<mid_fim>{middle} (a.k.a SPM variant 1) in pre-training. you should still be able to use <pre_fim><suf_fim>{suffix}<mid_fim>{prefix}{middle} during inference as it is just PSM with an empty prefix.

English

omkaar@omkizzy·12 Ara

@anlthms awesome! it worked in my simple vibe tests well, so just wanted to confirm. just to confirm, SPM is <pre_fim><suf_fim>{suffix}<mid_fim>{prefix}{middle} right?

English

omkaar@omkizzy·11 Ara

hi @_saurabh @_jainyash, rnj-1 came out really well. I see PSM working for code FIM, has there been SPM format pre-training as well? SPM is better for KV caching

English

368

Anil Thomas@anlthms·6 Ara

Thrilled to share what we have been building!

Ashish Vaswani@ashVaswani

We are beyond thrilled to share our first flagship models, Rnj-1 base and instruct 8B parameter models. Rnj-1 is the culmination of 10 months of hard work by a phenomenal team, dedicated to advancing American SOTA OSS AI. Lots of wins with Rnj-1. 1. SWE bench performance close to GPT 4o. 2. Tool use outperforming all comparable open source models. 3. Mathematical reasoning (AIME’25) nearly at par with GPT OSS MoE 20B. ….

English

508

Anil Thomas retweetledi

Essential AI@essential_ai·5 Eyl

[1/2] We at Essential are driven by mission to advance fundamental research guided by first principles, rigor and sharing research openly.

English

5.4K

Anil Thomas retweetledi

Essential AI@essential_ai·22 Haz

Why run the same race when we can pioneer our own path? Thats how we approach AI, by taking big bets and pushing on the foundations of AI 💥 Check out @ashVaswani's recent interview with @EconomicTimes

English

27.6K

Anil Thomas retweetledi

Ashish Vaswani@ashVaswani·18 Haz

Check out our latest research on data. We're releasing 24T tokens of richly labelled web data. We found it very useful for our internal data curation efforts. Excited to see what you build using Essential-Web v1.0!

Essential AI@essential_ai

[1/5] 🚀 Meet Essential-Web v1.0, a 24-trillion-token pre-training dataset with rich metadata built to effortlessly curate high-performing datasets across domains and use cases!

English

653

145.3K

Anil Thomas retweetledi

Essential AI@essential_ai·3 May

🗞️ We just launched our new landing page and dropped a fresh blog post on how LLMs learn to reflect and revise their thinking: In order to advance reasoning, it's vital to measure and understand its constituents, such as reflection. More to come - essential.ai

English

9.1K

Anil Thomas retweetledi

Ashish Vaswani@ashVaswani·8 Nis

Reinforcement learning has shown success in eliciting reflection from LLMs, but what if this capability actually manifests earlier in pre-training? We investigated this question and our results are surprising 👇 [1/4]

English

100

806

137.7K

Anil Thomas@anlthms·14 Kas

@sama see you on the leaderboard next year

GIF

English

642

Sam Altman@sama·14 Kas

@DavidSHolz @willdepue in your heart do you believe we’ve solved that one or no?

English

726

505.2K

will depue@willdepue·13 Kas

scaling has hit a wall and that wall is 100% eval saturation.

English

355.1K

Anil Thomas@anlthms·12 Kas

ZXX

139

Anil Thomas@anlthms·12 Kas

ZXX

139

Anil Thomas@anlthms·11 Kas

@arcprize

ARC Prize@arcprize

ARC Prize 2024 is now closed for code submissions! 🏁 Thank you to everyone who participated. We made incredible progress on ARC-AGI. Next: paper deadline Tuesday + a review period where the Kaggle & ARC Prize teams will verify winning solutions. Winners announced Dec. 6.

QME

593

Anil Thomas@anlthms·4 Ara

@317070 *Maniacal laughter*

English

317070@317070·4 Ara

Did you know, that you can build a virtual machine inside ChatGPT? And that you can use this machine to create files, program and even browse the internet? engraved.blog/building-a-vir…

English

217

2.1K

7.8K

Anil Thomas@anlthms·4 Ara

@317070

QME

Anil Thomas@anlthms·4 Ara

@317070 The output seems incorrect for non-trivial commands, but it does a pretty good job of hallucinating what the output might look like.

English

Anil Thomas retweetledi

Luminide@LuminideInc·31 May

@karpathy You'd probably also like Luminide. It's just as easy to "get a GPU in the cloud", but also includes AI model dev features like Experiment Tracking and Hyperparameter Tuning. And it's available to individuals today! luminide.com/features

English

221

Keşfet

@GregoryDiamos @tensorwave @AMD @essential_ai @omkizzy @_saurabh @_jainyash @ashVaswani