Christopher Settles

919 posts

Christopher Settles

@never_settles_

Building RL gyms @refresh_dev | Prev AI @Uber , CS @UofIllinois | believer in community

San Francisco Katılım Şubat 2022

2K Takip Edilen2.2K Takipçiler

Christopher Settles retweetledi

Shuyan Zhou@shuyanzh36·2d

In 2023, WebArena took 7 grad students more than 6 months to build just 5 environments with 812 variable browser-use tasks. Now, it takes under 10 hours and less than $100 per environment, with easy support for parallel generation. Excited to introduce WebArena-Infinity: a scalable approach for automatically generating high-authenticity, high-complexity browser environments with verifiable tasks suitable for RL training and benchmarking. Even strong open-source models that already achieve 60%+ success rates on WebArena and OSWorld complete fewer than 50% of tasks here. Project page: webarena.dev/webarena-infin… Repo: github.com/web-arena-x/we… 🧵 (1/n)

GIF

English

317

38.8K

Christopher Settles retweetledi

Ivan Bercovich@neversupervised·4d

x.com/i/article/2035…

ZXX

77.6K

Christopher Settles retweetledi

Arcee.ai@arcee_ai·5d

Here are a few of our favorite shots from our recent out-of-home campaign. Loving how the Arcee teal cuts right through the noise of downtown SF and the traffic on the 101 + a bonus shot from the DC metro.

English

1.7K

Christopher Settles retweetledi

Luke Melas-Kyriazi@lukemelas·6d

Our first frontier-level model! It's the result of our first continued pretraining run as well as further scaling RL. Very excited to hear how people like it! Feel free to send me feedback and we'll incorporate it into future models.

Cursor@cursor_ai

Composer 2 is now available in Cursor.

English

4.9K

Christopher Settles retweetledi

Tzafon@tzafon_company·16 Mar

We're open sourcing Northstar CUA Fast, a frontier 4B open-source Computer Use Action (CUA) model, built for accuracy and long-horizon action planning.

English

1.7K

Christopher Settles@never_settles_·15 Mar

@arlanr @nozomioai I'll try to stop by to say hi!

English

106

Arlan@arlanr·15 Mar

@never_settles_ @nozomioai for 6 days only

English

561

Arlan@arlanr·15 Mar

If you do a work trial or work at @nozomioai, the least you get is: - unlimited doordash and steaks - unlimited Hinge and Tinder - Airbnb - unlimited access to white monster and diet coke - $5,000 worth of claude code every week - handsome founder

English

264

17.9K

Christopher Settles retweetledi

Eli Mernit@mernit·15 Mar

@tekbog

QME

195

9.5K

Christopher Settles retweetledi

RunRL@runrl_com·15 Mar

ZXX

241

Christopher Settles retweetledi

Suman@0xSuman·12 Mar

Introducing MeetClaw 🦞 (aka OpenUtter) `npx openutter` on your OpenClaw Let the lobster take over meetings, send live updates and screenshots. Say goodbye to @meetgranola , @otter_ai github.com/sumansid/openu…

Suman@0xSuman

🦞taking over meetings

English

488

134.4K

Christopher Settles retweetledi

Ishaan Sehgal@ishaansehgal·3 Mar

every dev wants to code from anywhere but SSH is a pain. cloud sandboxes don't know your environment. remote control apps die when the laptop closes. so we mapped every approach 🔗 omnara.com/blog/mobile-co…

English

820

Christopher Settles@never_settles_·7 Mar

It's finally hot in SF because Claude has been running all the GPUs overclocked

English

202

Christopher Settles retweetledi

Xiangyi Li@xdotli·7 Mar

Room ready for the largest Agent Skills hackathon at @fdotinc Sat March 7. 🌁 We added the following speakers: @underyx from Anthropic @FurqanR founder of @fdotinc @thirdweb @nebulagg @ryanmart3n creator of Harbor and Terminal Bench @xdotli yours truly who made SkillsBench as well. We have two tracks: - Make skills in economically valuable domains where models are less trained on, and help the model. This can be OpenClaw skills for marketing, writing compliance, excels, etc. - Make skills continual learning pipelines. Join us to make agent skills reliable and self-evolving! @belindmo @roeybc @turboblitzzz @ruslanjabari @never_settles_ @fdotinc

English

3.9K

Christopher Settles@never_settles_·4 Mar

@ay_ushr the sky is the limit for the cap!

English

101

Ayush@ay_ushr·4 Mar

$6 uncapped >>> $6m on 60m.

Ayush@ay_ushr

Alright, it's been leaked. Today we're announcing the @autumnpricing seed round.

English

6.1K

Christopher Settles retweetledi

Daanish Khazi@bertgodel·4 Mar

We’re announcing Kos-1 Lite, a medical model that achieves SOTA on HealthBench Hard at 46.6%. As a medium sized language model (~100B), it achieves these results at a fraction of the serving cost of frontier trillion-parameter models.

English

319

25K

Christopher Settles@never_settles_·3 Mar

@defi_dua @AnswersAi_ai @quizlet congrats!!!

English

Shubhan Dua@defi_dua·2 Mar

I’m pleased to announce that @AnswersAi_ai has been acquired and our Team SF is joining @quizlet We started this journey 3 years ago as Juniors at Cal and UCLA as a hackathon project and built our way through Senior year, three offices and more. I’m proud of our team that got us 2M Users, 1M followers, 2 Billion views and $3.5M+ across our lifetime. We’re grateful for our investors, supporters and team that took a bet on us, starting with my co founders. Thank you to Kurt Beidler, Ismail Orujov and the entire Quizlet team for taking a bet on us. We continue our journey there alongside the incredible @satapathy_dev_ , @DanielBerezhnoy and @angeldzzz23 Always day one as we continue making a dent in the universe

English

107

371

110.1K

Christopher Settles@never_settles_·2 Mar

Claude code took off partly because its core feature was dead clear (Agent Mode). Cursor led with next edit prediction for a while, even though Agent Mode was just as good as Claude code, so lots of people formed an opinion before ever trying it. Any parallels in CUA apps?

English

379

Christopher Settles@never_settles_·27 Şub

Realtime desktop screencast experience is actually game changing for users to use a product

Tzafon@tzafon_company

Everyone loves a fast product. That's why we decided to rebuild our Lightcone OS for significantly faster interaction – we now support full real-time streaming at 30 FPS. We did this to address one of the most common complaints with computer use – namely that it feels slow and clunky. It doesn't have to be this way. Our goal is to build the fastest computer use experience in the world and this brings us one step closer to this goal. We're especially keen to see what developers can build on top of this. Available to try out in Lightcone beta, and for developers via the API.

English

463

Christopher Settles retweetledi

Sam Altman@sama·27 Şub

We have raised a $110 billion round of funding from Amazon, NVIDIA, and SoftBank. We are grateful for the support from our partners, and have a lot of work to do to bring you the tools you deserve.

English

4.2K

2.6K

39.5K

8.9M

Christopher Settles retweetledi

will brown@willccbb·27 Şub

the reason every model is bad at multimodal is because literally nobody except @vikhyatk is even trying there’s prob a lot of easy wins on the CUA path still to be found

Tzafon@tzafon_company

We showed model colored squares for a few hours. It learned to use a computer better than models trained on thousands of real screenshots.

English

586

61.6K

Keşfet

@arlanr @nozomioai @tekbog @meetgranola @otter_ai @fdotinc @underyx @FurqanR