Sam Wolfstone

663 posts

Sam Wolfstone

@SamWolfstone

Sculpting, AI, Philosophy, Coding

Entrou em Kasım 2020

134 Seguindo246 Seguidores

Sam Wolfstone@SamWolfstone·7h

@Teknium Will eagerly wait to use this once the required PRs have been merged in! (I'd rather just stick to official updates rather than apply random PRs on my own instance...)

English

Teknium (e/λ)@Teknium·2d

If you liked lossless claw, here's a new memory plugin that brings it to Hermes Agent 👇

Stephen Schoettler@StephenSch44748

I built an LCM plugin for @NousResearch Hermes Agent. Required a patch to make the context engine pluggable. Lossless Context Management — every message preserved, nothing lost to compaction. PR: github.com/NousResearch/h… Plugin: github.com/stephenschoett…

English

143

10.9K

Sam Wolfstone@SamWolfstone·2d

@AcerFur @synthwavedd Such a cool frickin' benchmark!! Very nice work. Can't wait to see how the new GPT one does on this. Do you have other tests you're keeping in your back pocket in case the labs target these specific tests?

English

Acer@AcerFur·3d

I'm not one to be too interested in image gen capabilities, but I do care about reasoning capabilities, so I am introducing a new benchmark testing reasoning during image generation. Introducing the Image Reasoning Generation Benchmark (IRGB): #irgb" target="_blank" rel="nofollow noopener">pellaml.github.io/iumb/#irgb

English

198

12.5K

Sam Wolfstone@SamWolfstone·2d

@synthwavedd @chatgpt21 @DrBeavisAI Can't help but think that 5o and 5.5 will be different models. 5o needs to be really fast/cheap if it's going to power advanced voice mode. If 5.5 is a step change in intelligence, probably huge, slow and expensive.

English

216

leo 🐾@synthwavedd·2d

@chatgpt21 @DrBeavisAI I mean naturally they would say that Internally it's currently slated as 5.5, but 5o isn't out of contention either. Doubt it's 6

English

1.5K

leo 🐾@synthwavedd·2d

big week coming up

English

381

68.8K

Sam Wolfstone@SamWolfstone·3d

@johnennis Definitely did at first. Got one of my smartest friends into LLMs so now I have someone who's fairly interested in talking about AI stuff with me. Also have a work-colleague-turned-friend who also loves talking about AI with me now, who has very different views from me Need more

English

John Ennis@johnennis·3d

I think one of the biggest challenges when it comes to going hard into using AI is loneliness I am learning all these awesome things and becoming super capable But the set of people that I can really talk to about it is very small Is anyone else having this experience?

English

1.1K

173

3.8K

161.6K

Sam Wolfstone retweetou

Aidan McLaughlin@aidan_mclau·4d

one of my all-time favorite plots

English

2.1K

229.3K

Sam Wolfstone@SamWolfstone·4d

@AcerFur Feel kinda stupid asking but I'm too curious, what makes it not quite pass the test here?

English

368

Acer@AcerFur·5d

doesn't quite pass the animal keyboard test

English

7.8K

Acer@AcerFur·5d

maskingtape-alpha gaffertape-alpha packingtape-alpha Seems like good image models on the image arena (try them out)... but they're not quite perfect just yet. Still fails the Rubik's Cube reflection test.

Acer@AcerFur

@m__dehghani Alright, @m__dehghani time for these next: A validly scrambled Rubik's cube placed by a mirror, clearly showing its mirror reflection. No harsh light reflections. Incorrect centres, edge, and corner pairings:

English

295

232.1K

Sam Wolfstone@SamWolfstone·4d

@KarolCodes @0xSero @theo Even with some details in the SOUL.md, it's really hard to get GPT-5.4 not to yap. If you ask it to be succinct, it'll just be curt in its sentences but still send you 50 lines in the response...

English

Karol@KarolCodes·4d

@0xSero @theo don't you get 5 pages essays from 5.4 everytime you ask it to do something?

English

Theo - t3.gg@theo·5d

So, uh, what subscription should I be using for my OpenClaw now? 🙃

English

264

1.4K

264.1K

Sam Wolfstone@SamWolfstone·4d

@synthwavedd @sawlygg Was the prompt for this one also fairly simple?

English

333

leo 🐾@synthwavedd·5d

yeah it's over holy shit h/t @sawlygg

leo 🐾@synthwavedd

for those wondering, yes, gpt image 2 (not based on 4o ;3) is now on @arena enjoy

English

1.1K

286.6K

Sam Wolfstone@SamWolfstone·1 Nis

@0xSero :o Keep up the good work :P

English

0xSero@0xSero·1 Nis

@SamWolfstone I have 3 more much larger giveaways in the work, it ain’t easy

English

371

0xSero@0xSero·1 Nis

Do you want 3 months of Codex Pro? Comment under the post we are selecting a 5 people very soon 3 MONTHS of Pro. I genuinely think this is the highest leverage sub we have

Sarah Chieng@MilksandMatcha

Giving away 5 Codex Pro plans Each person will get 3 months of free Codex Pro (highest tier). Winners will be selected from comments in 48 hours, comment below why you want it.

English

449

699

75.8K

Sam Wolfstone@SamWolfstone·1 Nis

@MilksandMatcha Pick me, pick me!

English

Sarah Chieng@MilksandMatcha·1 Nis

Giving away 5 Codex Pro plans Each person will get 3 months of free Codex Pro (highest tier). Winners will be selected from comments in 48 hours, comment below why you want it.

OpenAI@OpenAI

Today, we closed our latest funding round with $122 billion in committed capital at an $852B post-money valuation. The fastest way to expand AI’s benefits is to put useful intelligence in people’s hands early and let access compound globally. This funding gives us resources to lead at scale. openai.com/index/accelera…

English

148

3.5K

580.9K

Sam Wolfstone@SamWolfstone·1 Nis

@0xSero Colour me intrigued.

English

0xSero@0xSero·1 Nis

Do you want to try Droid? I’m doing a giveaway 3 people will win 100M Factory credits each.Thats 5 months of their 20$ a month subscription. Winners selected randomly from comments in 48 hours.

English

1.1K

789

70.8K

Sam Wolfstone@SamWolfstone·31 Mar

@rohanvarma Overnight research/experimentation tasks!

English

Rohan Varma@rohanvarma·31 Mar

If we made /slow mode in Codex, would you use it? What for? (Slower inference at a cheaper cost)

English

953

2.2K

185.6K

Sam Wolfstone@SamWolfstone·31 Mar

@dfrsrchtwts Very cute, I must say

English

Daniel Filan@dfrsrchtwts·31 Mar

Some evaluations work we are getting up to at METR

English

3.1K

Sam Wolfstone retweetou

Adam Kranz@adam_kranz·31 Mar

Claude, in a world full of unknown unknowns: Good, now I have a complete picture

English

460

12.9K

Sam Wolfstone@SamWolfstone·30 Mar

@robbensinger @theaidocfilm I wish it was showing somewhere in central London :(

English

Rob Bensinger ⏹️@robbensinger·30 Mar

I've seen @theaidocfilm three times, I have never been this excited to rewatch a movie?? feed me your reactions if you've seen it

English

2.2K

Sam Wolfstone@SamWolfstone·30 Mar

@EntropyChase @catehall Also, happy to continue in DMs if you'd like, as tweets are a bit annoying for this type of convo :)

English

Sam Wolfstone@SamWolfstone·30 Mar

@EntropyChase @catehall of things in the world, and they'd still be able to 'understand' those things, even if those things were incorrect. Maybe you and I have a different definition of 'understanding', so maybe we're slightly talking past each other...

English

Cate Hall@catehall·30 Mar

“Stochastic parrot” is such a potent coinage — so fun to say! so conceptually efficient! — that it seems to have permanently colonized a lot of people’s minds despite not being true of today’s models. Genuinely a linguistic work of art.

English

748

94.9K

Descobrir

@Teknium @AcerFur @synthwavedd @chatgpt21 @DrBeavisAI @johnennis @KarolCodes @0xSero