Noema

773 posts

Noema

@noemaclips

Nostalgic for the future. A lantern into the latent space ✧

The Long Now Katılım Aralık 2015

1.8K Takip Edilen189 Takipçiler

Noema@noemaclips·13h

@gabepereyra @deredleritt3r you might be interested in monitoring this space!

English

Gabe Pereyra@gabepereyra·1d

x.com/i/article/2051…

ZXX

320

270.5K

Noema@noemaclips·1d

@stalkermustang @Bayesian0_0 The return of the king

English

Igor Kotenkov@stalkermustang·1d

@Bayesian0_0 I'm beginning to believe

English

Bayesian@Bayesian0_0·1d

Saw this one coming last month!

English

628

Noema@noemaclips·2d

@KLieret @jyangballin @18jeffreyma @parth007_96 @dpedch @sten_sootla @micmylin @pengchengyin @magpie_rayhou @syhw @Diyi_Yang @OfirPress @sootla_sten Made my day! Keep up the good work, you legends :)

English

Kilian Lieret@KLieret·2d

@noemaclips @jyangballin @18jeffreyma @parth007_96 @dpedch @sten_sootla @micmylin @pengchengyin @magpie_rayhou @syhw @Diyi_Yang @OfirPress @sootla_sten We're definitely planning to also update CodeClash again! @jyangballin has also recently been working on making it easier & cheaper to evaluate

English

John Yang@jyangballin·2d

How much of SQLite, FFmpeg, PHP compiler can LMs code from scratch? Given just an executable and no starter code or internet access. Introducing ProgramBench: 200 rigorous, whole-repo generation tasks where models design, build, and ship a working program end to end. 🧵

English

242

1.5K

671.2K

Noema@noemaclips·2d

@jyangballin @KLieret @18jeffreyma @parth007_96 @dpedch @sten_sootla @micmylin @pengchengyin @magpie_rayhou @syhw @Diyi_Yang @OfirPress @sootla_sten I hope you get enough support to keep the leaderboard updated from time to time, unlike Codeclash, which is one of my favs :(

English

139

John Yang@jyangballin·2d

ProgramBench is a joint effort across Meta FAIR, Meta TBD, Stanford, Harvard @KLieret (co-first author) @18jeffreyma @parth007_96 @dpedch @sten_sootla @micmylin @pengchengyin @magpie_rayhou @syhw @Diyi_Yang @OfirPress Paper: programbench.com/static/paper.p…

English

9.5K

Noema@noemaclips·2d

@jyangballin @KLieret Wake up, another banger eval by John Yang just dropped @stalkermustang @scaling01 @xeophon @spicey_lemonade

English

790

Noema@noemaclips·2d

@akorinek @spicey_lemonade @emollick

QAM

182

Anton Korinek@akorinek·2d

1/🆕 New NBER paper: 𝗪𝗵𝗲𝗻 𝗗𝗼𝗲𝘀 𝗔𝘂𝘁𝗼𝗺𝗮𝘁𝗶𝗻𝗴 𝗔𝗜 𝗥𝗲𝘀𝗲𝗮𝗿𝗰𝗵 𝗣𝗿𝗼𝗱𝘂𝗰𝗲 𝗘𝘅𝗽𝗹𝗼𝘀𝗶𝘃𝗲 𝗚𝗿𝗼𝘄𝘁𝗵? Under empirically grounded calibrations, a singularity could arrive within just a few years of automating AI research. 🧵 📄 nber.org/papers/w35155

English

345

80.4K

Noema@noemaclips·2d

@Delachica_ @runwayml Brutal!! Para el audio habéis usado IA también? De ser así cuál?

Español

25.6K

Contanimation@Cont_animation·3d

Sincitium is finally here. We are pleased to present our latest piece: a concept trailer created specifically for the @runwayml Big Pitch Contest. For this project, we wanted to explore a completely different aesthetic from our usual studio style, and this film is the result of that experimentation. We hope you enjoy it as much as we enjoyed the creative process. Produced by: Contanimation Directed by: Javier De La Chica and Guillermo Miranda Art Direction: Javier De La Chica Editing: Guillermo Miranda Voices: Juan Rabadán #runwaybigpitchcontest

English

407

490

4.8K

Noema@noemaclips·3d

@j_dekoninck Amazing. Are you planning to develop your own aggregated index? I'd love something similar to Epoch's ECI or the Artificial Analysis Index but for Matharena. Keep up the good work!!🫶

English

Jasper Dekoninck@j_dekoninck·3d

Introducing a new paper! Beyond Benchmarks: MathArena as an Evaluation Platform for Mathematics with LLMs Static benchmarks are no longer enough. Models improve too quickly and numbers become stale quickly. Instead, we argue for continuously maintained evaluation platforms.

English

2.6K

Noema@noemaclips·3d

@Miles_Brundage @thsottiaux happening to me too! (In Windows)

English

Miles Brundage@Miles_Brundage·3d

Common Codex (app) bug - you close the window but the app doesn't close, and then when you try to open up a new window, it doesn't work, and you have to reset the app

English

2.5K

Noema@noemaclips·24 Nis

@stalkermustang @Ahmad_Al_Dahle Savage

Français

Igor Kotenkov@stalkermustang·24 Nis

@Ahmad_Al_Dahle Curious to learn why you didn't make this bet @ Meta though 👀

English

359

Ahmad Al-Dahle@Ahmad_Al_Dahle·24 Nis

The most interesting thing about DeepSeek-V4 isn't the benchmarks, it's the bet: efficient ultra-long context is the precondition for test-time scaling and long-horizon agents. 27% of V3's FLOPs at 1M tokens! The rest flows from this ...

DeepSeek@deepseek_ai

🚀 DeepSeek-V4 Preview is officially live & open-sourced! Welcome to the era of cost-effective 1M context length. 🔹 DeepSeek-V4-Pro: 1.6T total / 49B active params. Performance rivaling the world's top closed-source models. 🔹 DeepSeek-V4-Flash: 284B total / 13B active params. Your fast, efficient, and economical choice. Try it now at chat.deepseek.com via Expert Mode / Instant Mode. API is updated & available today! 📄 Tech Report: huggingface.co/deepseek-ai/De… 🤗 Open Weights: huggingface.co/collections/de… 1/n

English

5.4K

Noema@noemaclips·22 Nis

@DotCSV Ya han dicho desde OAI que es un bug y lo van a arreglar: x.com/BoyuanChen0/st…

Boyuan Chen@BoyuanChen0

@SEO @kenjihata @nicdunz @gabeeegoooh @SecrtAgntSquirl @ayaanzhaque @dibyayB @jianfw @kiwhansong0 @liang_weixin @Marco_B_Liang @mengchaozzz @yuguang_yang Will fix!

Español

132

Carlos Santana@DotCSV·22 Nis

Aquí en súper evidente! Fijaos en el agua y la espuma x.com/i/status/20469…

🍔 Campero De Pollo 🍔@campero_depollo

@DotCSV Descaradísimo en el agua 🌊

Español

28.7K

Carlos Santana@DotCSV·22 Nis

¿Habéis detectado ya cuál es el "tinte amarillo" de la nueva versión de imágenes de ChatGPT? Yo lo tengo claro 🤗 Os dejo un examen visual, pero cuidado porque una vez lo veais no parareis de notarlo.

Español

617

194.2K

Noema@noemaclips·22 Nis

@DotCSV Parece que es un bug! Si ejecutas desde API nunca pasa (que yo haya detectado)

Español

7.1K

Noema@noemaclips·21 Nis

@Angaisb_ Imagegen skill?

Svenska

248

Angel 🌼@Angaisb_·21 Nis

I also made this game using GPT Image 2 and Codex (GPT-5.4) We can now build so much stuff thanks to GPT Image 2

English

342

28.4K

Noema@noemaclips·21 Nis

@AcerFur 2 models though, maybe the nerfed one is the instant one? Hyped to see how well it scores on IRGB!

English

252

Acer@AcerFur·21 Nis

GPT-image-2 reasons during image generation. Now you know why I made IRGB ;)

English

118

5.5K

Noema@noemaclips·21 Nis

@fofrAI Fofr dubstep room, smoke guns triggering on a big drop, crushed monstera leaves strong aroma fills the air

English

130

fofr@fofrAI·21 Nis

Soon after we’ll have generative ai for aromas, and I’ll be out here prompting stuff like “the smell of that old book, but with a hint of jasmine” and “newly opened apple product with freshly cut grass”

Pirat_Nation 🔴@Pirat_Nation

Researchers have created a new device that uses ultrasound to trigger smells directly in the brain for VR The small device rests on the forehead with a soft gel pad. It sends gentle sound waves through the skull to the area of the brain that processes smell. Current VR smell systems use cartridges that need constant refills and can make a mess. This new method is cleaner and could one day let people smell virtual worlds like forests or oceans naturally. The project is still in early prototype stage from a small research team. Safety levels were kept low, but more testing is needed.

English

8.2K

Noema@noemaclips·20 Nis

@spicey_lemonade That really looks like images v1.5, are you sure it's the new one?

English

651

spicylemonade@spicey_lemonade·20 Nis

GPT Image 2 failure point: I tested whether the model could reconstruct a figure from a physics paper using only the text description, and it generated a completely unrelated image. It seems like it can’t handle long context. This reinforces the idea that it’s not part of an Omni 5.5 model, but rather a separate image generation model that the system routes to

English

7.4K

Noema@noemaclips·15 Nis

@AcerFur On top of Prism or a totally different tool?

English

238

Acer@AcerFur·15 Nis

I can’t wait to work on some things in the summer. Hopefully we can bring a *very* good tool for mathematicians

English

142

4.8K

Noema@noemaclips·15 Nis

@AcerFur Had to try <3 Excited to see you pushing the limits on your stay at OAI bro! 🫶

English

165

Acer@AcerFur·15 Nis

@noemaclips Hah I’m not gonna answer that one

English

209

Acer@AcerFur·15 Nis

It is funny to see how much AI bros are like eager to see what mathematicians say about models nowadays lol I don’t think society has ever cared so much about what mathematicians have to say

English

181

8.6K

Noema@noemaclips·15 Nis

@AcerFur For a .1 jump or for a new pre-training?

English

203

Acer@AcerFur·15 Nis

@noemaclips The models have done pretty much exactly as well as I expected

English

521

Keşfet

@gabepereyra @deredleritt3r @stalkermustang @Bayesian0_0 @KLieret @jyangballin @18jeffreyma @parth007_96