Not Me

1.8K posts

Not Me

@davidschlangen

This is just a placeholder. For updates from me, please find me at social media sites that are not swamped by troglodytes and owned by enemies of humanity.

@[email protected] Katılım Şubat 2013

236 Takip Edilen1.4K Takipçiler

Sabitlenmiş Tweet

Not Me@davidschlangen·18 Kas

It was fun while it lasted, etc etc. If you’re still here, don’t wait any longer and come over to … that other thing. It really isn’t that hard. If you identify as an “AI person”, sigmoid.social is probably the server for you. I’m at davidschlangen@scholar.social

English

Not Me@davidschlangen·8 Eki

Bob Dylan getting the Nobel Prize in Literature really paved the way for Geoff Hinton getting the Nobel Prize in Physics.

English

417

Not Me@davidschlangen·26 Eyl

I see another area is getting the treatment. (I was there for dialogue systems being discovered (in the way America was discovered).)

Matthew Guzdial@MatthewGuz

It's funny to watch the stream of Google research/Deepmind papers that say they want to do automated game generation with 0 citations to anything in the area's 30+ year history.

English

573

Not Me@davidschlangen·26 Eyl

Meta following Apple’s playbook?

English

279

Not Me@davidschlangen·26 Eyl

@techyalzay Yeah, we briefly looked into this and didn’t find an obvious easy source in the code of the page. (And the HELM webpage is… peculiar.) And it somehow feels wrong to have to scrape this data, which the producers should only have an interest in making accessible.

English

Not Me@davidschlangen·25 Eyl

Does anyone know how to get the HELM and Arena rankings in a machine readable format (and ideally, programmatically)? #lazyweb #LLMs

English

290

Not Me retweetledi

Kranti@krantich_·20 Eyl

🥳This work has been accepted to #EMNLP 2024 Findings. Thanks to my co-authors @SherzodHakimov, @davidschlangen paper link: arxiv.org/abs/2406.17553

English

358

Not Me@davidschlangen·20 Eyl

@roman_klinger I’m old school, for me it’s only real when the notification letter arrives by post … erm, the email arrives. (Which now appears to be the case.) We did have a case recently however where something briefly was visible on OpenReview, and then the final decision was different.

English

100

Not Me@davidschlangen·20 Eyl

OpenReview hiding the EMNLP decisions until notifications have been sent out.

English

3.1K

Not Me retweetledi

SIGdial@sigdial·18 Eyl

The SIGdial 2024 proceedings can now be found on the ACL Anthology 🎉 So many fantastic papers: aclanthology.org/volumes/2024.s… And if you are sharing your papers here, make sure to tag us @sigdial so we can repost it too! #SIGdial #SIGdial2024 @aclanthology

English

1.6K

Not Me@davidschlangen·15 Eyl

@wdavidmarx That’s hilarious. I don’t know what it means in Beck’s American context, but in Germany a reference to Heino in that situation would have been signalling an “I’m too cool to be embarrassed” attitude, because it could quite likely be true.

English

W. David Marx@wdavidmarx·15 Eyl

But oops: "Haino" is actually Heino, the German schlager singer. Still a very IYKYK answer! I'm hoping to fix this in newer editions of the book, but I think the point still stands that Beck dropped outré references to hint that he was a musical prodigy not an uneducated fluke

English

1.6K

W. David Marx@wdavidmarx·15 Eyl

I want to acknowledge an error in "Status and Culture" around the Beck anecdote that starts Chapter Three. As a kid, I watched this crazy Beck/Thurston Moore on 120 Minutes, at a time when DGC positioned Beck as a formerly homeless leaf blower savant. youtube.com/watch?v=zdzY49…

YouTube

English

3.8K

Not Me@davidschlangen·13 Eyl

Stay tuned for the full run. In the meantime, you can check out the clembench leaderboard here: clembench.github.io

English

190

Not Me@davidschlangen·13 Eyl

We still have to run the whole benchmark, mind you. This is slow and eye-wateringly expensive 🥹. (Actually, expensive & slow enough for there to be humans on the other side. 😅 )

English

211

Not Me@davidschlangen·13 Eyl

Ok, whatever it is that @OpenAI has done to o1, it has payed off. At least on wordle, which used to be one of the hardest parts of our “conversational agency” benchmark. 4o: 23 (previous best) o1: 75.33 (Human expert players: 72)

English

837

Not Me retweetledi

Sherzod Hakimov@SherzodHakimov·12 Eyl

"Reflection-Llama-3.1-70B" got first attention then frustration regarding the validity of the results. We benchmarked it with clembench and compared against stock model: Reflection-Llama-3.1-70B - 17/100 Meta-Llama-3.1-70B-Instruct - 39/100 It got worse.

GIF

Matt Shumer@mattshumer_

I'm excited to announce Reflection 70B, the world’s top open-source model. Trained using Reflection-Tuning, a technique developed to enable LLMs to fix their own mistakes. 405B coming next week - we expect it to be the best model in the world. Built w/ @GlaiveAI. Read on ⬇️:

English

353

Not Me@davidschlangen·18 Ağu

Observation was that neither linguistics nor NLP/AI cared too much about CL, leaving it free to reinvent itself. Slides: clp.ling.uni-potsdam.de/assets/docs/st… Video (if you really must): mediaup.uni-potsdam.de/Player/4BDFGgaE

English

182

Not Me@davidschlangen·18 Ağu

Re: "ACL is (not) an AI conf.", was reminded that I did some similar soul searching some years ago. But a) openly prescriptive, b) coming to conclusion that domain to be claimed could be "linguistic intelligence".

English

314

Not Me@davidschlangen·14 Ağu

I guess “ACL is, or at Least Ought to be, Not Just an AI Conference” would have required a font that is too small. #ACL2024NLP

English

644

Not Me@davidschlangen·14 Ağu

@yoavgo Had the same impression @ EMNLP last year. My ad hoc expl was the demographic pyramid in a rapidly growing field — fewer senior people, who also travel less than they used to (inconvenience, guilt abt spent CO2 budget); lots of younger people who hv 2 go & don’t know <1k ppl cnfs

English

1.2K

(((ل()(ل() 'yoav))))👾@yoavgo·14 Ağu

an observation about ACL 2024: there are surprisingly few people around my age or older here. now this may also reflect on travel preferences of people in Europe and North America and Bangkok being far away. But it also reflects on the decline of ACL as the main, central venue.

English

12.9K

Not Me@davidschlangen·14 Ağu

@rulimanurung Flooding predatory conferences with bogus work would of course be a sensible use case. But the results will just be that ARR is flooded with more bogus papers.

English

Ruli Manurung@rulimanurung·14 Ağu

@davidschlangen TBF, it addresses a lot of the shortcomings of pdos.csail.mit.edu/archive/scigen😂 Personally I think it's a really neat project, but seeing the speed at which techbros explode with excitement over this is just so amusing.

English

152

Not Me@davidschlangen·14 Ağu

You have to admire the dedication to the bit. They even went ahead and created a website and actual fake papers. Just to make a satirical point about what AI as a research field has become.

Sakana AI@SakanaAILabs

Introducing The AI Scientist: The world’s first AI system for automating scientific research and open-ended discovery! sakana.ai/ai-scientist/ From ideation, writing code, running experiments and summarizing results, to writing entire papers and conducting peer-review, The AI Scientist opens a new era of AI-driven scientific research and accelerated discovery. Here are 4 example Machine Learning research papers generated by The AI Scientist. We published our report, The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery, and open-sourced our project! Paper: arxiv.org/abs/2408.06292 GitHub: github.com/SakanaAI/AI-Sc… Our system leverages LLMs to propose and implement new research directions. Here, we first apply The AI Scientist to conduct Machine Learning research. Crucially, our system is capable of executing the entire ML research lifecycle: from inventing research ideas and experiments, writing code, to executing experiments on GPUs and gathering results. It can also write an entire scientific paper, explaining, visualizing and contextualizing the results. Furthermore, while an LLM author writes entire research papers, another LLM reviewer critiques resulting manuscripts to provide feedback to improve the work, and also to select the most promising ideas to further develop in the next iteration cycle, leading to continual, open-ended discoveries, thus emulating the human scientific community. As a proof of concept, our system produced papers with novel contributions in ML research domains such language modeling, Diffusion and Grokking. We (@_chris_lu_, @RobertTLange, @hardmaru) proudly collaborated with the @UniOfOxford (@j_foerst, @FLAIR_Ox) and @UBC (@cong_ml, @jeffclune) on this exciting project.

English

885

Keşfet

@techyalzay @SherzodHakimov @sigdial @aclanthology @wdavidmarx @OpenAI @yoavgo @elonmusk