Not Me

1.8K posts

Not Me banner
Not Me

Not Me

@davidschlangen

This is just a placeholder. For updates from me, please find me at social media sites that are not swamped by troglodytes and owned by enemies of humanity.

@[email protected] Katılım Şubat 2013
236 Takip Edilen1.4K Takipçiler
Sabitlenmiş Tweet
Not Me
Not Me@davidschlangen·
It was fun while it lasted, etc etc. If you’re still here, don’t wait any longer and come over to … that other thing. It really isn’t that hard. If you identify as an “AI person”, sigmoid.social is probably the server for you. I’m at davidschlangen@scholar.social
English
0
1
2
0
Not Me
Not Me@davidschlangen·
Bob Dylan getting the Nobel Prize in Literature really paved the way for Geoff Hinton getting the Nobel Prize in Physics.
English
0
0
3
417
Not Me
Not Me@davidschlangen·
Meta following Apple’s playbook?
Not Me tweet media
English
0
0
0
279
Not Me
Not Me@davidschlangen·
@techyalzay Yeah, we briefly looked into this and didn’t find an obvious easy source in the code of the page. (And the HELM webpage is… peculiar.) And it somehow feels wrong to have to scrape this data, which the producers should only have an interest in making accessible.
English
0
0
0
38
Not Me
Not Me@davidschlangen·
Does anyone know how to get the HELM and Arena rankings in a machine readable format (and ideally, programmatically)? #lazyweb #LLMs
English
0
0
0
290
Not Me
Not Me@davidschlangen·
@roman_klinger I’m old school, for me it’s only real when the notification letter arrives by post … erm, the email arrives. (Which now appears to be the case.) We did have a case recently however where something briefly was visible on OpenReview, and then the final decision was different.
English
0
0
1
100
Not Me
Not Me@davidschlangen·
OpenReview hiding the EMNLP decisions until notifications have been sent out.
Not Me tweet media
English
1
0
38
3.1K
Not Me
Not Me@davidschlangen·
@wdavidmarx That’s hilarious. I don’t know what it means in Beck’s American context, but in Germany a reference to Heino in that situation would have been signalling an “I’m too cool to be embarrassed” attitude, because it could quite likely be true.
English
1
0
1
69
W. David Marx
W. David Marx@wdavidmarx·
But oops: "Haino" is actually Heino, the German schlager singer. Still a very IYKYK answer! I'm hoping to fix this in newer editions of the book, but I think the point still stands that Beck dropped outré references to hint that he was a musical prodigy not an uneducated fluke
W. David Marx tweet mediaW. David Marx tweet media
English
3
0
26
1.6K
W. David Marx
W. David Marx@wdavidmarx·
I want to acknowledge an error in "Status and Culture" around the Beck anecdote that starts Chapter Three. As a kid, I watched this crazy Beck/Thurston Moore on 120 Minutes, at a time when DGC positioned Beck as a formerly homeless leaf blower savant. youtube.com/watch?v=zdzY49…
YouTube video
YouTube
English
2
3
24
3.8K
Not Me
Not Me@davidschlangen·
Stay tuned for the full run. In the meantime, you can check out the clembench leaderboard here: clembench.github.io
English
0
0
0
190
Not Me
Not Me@davidschlangen·
We still have to run the whole benchmark, mind you. This is slow and eye-wateringly expensive 🥹. (Actually, expensive & slow enough for there to be humans on the other side. 😅 )
English
1
0
0
211
Not Me
Not Me@davidschlangen·
Ok, whatever it is that @OpenAI has done to o1, it has payed off. At least on wordle, which used to be one of the hardest parts of our “conversational agency” benchmark. 4o: 23 (previous best) o1: 75.33 (Human expert players: 72)
Not Me tweet media
English
1
1
5
837
Not Me retweetledi
Sherzod Hakimov
Sherzod Hakimov@SherzodHakimov·
"Reflection-Llama-3.1-70B" got first attention then frustration regarding the validity of the results. We benchmarked it with clembench and compared against stock model: Reflection-Llama-3.1-70B - 17/100 Meta-Llama-3.1-70B-Instruct - 39/100 It got worse.
GIF
Matt Shumer@mattshumer_

I'm excited to announce Reflection 70B, the world’s top open-source model. Trained using Reflection-Tuning, a technique developed to enable LLMs to fix their own mistakes. 405B coming next week - we expect it to be the best model in the world. Built w/ @GlaiveAI. Read on ⬇️:

English
0
1
1
353
Not Me
Not Me@davidschlangen·
Re: "ACL is (not) an AI conf.", was reminded that I did some similar soul searching some years ago. But a) openly prescriptive, b) coming to conclusion that domain to be claimed could be "linguistic intelligence".
Not Me tweet media
English
1
0
2
314
Not Me
Not Me@davidschlangen·
I guess “ACL is, or at Least Ought to be, Not Just an AI Conference” would have required a font that is too small. #ACL2024NLP
English
0
0
1
644
Not Me
Not Me@davidschlangen·
@yoavgo Had the same impression @ EMNLP last year. My ad hoc expl was the demographic pyramid in a rapidly growing field — fewer senior people, who also travel less than they used to (inconvenience, guilt abt spent CO2 budget); lots of younger people who hv 2 go & don’t know <1k ppl cnfs
English
0
0
3
1.2K
(((ل()(ل() 'yoav))))👾
an observation about ACL 2024: there are surprisingly few people around my age or older here. now this may also reflect on travel preferences of people in Europe and North America and Bangkok being far away. But it also reflects on the decline of ACL as the main, central venue.
English
10
1
66
12.9K
Not Me
Not Me@davidschlangen·
@rulimanurung Flooding predatory conferences with bogus work would of course be a sensible use case. But the results will just be that ARR is flooded with more bogus papers.
English
0
0
2
47
Not Me
Not Me@davidschlangen·
You have to admire the dedication to the bit. They even went ahead and created a website and actual fake papers. Just to make a satirical point about what AI as a research field has become.
Sakana AI@SakanaAILabs

Introducing The AI Scientist: The world’s first AI system for automating scientific research and open-ended discovery! sakana.ai/ai-scientist/ From ideation, writing code, running experiments and summarizing results, to writing entire papers and conducting peer-review, The AI Scientist opens a new era of AI-driven scientific research and accelerated discovery. Here are 4 example Machine Learning research papers generated by The AI Scientist. We published our report, The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery, and open-sourced our project! Paper: arxiv.org/abs/2408.06292 GitHub: github.com/SakanaAI/AI-Sc… Our system leverages LLMs to propose and implement new research directions. Here, we first apply The AI Scientist to conduct Machine Learning research. Crucially, our system is capable of executing the entire ML research lifecycle: from inventing research ideas and experiments, writing code, to executing experiments on GPUs and gathering results. It can also write an entire scientific paper, explaining, visualizing and contextualizing the results. Furthermore, while an LLM author writes entire research papers, another LLM reviewer critiques resulting manuscripts to provide feedback to improve the work, and also to select the most promising ideas to further develop in the next iteration cycle, leading to continual, open-ended discoveries, thus emulating the human scientific community. As a proof of concept, our system produced papers with novel contributions in ML research domains such language modeling, Diffusion and Grokking. We (@_chris_lu_, @RobertTLange, @hardmaru) proudly collaborated with the @UniOfOxford (@j_foerst, @FLAIR_Ox) and @UBC (@cong_ml, @jeffclune) on this exciting project.

English
2
0
6
885