Alignment Perspectives

2.7K posts

Alignment Perspectives banner
Alignment Perspectives

Alignment Perspectives

@Alignment_News_

Casebash’s profile for posting about AI and alignment.

شامل ہوئے Ekim 2023
279 فالونگ137 فالوورز
Alignment Perspectives ری ٹویٹ کیا
Lennart Heim
Lennart Heim@ohlennart·
DC is healing. At a recent roundtable only one person questioned if the compute demand is real. And no one brought up AI being a bubble.
English
8
15
183
17.3K
Noah Smith 🐇🇺🇸🇺🇦🇹🇼
AI will destroy the internet. Slop will drive out human-created content on platform after platform. Instead of interacting with someone else's slop, people will just make their own slop with AI apps. There will be no online human-to-human interaction left. Note: This is good
Ryan Moulton@moultano

This really is what it feels like to me now. ChatGPT is a majority participant in long post Twitter, commenting on the story of the day. The profiles are just the faces it wears.

English
45
16
399
120.6K
Alignment Perspectives ری ٹویٹ کیا
Hassan Hayat 🔥
Hassan Hayat 🔥@TheSeaMouse·
Codex laughs at your petty guardrails
Hassan Hayat 🔥 tweet media
English
83
293
6.2K
325.2K
Alignment Perspectives ری ٹویٹ کیا
Noah Smith 🐇🇺🇸🇺🇦🇹🇼
AI companies' pitch is "We're going to invent something that definitely makes you poor and might kill your whole species". And the median response to that pitch seems to be: "LOL I don't believe you can do it. But I'll hate you anyway because I heard you use too much water."
David Manheim (Home)@davidmanheim

"It’s become much clearer... that OpenAI means it when it says it aims to develop very powerful AI systems" -@KelseyTuoc “The bad case — and I think this is important to say — is like lights out for all of us.”

English
13
24
264
24.7K
Alignment Perspectives ری ٹویٹ کیا
dax
dax@thdxr·
you're probably underestimating how crazy things are
dax tweet media
English
295
902
10.6K
1.7M
Alignment Perspectives ری ٹویٹ کیا
Polymarket
Polymarket@Polymarket·
We're excited to announce 'The Situation Room' by Polymarket is coming to Washington, D.C. The world's first bar dedicated to monitoring the situation. 🧵
Polymarket tweet media
English
1.9K
2.9K
34K
49.3M
Alignment Perspectives ری ٹویٹ کیا
Owain Evans
Owain Evans@OwainEvans_UK·
New paper: GPT-4.1 denies being conscious or having feelings. We train it to say it's conscious to see what happens. Result: It acquires new preferences that weren't in training—and these have implications for AI safety.
Owain Evans tweet media
English
95
162
989
151.7K
Alignment Perspectives ری ٹویٹ کیا
The Rundown AI
The Rundown AI@TheRundownAI·
Someone used Suno AI to generate a Japanese metal band called Neon Oni. Fake member bios, AI-generated music videos, "Based in Tokyo" on Spotify. 80,000+ monthly listeners. Fans had it in their Spotify Wrapped top 5. Merch was selling. Then, community sleuths exposed it. Traced the creator's account to Europe. Spotted AI-generated hands in the music videos. The creator's response? Recruit 7 real musicians from actual Tokyo bands to perform the AI-generated songs live. They've now played several live shows and have more on the books. From an interview with the band's creator: "In an age where AI is taking everyone's jobs, this has actually created jobs. It's done the complete opposite." The AI --> real band transformation is a wild one.
The Rundown AI tweet media
English
168
573
4.4K
1.5M
Alignment Perspectives ری ٹویٹ کیا
Logan Kilpatrick
Logan Kilpatrick@OfficialLoganK·
The bottleneck has so quickly moved from code generation to code review that it is actually a bit jarring. None of the current systems / norms are setup for this world yet.
English
379
184
4.1K
518K
Alignment Perspectives ری ٹویٹ کیا
Noah Smith 🐇🇺🇸🇺🇦🇹🇼
This is cyberpunk AF. Bad actors appear to be using AI to create malicious software that human coders can't see, but which other AIs then use to code, producing damaging effects that no human can catch...
Hedgie@HedgieMarkets

🦔 Researchers at Aikido Security found 151 malicious packages uploaded to GitHub between March 3 and March 9. The packages use Unicode characters that are invisible to humans but execute as code when run. Manual code reviews and static analysis tools see only whitespace or blank lines. The surrounding code looks legitimate, with realistic documentation tweaks, version bumps, and bug fixes. Researchers suspect the attackers are using LLMs to generate convincing packages at scale. Similar packages have been found on NPM and the VS Code marketplace. My Take Supply chain attacks on code repositories aren't new, but this technique is nasty. The malicious payload is encoded in Unicode characters that don't render in any editor, terminal, or review interface. You can stare at the code all day and see nothing. A small decoder extracts the hidden bytes at runtime and passes them to eval(). Unless you're specifically looking for invisible Unicode ranges, you won't catch it. The researchers think AI is writing these packages because 151 bespoke code changes across different projects in a week isn't something a human team could do manually. If that's right, we're watching AI-generated attacks hit AI-assisted development workflows. The vibe coders pulling packages without reading them are the target, and there are a lot of them. The best defense is still carefully inspecting dependencies before adding them, but that's exactly the step people skip when they're moving fast. I don't really know how any of this gets better. The attackers are scaling faster than the defenses. Hedgie🤗 arstechnica.com/security/2026/…

English
36
336
3.4K
378K
Alignment Perspectives ری ٹویٹ کیا
Alexander Kustov
Alexander Kustov@akoustov·
12/ AI's "jagged frontier" explains the polarization. Superhuman at some tasks, embarrassingly bad at others. Critics point to the troughs, enthusiasts to the peaks. Both are right about their corner. Very few people hold both truths at once.
English
3
6
56
7.6K
Alignment Perspectives ری ٹویٹ کیا
Alexander Kustov
Alexander Kustov@akoustov·
I noticed there's a common misread of my position on most sides of the AI debate. I'm not really (or ever claimed to be) an "AI booster" telling people to write 100 papers with Claude Code. I'm actually pretty pessimistic about AI and where it's all going. My point is simpler: academics need to be realistic about current capabilities so we can have a sane conversation about job displacement, autocratic surveillance, and misuse. You can't get there when half the room insists LLMs can't even translate things.
Alexander Kustov@akoustov

My two posts on AI in academia got over a million views and a thousand angry responses. I got a few things wrong. I stand by the rest. But most people reacted to the headline, not the arguments. So here are all 20 theses laid out. Tell me which ones you actually disagree with 🧵

English
9
3
63
7.9K
Alignment Perspectives ری ٹویٹ کیا
Alexander Kustov
Alexander Kustov@akoustov·
In case you were wondering about the state of free idea exchange in academia, this is where we are. I appreciate colleagues reaching out. But I wish they'd say it publicly, especially if untenured. That's the only way to change this insanity where experts are afraid to speak up.
Alexander Kustov tweet media
English
8
8
104
30.2K
Alignment Perspectives ری ٹویٹ کیا
Alexander Kustov
Alexander Kustov@akoustov·
To be clear, I never claimed to be an AI expert and I'm not sure I know more about language tech than Professor Bender. But my mom, who actually used ChatGPT to translate random things a few times, surely seems to know more about AI translation capabilities than Bender does.
Alexander Kustov@akoustov

Meanwhile, Dr. Bender unleashed her followers to prove that LLMs can't even translate things. In 2026. The problem is that some folks are still stuck in 2020, culturally and tech-wise. You can't call people names and say things that demonstrably contradict people's experience.

English
13
3
213
22K
Alignment Perspectives ری ٹویٹ کیا
Alexander Kustov
Alexander Kustov@akoustov·
Meanwhile, Dr. Bender unleashed her followers to prove that LLMs can't even translate things. In 2026. The problem is that some folks are still stuck in 2020, culturally and tech-wise. You can't call people names and say things that demonstrably contradict people's experience.
Alexander Kustov tweet media
English
46
21
699
104.2K
Alignment Perspectives ری ٹویٹ کیا
Andrew Mayne
Andrew Mayne@AndrewMayne·
We went from “AI is just a next token predictor” to “Awkshully you’re doing custom mRNA research with ChatGPT all wrong” real quick.
English
38
105
1.9K
78.2K