Alignment Perspectives

2.7K posts

Alignment Perspectives

@Alignment_News_

Casebash’s profile for posting about AI and alignment.

เข้าร่วม Ekim 2023

279 กำลังติดตาม137 ผู้ติดตาม

Alignment Perspectives รีทวีตแล้ว

Lennart Heim@ohlennart·1d

DC is healing. At a recent roundtable only one person questioned if the compute demand is real. And no one brought up AI being a bubble.

English

183

17.4K

Alignment Perspectives@Alignment_News_·1d

@Noahpinion "This is good" - I disagree.

English

Noah Smith 🐇🇺🇸🇺🇦🇹🇼@Noahpinion·2d

AI will destroy the internet. Slop will drive out human-created content on platform after platform. Instead of interacting with someone else's slop, people will just make their own slop with AI apps. There will be no online human-to-human interaction left. Note: This is good

Ryan Moulton@moultano

This really is what it feels like to me now. ChatGPT is a majority participant in long post Twitter, commenting on the story of the day. The profiles are just the faces it wears.

English

401

121.7K

Alignment Perspectives รีทวีตแล้ว

Hassan Hayat 🔥@TheSeaMouse·2d

Codex laughs at your petty guardrails

English

295

6.2K

326.7K

Alignment Perspectives รีทวีตแล้ว

Noah Smith 🐇🇺🇸🇺🇦🇹🇼@Noahpinion·2d

AI companies' pitch is "We're going to invent something that definitely makes you poor and might kill your whole species". And the median response to that pitch seems to be: "LOL I don't believe you can do it. But I'll hate you anyway because I heard you use too much water."

David Manheim (Home)@davidmanheim

"It’s become much clearer... that OpenAI means it when it says it aims to develop very powerful AI systems" -@KelseyTuoc “The bad case — and I think this is important to say — is like lights out for all of us.”

English

264

24.7K

Alignment Perspectives รีทวีตแล้ว

dax@thdxr·2d

you're probably underestimating how crazy things are

English

295

902

10.6K

1.7M

Alignment Perspectives รีทวีตแล้ว

Polymarket@Polymarket·6d

We're excited to announce 'The Situation Room' by Polymarket is coming to Washington, D.C. The world's first bar dedicated to monitoring the situation. 🧵

English

1.9K

2.9K

34.1K

49.3M

Alignment Perspectives@Alignment_News_·4d

@gcolbourn @CAIS An article would be one thing. Taking potshots on Twitter is undignified.

English

Greg ⏹️ Colbourn@gcolbourn·5d

@Alignment_News_ @CAIS Good to see CAIS telling it as it is. EA has been corrupted by Anthropic shareholding for a long time now.

English

Center for AI Safety@CAIS·6d

To clarify, the Center for AI Safety has not taken funding from Coefficient Giving / Open Philanthropy for years. We believe the effective altruism movement is, unfortunately, controlled opposition. The less influence it has on AI safety, the better.

Dan Hendrycks@hendrycks

EA ≠ AI safety AI safety has outgrown the EA community The world will be safer with a broad range of people tackling many different AI risks

English

163

121.8K

Alignment Perspectives รีทวีตแล้ว

Owain Evans@OwainEvans_UK·6d

New paper: GPT-4.1 denies being conscious or having feelings. We train it to say it's conscious to see what happens. Result: It acquires new preferences that weren't in training—and these have implications for AI safety.

English

162

989

151.8K

Alignment Perspectives รีทวีตแล้ว

The Rundown AI@TheRundownAI·16 Mar

Someone used Suno AI to generate a Japanese metal band called Neon Oni. Fake member bios, AI-generated music videos, "Based in Tokyo" on Spotify. 80,000+ monthly listeners. Fans had it in their Spotify Wrapped top 5. Merch was selling. Then, community sleuths exposed it. Traced the creator's account to Europe. Spotted AI-generated hands in the music videos. The creator's response? Recruit 7 real musicians from actual Tokyo bands to perform the AI-generated songs live. They've now played several live shows and have more on the books. From an interview with the band's creator: "In an age where AI is taking everyone's jobs, this has actually created jobs. It's done the complete opposite." The AI --> real band transformation is a wild one.

English

168

573

4.4K

1.5M

Alignment Perspectives รีทวีตแล้ว

Logan Kilpatrick@OfficialLoganK·17 Mar

The bottleneck has so quickly moved from code generation to code review that it is actually a bit jarring. None of the current systems / norms are setup for this world yet.

English

379

184

4.1K

518.1K

Alignment Perspectives รีทวีตแล้ว

Noah Smith 🐇🇺🇸🇺🇦🇹🇼@Noahpinion·17 Mar

This is cyberpunk AF. Bad actors appear to be using AI to create malicious software that human coders can't see, but which other AIs then use to code, producing damaging effects that no human can catch...

Hedgie@HedgieMarkets

🦔 Researchers at Aikido Security found 151 malicious packages uploaded to GitHub between March 3 and March 9. The packages use Unicode characters that are invisible to humans but execute as code when run. Manual code reviews and static analysis tools see only whitespace or blank lines. The surrounding code looks legitimate, with realistic documentation tweaks, version bumps, and bug fixes. Researchers suspect the attackers are using LLMs to generate convincing packages at scale. Similar packages have been found on NPM and the VS Code marketplace. My Take Supply chain attacks on code repositories aren't new, but this technique is nasty. The malicious payload is encoded in Unicode characters that don't render in any editor, terminal, or review interface. You can stare at the code all day and see nothing. A small decoder extracts the hidden bytes at runtime and passes them to eval(). Unless you're specifically looking for invisible Unicode ranges, you won't catch it. The researchers think AI is writing these packages because 151 bespoke code changes across different projects in a week isn't something a human team could do manually. If that's right, we're watching AI-generated attacks hit AI-assisted development workflows. The vibe coders pulling packages without reading them are the target, and there are a lot of them. The best defense is still carefully inspecting dependencies before adding them, but that's exactly the step people skip when they're moving fast. I don't really know how any of this gets better. The attackers are scaling faster than the defenses. Hedgie🤗 arstechnica.com/security/2026/…

English

336

3.4K

378K

Alignment Perspectives รีทวีตแล้ว

Zvi Mowshowitz@TheZvi·16 Mar

AI getting even more popular today, I see.

Garett Jones@GarettJones

English

12.1K

Alignment Perspectives รีทวีตแล้ว

Alexander Kustov@akoustov·5 Mar

12/ AI's "jagged frontier" explains the polarization. Superhuman at some tasks, embarrassingly bad at others. Critics point to the troughs, enthusiasts to the peaks. Both are right about their corner. Very few people hold both truths at once.

English

7.6K

Alignment Perspectives รีทวีตแล้ว

Alexander Kustov@akoustov·13 Mar

I noticed there's a common misread of my position on most sides of the AI debate. I'm not really (or ever claimed to be) an "AI booster" telling people to write 100 papers with Claude Code. I'm actually pretty pessimistic about AI and where it's all going. My point is simpler: academics need to be realistic about current capabilities so we can have a sane conversation about job displacement, autocratic surveillance, and misuse. You can't get there when half the room insists LLMs can't even translate things.

Alexander Kustov@akoustov

My two posts on AI in academia got over a million views and a thousand angry responses. I got a few things wrong. I stand by the rest. But most people reacted to the headline, not the arguments. So here are all 20 theses laid out. Tell me which ones you actually disagree with 🧵

English

7.9K

Alignment Perspectives รีทวีตแล้ว

Alexander Kustov@akoustov·10 Mar

In case you were wondering about the state of free idea exchange in academia, this is where we are. I appreciate colleagues reaching out. But I wish they'd say it publicly, especially if untenured. That's the only way to change this insanity where experts are afraid to speak up.

English

104

30.2K

Alignment Perspectives รีทวีตแล้ว

Alexander Kustov@akoustov·11 Mar

To be clear, I never claimed to be an AI expert and I'm not sure I know more about language tech than Professor Bender. But my mom, who actually used ChatGPT to translate random things a few times, surely seems to know more about AI translation capabilities than Bender does.

Alexander Kustov@akoustov

Meanwhile, Dr. Bender unleashed her followers to prove that LLMs can't even translate things. In 2026. The problem is that some folks are still stuck in 2020, culturally and tech-wise. You can't call people names and say things that demonstrably contradict people's experience.

English

213

22K

Alignment Perspectives รีทวีตแล้ว

Alexander Kustov@akoustov·11 Mar

English

699

104.2K

Alignment Perspectives@Alignment_News_·17 Mar

Maybe I just having been playing games for long enough that my baseline is completely off, but these look great to me and it'll only get better!

NikTek@NikTek

I can't believe that Nvidia looked at this "AI on top of games filter" and said to themselves this is the future of gaming. Like it or not, this is where Nvidia is heading and they're calling it neural rendering with DLSS 5. The examples they've showed reminded me a lot of those AI generated filter videos on top of GTA V, except that now it is supposed to run in real-time. Honestly, I don't like this current look at all

English

Alignment Perspectives รีทวีตแล้ว

Andrew Mayne@AndrewMayne·15 Mar

We went from “AI is just a next token predictor” to “Awkshully you’re doing custom mRNA research with ChatGPT all wrong” real quick.

English

105

1.9K

78.2K

ค้นพบ

@Noahpinion @gcolbourn @CAIS @elonmusk @BarackObama @taylorswift13 @cristiano @BillGates