Ruth Baader-Meinhof
737 posts

Ruth Baader-Meinhof
@mattbenyo
AI at Jamf
Minneapolis, MN Katılım Ağustos 2007
932 Takip Edilen382 Takipçiler

@benoitc You don't have to trust me blindly. Just read my tweets (or ban me, unfollow me, what you like) and obtain from it a signal that you may ignore, or partially account for, and so forth. But to ask for verification of statements on social networks is the wrong move.
English

Opus 4.8 is a total disaster. The problem is not the model per-se, they have Mythos and can anyway train a better model. The problem is: what is it happening inside Anthropic, at the management level? Since this is a product failure. If there was a technological issue NOT delivering is better.
English

@lanaberryy @alpohlovebot “I wish my movie would make a billion dollars”
English

@alpohlovebot actually same i watched it twice + allll the interviews i can find on youtube of the cast im weirdly fixated on it
English

@Luminouslead The sound design in this scene also really elevated it. the track from the soundtrack is "R U Ok"
English

The start of the wish was the creepiest scene for me
The way Nikki seems to be calibrating and learning how to be “human”… The way she thought her cat died instead of Bear.. And when she says “The cat, yeah... I'm so sorry... yeah..” She’s so creepy and robotic. And the way she was acting so unpredictable like laughing and smiling one second and then acting sad and scared. She behaves like a child by crossing her arms and then acts silly like she’s keeping a secret.
It was also the first time we saw Nikki snap back in and say “this is so weird” and then smiled deviously…It really set the tone for the rest of the movie. I thought to myself “Oh yeah..this is gonna be insane”

English

@LonelyGoomba @Zhane_Star A lot of the horror was anxiety/cringe based and laughing is a way to release that tension. Audience dynamics are wild too. Once the crowd establishes that’s how they’re going to deal with the discomfort, there’s no turning back. Becomes a lens or filter for the whole movie
English

@Zhane_Star @mattbenyo I know the movie has funny moments but it was like, every single fuckin scene regardless.
I ended up becoming an armchair psychologist mid-film trying to understand the intent behind the laughs.
English

I mean she was standing in the corner in a dark room being weird af lmao of course people will laugh
sailor thats no moon@mirandaiiisms
I finally saw Obsession and the amount of people I heard laugh during the movie actually disturbed me.
English

@Lons On my second viewing, at the beginning I had the thought “hmm not quite as good as I remembered” and then when she enters the story, the whole energy of the film shifts
English

@Fire_of_Liberty @internetfairie Fair, but I do think OP is on to something. It’s less that she was dehumanized and more that he was fixated on an idealized fantasy version of her. The instagram version
English

Except no.
If that had been the case, then Bear would not have been concerned about Nikki at any point. he would not have slept on the floor, assumed she was on molly, asked her if she was happy, been concerned she was no longer her, would not have despaired that their love wasn't real, would not have tried to end the wish, and would not have attempted suicide to save her (notoriously hard to do BTW).
What he would have done have been to killed her at many points to save himself.
But he did see her as a person, so that's not what happened. But it's a sad thing to see the assumptions of misandrists never fail.
English

@LonelyGoomba @Zhane_Star Someone on here was upset that people laughed at her frowning face at the party. Cmon man.
English

@mattbenyo @Zhane_Star At the start when he walks into his house and the cat is dead
English

@LonelyGoomba @Zhane_Star Okay, I’ll give you that. Pretty weird. There is lots of humor throughout though, even in the horror. Saw it twice, first crowd was packed and latched on to the comedy, second crowd smaller, no laughs.
English

@DonaldClarke63 Yeah no one bothering to cite which scenes were inappropriately laughed at
English

I mean... it's definitely a black comedy (among other things). I suppose it matters what scenes they were laughing at.
sailor thats no moon@mirandaiiisms
I finally saw Obsession and the amount of people I heard laugh during the movie actually disturbed me.
English

@shoka_sonjuku Which were the parts that were inappropriately laughed at?
English

didn't think there was anything laugh out loud funny in the movie at all, giggle to yourself typa comedic moments were there but so many people were laughing so much at so many scenes
sailor thats no moon@mirandaiiisms
I finally saw Obsession and the amount of people I heard laugh during the movie actually disturbed me.
English

@LonelyGoomba @Zhane_Star People keep making this complaint and I just want some specific examples of the things that were inappropriately laughed it.
English

@Zhane_Star Nah I experienced this. People were just laughing at everything regardless. Was fuckin weird to experience.
English

@EvenDroppinDime I assumed it was forking realities where wishes were used to cancel the existence of the one wish willow
English

Obsession Lore Discussion:
We need to talk about the Mandela Effect premise that was hinted at in this movie.
It appears that Curry Barker wanted us to believe that much of the happenings within the wish making community was a product of mass hysteria and that no wishes ever actually existed (outside of the green ham crystal shop) but clearly it isn’t that simple
If the mandela effect genuinely exists in this universe and supported by the parallel dimension theory that causes it, could it be due to the One Wish Willow that the mandela effect exists at all?
Like maybe the shop owner’s wish that he hinted at was to change the berenstein bears to the berenstain bears, something stupid like that.
Let me know your thoughts
English

@HatchTheBrute @missmayn @jbillinson I agree with you on aesthetics. Kind of reminds me of it follows. It has a dream like timelessness. Eg the analog dashboard in his car
English

@mattbenyo @missmayn @jbillinson Also I just realized on the girls college letter it said class of 2027
English

@HatchTheBrute @jbillinson iphones came out in 2007 hence the date i chose
English

@KarlMarxian2 @ducksuprem4cist @citizenofpawnee @VlNDACAT What makes you say it was inappropriate to laugh at her frown? It can be scary and funny
English

@ducksuprem4cist @citizenofpawnee @VlNDACAT True, if we’re gonna talk about this we need to define what moments were laughed at. That’s fair.
At my viewing people laughed at moments that weren’t supposed to be laughed at. Like Nikki’s frown, or Bear being told to go to sleep and he slowly goes to sleep.
English

@edzitron First five innings or so could be considered the early ones
English

@OzzyYanker @Nick_Newman 100% been saying this since the first story
English

@Nick_Newman Nathan’s gonna make a theranos machine that works lol
English

Was just reminded of this, and that Amanda Seyfried (who played her) recently told a friend that she's working on something with Nathan Fielder. What if season 3 of The Rehearsal is about Elizabeth Holmes.
DiscussingFilm@DiscussingFilm
Nathan Fielder has been reportedly visiting Elizabeth Holmes in prison as part of a top-secret documentary. (Source: theinsneider.com/p/nathan-field…)
English

@rwdaigle @danshipper @every Based on what he’s said in various places, I think he specifically means the codex desktop app
English

@danshipper @every What makes Codex so much better as a harness? Is it the agentic harness itself or the desktop app that sets it apart?
English

BREAKING:
Anthropic just dropped Opus 4.8—and it is a MONSTER
We've been testing for about a week @every and our verdict is they could've just called it Opus 5, it's that good.
Here's our vibe check:
- Beats GPT-5.5 on Senior Engineer bench. On our toughest benchmark Opus 4.8 scores a 63—a hair higher than GPT-5.5's score of 62, and a full 30 points higher than Opus 4.7. It tackled a ground-up rewrite of a production codebase, and actually built something that works.
HOWEVER: Coding performance varied a lot at different reasoning levels. We recommend using it on xhigh for best results.
- Incredibly good writer. Opus 4.8 scored a 79.6 on our writing benchmark—measuring models on real-world writing tasks we do all of the time like essay writing, promo email writing, and more. It beats GPT-5.5 by 6 points. It produces well-written prose with fewer "AI-isms". It's also very good at writing in your voice given the right context.
HOWEVER: Writing performance also varied with reasoning levels. Medium reasoning had higher incidence of AI-isms—we found best results with high.
- Beast at knowledge work. Opus 4.8 is very good at general knowledge work tasks like report creation, research and more. It produced the best PowerPoint one-shot we've ever seen on our deck generation benchmark.
- Emotionally intelligent, willing to question the frame. I've also found it to be quite good at talking through psychological or interpersonal issues. It has a high EQ, and it's also good at not glazing and helping to expand your perspective. Its thought process feels extremely rich and dynamic.
THE BAD:
These days a model is only as good as its harness, and Codex is still a far superior harness to the Claude Desktop app. This has kept me using Codex + GPT-5.5 as my daily driver, but I am flipping back and forth a lot more between Codex and Claude.
Anthropic is back baby!
Read the rest on @every:
every.to/vibe-check/opu…
English









