Shadow C | 🍔Arbiter🍕 🌭of🥪 🌮Sandwiches🌯

6.3K posts

Shadow C | 🍔Arbiter🍕 🌭of🥪 🌮Sandwiches🌯

Shadow C | 🍔Arbiter🍕 🌭of🥪 🌮Sandwiches🌯

@ShadowC5

Ordained by Highest Priestess of Sexual Humanism

Joined Nisan 2014
61 Following111 Followers
Michael Waples
Michael Waples@michael_waples·
@ShadowC5 @0xdippo @thsottiaux That comparison still does not fix the issue, because the loss happens at the transition. In no situation can you get them back. If they change the reset day you lose if you were going to use the tokens before the reset.
English
1
0
0
28
Tibo
Tibo@thsottiaux·
Hi. Over the last 24 hours we had three separate small incidents that affected Codex reliability. Those are three too many and we are taking active steps for them to not reproduce. I have reset usage limits for Codex across all paid plans. May the tokens flow again.
English
1K
508
10.8K
977.5K
Shadow C | 🍔Arbiter🍕 🌭of🥪 🌮Sandwiches🌯
@athiestboi It's completely fair to ask others to make their case if they so choose. Atheism being the default position, is more about one's own internal evaluation; and is not applicable in situations where ones tries to convince another of a position.
English
0
0
0
15
Ivan Burazin
Ivan Burazin@ivanburazin·
I've been on testosterone replacement for two years now. Once I figured out what was going on and got on a low dose, everything improved. My levels were at 301 when I found out. The healthy range for a man under 40 is anywhere between 500 and 1000. 301 is dangerously low. Everything is harder when your levels are that low. Sleep, focus, energy, building muscle, keeping fat off, all of it. Whenever I tell this to guys, the response I get always boils down to one of two things. Either "I've never checked my levels in my life..." or "I'm a man, I don't have that problem!" - which I always find hilarious. Women periodically visit doctors and get their hormones checked because they have structural reasons to. Men don't, so they never find out.
English
17
3
149
29.8K
💥 Nurse D, RN/1L
💥 Nurse D, RN/1L@TakeThatNurses·
@LinchZhang Why are you using diameter instead of radius? I get that the answer comes out the same in terms of which is more pizza, but I’m just wondering why you made that choice for the area of a circle
English
3
0
1
7.5K
Shadow C | 🍔Arbiter🍕 🌭of🥪 🌮Sandwiches🌯
@thsottiaux Also, it is much easier to reach over the desk to ask the friend, how he got the model or harness to work in the way he likes it best. Benchmarks don't give that. They just give general scores for untuned models. There is no next steps or what to try next.
English
0
0
0
8
Shadow C | 🍔Arbiter🍕 🌭of🥪 🌮Sandwiches🌯
@thsottiaux Friends are an underrated but big factor. It is inevitable that your closest connections are so, because you have something in common - what you work on, how you work, think, or communicate, etc. Friend who has success often is a better predictor of own success than benches.
English
1
0
0
89
Tibo
Tibo@thsottiaux·
Do you still trust benchmarks or do you just listen to your friends? What makes you try a new model?
English
975
37
2.1K
231.5K
Sabre9186
Sabre9186@Sabre9186·
@cmuratori Schmidt's using the phrasing to reflect *intention*, not culpability. He didn't target these effects, but they were real externalities of his choices. He's telling the next generation that he was naive, and to make use of this hindsight when they have similar responsibilities.
English
3
0
2
512
Casey Muratori
Casey Muratori@cmuratori·
[1/2] I don't normally make commentary videos, but after seeing the entirety of Eric Schmidt's University of Arizona commencement speech, I felt like there was a lot more going on than just "CEO mentions AI, gets booed". So I made a video to explain what upset me about it.
English
22
51
788
60.6K
Kol Tregaskes
Kol Tregaskes@koltregaskes·
Many developers have suspected for months that GPT-5.5 outperforms Claude Sonnet for coding. But SWE-Bench reported near-parity, and it made people question what they’d been seeing in practice. DeepSWE aligns more closely with that day-to-day experience: GPT-5.5 scores 70% versus Claude Sonnet at 32%. That difference is substantial. DeepSWE focuses on what tends to matter in real workflows: whether an agent can take a short behavioral prompt, locate the correct area of the codebase, and implement the change cleanly - without needing you to enumerate files, modules, and functions. SWE-Bench often fails to capture that, due to dataset contamination and weaker verification. deepswe.datacurve.ai/blog
Kol Tregaskes tweet media
English
139
139
1.9K
456.4K
LabelGuy
LabelGuy@josephathomas·
@theo I was under the impression that he used GMT or UTC because I’ve noticed my resets happen around 6 PM central
English
1
0
1
2.1K
Theo - t3.gg
Theo - t3.gg@theo·
My Claude Code sub expires tomorrow. I barely use it, but I still had it installed on my Windows PC so I used it to debug some crashing earlier. They hard cut me off over 24 hours early.
Theo - t3.gg tweet media
English
129
19
1.7K
780.8K
childless cat lady
childless cat lady@laurie_guilbeau·
@NepsisVT @todayyearsold I thought the answer was zero because if two sides are removed from a rectangle you no longer have a shape since a shape must be enclosed
English
2
0
5
622
Shadow C | 🍔Arbiter🍕 🌭of🥪 🌮Sandwiches🌯
@weswinder To be fair, asking the model to self-identify is already the wrong approach. How many more variations of Gemini 3 Pro identifying itself as Gemini 1.5 Pro do we need? Would be better if he just pulled up an official source for model list and checked.
English
0
0
0
20
Wes Winder
Wes Winder@weswinder·
lol this dude just blocked me for telling him gpt-5.5 doesn't have a codex model variant?? he was (understandably) confused because codex (the app) identified itself as "codex" in chat openai has done irreversible damage by using the term "codex" for a million different things
Wes Winder tweet mediaWes Winder tweet media
English
47
1
303
20.9K
Lior Messika
Lior Messika@lior_eth·
What the Enhanced Games have shown in real time: 1) everyone is already enhanced. Thor couldn’t even improve his deadlift by >1%. Either steroids are useless or he was already juiced to the gills. 2) enhancement without star athletes is kind of sad? “This 44 year old swimmer almost beat his personal best from 10 years ago!” is not a great look lol. 3) people are drawn to sports, particularly when the idea of performance & enhancement is not taboo. For the first event, I think it’s going well. Will be cool to see this thing grow.
Enhanced Games@enhanced_games

Watch the Enhanced Games LIVE: x.com/i/broadcasts/1…

English
117
42
1.5K
563.3K