SandyBay

377 posts

SandyBay

SandyBay

@_SandyBay_

Abortion abolitionist, LGBTQIA+ supporter, vegan, eviromentalist, anti-gun, pro-male, men's rights advocate, animal's rights advocate, anti-slavic, capitalist.

انضم Ekim 2023
234 يتبع15 المتابعون
تغريدة مثبتة
SandyBay
SandyBay@_SandyBay_·
New York Times about feminism: “Their idea of equality is to enjoy all the rights men are supposed to have with none of their responsibilities” - New York Times 1946
English
0
0
1
560
Anthropic
Anthropic@AnthropicAI·
We've signed an agreement with Google and Broadcom for multiple gigawatts of next-generation TPU capacity, coming online starting in 2027, to train and serve frontier Claude models.
English
508
1.1K
16.9K
2.1M
Noah
Noah@NoahKingJr·
Who's gonna win the AI race? > OpenAI > Anthropic > Google > xAI
English
436
8
267
45.9K
SandyBay
SandyBay@_SandyBay_·
@ai_for_success Every single proprietary model has this problem. They luck compute so forced to drop precision.
English
0
0
3
932
SandyBay
SandyBay@_SandyBay_·
@evgeniymikholap Did you try to create a database of Claude's outputs to sell to DeepSeak?
English
0
0
1
32
Evgeniy Mikholap
Evgeniy Mikholap@evgeniymikholap·
1 min of using Claude 😅
Evgeniy Mikholap tweet media
English
230
105
6.9K
775.3K
SandyBay
SandyBay@_SandyBay_·
@scaling01 They use Amazon GPUs instead of Nvidia.
English
0
0
2
313
SandyBay
SandyBay@_SandyBay_·
@scaling01 For mathematical and physical discoveries, price doesn’t matter. Just a small group of scientists who can afford this model can unlock its full potential of existence.
English
0
0
2
123
SandyBay
SandyBay@_SandyBay_·
@scaling01 MiniMax, Qwen, ZAI, StepFun, Mimo, Ernie, and Hunyuan are outstanding.
Română
0
0
2
71
Lisan al Gaib
Lisan al Gaib@scaling01·
almost forgot that qwen is dead the chinese top dogs are now moonshot, deepseek and bytedance
English
21
0
203
15.4K
Ibragim
Ibragim@ibragim_bad·
🚨 SWE-rebench update! SWE-rebench is a live benchmark with fresh SWE tasks (issue+PR) from GitHub every month. updates: > we removed demonstrations and the 80-step limit (modern models can now handle huge contexts without getting trapped in loops!). > we added auxiliary interfaces for specific tasks like in SWE-bench-Pro to evaluate larger tasks fairly, ensuring valid solutions don't fail just because of mismatched test calls. insights: > Top models perform similarly. Among open-source options, GLM @Zai_org shows strong results, and StepFun @StepFun_ai is very cheap for its performance level ($0.14 per task). > GPT-5.4 shows high token efficiency, it ranks in the top 5 overall but uses the lowest number of tokens (774k per task) > Qwen3-Coder-Next & Step-3.5-Flash benefit massively from huge contexts. Qwen is an extreme case, averaging a wild 8.12M tokens. > We evaluated agentic harnesses (Claude Code, Codex, and Junie) and found a few things. Even in headless mode, they sometimes ask for additional context or attempt web searches. We explicitly disabled search and verified their curl commands to ensure they aren't just pulling solutions from the web. 🏆 You can find the full leaderboard here: swe-rebench.com 👾 Also, we launched our Discord! Join our leaderboard channel to discuss models, share ideas, ask questions, or report issues: discord.gg/V8FqXQ4CgU
Ibragim tweet media
English
40
36
450
155.9K
SandyBay
SandyBay@_SandyBay_·
@scaling01 ONE BILLION OF OPTIMUS ROBOTS🤦‍♂️🤦‍♂️🤦‍♂️🤦‍♂️🤦‍♂️ SHOW OF CLOWNS🤡🤡🤡🤡🤡
English
0
0
1
10
SandyBay
SandyBay@_SandyBay_·
@scaling01 Yes! I love it very much! Great style of communication! I talk with Claude Opus 4.6 on LMArena all the time. This comment has 4 screenshots of examples, but I will comment this comment with another 2 (Twitter has limit of 4 images per comment).
SandyBay tweet mediaSandyBay tweet mediaSandyBay tweet mediaSandyBay tweet media
English
1
0
1
212
Lisan al Gaib
Lisan al Gaib@scaling01·
talked to Opus 4.6 for a couple of hours about personal problems and it has this weird response mode where it's very commanding "put the phone down", "close the laptop", "Save this conversation. Set the reminder. Go to sleep.", do this, do that not sure how I feel about it
Lisan al Gaib tweet media
English
533
71
4.5K
415.8K
Gender Studies for Men
Gender Studies for Men@JohnDavisJDLLM·
This is representative of simp culture we inherited from European wussies. The big asshole in the red coat is a classic wussy boy who uses the woman's assault on a man as an excuse to harm the male victim of the woman's assault.
English
15
50
402
7.6K
SandyBay
SandyBay@_SandyBay_·
@scaling01 I hate it due to terrible performance. DLSS 4.5 upscales 1080P to 4K with FPS like it's native 4K. It's completely pointless, Nvidia fools you.
English
0
0
1
18
SandyBay
SandyBay@_SandyBay_·
@scaling01 Because misandrists control Twitter entirely.
English
0
0
3
129
Lisan al Gaib
Lisan al Gaib@scaling01·
and suddenly this becomes a cancelable offense and a hate crime or something
English
2
0
96
3.8K
Lisan al Gaib
Lisan al Gaib@scaling01·
Seedance, swap the gender of each person in the video
English
8
2
343
37.1K
SandyBay
SandyBay@_SandyBay_·
@scaling01 But much worse than GPT-5.2-latest.
English
0
0
2
305
Lisan al Gaib
Lisan al Gaib@scaling01·
GPT-5.4 completely destroys GPT-5.2 in the Arena
Lisan al Gaib tweet media
Arena.ai@arena

GPT-5.4 High by @OpenAI has landed in the top 10 Text Arena. Let’s dig into why. Overall the latest model is much more rounded than the previous GPT-5.2-High, with significant improvements across quite a large number of categories. Below are where it has made the largest gains: Text categories: - Creative Writing (+46pts, #6 vs. #52) - Longer Query (+25, #11 vs #36) - Arena Expert (+17pts, #4 vs #21) Occupational categories: - Writing, Literature & Language (35pts, #4 vs #39) - Entertainment, Sports & Media (+33pts, #6 vs #39) - Life, Physical & Social Science (+30pts, #6 vs #36) - Legal & Government (+30pts, #1 vs #31) Math is the only category in similar range with the older model (+4pts, #8 vs #12)

English
14
36
645
41.2K
Ambar
Ambar@Ambar_SIFF_MRA·
His simp energy disappeared after this.
English
40
39
317
13.2K
SandyBay
SandyBay@_SandyBay_·
@venom1s Why do they sexualize themselves? Because they are sluts. It's easy.
English
0
0
1
38
︎ ︎venom
︎ ︎venom@venom1s·
Someone’s future wives. Look at the men. All of them are wearing proper shirts, T-shirts, and jeans. While these girls are wearing bras and touching each other’s chests, then posting it on Instagram. Would you marry such girls? Why do girls always sexualize themselves?
English
58
127
835
16.3K