MelonLeather

784 posts

MelonLeather banner
MelonLeather

MelonLeather

@MelonLeather

Manila City Katılım Ağustos 2015
195 Takip Edilen29 Takipçiler
MelonLeather
MelonLeather@MelonLeather·
21 kills in 16 rounds at the major for d0nk... It's official, @Team__Spirit will be in the Major finals
English
0
0
0
11
MelonLeather retweetledi
Razed Esports
Razed Esports@RazedEsport·
krabeni with the massive BM, Riot would fine him $500k for this 💀
English
13
17
1K
88.6K
MelonLeather
MelonLeather@MelonLeather·
@HLTVorg @nitr0 Don't ever think you're the problem! It's a team game. There's always another major. Unlucky spread on that galil at the end, could have happened to anyone.
English
1
0
0
3.5K
HLTV.org
HLTV.org@HLTVorg·
"Maybe I am the problem after all" 😞 Heartbreaking scenes in the NRG camp after allowing the historic 12-0 comeback
English
33
36
3K
352.9K
Tekee
Tekee@Tekeee·
So which one are you choosing?
Tekee tweet media
English
103
6
163
13.3K
MelonLeather
MelonLeather@MelonLeather·
The bubble will burst, then will recover quickly
English
0
0
0
2
MelonLeather retweetledi
Retro Anime
Retro Anime@retro_anime·
💔
Retro Anime tweet media
QME
101
1.2K
8.6K
159.8K
MelonLeather
MelonLeather@MelonLeather·
@theo Scare me into never installing anything ever again
English
0
0
0
44
Theo - t3.gg
Theo - t3.gg@theo·
Finally back and ready to stream. What should I film videos about today?
English
81
1
263
21.1K
MelonLeather retweetledi
Cursor
Cursor@cursor_ai·
With Design Mode, you can now point, draw, or talk to update your UI.
English
106
159
2.4K
1.2M
Arena.ai
Arena.ai@arena·
Introducing Agent Arena: real-world agentic evals at scale. How do you evaluate agents doing actual work? We measure millions of live sessions where real users accomplish real tasks. On Arena, models now get web search, filesystem, and terminal tools to complete complex workflows: writing code, creating slide deck, researching the web, building apps, and analyzing documents. Every session produces rich signals. Users iterate with the agent turn-by-turn: approving, editing, correcting, praise or expressing frustration. The environment gives feedback too: shell errors, tool failures, recovery attempts, and more. Our leaderboard measures each model's agentic performance using causal inference across five signals: task success, steerability, error recovery, user praise vs. complaint, and tool hallucination. This leaderboard snapshot is built from 300K+ tasks, 2M+ tool calls, and 40M lines of code by agents. Top labs in Agent Arena: - #1 @OpenAI: GPT-5.5 (High) - #2 @AnthropicAI: Claude-Opus-4.7 (Thinking) - #3 @Zai_org: GLM-5.1 - #4 @GoogleDeepMind: Gemini-3.1-Pro - #5 @Kimi_Moonshot: Kimi-K2.6 More analysis in the thread, with the full technical blog below.
Arena.ai tweet media
Arena.ai@arena

Introducing Agent Mode: Agentic AI is now measured in the Arena. Agent Mode can do deep research, create reports, generate images, build websites, debug code, and more. It completes more complex tasks by using tools like web search, bash in a sandbox environment, image generation, file writing, and asking follow-up questions. Frontier models are waiting for you in Agent Mode to take on real-world tasks. GPT-5.5, Claude Opus 4.7, Gemini 3.1 Pro, and top open models. Test them yourself.

English
70
146
1.2K
357.5K
MelonLeather
MelonLeather@MelonLeather·
Where the hell is GPT 5.6?
English
0
0
1
103
MelonLeather retweetledi
Sam Korus
Sam Korus@skorusARK·
“We’ve done the analysis, reusable rockets aren’t economic.” SpaceX makes reusable rockets economic. “We’ve done the analysis satellite internet isn’t economic. The antenna alone is tens of thousands of dollars. The cost to manage a constellation that size, the radiation, the space hardened solar cost…” Satellite internet appears to be a very good business with antennas in the $100 range. “We’ve done the analysis, orbital data centers aren’t economic. The radiators, launch costs, the radiation, the solar…” You are here.
English
158
479
6.1K
212.8K
MelonLeather retweetledi
NVIDIA AI
NVIDIA AI@NVIDIAAI·
Today we're shipping Nemotron 3 Ultra. A 550B MoE frontier-intelligence open model built for long-running agents. It delivers 5x faster inference and lowers the cost of complex agentic tasks by up to 30% versus other open frontier models.
English
175
444
3.4K
1.2M
MelonLeather
MelonLeather@MelonLeather·
@ThePrimeagen What's the multiplier for "I didn't know how to code before, now I can ship simple landing pages, and I'm working towards building simple apps"?
English
0
0
0
20
ThePrimeagen
ThePrimeagen@ThePrimeagen·
I am so thoroughly convinced that anyone who thinks AI 100x's their output is a liar or a lunatic. You are telling me you can make 1 years worth of decisions in 3.65 days? Let alone describing those accurately and coaxing the result from the AI... (1.8 days european time)?
English
549
180
4.6K
216.8K
MelonLeather retweetledi
Wise
Wise@trikcode·
You basically need to be unemployed to keep up with all this AI stuff.
English
624
1K
12.9K
447.2K