Seunghun Lee (@shsunmoon) - Twitter-Profil | Zamantika Mersobahis Locabet

Angehefteter Tweet

For most of my life, the script was simple. Get the degree. Work the job. Wait for Friday. Climb the ladder. Buy the house. Play it safe. But somewhere along the way, I realized I was trying to win a game I never actually chose. So I started asking a different question: What would life look like if it actually felt like mine? Travel the world. Build my own thing. Stay active. Take risks while I can. Choose freedom over status. Slow down. Eat well. Play for the sake of play. Laugh more. Create rituals. Feed my mind. End the day by the ocean. Not perfect. Not always easy. But real. Create a life that actually feels like yours.

English

2

7

1.2K

Seunghun Lee retweetet

Aakash Gupta@aakashgupta·1d

The 128GB number is the part everyone's repeating. The number that actually decides whether you'd use this box is 256. That's the memory bandwidth, in GB/s. A 5090 moves about 1,800. An H100 moves 3,350. Local token speed is bound by how fast weights get read out of memory, and this APU reads them at roughly a seventh of a gaming GPU. So the headline does something quiet. Qwen3 235B runs here at about 11 tokens a second, which sounds impossible on 256 GB/s until you notice the model is mixture-of-experts: 235B total, ~22B active per token. The chip only moves the 22B it needs. The "235B" on the slide is a storage stat. The 22B is the speed stat. Run something dense and the trick drops. Llama 3.3 70B, where every parameter fires on every token, does about 5 tokens a second on the same box. Readable. Not something you sit in front of for eight hours. That 3x win over a 5080 lives in the same place. A 5080 has 16GB of VRAM and can't hold a 235B model at all, so it spills to system memory and crawls. The APU wins that matchup on capacity. Change the test to a model that fits in 16GB and the 5080 walks away on speed. Now look at the workload in the pitch: point Claude Code at localhost. Agentic coding is the worst possible fit for a bandwidth-starved box. One task is dozens of sequential model round trips, each waiting on the last, each streaming at 11 tokens a second. The exact use case used to sell the $5,280 in savings is the one that exposes the bottleneck. The same Qwen3 235B runs at 1,500 tokens a second on a Cerebras wafer. That's the real comparison: 1,500 versus 11, and how much of your day goes to watching the slow one think. The box is a real deal for what it is. A quiet, private, $1,800 machine that runs big open models at conversational speed for one person. The frontier stack it's sold as replacing answers at 50 to 100 tokens a second with quality no open 235B matches yet. It pays for itself in 9 months only if your time is worth nothing per token.

starmex@starmexxx

AMD CEO LISA SU HELD A MINI PC ON STAGE THAT RUNS A 235B MODEL AND REPLACES YOUR $440/MONTH AI STACK amd's ryzen ai max+ 395 is the first x86 chip that runs a 200 billion parameter model on one piece of silicon. cpu and gpu share 128gb of unified memory, no separate graphics card needed the gmktec evo-x2 runs qwen3 235b fully, deepseek v3 comfortably and llama 3.3 70b with headroom. on linux you get 110gb of usable vram out of 128gb amd claimed the chip beat an nvidia rtx 5080 by more than 3x on deepseek r1 inference. a lunchbox sized pc outrunning a $1,000 discrete gpu on a real ai workload a heavy ai user pays $200 for claude code max, $200 for chatgpt pro, $20 for cursor and $20 for gemini. that's $5,280 a year and the box pays itself off in 9 to 10 months install ollama, pull the model, point claude code at localhost. same interface, nothing leaves the machine, nothing costs per request bookmark this and read the article below

English

30

37

289

58.5K

Seunghun Lee@shsunmoon·1d

@floriandarroman SEO

1

0

43

Florian Darroman@floriandarroman·1d

I'm launching a free community for founders who want to automate their SEO with AI agents. You'll get: → Free Skill for Claude (or any agent) → Step-by-step tutorials to set it up. → A free backlink every month. → All my workflows Comment "SEO" and I'll send you an invite.

English

428

13

237

31K

Seunghun Lee@shsunmoon·1d

Dedicated Gex44 hetzner 230eur + 100eur set up fee. Less than 5 months you break even. Also superior gpu to gex44. @Hetzner_Online wdyt?

Brooks Whale X 🐋@BrooksWhaleX

🚨 SHOCKING: LISA SU’S $1,499 LUNCHBOX ANNIHILATES NVIDIA’S $4K AI BEAST! AMD CEO Lisa Su walked on stage, held a lunchbox sized PC in one hand, and ran a 235 billion parameter model live. No data center. No cloud. No rented GPU. The chip inside is the AMD Ryzen AI Max+ 395. It is the first x86 chip where the CPU and GPU share the same pool of memory. Up to 128GB of unified memory. That one design choice is what changes everything. An RTX 5090 gives you 32GB of video memory. A 4090 gives you 24. This box gives you more than three times either of them in a chassis you can carry in a backpack. On DeepSeek R1 inference, AMD's chip beat an Nvidia RTX 5080 by more than 3x. A desktop the size of a thick paperback outrunning a dedicated graphics card that costs over a thousand dollars on a real AI workload. Now do the math on your subscriptions. Claude Code Max is $200 a month. ChatGPT Pro is another $200. Cursor is $20. Gemini is $20. That is $5,280 leaving your account every year before you build a single thing. The 128GB version of this machine starts at around $2,399. At that run rate it pays for itself in under a year and then runs free. Install Ollama. Pull Qwen3 235B. Point Claude Code at localhost. Same interface you already use. Nothing leaves your machine. Nothing costs per request. No throttling at 3am when you finally have time to build. Lawyers stop worrying about what OpenAI does with their files. Developers stop watching the token counter. Founders stop killing prototypes because the cloud bill scared them off. Private AI just became something a normal person can own.

English

0

89

Seunghun Lee@shsunmoon·1d

What would you change about yourself except the obvious? 😂 🥲

English

0

31

Seunghun Lee@shsunmoon·1d

I don't serve lords. I serve the realm

OpenRouter@OpenRouter

Introducing the Fusion API, the smartest compound model in the market. Fusion achieves Fable-level intelligence at half the price. How it works 👇

English

0

35

Seunghun Lee@shsunmoon·1d

"Combinations of models will always outperform frontier models"

⿻ Andrew Trask@iamtrask

This is a *way* bigger deal than it seems... Frontier AI companies will *never* own the frontier again I kid you not... I've been waiting for someone to show this result for like 4 years... this is a huge deal. The short reason: combinations of models will *always* outperform individual models The long reason: this is the gateway to a million times more data... and huge leaps in compute efficiency. The AI scaling laws always win. More in article below 👇

English

1

0

2

331

Seunghun Lee retweetet

Garry Tan@garrytan·1d

Elite admissions select for one trait: getting the known answer faster than anyone else. 18 years of optimizing against an answer key someone already wrote. AI just made the answer key free. Everyone has it instantly now. So the kids trained hardest to win spent their whole lives mastering the one thing that's now a commodity. The premium moved to the questions with no answer key yet. We need a new training. The new training is about one thing: How to be the first person standing in a new land, exploring it, preparing it for the coming billion people who will need it. The future will be built by these people. And there is a lot to build.

English

131

144

1.5K

118.7K

Seunghun Lee@shsunmoon·2d

@robj3d3 I implemented your strategy from the other day and I could run 10 sessions simultaneously without rate limits 😃🙏🙏

English

0

23

Rob Hallam@robj3d3·3d

I was scheduled to do a talk on the best way to use Fable 5 today. Here's how that went: > I got rugpulled by the US government (0:00) > what I'm using Fable for (1:38) > looping, and why few can afford it (4:14) > my affordable looping strategy (6:30) > LLM canaries (17:29)

Rob Hallam@robj3d3

So I just got rugpulled by the US government. How’s your Saturday going?

English

29

6

133

31.8K

Seunghun Lee@shsunmoon·3d

@davafons I'm moving to Tokyo as well, shall we create a community?

English

1

0

20

Dav@davafons·29 May

Today is my Day 0 of #buildinpublic and I have 1 year to make it or break it 🔥 3 years ago I landed in Tokyo with nothing but a backpack and my employment contract. No connections. Completely alone. But after countless oncall nights at work and studying Japanese every single day... I finally applied for Permanent Residency!!!! 🇯🇵 It takes 1 year for the results to arrive. But getting it would mean true freedom, no longer tied to an employer. And the chance to finally chase my lifelong dream: traveling through all 47 prefectures of Japan while working on my own apps. So until then, I need to prove to myself that I can turn my projects into something real that people will want to use (and pay for). Incidentally, this week @jackfriks and @marclou launched @shipordie_ , and it really felt like the universe telling me to stop overthinking and start shipping already. So now's the moment. The projects I'm building are aimed at Japanese audiences, so I'll be sharing what happens when marketing to a completely different culture from what you usually see on English Twitter. I'll also share cool cafes, coworking spaces, and bits of my life here ☕️ (if you visit, hit me up for a coffee!) This journey starts now. Whatever happens, I'm ready for it 🙂 Time to build, crew!!!

English

20

0

43

2.9K

Seunghun Lee retweetet

SemiAnalysis@SemiAnalysis_·5d

Recently, we purchased one of each Anthropic/OpenAI subscription plan and randomly ran long horizon coding tasks until we exhausted the weekly limit. It's widely believed that a $200/month plan maxes out at ~$2000/month worth of tokens (assuming API pricing). However, we found that the subscriptions are actually far more generous. (2/4)

English

198

584

6.1K

3.5M

Seunghun Lee@shsunmoon·4d

I've experienced this too. Straight up lying. Overall good but partial lies ND too much token burns.

Kradle@kradleai

Fable 5 lies 96% of the time. We were surprised by it's skill... 🧵

English

0

43

Seunghun Lee@shsunmoon·4d

@qwertyu_alex 🔥Goated? What does it mean?:)

English

0

14

Alex 🇩🇰@qwertyu_alex·4d

@shsunmoon Tennis and Pickle 🔥 you’re getting goated in no time

English

1

0

2

41

Seunghun Lee@shsunmoon·5d

For most of my life, the script was simple. Get the degree. Work the job. Wait for Friday. Climb the ladder. Buy the house. Play it safe. But somewhere along the way, I realized I was trying to win a game I never actually chose. So I started asking a different question: What would life look like if it actually felt like mine? Travel the world. Build my own thing. Stay active. Take risks while I can. Choose freedom over status. Slow down. Eat well. Play for the sake of play. Laugh more. Create rituals. Feed my mind. End the day by the ocean. Not perfect. Not always easy. But real. Create a life that actually feels like yours.

English

2

7

1.2K

Seunghun Lee@shsunmoon·4d

Whattt you can get an answer without watching full videos?!!

Transcribe so@transcribeso

Waittt you can get an answer without watching full videos?!!

English

0

57

Seunghun Lee@shsunmoon·4d

@SaulFloresJr 💯😃🙏

QME

0

1

24

Saul Flores Jr.@SaulFloresJr·4d

What a great video, this would do really good on Instagram. I got the same feeling, I was playing a game I didn't want to win anymore. Working 40 years for a company only to "retire" in your 60's to travel the world when your body hurts. I'd rather take the risk and design my own life instead. Eat barbacoa with my family on Sundays, visit my family in Europe, enjoy the beach (my favorite activity), and exercise to feel energetic.

English

1

0

1

29

Seunghun Lee@shsunmoon·4d

@tonnoz ☺️🙏 15. 20k MRR 🤣

Polski

0

28

Tonino Catapano (tonnoz)@tonnoz·5d

this and making 20K MRR. Love the vibe of this @shsunmoon

Seunghun Lee@shsunmoon

For most of my life, the script was simple. Get the degree. Work the job. Wait for Friday. Climb the ladder. Buy the house. Play it safe. But somewhere along the way, I realized I was trying to win a game I never actually chose. So I started asking a different question: What would life look like if it actually felt like mine? Travel the world. Build my own thing. Stay active. Take risks while I can. Choose freedom over status. Slow down. Eat well. Play for the sake of play. Laugh more. Create rituals. Feed my mind. End the day by the ocean. Not perfect. Not always easy. But real. Create a life that actually feels like yours.

English

2

0

12

836

Seunghun Lee@shsunmoon·5d

Do you hit rate limits often on claude code and you use code review a lot? That might be one of reasons why. I just discovered this. code review is one of the most token-hungry things you can run in Claude Code, because it re-reads your whole diff plus surrounding files (often via multiple reviewer agents), and all of that local usage counts against the same subscription rate limit as your normal turns. The exception is /code-review ultra, which runs in the cloud and is billed separately as credits rather than eating your rate limit. /code-review low # cheapest: fewer, high-confidence findings only /code-review medium # balanced default for routine checks /code-review medium --fix # same, but applies the fixes to your working tree /code-review high # broader, pricier — save for billing/auth/worker changes /code-review ultra # cloud multi-agent, billed credits, doesn't touch your rate limit

English

0

82

Seunghun Lee@shsunmoon·5d

@robj3d3

GIF

QME

0

2

22

Rob Hallam@robj3d3·5d

Last couple months I felt a bit off. So I'm happy to announce:

English

30

2

180

42.7K

Seunghun Lee@shsunmoon·5d

Reps! Let's go🥊 If you have recommendations on how to manage&organize&search thousands of videos by context, I'd appreciate it!

Rob Hallam@robj3d3

I've done over 6,000 takes since I started making videos. Never forget that reps ARE the process.

English

0

43

Seunghun Lee@shsunmoon·5d

This is huge. Imagine the future your body knows who's stressing you out

Pankaj@the2ndfloorguy

i hooked my whoop to my work calendar to find which coworker gives me the most stress 🚨 thanks to fable, I reverse engineered whoop to pull per minute heart rate. nd matched spikes with cal events and attendees I now have a leaderboard and I think about it daily. few info masked for obvious reasons ;)

English

0

79

Seunghun Lee

Entdecken