Seunghun Lee

497 posts

Seunghun Lee banner
Seunghun Lee

Seunghun Lee

@shsunmoon

move | build | travel 🌎 building https://t.co/JKkEWrHUgf @transcribeso like a friend who watched the whole thing ↓

Da Nang → Tokyo Beigetreten Temmuz 2014
332 Folgt182 Follower
Angehefteter Tweet
Seunghun Lee
Seunghun Lee@shsunmoon·
For most of my life, the script was simple. Get the degree. Work the job. Wait for Friday. Climb the ladder. Buy the house. Play it safe. But somewhere along the way, I realized I was trying to win a game I never actually chose. So I started asking a different question: What would life look like if it actually felt like mine? Travel the world. Build my own thing. Stay active. Take risks while I can. Choose freedom over status. Slow down. Eat well. Play for the sake of play. Laugh more. Create rituals. Feed my mind. End the day by the ocean. Not perfect. Not always easy. But real. Create a life that actually feels like yours.
English
2
2
7
1.2K
Seunghun Lee retweetet
Aakash Gupta
Aakash Gupta@aakashgupta·
The 128GB number is the part everyone's repeating. The number that actually decides whether you'd use this box is 256. That's the memory bandwidth, in GB/s. A 5090 moves about 1,800. An H100 moves 3,350. Local token speed is bound by how fast weights get read out of memory, and this APU reads them at roughly a seventh of a gaming GPU. So the headline does something quiet. Qwen3 235B runs here at about 11 tokens a second, which sounds impossible on 256 GB/s until you notice the model is mixture-of-experts: 235B total, ~22B active per token. The chip only moves the 22B it needs. The "235B" on the slide is a storage stat. The 22B is the speed stat. Run something dense and the trick drops. Llama 3.3 70B, where every parameter fires on every token, does about 5 tokens a second on the same box. Readable. Not something you sit in front of for eight hours. That 3x win over a 5080 lives in the same place. A 5080 has 16GB of VRAM and can't hold a 235B model at all, so it spills to system memory and crawls. The APU wins that matchup on capacity. Change the test to a model that fits in 16GB and the 5080 walks away on speed. Now look at the workload in the pitch: point Claude Code at localhost. Agentic coding is the worst possible fit for a bandwidth-starved box. One task is dozens of sequential model round trips, each waiting on the last, each streaming at 11 tokens a second. The exact use case used to sell the $5,280 in savings is the one that exposes the bottleneck. The same Qwen3 235B runs at 1,500 tokens a second on a Cerebras wafer. That's the real comparison: 1,500 versus 11, and how much of your day goes to watching the slow one think. The box is a real deal for what it is. A quiet, private, $1,800 machine that runs big open models at conversational speed for one person. The frontier stack it's sold as replacing answers at 50 to 100 tokens a second with quality no open 235B matches yet. It pays for itself in 9 months only if your time is worth nothing per token.
starmex@starmexxx

AMD CEO LISA SU HELD A MINI PC ON STAGE THAT RUNS A 235B MODEL AND REPLACES YOUR $440/MONTH AI STACK amd's ryzen ai max+ 395 is the first x86 chip that runs a 200 billion parameter model on one piece of silicon. cpu and gpu share 128gb of unified memory, no separate graphics card needed the gmktec evo-x2 runs qwen3 235b fully, deepseek v3 comfortably and llama 3.3 70b with headroom. on linux you get 110gb of usable vram out of 128gb amd claimed the chip beat an nvidia rtx 5080 by more than 3x on deepseek r1 inference. a lunchbox sized pc outrunning a $1,000 discrete gpu on a real ai workload a heavy ai user pays $200 for claude code max, $200 for chatgpt pro, $20 for cursor and $20 for gemini. that's $5,280 a year and the box pays itself off in 9 to 10 months install ollama, pull the model, point claude code at localhost. same interface, nothing leaves the machine, nothing costs per request bookmark this and read the article below

English
30
37
289
58.5K
Florian Darroman
Florian Darroman@floriandarroman·
I'm launching a free community for founders who want to automate their SEO with AI agents. You'll get: → Free Skill for Claude (or any agent) → Step-by-step tutorials to set it up. → A free backlink every month. → All my workflows Comment "SEO" and I'll send you an invite.
Florian Darroman tweet media
English
428
13
237
31K
Seunghun Lee
Seunghun Lee@shsunmoon·
Dedicated Gex44 hetzner 230eur + 100eur set up fee. Less than 5 months you break even. Also superior gpu to gex44. @Hetzner_Online wdyt?
Brooks Whale X 🐋@BrooksWhaleX

🚨 SHOCKING: LISA SU’S $1,499 LUNCHBOX ANNIHILATES NVIDIA’S $4K AI BEAST! AMD CEO Lisa Su walked on stage, held a lunchbox sized PC in one hand, and ran a 235 billion parameter model live. No data center. No cloud. No rented GPU. The chip inside is the AMD Ryzen AI Max+ 395. It is the first x86 chip where the CPU and GPU share the same pool of memory. Up to 128GB of unified memory. That one design choice is what changes everything. An RTX 5090 gives you 32GB of video memory. A 4090 gives you 24. This box gives you more than three times either of them in a chassis you can carry in a backpack. On DeepSeek R1 inference, AMD's chip beat an Nvidia RTX 5080 by more than 3x. A desktop the size of a thick paperback outrunning a dedicated graphics card that costs over a thousand dollars on a real AI workload. Now do the math on your subscriptions. Claude Code Max is $200 a month. ChatGPT Pro is another $200. Cursor is $20. Gemini is $20. That is $5,280 leaving your account every year before you build a single thing. The 128GB version of this machine starts at around $2,399. At that run rate it pays for itself in under a year and then runs free. Install Ollama. Pull Qwen3 235B. Point Claude Code at localhost. Same interface you already use. Nothing leaves your machine. Nothing costs per request. No throttling at 3am when you finally have time to build. Lawyers stop worrying about what OpenAI does with their files. Developers stop watching the token counter. Founders stop killing prototypes because the cloud bill scared them off. Private AI just became something a normal person can own.

English
0
0
0
89
Seunghun Lee
Seunghun Lee@shsunmoon·
What would you change about yourself except the obvious? 😂 🥲
English
0
0
0
31
Seunghun Lee retweetet
Garry Tan
Garry Tan@garrytan·
Elite admissions select for one trait: getting the known answer faster than anyone else. 18 years of optimizing against an answer key someone already wrote. AI just made the answer key free. Everyone has it instantly now. So the kids trained hardest to win spent their whole lives mastering the one thing that's now a commodity. The premium moved to the questions with no answer key yet. We need a new training. The new training is about one thing: How to be the first person standing in a new land, exploring it, preparing it for the coming billion people who will need it. The future will be built by these people. And there is a lot to build.
English
131
144
1.5K
118.7K
Seunghun Lee
Seunghun Lee@shsunmoon·
@robj3d3 I implemented your strategy from the other day and I could run 10 sessions simultaneously without rate limits 😃🙏🙏
English
0
0
0
23
Seunghun Lee
Seunghun Lee@shsunmoon·
@davafons I'm moving to Tokyo as well, shall we create a community?
English
1
0
0
20
Dav
Dav@davafons·
Today is my Day 0 of #buildinpublic and I have 1 year to make it or break it 🔥 3 years ago I landed in Tokyo with nothing but a backpack and my employment contract. No connections. Completely alone. But after countless oncall nights at work and studying Japanese every single day... I finally applied for Permanent Residency!!!! 🇯🇵 It takes 1 year for the results to arrive. But getting it would mean true freedom, no longer tied to an employer. And the chance to finally chase my lifelong dream: traveling through all 47 prefectures of Japan while working on my own apps. So until then, I need to prove to myself that I can turn my projects into something real that people will want to use (and pay for). Incidentally, this week @jackfriks and @marclou launched @shipordie_ , and it really felt like the universe telling me to stop overthinking and start shipping already. So now's the moment. The projects I'm building are aimed at Japanese audiences, so I'll be sharing what happens when marketing to a completely different culture from what you usually see on English Twitter. I'll also share cool cafes, coworking spaces, and bits of my life here ☕️ (if you visit, hit me up for a coffee!) This journey starts now. Whatever happens, I'm ready for it 🙂 Time to build, crew!!!
Dav tweet media
English
20
0
43
2.9K
Seunghun Lee retweetet
SemiAnalysis
SemiAnalysis@SemiAnalysis_·
Recently, we purchased one of each Anthropic/OpenAI subscription plan and randomly ran long horizon coding tasks until we exhausted the weekly limit. It's widely believed that a $200/month plan maxes out at ~$2000/month worth of tokens (assuming API pricing). However, we found that the subscriptions are actually far more generous. (2/4)
SemiAnalysis tweet media
English
198
584
6.1K
3.5M
Alex 🇩🇰
Alex 🇩🇰@qwertyu_alex·
@shsunmoon Tennis and Pickle 🔥 you’re getting goated in no time
English
1
0
2
41
Seunghun Lee
Seunghun Lee@shsunmoon·
For most of my life, the script was simple. Get the degree. Work the job. Wait for Friday. Climb the ladder. Buy the house. Play it safe. But somewhere along the way, I realized I was trying to win a game I never actually chose. So I started asking a different question: What would life look like if it actually felt like mine? Travel the world. Build my own thing. Stay active. Take risks while I can. Choose freedom over status. Slow down. Eat well. Play for the sake of play. Laugh more. Create rituals. Feed my mind. End the day by the ocean. Not perfect. Not always easy. But real. Create a life that actually feels like yours.
English
2
2
7
1.2K
Saul Flores Jr.
Saul Flores Jr.@SaulFloresJr·
What a great video, this would do really good on Instagram. I got the same feeling, I was playing a game I didn't want to win anymore. Working 40 years for a company only to "retire" in your 60's to travel the world when your body hurts. I'd rather take the risk and design my own life instead. Eat barbacoa with my family on Sundays, visit my family in Europe, enjoy the beach (my favorite activity), and exercise to feel energetic.
English
1
0
1
29
Seunghun Lee
Seunghun Lee@shsunmoon·
Do you hit rate limits often on claude code and you use code review a lot? That might be one of reasons why. I just discovered this. code review is one of the most token-hungry things you can run in Claude Code, because it re-reads your whole diff plus surrounding files (often via multiple reviewer agents), and all of that local usage counts against the same subscription rate limit as your normal turns. The exception is /code-review ultra, which runs in the cloud and is billed separately as credits rather than eating your rate limit. /code-review low # cheapest: fewer, high-confidence findings only /code-review medium # balanced default for routine checks /code-review medium --fix # same, but applies the fixes to your working tree /code-review high # broader, pricier — save for billing/auth/worker changes /code-review ultra # cloud multi-agent, billed credits, doesn't touch your rate limit
Seunghun Lee tweet media
English
0
0
0
82
Rob Hallam
Rob Hallam@robj3d3·
Last couple months I felt a bit off. So I'm happy to announce:
Rob Hallam tweet media
English
30
2
180
42.7K