Aung Kyaw Soe (AK)

488 posts

Aung Kyaw Soe (AK)

@kernelsoe

Software engineer exploring computer graphics and design tools. UI, math & physics

Tokyo Katılım Ocak 2017

1.5K Takip Edilen147 Takipçiler

Sabitlenmiş Tweet

Aung Kyaw Soe (AK)@kernelsoe·13 Kas

You have two engines: one runs on symbols (CPU), the other on vision (GPU). Use both 🔥 Symbolic reasoning is a valuable but i think Spatial reasoning is more accessible. Mathematical ideas are still communicating in very narrow bandwidth medium and legacy static pdf/paper

English

517

Aung Kyaw Soe (AK)@kernelsoe·23 Şub

Color prompting 🎨 Steer image generation not just with words but palettes...

English

Aung Kyaw Soe (AK)@kernelsoe·17 Şub

Cooking ...

English

Aung Kyaw Soe (AK)@kernelsoe·15 Şub

The iOS Photos app has one of the most impressive interactions and UIs like you can pinch to control photos density timeline. Recreated the photo-editing “styles” pad, experimenting with an imperial blue color.

English

145

Aung Kyaw Soe (AK)@kernelsoe·14 Şub

Neither ClaudeCode/Cursor nor Figma can replace this workflow yet Until a device with 120fps, without backlit exists 😄

English

Aung Kyaw Soe (AK)@kernelsoe·13 Şub

Exploring joystick as user interface for mobile. iPod inspired 😄

English

Aung Kyaw Soe (AK)@kernelsoe·13 Şub

Wow! i'm impressed by this 1 shot @v0 max

English

Aung Kyaw Soe (AK)@kernelsoe·12 Şub

Added Transform UI. Direct manipulation is always fun!

English

Aung Kyaw Soe (AK)@kernelsoe·9 Şub

@brdrck Super cool lines!

English

141

Jeff Broderick@brdrck·9 Şub

I've almost got volume fill down! Still some obvious bugs, but damn this looks insanely cool! ortho.brdrck.me

English

289

14.6K

Aung Kyaw Soe (AK)@kernelsoe·9 Şub

zoom-to-cursor (keep the point under the mouse fixed) and geometric zoom, two ideas inspired from the first time I used Figma

English

Aung Kyaw Soe (AK)@kernelsoe·9 Şub

draw shapes with SDFs in a single draw call so they stay crisp at any zoom x.com/kernelsoe/stat…

Aung Kyaw Soe (AK)@kernelsoe

made an interactive signed distanced fields to learn pixel shading and drawing smooth shapes

English

Aung Kyaw Soe (AK)@kernelsoe·9 Şub

Started exploring what an AI-native design tool that runs in the browser on WebGPU looks like. I learned how to:

English

154

Aung Kyaw Soe (AK)@kernelsoe·8 Şub

made an interactive signed distanced fields to learn pixel shading and drawing smooth shapes

English

111

Aung Kyaw Soe (AK)@kernelsoe·15 Oca

@ctatedev Thanks! Just needed this today!

English

Chris Tate@ctatedev·15 Oca

Introducing json-render AI-generated UI. Deterministic output. 1. Define your component catalog 2. AI steams JSON 3. Render interactive UI Let users prompt dashboards, widgets and apps - safely constrained to components and actions you define

English

252

468

700.2K

Aung Kyaw Soe (AK)@kernelsoe·8 Oca

@karpathy Thanks for sharing! Can you also share your thoughts on making/training/optimizing smaller models efficiently useable locally?

English

Andrej Karpathy@karpathy·8 Oca

New post: nanochat miniseries v1 The correct way to think about LLMs is that you are not optimizing for a single specific model but for a family models controlled by a single dial (the compute you wish to spend) to achieve monotonically better results. This allows you to do careful science of scaling laws and ultimately this is what gives you the confidence that when you pay for "the big run", the extrapolation will work and your money will be well spent. For the first public release of nanochat my focus was on end-to-end pipeline that runs the whole LLM pipeline with all of its stages. Now after YOLOing a few runs earlier, I'm coming back around to flesh out some of the parts that I sped through, starting of course with pretraining, which is both computationally heavy and critical as the foundation of intelligence and knowledge in these models. After locally tuning some of the hyperparameters, I swept out a number of models fixing the FLOPs budget. (For every FLOPs target you can train a small model a long time, or a big model for a short time.) It turns out that nanochat obeys very nice scaling laws, basically reproducing the Chinchilla paper plots: Which is just a baby version of this plot from Chinchilla: Very importantly and encouragingly, the exponent on N (parameters) and D (tokens) is equal at ~=0.5, so just like Chinchilla we get a single (compute-independent) constant that relates the model size to token training horizons. In Chinchilla, this was measured to be 20. In nanochat it seems to be 8! Once we can train compute optimal models, I swept out a miniseries from d10 to d20, which are nanochat sizes that can do 2**19 ~= 0.5M batch sizes on 8XH100 node without gradient accumulation. We get pretty, non-itersecting training plots for each model size. Then the fun part is relating this miniseries v1 to the GPT-2 and GPT-3 miniseries so that we know we're on the right track. Validation loss has many issues and is not comparable, so instead I use the CORE score (from DCLM paper). I calculated it for GPT-2 and estimated it for GPT-3, which allows us to finally put nanochat nicely and on the same scale: The total cost of this miniseries is only ~$100 (~4 hours on 8XH100). These experiments give us confidence that everything is working fairly nicely and that if we pay more (turn the dial), we get increasingly better models. TLDR: we can train compute optimal miniseries and relate them to GPT-2/3 via objective CORE scores, but further improvements are desirable and needed. E.g., matching GPT-2 currently needs ~$500, but imo should be possible to do <$100 with more work. Full post with a lot more detail is here: github.com/karpathy/nanoc… And all of the tuning and code is pushed to master and people can reproduce these with scaling_laws .sh and miniseries .sh bash scripts.

English

227

675

5.4K

711.2K

Aung Kyaw Soe (AK) retweetledi

Joseph Suarez 🐡@jsuarez·11 Tem

x.com/i/article/1941…

ZXX

266

2.8K

558K

Aung Kyaw Soe (AK) retweetledi

Andrej Karpathy@karpathy·19 Ara

x.com/i/article/2002…

ZXX

364

2.9K

15.5K

Aung Kyaw Soe (AK)@kernelsoe·1 Ara

@JacklouisP It has no intention of reusing the bags 😂

English

434

Jack 🤖@JacklouisP·1 Ara

Something very satisfying about the way this system opens bags of bulk plastic powder. Not sure if it's the slice or the wiggle

English

741

102.2K

Aung Kyaw Soe (AK)@kernelsoe·1 Ara

@JieWang_ZJUI @physical_int @sundayrobotics Great explanation! Can we transfer the teaching plan and knowledge for factory robotic arms? I mean same robotic arms but can perform a wide range of tasks with precisions

English

Jie Wang@JieWang_ZJUI·30 Kas

Brewing a latte is one of the HARDEST manipulation tasks today. We saw impressive results from @physical_int and @sundayrobotics . But why is it so hard? Shouldn’t we have had fully autonomous coffee machines for decades? Happy Sunday with coffee—here’s a thread on why. 👇☕️🤖

English

115

18.8K

Aung Kyaw Soe (AK)@kernelsoe·1 Ara

@chris_j_paxton Looks cool! How’s the visual intelligence work here?

English

Chris Paxton@chris_j_paxton·1 Ara

A wonderful AI powered home robot that does actually work

Matic Robots@maticrobots

Wired: 10/10 💯 The Verge: "The best robot vacuum" + 9/10 🔥 Shopify Shop: 5.0 perfect score ⭐ ZDNet: Editor's Choice 🏆 Gizmodo: Editor's Choice 🏆 Meet Matic — your home's best helper 🤖

English

5.8K

Aung Kyaw Soe (AK)@kernelsoe·23 Kas

@nandafyi The theme and interface is on 🔥!

English

nanda@nandafyi·22 Kas

interactive visuals for a new site (still some bugs to iron out)

English

215

14.4K

Keşfet

@v0 @brdrck @ctatedev @karpathy @JacklouisP @elonmusk @BarackObama @taylorswift13