Lee Higgins

8.9K posts

Lee Higgins

@Depthperpixel

AI augmented Flutter dev. We have entered the age of AI. What a time to be alive! #flutter #flutterDev

Barcelona, Spain Katılım Mayıs 2010

919 Takip Edilen963 Takipçiler

Sabitlenmiş Tweet

Lee Higgins@Depthperpixel·30 Nis

birdfeedgames.com We are looking for commissions. Lets build a beautiful game together.

English

2.9K

Lee Higgins@Depthperpixel·1d

@jezell I had to do something like this a few years ago and made this package. audio_io | Flutter package. Very primitive. share.google/Isq01nwL0W3Oh9…

English

Jesse Ezell@jezell·1d

Tried every audio package there is, all of either don't support or suck on flutter web. By suck I mean all of them can't do basic audio streaming without clicks and artifacts every few milliseconds, especially in the debugger. Take soloud for example, it's a really cool library, probably amazing on native, but you just can't pump those js byte arrays through to WASM fast enough or something (byte array copy perf from js to WASM is the bane of a lot of things). So, after wasting hours trying everything on pub.dev and getting crap results with all of them, I just asked codex to write me one that uses JS interop and web audio. Worked great on the first try. Codex is the best package manager.

English

Lee Higgins@Depthperpixel·2d

@elonmusk What will V10 look like !!! 🤯

English

Elon Musk@elonmusk·5d

🔥🔥🔥 Starship Static Fire 🔥🔥🔥

SpaceX@SpaceX

Full duration and full thrust 33-engine static fire with Super Heavy V3

English

5.5K

15.3K

118.4K

31.5M

Lee Higgins@Depthperpixel·4d

@sudoingX Did it work?

English

108

Sudo su@sudoingX·4d

do you understand what's happening here? if this doesn't excite you about local ai nothing will. my dgx spark is writing custom CUDA kernels to optimize its own inference. the agent studied the triton-proven algorithm, understood the dispatch chain, and is now writing a native CUDA kernel as a fast path for Q8 matmul decode. this is a machine improving itself. autonomously. powered by hermes agent /goal running qwen 27B locally. no human wrote this. no api was called. just local silicon teaching itself to run faster.

Sudo su@sudoingX

my dgx spark is writing custom CUDA kernels to make itself faster. let that sink in. hermes agent running qwen 3.6 27B Q8 autonomously decided to port its own triton kernel to native CUDA C++ for llama.cpp integration. it understood the dispatch chain. studied the mmq kernel structure. now it's writing the port itself. this machine is literally optimizing its own inference pipeline. no human in the loop. i set a /goal last night and woke up to a 12.91x speedup on SSM and 9.66x on Q8 matmul. now it wants another 2-3x through FP8 tensor cores. local ai. autonomous agents. self-improving inference. this is not science fiction. this is my friday.

English

181

14.3K

Lee Higgins@Depthperpixel·5d

@thekitze @elonmusk Depends how many petabytes of swap space you have.

English

kitze@thekitze·6d

@elonmusk how many electron apps can it run

English

4.7K

Elon Musk@elonmusk·6d

The GB300 is the best AI computer

NVIDIA@nvidia

Two frontier labs. One accelerated computing platform. Congrats to @SpaceX and @AnthropicAI on the new compute partnership, powered by 220,000+ NVIDIA GPUs inside Colossus 1. The future of AI runs on NVIDIA.

English

2.8K

5.7K

57.6K

34.1M

Lee Higgins@Depthperpixel·6d

@dakshgup @greptile Fine with usage based, it's just not all PRs are the same. We have many small PRs that cost the same as big PRs.

English

Daksh Gupta@dakshgup·6d

hey! it’s 30 per month, with 50 reviews and you can turn off usage based pricing from the dashboard if you’d prefer. it’s unlikely we’ll move off a usage model any time soon. we want to be able to continue using the best frontier models in our product and it’s hard to do so without it.

English

Lee Higgins@Depthperpixel·6d

@greptile is a great product, but the pricing is far too high. 90 per month per person with only 30 reviews, then 1 dollar per review.... 250 in extra spend a week into this month. The pace of PRs with AI makes you too expensive. And I can't seem to find a way to turn off the overage and waste a bunch of my reviews on tiny PRs. You need a better pricing model. Let me know when you have one I might come back.

English

121

Lee Higgins@Depthperpixel·6d

@sudoingX Vibe code as fast as possible -> improve structure -> Image gen 2 chat loop with codex on the codebase and then build a design system with separate storybook app. worked shockingly well for me today.

English

Sudo su@sudoingX·6d

ill just say it. chatgpt 5.5 frontend skills are retarded. great at agentic backend, terrible at design execution.

English

Lee Higgins@Depthperpixel·6d

@Hesamation Let us dream. 🥺

English

ℏεsam@Hesamation·6 May

> 12M context window (read it again) > 52x faster than FlashAttention > beats Opus 4.6 on SWE-Bench > 5% the cost of Opus BUT WAIT A MINUTE: > technical blog not technical > access coming soon > paper coming soon > ““Built by researchers from Meta, Google, Oxford, Cambridge, BYU” doesn’t name a single one of them if this is not a scam, or the numbers aren’t dishonest, it’s disgustingly promotional.

Alexander Whedon@alex_whedon

Introducing SubQ - a major breakthrough in LLM intelligence. It is the first model built on a fully sub-quadratic sparse-attention architecture (SSA), And the first frontier model with a 12 million token context window which is: - 52x faster than FlashAttention at 1MM tokens - Less than 5% the cost of Opus Transformer-based LLMs waste compute by processing every possible relationship between words (standard attention). Only a small fraction actually matter. @subquadratic finds and focuses only on the ones that do. That's nearly 1,000x less compute and a new way for LLMs to scale.

English

1.3K

123.1K

Lee Higgins@Depthperpixel·6d

@NoahKingJr @patokekar F*CK up on purpose.

GIF

English

Noah@NoahKingJr·6d

TELL ME SOMETHING YOU CAN DO THAT CLAUDE CANNOT

English

3.1K

1.8K

896.7K

Lee Higgins@Depthperpixel·6d

@alex_whedon

GIF

QME

Alexander Whedon@alex_whedon·5 May

English

1.5K

2.9K

23K

12.6M

Lee Higgins@Depthperpixel·6d

I have run both for a few months, they catch things each other misses. And greptile caches more things. Happy to have them both if the price was reasonable. Code rabbit is less than half the price for us. We can do 50prs a day and at $1 a PR that gets expensive. Some PRs are small so the pricing does not work for us.

English

just-a-programmer@programmer_just·6d

@Depthperpixel @greptile Agreed, Greptile lost our business to CodeRabbit.

English

Lee Higgins@Depthperpixel·6d

@steipete M porter 👌

Español

Peter Steinberger 🦞@steipete·6d

Me and codex were busy. 🔊 sonoscli.sh — Sonos 🗃️ wacli.sh — WhatsApp 🪶 birdclaw.sh — X archive 🧰 gitcrawl.sh — GitHub archive 🛰️ discrawl.sh — Discord archive 🎧 spogo.sh — Spotify 💬 imsg.sh — iMessage 🧳 mcporter.sh — MCP to CLI 🗣️ sag.sh — ElevenLabs voice 🧿 askoracle.sh — second opinion Upgrading the 🦞 OpenClaw army.

English

238

385

6.3K

521K

Lee Higgins@Depthperpixel·6d

@sudoingX Ssshhhh you will spike the price!

English

Sudo su@sudoingX·6d

if you want mac portability and you want to learn cuda, the dgx spark is the silent king nobody is talking about. 128gb unified memory in a form factor that fits on a desk corner, full cuda stack, runs nemotron 30b q8 at 56 tok/s on hermes agent, multimodal + tool calls nobody has written custom kernels for this specific silicon yet. spark has its own architecture (gb10 blackwell, aarch64), the whole ecosystem of model-specific kernel work for 3090 / 4090 / 5090 has not been ported here. that is an openlane for builders who want the territory. i expect nvidia to focus on its ecosystem more this year. the hardware is in front of builders, the software needs to catch up to make spark the developer-default for portable ai workstations. if you have one and you have not written or tested anything model-specific on it yet, you are sitting on the most underexplored consumer AI silicon shipping right now.

English

175

15.8K

Lee Higgins@Depthperpixel·6d

@XorDev Yeah this is what makes it hard. You need a contribution heatmap. Looks at conditions etc. nightmare

English

Xor@XorDev·10 Ağu

@Depthperpixel Essentially every bit of the code effects every pixel

English

Xor@XorDev·8 Ağu

for(float i,z,d,f;i++<1e2;o+=vec4(4,6,8.+z,0)/f-min(dFdx(z)*r.y+z,0.)/exp(d*d/.1)){vec3 p=z*(FC.rgb*2.-r.xyy)/r.y,c=p;p.z+=8.;c.z*=3.;for(f=1.;f++<9.;c+=sin(c.yzx*f+z+t*.5)/f);z+=min(f=.1+abs(.2*c.y+abs(p.y+.8)),d=max(length(p)-3.,.9-length(p-vec3(-1,1,3))))/7.;}o=tanh(o/2e3);

Sam Altman@sama

177

3.2K

166.3K

Lee Higgins@Depthperpixel·5 May

Interesting though experiment to anyone following the open AI case. Replace the name "OpenAI", with "Starving Baby Food Aid". Does your opinion change?

English

Lee Higgins@Depthperpixel·4 May

Goblins 😂

English

Lee Higgins@Depthperpixel·4 May

@code_coded @LinusEkenstam Nice one!

English

Sky@code_coded·3 May

@Depthperpixel @LinusEkenstam Happy days! Turns out there’s a recently formed fencing club here in Pattaya! Gonna check it out soon! Pretty sure I’ll feel a like a fat knight in armour though when I put the kit back on 🤣

English

Linus ✦ Ekenstam@LinusEkenstam·20 Nis

I was skeptical, but now I’m completely convinced. Fencing will become super popular due to this one very particular improvement to the sport. “Sword tip visualization” It’s going to debut at the summer olympics. Every single duel will look like a bloody lightsaber fight

English

972

4.2K

66.1K

3.9M

Lee Higgins@Depthperpixel·3 May

@sama The correct answer is to use them all in a diverse team. Works the same as humans did.

English

Sam Altman@sama·1 May

you know what all of these "which is better" polls are silly use codex or claude code, whatever works best for you i am grateful we live in a time with such amazing tools, and grateful there is a choice

English

2.2K

1.1K

23K

1.6M

Lee Higgins@Depthperpixel·2 May

@sudoingX How does the output compare to the frontier models? Benchmarks say one thing but every time I used local models in the past there's a massive gap.

English

516

Sudo su@sudoingX·2 May

most of you don't know how big a deal it is that a single rtx 3090 from 2020 runs qwen 27b dense q4 with 256k context at 40 tok/s, full agentic loops on hermes agent, zero tool call failures. the more i build on this card the more i think nobody really knows how untapped it actually is. the silicon was always capable, the models finally caught up.

English

569

243.6K

Lee Higgins@Depthperpixel·2 May

@PeterSweden7 Hey Spain 🇪🇸☝️

English

PeterSweden@PeterSweden7·2 May

They did it. Poland has enacted ZERO income tax for parents who have at least two children. Parents will pay no income tax on income up to around €33.000 This is being done to increase the birthrates. Very good 🇵🇱👍

English

382

1.4K

12.4K

250.9K

Keşfet

@jezell @elonmusk @sudoingX @thekitze @dakshgup @greptile @Hesamation @NoahKingJr