Kevin Milczewski

66 posts

Kevin Milczewski

Kevin Milczewski

@KevinMilczewski

I write code and build BattleBots.

Seattle انضم Mayıs 2009
89 يتبع31 المتابعون
T. Vuorinen
T. Vuorinen@Geenimetsuri·
@straceX Doesn't the compiler optimize those anyways?
English
7
0
30
7.4K
Gracia
Gracia@straceX·
The humble if statement is secretly destroying your CPU's performance. Look at these two snippets. They do the exact same thing, but Snippet B can execute up to 6x faster on unsorted data. How does removing an if statement speed up the code?
Gracia tweet media
English
147
76
1.9K
195.9K
Kevin Milczewski
Kevin Milczewski@KevinMilczewski·
@Sirupsen The 1-2us "SSD" speeds look like block cache times. I'm pretty darn sure you're not getting through PCI + interrupt stack in 2 us. Hell, if your machine has deep sleep power modes on, the interrupt wake latency is over 100us.
English
1
0
0
133
Simon Eskildsen
Simon Eskildsen@Sirupsen·
updating napkin math benchmarks, it's been too long github.com/sirupsen/napki… using c4-standard-48-lssd now as a representative machine (turbopuffer sponsored) previously a dinky 6 core 2019 VM
Simon Eskildsen tweet media
English
4
4
135
99K
Caleb (OSH Cut)
Caleb (OSH Cut)@CalebChamberla6·
Wheels turning on OSH Cut East. Serving you faster.
Caleb (OSH Cut) tweet media
English
19
0
141
6.6K
Kevin Milczewski
Kevin Milczewski@KevinMilczewski·
@max0x7ba @ChShersh I feel like the talk probably about the lock free queue itself where you already have data in L1d and want to shove a pointer through a queue. The cache coherency there is usually high if there is contention with traditional CAS based approaches. I want to find the talk!
English
1
0
0
39
max0x7ba
max0x7ba@max0x7ba·
The sub-10 ns hot paths cannot possibly afford executing software in the CPU, though. A load from CPU L3d cache latency is ~46 CPU cycles, 9.2 ns with a 5GHz CPU. A 5GHz CPU ticks 50 CPU cycles in 10 ns, executing multiple instructions simultaneously and out-of-order. Low-latency NICs with kernel bypass communicate with CPU and RAM over PCIe. Incoming network frames are streamed in 64-byte blocks as they arrive from the wire straight into CPU L3d cache lines. One-way PCIe transfer latency is ~500 ns. Within a 10-ns budget, a modern AVX-512 CPU running at 5GHz could load an entire 64-byte L3d cache line with one 64-byte ymm load instruction in 46 CPU cycles, and have 4 CPU cycles remaining to compute with these 64 bytes of data, but its stores won't become visible to anything else outside the CPU until many nanoseconds later.
English
1
0
5
215
Dmitrii Kovanikov
Dmitrii Kovanikov@ChShersh·
An SWE came to me and asked a very basic question. I assumed he’s a Junior. Later, I learned he gives a talk, so I attended. Turns out, the guy is cracked as hell. He described a highly complex low-latency lock-free multithreaded system (sub-10 ns hot path processing) he built.
English
94
54
4.5K
436.9K
Kevin Milczewski
Kevin Milczewski@KevinMilczewski·
@ChShersh @thedcoffman What CPU is that running on? Sub 10ns implies Shared L2 unless it's some 5ghz monster? And maybe doing it without locked instructions for the write? So no CAS and opportunistic reads?
English
0
0
0
12
Kevin Milczewski
Kevin Milczewski@KevinMilczewski·
@chatgpt21 So it recreated something that already existed and was highly text based? Not really surprising. Still cool.
English
0
0
0
59
Chris
Chris@chatgpt21·
Anthropic had 16 AI agents build a C compiler from scratch. 100k lines, compiles the Linux kernel, $20k, 2 weeks. To put that in perspective GCC took thousands of engineers over 37 years to build. (Granted from 1987 - however) One researcher and 16 AI agents just built a compiler that passes 99% of GCC's own torture test suite, compiles FFmpeg, Redis, PostgreSQL, QEMU and runs Doom. They say they "(mostly) walked away." But that "mostly" is doing heavy lifting. No human wrote code but the researcher constantly redesigned tests, built CI pipelines when agents broke each other's work, and created workarounds when all 16 agents got stuck on the same bug. The human role didn't disappear. It shifted from writing code to engineering the environment that lets AI write code. I don’t know how you could make the point AI is hitting a wall.
English
735
828
7.4K
1.7M
Kevin Milczewski
Kevin Milczewski@KevinMilczewski·
@Adriksh "no memcpy" -> writes a bytewise memory copy manually and stores the whole data in the ring. The way to do this right is to pair it with an object pool and pass the pointers through the ring. That is zero copy. Of course your object pool is likely to use the same structure.
English
0
0
4
188
Kevin Milczewski
Kevin Milczewski@KevinMilczewski·
@CalebChamberla6 Xometry went to crap when it became more B2B focused (among other things). These dinosaur companies are the heart of the problem you've fixed with innovation. Don't let them drag you down into their game of waste an inefficiency.
English
0
0
3
54
Caleb (OSH Cut)
Caleb (OSH Cut)@CalebChamberla6·
Manufacturing X, I have strategy question. OSH Cut started with one goal: make it easy to buy custom sheet metal and plate. We did that with custom self-serve software, providing instant prices and lead-times, instant manufacturability checks, and deeply integrated production systems. Our target customers - people like us - absolutely love it. Our Net Promoter Score is through the roof, in the mid-eighties. As we grow, we spend increasing time serving customers who value our systems less. Engineers and builders love us. Purchasing departments? Not so much. We have full-time people now whose only job is to service the endless stream of onboarding packets from customers. Custom NDAs, custom terms of service, net terms applications, quality flowdowns, audits, 25+ page onboarding forms, supplier portals, legal departments. I'm certain that this could unlock a lot of revenue from big enterprises. On the other hand, targeting big businesses erodes our core value proposition. Professional buyers don't care that they can upload parts and get instant prices, lead-times, and DFM feedback; they don't want to login to our system to get production status for their orders; they don't value our self-serve, instant model at all. The engineers at these companies adore us. But the professional buyers create a lot of friction. We can help overcome that by streamlining the onboarding process. We've been working on that. On the other hand, this creates a lot of overhead that enables revenue but that simultaneously dilutes our value proposition. Eg. we have a great enterprise business funnel, but to them, we are "just another shop." Quality is the table stakes, but after that, net terms and price often take precedence over speed and simplicity. Professional buying eliminates those kinds of quality of life factors by design. Even so, these enterprise requests are burying our team. So we face a choice - grow the enterprise support team, even though it's not a big revenue driver today, and even though it erodes our core value proposition; or start saying "no," understanding that this may have a huge long-term impact on revenue, and on the kind of business we are in the future. On its face, this seems a minor choice. Hiring another couple people is super easy and low risk. On the other hand, the kinds of customers we focus on can have a large impact on future OSH Cut. So this really boils down to strategy, not immediate cash flow and revenue growth. I bounce back and forth on this question all the time.
English
66
2
138
11.6K
Caleb (OSH Cut)
Caleb (OSH Cut)@CalebChamberla6·
Minecraft needs conveyor belts.
English
11
1
48
2.8K
Caleb (OSH Cut)
Caleb (OSH Cut)@CalebChamberla6·
What do you think? Wire EDM from OSH Cut?
English
19
2
109
8.4K
Oracle Red Bull Racing
Oracle Red Bull Racing@redbullracing·
Late on the breaks, first out the corner 👏 Simply lovely from Max 💙
English
157
1.6K
17.2K
349.6K
Kevin Milczewski
Kevin Milczewski@KevinMilczewski·
@CalebChamberla6 @fabworks_guy You should see some of the horrible things we've done with pulleys and gears made from stacked laser cut plates! When doing it with aluminum, the laser edge finish does wear belts quite a bit.
English
0
0
0
6
Caleb (OSH Cut)
Caleb (OSH Cut)@CalebChamberla6·
@fabworks_guy I'm a little worried about rapid wear and tear on the timing belt if the pulley layers aren't perfectly aligned
English
2
0
2
365
Caleb (OSH Cut)
Caleb (OSH Cut)@CalebChamberla6·
First-pass assembly built! Learned some things, spinning some changes.
Caleb (OSH Cut) tweet mediaCaleb (OSH Cut) tweet mediaCaleb (OSH Cut) tweet media
English
13
6
235
11.2K
Kevin Milczewski
Kevin Milczewski@KevinMilczewski·
@parthingle_x Small, high RPM gears yeah it's probably not the best. I've seen it done but takes a lot of care and you have to run them in a lot, and you'll never have the right tolerances. Large high torque gears work great. Stacked laser cut is the best way to get 7075 gears I've found.
English
0
0
1
29
LaurieWired
LaurieWired@lauriewired·
What happens if a cosmic ray hits a voting machine? In Belgium’s 2003 elections, a relatively unknown Communist Party candidate received 4096 extra votes…from a spontaneous bit inversion. It was more votes than was mathematically possible at that polling station.
LaurieWired tweet mediaLaurieWired tweet media
English
158
441
7.5K
447.1K
Robert Dunn
Robert Dunn@AgingWheels·
Anybody need rescuing? Quad motors and incredible traction control is giving me a superiority complex
Robert Dunn tweet media
English
37
5
409
20.3K
Kevin Milczewski
Kevin Milczewski@KevinMilczewski·
@DJSnM Is the RF environment up there easier or harder than terrestrial? One of the problems with wifi data rates is how clogged the channels are and needing application protocols that can deal with lossy networks.
English
0
0
1
502
Scott Manley
Scott Manley@DJSnM·
The ISS has WiFi antenna on the exterior which are used for things like video feeds from space suits, but a few years ago JASA demonstrated HD video over WiFi from 600meters away from the station. youtube.com/watch?v=yv_eqn…
YouTube video
YouTube
Scott Manley tweet media
English
6
21
396
37.4K
Robert Dunn
Robert Dunn@AgingWheels·
Turns out, my mower was 90% batteries
Robert Dunn tweet media
English
17
6
330
12K
Kevin Milczewski
Kevin Milczewski@KevinMilczewski·
@AgingWheels EZCal Calipers have no business being as good as they are for the price.
English
0
0
0
62
Robert Dunn
Robert Dunn@AgingWheels·
This is going to look so cool when it's done, but drilling and tapping all of these holes suuuucks
Robert Dunn tweet media
English
7
2
195
6.2K
Kevin Milczewski
Kevin Milczewski@KevinMilczewski·
@davidbrevik Marvel Heroes was well after data backup practices should have been well understood. Surprising!
English
0
0
0
58
David Brevik
David Brevik@davidbrevik·
Incredibly this was not the only time it happened! It happened twice in my career on two different projects (Diablo II, Marvel Heroes)
English
8
0
60
6.7K
David Brevik
David Brevik@davidbrevik·
True story. Just weeks before gold master, we lost everything. We had to cobble the depot and source code back together from random machines. Some stuff was lost forever.
Erich Schaefer@mediumclawboy

@ZenOfDesign 100% happened on Diablo II. @davidbrevik tells the story best.

English
7
5
145
24.7K