Rowan Crowe
2.1K posts


> you open a website
> it says: "click all the traffic lights"
> it takes you 10 seconds
> 200 million people do this every day
> you weren't passing a security check
> you were labeling training data for a self-driving car
> google acquires recaptcha in 2009
> deploys it on every bank, store, and government site on earth
> turns the entire internet into an unpaid AI annotation factory
> waymo is now worth $45 billion
> you didn't pass the test. you built it.
Sharbel@sharbel
English

@crisisofconsc A family 'friend' moved out and left behind a heavily pregnant cat. I collected mum+kittens and fostered them for several weeks.
I suspect our cat, who randomly appeared in our yard one day, was a "hey, you want a cat?" thing, and the 'owner' never really cared enough for him.
English

What I learned today is that some people, even with the best intentions, take minimal care of their pets.
The foster cats I took in (it was an owner surrender) have: fleas, tapeworms, upper respiratory infections, and one with an infected toenail that might need surgery.
Their owner thought they were right as rain. 😵💫 She actually thought I was odd for requesting a vet visit before taking them in.
English

@martyrdison Years ago, in group chat, there was a participant who was a lesbian in a wheelchair. Frequently seeking sympathy. After some time, people started asking questions, then her partner informed us that she'd died. Turned out both were played by an able man - also a chat participant.
English

saw this tweet today about me in middle school and laughed because it’s ACTUALLY partially true:
> be me
> ugly 12–13 year old theater kid in middle school
> have one best friend, (we’ll call her emily)
> emily and i meet at musical theater camp
> one day she says she has a boyfriend
> emo kid, dark swishy hair, exactly my type
> then tells me
> “his best friend likes you”
> suddenly i have a boyfriend too
> entire relationship happens over AIM
> never meet him because he “goes to another school”
> then one day she tells me he killed himself
> train tracks, dramatic note, whole story
> i’m devastated
> my mom starts calling around like the FBI, trying to see if anyone actually died
> turns out none of it happened
> emily made up the boyfriend, the best friend, the suicide
> her mom eventually calls my parents and confesses
> never speak to emily again
> switch schools out of pure embarrassment because of a middle school catfish ring run by exactly one person

English

@BonesMoses @loyalmoses I acquired a CD-ROM drive around the same time, although it was given to me. It was CD player size, with a big fat SCSI cable going to an ISA card.
(I realised later that this would have cost a fortune new, and was probably worth a bit, even used)
English

@loyalmoses I bought a CD drive specifically for my BBS in '94 or '95. I started with So Much Shareware! Vol. 3 or something and upgraded whenever I was at the local computer shop. 😭
English

When I ran a multi-node BBS back in the early 90s, I started with a 40MB card hard drive.
The BBS was so busy, and we didn’t have enough storage, so we had a published schedule, a list and a request system to unmount and mount two 1.44MB 3.5” floppy disks throughout the day.
This allowed us to serve far more downloads… what a crazy time that was. 🤓
GIF
English

@AmazonHelp Yes, I've now removed it, but it's been dormant for years. Amazon, for some reason, decided to start using it as primary??? Which then made it my problem to fix.
English

Hello, we're sorry to hear about this. If your main payment method has an issue, such as declining etc., then we will try one of the other available payment methods on your account. If this is an old or expired card we advise removing it from your account, and updating your payment method. You can learn more about this here: amzn.to/3NrsLaF. Thank you.
-Morgan
English

@tsoding I often use "if (1==2)" to make it stand out as something unusual, and be absolutely clear that this condition will never be satisfied. I do like "if (0)", might give that a try.
English

Very often I want to temporarily disable a piece of code. I comment it out, but then I'm faced with a problem that since the code is never compiled it gets "stale". Some functions it uses may have changed and it is never type checked. So the next time I enable it, it doesn't compile and I spent a lot of time fixing it.
The solution I came up with so far is to "comment out" the code with the runtime `if (0)`. The code will never be executed, the optimizer will very like eliminate the code entirely, but before doing so the compiler will type check it, and will force me to fix it on the spot.

English

@Andrew___Morris Yes, the threshold seems far too low. A few blackholed packets to port 22 should be noted, but shouldn't be sufficient to auto-generate an abuse report, as the source can't be verified.
Connect and negotiation resulting in a failed login attempt? That's more like it.
English

@levelsio I guess the host logging all outbound connections would definitively prove that your VPS didn't initiate the claimed abuse, but that's going to be difficult at scale, and it's ethically questionable.
English

Any way I can avoid my IP getting spoofed?
Can hosting providers check if abuse reports are based on real or spoofed packages?
Matthew Prince 🌥@eastdakota
@levelsio Probably a reflection attack. Your IP spoofed in Port 22 queries to other services which echo back a larger response. Been a lot of these going around lately.
English

@levelsio @Hetzner_Online How do you even respond to a superfluous abuse report?
- "idk what's happening"
- "k, carry on"
I ran a TOR exit node at Hetzner some years back. That didn't last long. :)
English

What's scary is how easy it is to take down anyone's site if they're hosted on @Hetzner_Online
I'm the biggest Hetzner fan btw so that's why I say it
But you can just send an abuse report to any site on Hetzner and if the site owner doesn't reply within 24 hours they generally block the server IP taking down the site
It happened to me this week for the nth time!
The question is if you're a solo biz like me, if you get to that email on time
Essentially you have to be on standby every day of the week forever just to keep your sites up
@Cloudflare is much more reasonable with abuse reports, taking the side of their paying customers first, and only taking down if forced by a court (as it should be!)
@levelsio@levelsio
I keep getting abuse reports on Hetzner about portscans on port 22 outbound But I have port 22 outbound blocked in Hetzner's firewall and yes it's enabled etc. I'm starting to think their abuse reports are some kind of abuse itself?
English

@financedystop Ehhh. I wanted to add a rear courtesy lamp to my mid level sedan. Had the cutout for the lamp lens, but once I removed the console I realised there was no harness. Ford said I had to pay $230 for a high level harness, just to get those 2 wires going to the rear lamp.
English

@RosannaPansino I've had the same IP address for years. It's never watched a MrBeast video. One day I cleared cookies and loaded Youtube. Two or three MrBeast videos featured in the first few rows.
Does he even need to buy views when Youtube feed him to us by default?
English

Had the team run some tests!
Apparently, if you spend $1 Million (the amount MrBeast gives away in a video) you can buy nearly 600 MILLION views using YouTube Promote.
This is a screenshot of YouTubes backend where you can put how many views you want to buy for your video.
Turns out it’s pretty easy to inflate views and subs when you’re a billionaire with some extra cash to spend 💸📈

English

@val_nemo @kingofknowwhere How do you get that 5TB of data to your rented CPU? At 100Mbytes/sec it would take half a day to transfer.
English

@kingofknowwhere > be a normal person
> word count is io bound
> rent 4x16 core cpu, shard and grep
> 30 min max and $4.
> faster than read the ai generated wall of slop
English

> be data scientist
> boss gives 5TB of raw .txt files. (In interview always ask file format btw)
> "just do some word counts and searches, anon"
> MFW it would take 3 years to grep this on his t3 micro mini aws
> time to go full autist and achieve maximum throughput
The "Ascended" Data Pipeline
> Step 1: The Great Partitioning
> don't just "read" the files because byte io streams will cry
> use pyarrow.dataset to scan the disk
> fragment the raw text into 128MB Parquet chunks
> optimise for L3 caching and NVMe sequential read speed
> partition by date/source so the OS doesn't cry
> Or partition by random (non English words)
> Step 2: The Information Theory Play
> take a 1% statistical sample of the whole 5TB
> build a global FSST (Fast Static String Taxonomy) dictionary
> global dictionary for high-cardinality terms
> identify the common prefixes/suffixes for trie storage
> language entropy into 1-byte symbols
> A LOUDS Trie uses only 2.14 bits per node.
> You can then run POPCNT at O(1)
> Step 3: The Ingestion (The "Warm-up")
> spin up every CPU core you own
> pq.ParquetWriter with dictionary_encoding=True
> convert raw strings into integer keys pointing to the FSST Trie
> attach a Bloom Filter to every Row Group
> if the word "abcd" isn't in the filter, the CPU won't even look at the data
> key based counting is easy but we take one step ahead
> Step 4: Memory Mapping (Arrow fs "No-Copy" Hack)
> set mmap=True in the PyArrow scanner
> OS kernel handles the data
> RAM usage stays at 500MB while processing TBs
> physical disk bytes are mapped directly to virtual address space
> ZERO. COPIES. MADE.
> Step 5: More optim: The GPGPU Option
> standard Python is for children
> pipe the Arrow Buffers directly to NVIDIA cuDF via GPUDirect Storage
> bypass the CPU and System RAM entirely
> NVMe -> PCIe -> VRAM
> Step 6: The Calculation (The "Sigma" Move)
> bit-sliced parallel search on the GPU
> 5,000+ CUDA cores performing integer comparisons on the FSST keys
> search O(1) instead of O(N)
> word counts happen at 40GB/sec
> finish the 5TB task before your coffee gets cold
> Step 7: CPU "SIMD" cope-poor people pathway
> if you're poor and don't have a GPU
> use AVX-512 bit-slicing in C++
> process 64 words in a single clock cycle on one core
> still faster than 99% of "Big Data" engineers
Raj Dabre@prajdabre
Technical interview question: Suppose you have 5 TB worth of text data and you want to count the total number of words, how will you do this?
English

@0xSweep No one (as far as we know) has a way to see into the future.
I was a relatively early adopter. If I'd HODL'd every Bitcoin I ever mined or purchased, I'd be a millionaire.
Would life be nicer? Yeah. Do I beat myself up because I didn't predict the explosion of Bitcoin? Nah.
English

Everyone knows about the 10,000 Bitcoin pizza but nobody talks about the other 150,000 BTC he wasted after
Before the famous pizza order, Laszlo Hanyecz invented GPU mining for Bitcoin
Which made it possible for him to mine thousands of BTC per day
Satoshi personally messaged him about it, and was worried it would break the network
Laszlo's response: "I feel like I crapped up your project"
He kept buying strangers dinner in Bitcoin until he couldn't mine fast enough to keep up
His wallet has a total volume of 162,864 BTC
That's $10.7 BILLION in today's value and $20.3 BILLION at the peak price of Bitcoin
His balance now? 0.00018 BTC around $12
The guy who invented Bitcoin mining actually wasted BILLIONS
Probably the biggest fumble in the history of Bitcoin


English

@AaronDodd @DailyMailAU She would have been trained and had it repeatedly drilled into her over her 14 years that patient records are private.
The extra restricted access step means he's probably marked as a confidential patient, but that wasn't enough to make her reconsider!?
"lolz everyone does it"
English

Point of order on this @DailyMailAU. The staff were sacked because they illegally accessed a patient's confidential health records *NOT* because they were Dan Andrews' records. That's actually incidental. #auspol mol.im/a/15583153
English





