ron

1.2K posts

ron banner
ron

ron

@RonxldWilson

Builder | Autodidact in making.

India Katılım Haziran 2020
312 Takip Edilen753 Takipçiler
Sabitlenmiş Tweet
ron
ron@RonxldWilson·
how I built a search engine from scratch here's what I have been building over the course of last month resulting in visiting of over 55 million unique domains 130 GB of sqlite DB, 200 million rows and over 4 million unique Indian B2B businesses 1/n
ron tweet media
English
11
19
164
32.2K
ron retweetledi
difficultyang
difficultyang@difficultyang·
You've implemented FIFO and LIFO data structures, but have you implemented FAFO?
English
36
55
909
67.2K
ron
ron@RonxldWilson·
RAM shortage so insane that have to rewrite established docker images to make them efficient
English
0
0
5
60
ron
ron@RonxldWilson·
mac mini crashed thrice today running self hosted services system data space on device would just randomly bloat up by 50+GB fill up all device storage and then crash the device would shutdown and then just startup normally as if nothing happened so annoying
English
0
0
4
88
ron
ron@RonxldWilson·
@terencebuilds Workers have access to the git repo they detect change in git commit in 15 min interval and restart the run with updated code
English
0
0
0
14
Terence
Terence@terencebuilds·
@RonxldWilson remote worker control is such a game changer. how are you handling network failures when pushing updates?
English
1
0
1
14
ron
ron@RonxldWilson·
@ryancarson recently tested out restoring backups of my search engine db and realized its impossible to recover from the backups I was taking so diligently had to change entire backup strategy due to it
English
0
0
3
54
Ryan Carson
Ryan Carson@ryancarson·
Please, pretty please, *actually* test restoring from your db backup (which, for the love of god, should be from an offsite location).
English
28
2
106
11.3K
ron
ron@RonxldWilson·
One of the things that I found was most Indian businesses are siloed sites having no outbound/inbound links to other sites in industry Which made making discovery harder as they couldn't be found via normal hopping Which lead me to solution I call as search engine distillation
ron@RonxldWilson

how I built a search engine from scratch here's what I have been building over the course of last month resulting in visiting of over 55 million unique domains 130 GB of sqlite DB, 200 million rows and over 4 million unique Indian B2B businesses 1/n

English
0
0
9
248
ron
ron@RonxldWilson·
Some of these domains in the clusters were legit business that got hacked and were acting as a proxy and taking part in seo farm activities without being aware of it
English
0
0
3
56
ron
ron@RonxldWilson·
I would go in and manually prune these clusters from the frontier to save resources while doing in depth crawling.
English
1
0
3
72
ron
ron@RonxldWilson·
These are basically group of 50-100 websites which rapidly linked to each other. Forming a dense cluster of their own. Think of them like SEO farms which just link to each other to boost each other's popularity
Kinder • Grinder@kinder_grinder

@RonxldWilson what does those isolated groups represent? like PBN?

English
3
0
9
185
ron
ron@RonxldWilson·
@kinder_grinder Each dot is a site, different colors here represents various TLDs For my usecase I was mostly concerned with .in .co.in .org .net and .com domains This graph here is just the top 10k most connected domians, wasn't able to render more as that slows down the device by a lot
English
1
0
1
18
Kinder • Grinder
Kinder • Grinder@kinder_grinder·
@RonxldWilson I can imagine. I am very interested in things u do, created some simple crawlers in the past, but always wanted to crawl whole web (ik its huge job). What does each dot represent? A website or something bigger?
English
1
0
1
14
ron
ron@RonxldWilson·
@kinder_grinder It does look like that, the interactive view is fun to play around with, can drag around any single one and it causes ripples across the whole landscape
English
1
0
1
17
ron
ron@RonxldWilson·
@byLuocca Thanks Luca!
English
0
0
1
10
ron
ron@RonxldWilson·
how I built a search engine from scratch here's what I have been building over the course of last month resulting in visiting of over 55 million unique domains 130 GB of sqlite DB, 200 million rows and over 4 million unique Indian B2B businesses 1/n
ron tweet media
English
11
19
164
32.2K
ron
ron@RonxldWilson·
@byLuocca Original usecase was just to use it as an ancillary service for our main service but we might launch it as a standalone service something in the form of serper or apollo like
English
1
0
2
58
ron
ron@RonxldWilson·
@robaiapps Haha yes! No idea how I ended up with it
English
0
0
2
56
ron
ron@RonxldWilson·
this was all just step 1 of getting all the urls in existence there are around 10+ more steps that were done in the project. its getting late will share about the next steps in done in another thread soon. 17/X
English
2
0
6
472
ron
ron@RonxldWilson·
@Dmarketsniper this was originally started as a side project to see the feasibility of this becoming a ancillary service to our main service have reached a point where this alone can be converted into a standalone service but we still need to internally align on where we plan to take this
English
1
0
2
150
Brandon | Outreach
Brandon | Outreach@Dmarketsniper·
@RonxldWilson Impressive build. Curious how are you planning to turn that into distribution or customers? Feels like that’s where most projects like this stall.
English
1
0
1
167