
@sublime_sec TL;DR How to make ~90% similarity search
Instead of one hash for a 100% match,
1. Use many min hashes (400-500)
2. Group those into a handful of big hashes (10-20)
3. Find an exact matching big hash to get close
4. Count matching small hashes to calculate similarity
English







