Follow @
OH
R
trying shingling / resemblance / simhash / sketching to do some data deduping
dedup deduplication