D
Analyzed 12 months ago
Duke is a fast record linkage and deduplication engine written in Java. It provides both an API and a command-line interface, and supports incremental processing. There is also a genetic algorithm for automatically tuning configurations. Duke is based on Lucene.
18.7K
lines of code
0
current contributors
over 1 year
since last commit
2
users on Open Hub