D
Analyzed 1 day ago
Duke is a fast record linkage and deduplication engine written in Java. It provides both an API and a command-line interface, and supports incremental processing. There is also a genetic algorithm for automatically tuning configurations. Duke is based on Lucene.
18.7K
lines of code
0
current contributors
6 months
since last commit
2
users on Open Hub