2
I Use This!
Inactive
Analyzed 1 day ago. based on code collected 1 day ago.

Project Summary

Duke is a fast record linkage and deduplication engine written in Java. It provides both an API and a command-line interface, and supports incremental processing. There is also a genetic algorithm for automatically tuning configurations. Duke is based on Lucene.

Tags

dedup deduplication java recordlinkage recordlinking

Badges

In a Nutshell, Duke (Dupe Killer)...

Apache License 2.0
Permitted

Commercial Use

Modify

Distribute

Place Warranty

Sub-License

Private Use

Use Patent Claims

Forbidden

Hold Liable

Use Trademarks

Required

Include Copyright

State Changes

Include License

Include Notice

These details are provided for information only. No information here is legal advice and should not be used as such.

This Project has No vulnerabilities Reported Against it

Did You Know...

  • ...
    nearly 1 in 3 companies have no process for identifying, tracking, or remediating known open source vulnerabilities
  • ...
    compare projects before you chose one to use
  • ...
    55% of companies leverage OSS for production infrastructure
  • ...
    check out hot projects on the Open Hub

Languages

Java
94%
XML
6%
HTML
<1%

30 Day Summary

Sep 20 2025 — Oct 20 2025

12 Month Summary

Oct 20 2024 — Oct 20 2025