Tags : Browse Projects

Select a tag to browse associated projects and drill deeper into the tag cloud.

dupeGuru

Compare

  Analyzed about 9 hours ago

dupeGuru is a tool to find duplicate files on your computer. It can scan either filenames or contents. The filename scan features a fuzzy matching algorithm that can find duplicate filenames even when they are not exactly the same. dupeGuru runs on Windows and Mac OS X.

17.8K lines of code

3 current contributors

2 months since last commit

4 users on Open Hub

Low Activity
0.0
 
I Use This

Duke (Dupe Killer)

Compare

  Analyzed about 6 hours ago

Duke is a fast record linkage and deduplication engine written in Java. It provides both an API and a command-line interface, and supports incremental processing. There is also a genetic algorithm for automatically tuning configurations. Duke is based on Lucene.

18.7K lines of code

0 current contributors

7 months since last commit

2 users on Open Hub

Very Low Activity
4.0
   
I Use This

lessfs - data deduplication for less

Compare

  Analyzed about 6 hours ago

Lessfs is an userspace (fuse) inline data de-duplicating filesystem for Linux that includes support for lzo or QuickLZ compression and encryption.

20.1K lines of code

0 current contributors

almost 14 years since last commit

0 users on Open Hub

Inactive
0.0
 
I Use This
Licenses: No declared licenses

resemblance

Compare

  Analyzed about 1 hour ago

trying shingling / resemblance / simhash / sketching to do some data deduping

3.01K lines of code

0 current contributors

over 8 years since last commit

0 users on Open Hub

Inactive
0.0
 
I Use This
Licenses: No declared licenses

DedupFS

Compare

  Analyzed about 8 hours ago

A Python FUSE file system that features transparent deduplication and compression which make it ideal for archiving backups.

1.78K lines of code

0 current contributors

almost 14 years since last commit

0 users on Open Hub

Inactive
0.0
 
I Use This