rdfind - Online in the Cloud

This is the command rdfind that can be run in the OnWorks free hosting provider using one of our multiple free online workstations such as Ubuntu Online, Fedora Online, Windows online emulator or MAC OS online emulator

PROGRAM:

NAME


rdfind - finds duplicate files

SYNOPSIS


rdfind [ options ] directory1 | file1 [ directory2 | file2 ] ...

DESCRIPTION


rdfind finds duplicate files across and/or within several directories. It calculates
checksum only if necessary. rdfind runs in O(Nlog(N)) time with N being the number of
files.

If two (or more) equal files are found, the program decides which of them is the original
and the rest are considered duplicates. This is done by ranking the files to each other
and deciding which has the highest rank. See section RANKING for details.

If you need better control over the ranking than given, you can use some preprocessor
which sorts the file names in desired order and then run the program using xargs. See
examples below for how to use find and xargs in conjunction with rdfind.

To include files or directories that have names starting with -, use rdfind ./- to not
confuse them with options.

RANKING


Given two or more equal files, the one with the highest rank is selected to be the
original and the rest are duplicates. The rules of ranking are given below, where the
rules are executed from start until an original has been found. Given two files A and B
which have equal content, the ranking is as follows:

If A was found while scanning an input argument earlier than than B, A is higher ranked.

If A was found at a depth lower than B, A is higher ranked (A closer to the root)

If A was found earlier than B, A is higher ranked.

The last rule is needed when two files are found in the same directory (obviously not
given in separate arguments, otherwise the first rule applies) and gives the same order
between the files as the operating system delivers the files while listing the directory.
This is operating system specific behaviour.

OPTIONS


Searching options etc:

-ignoreempty true|false
Ignore empty files. (default)

-followsymlinks true|false
Follow symlinks. Default is false.

-removeidentinode true|false
removes items found which have identical inode and device ID. Default is true.

-checksum md5|sha1
what type of checksum to be used: md5 or sha1. Default is md5.

Action options:

-makesymlinks true|false
Replace duplicate files with symbolic links

-makehardlinks true|false
Replace duplicate files with hard links

-makeresultsfile true|false
Make a results file results.txt (default) in the current directory.

-outputname name
Make the results file name to be "name" instead of the default results.txt.

-deleteduplicates true|false
Delete (unlink) files.

General options:

-sleep Xms
sleeps X milliseconds between reading each file, to reduce load. Default is 0 (no
sleep). Note that only a few values are supported at present: 0,1-5,10,25,50,100
milliseconds.

-n -dryrun
displays what should have been done, dont actually delete or link anything.

-h, -help, --help
displays brief help message.

-v, -version, --version
displays version number.

EXAMPLES


Search for duplicate files in home directory and a backup directory:
rdfind ~ /mnt/backup

Delete duplicate in a backup directory:
rdfind -deleteduplicates true /mnt/backup

Search for duplicate files in directories called foo:
find . -type d -name foo -print0 |xargs -0 rdfind

Use rdfind online using onworks.net services



Latest Linux & Windows online programs