This is a Perl module that measures the similarity of two files or
two strings based on the number of overlapping (shared) words, scaled
by the lengths of the files. It computes the F-Measure, the Dice
Coefficient, the Cosine, and the Lesk measure.
We have mailing lists for
Download the Current Version (v0.09, released January 22, 2013)
Browse the current
Text-Similarity Development Team
The development of Text-Similarity has been supported by a National
Faculty Early Career Development (CAREER) Program
award (#0092784, 2001-2007), by a Grant in Aid of Research, Artistry and
Scholarship from the Graduate School of the University of Minnesota
(2003-2004), and by the Digital Technology Initiative of the Digital
Technology Center of the University of Minnesota (2004-2005).
By: Ted Pedersen -
tpederse AT d umn edu