Hello @all,
I'm looking for a solution to find out if there is already some data in
my dataset that is similar to a new entry.
Example:
Companynames
I would like to find out if there are already companies in my
addressbook (DB) which are similar to a given name to avoid double entries.
How to measure similarity:
I'am thinking of the hammingdistance. That means the difference between
Linux and Linus is 1 as there is one letter different. The distance
between Linux and Lisa is 3 as there is one letter more and two are
different.
Does anyone have an idea how to realize that?
Can one realize this with code running on the database (PL/SQL or
something) or is there a way doing that with DBIx::Class (drawback: all
data had to read before processing).
Thank you for any hint.
Greets,
Mario Minati
_______________________________________________
List: http://lists.rawmode.org/cgi-bin/mailman/listinfo/dbix-class
Wiki: http://dbix-class.shadowcatsystems.co.uk/
IRC: irc.perl.org#dbix-class
SVN: http://dev.catalyst.perl.org/repos/bast/trunk/DBIx-Class/
Searchable Archive: http://www.mail-archive.com/[email protected]/