Matthias Brantner has proposed merging lp:~matthias-brantner/zorba/data-cleaning into lp:zorba/data-cleaning-module.
Requested reviews: Diogo Simões (diogo-simoes89) For more details, see: https://code.launchpad.net/~matthias-brantner/zorba/data-cleaning/+merge/79759 reenabled the smith-waterman The function finally terminates and is much faster than before (with the latest version of Zorba). -- https://code.launchpad.net/~matthias-brantner/zorba/data-cleaning/+merge/79759 Your team Zorba Coders is subscribed to branch lp:zorba/data-cleaning-module.
=== modified file 'src/com/zorba-xquery/www/modules/data-cleaning/character-based-string-similarity.xq' --- src/com/zorba-xquery/www/modules/data-cleaning/character-based-string-similarity.xq 2011-08-02 14:07:32 +0000 +++ src/com/zorba-xquery/www/modules/data-cleaning/character-based-string-similarity.xq 2011-10-19 02:20:29 +0000 @@ -165,7 +165,6 @@ : @param $penalty The penalty value. : @return The Smith-Waterman distance between the two strings. :) -(: declare function simc:smith-waterman ( $s1 as xs:string, $s2 as xs:string, $score as xs:integer, $penalty as xs:integer ) as xs:double{ if(string-length($s1) = 0) then 0 else if(string-length($s2) = 0) then 0 else @@ -176,4 +175,3 @@ simc:smith-waterman(substring($s1, 2), substring($s2, 2), $score, $penalty) + ( if(substring($s1, 1, 1) = substring($s2, 1, 1)) then $score else -$penalty ) )) }; -:)
-- Mailing list: https://launchpad.net/~zorba-coders Post to : zorba-coders@lists.launchpad.net Unsubscribe : https://launchpad.net/~zorba-coders More help : https://help.launchpad.net/ListHelp