Matthias Brantner has proposed merging
lp:~matthias-brantner/zorba/data-cleaning into lp:zorba/data-cleaning-module.
Requested reviews:
Diogo Simões (diogo-simoes89)
For more details, see:
https://code.launchpad.net/~matthias-brantner/zorba/data-cleaning/+merge/79759
reenabled the smith-waterman
The function finally terminates and is much faster than before (with the latest
version of Zorba).
--
https://code.launchpad.net/~matthias-brantner/zorba/data-cleaning/+merge/79759
Your team Zorba Coders is subscribed to branch lp:zorba/data-cleaning-module.
=== modified file 'src/com/zorba-xquery/www/modules/data-cleaning/character-based-string-similarity.xq'
--- src/com/zorba-xquery/www/modules/data-cleaning/character-based-string-similarity.xq 2011-08-02 14:07:32 +0000
+++ src/com/zorba-xquery/www/modules/data-cleaning/character-based-string-similarity.xq 2011-10-19 02:20:29 +0000
@@ -165,7 +165,6 @@
: @param $penalty The penalty value.
: @return The Smith-Waterman distance between the two strings.
:)
-(:
declare function simc:smith-waterman ( $s1 as xs:string, $s2 as xs:string, $score as xs:integer, $penalty as xs:integer ) as xs:double{
if(string-length($s1) = 0) then 0 else
if(string-length($s2) = 0) then 0 else
@@ -176,4 +175,3 @@
simc:smith-waterman(substring($s1, 2), substring($s2, 2), $score, $penalty) + ( if(substring($s1, 1, 1) = substring($s2, 1, 1)) then $score else -$penalty )
))
};
-:)
--
Mailing list: https://launchpad.net/~zorba-coders
Post to : [email protected]
Unsubscribe : https://launchpad.net/~zorba-coders
More help : https://help.launchpad.net/ListHelp