Jason Lee created TEXT-129:
------------------------------
Summary: incorrect result from JaroWinklerDistance(计算不正确)
Key: TEXT-129
URL: https://issues.apache.org/jira/browse/TEXT-129
Project: Commons Text
Issue Type: Bug
Affects Versions: 1.4
Environment: commons-lang3:3.7
Reporter: Jason Lee
Attachments: Screenshot from 2018-07-31 13-04-24.png
JaroWinklerDistance resolves 0 similariy between "_trump_" and "_donald trump_"
scala exmaple here:
scala> val jw=new JaroWinklerDistance
scala> jw("*trump*","*donald trump*")
*res1: Double = 0.0*
scala> jw("ivanka trump","donald trump") // correct
res2: Double = 0.736111111111111
scala> jw(" trump","trump") // correct result; there's a leading space in first
string
res13: Double = 0.9444444444444445
scala> jw("a trump","trump") // correct
res14: Double = 0.9047619047619048
scala> jw("aa trump","trump") // correct
res15: Double = 0.875
scala> jw("aaa trump","trump") // *INCORRECT*
res16: Double = 0.0
scala> jw("hillary cliton","clinton") // correct
res8: Double = 0.30952380952380953
scala> jw("donald trump","trump") // INCORRECT
res9: Double = 0.0
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)