SWORD uses Lucene’s StandardAnalyzer which in turn uses WhitespaceTokenizer. It
doesn’t use WordDelimiterFilter. As such it doesn’t handle hyphenated words
well, including soft hyphen.
In Him,
DM
> On Apr 1, 2017, at 8:56 AM, David Haslam <[email protected]> wrote:
>
> Does SWORD search using Lucene ignore the presence of a soft hyphen in any
> word?
>
> i.e. If the user searches for 'violence' and the word in the text was
> 'violence' would it be found?
>
> NB. The second instance contains a soft hyphen \xAD between 'vio' and
> 'lence'.
>
> Best regards,
>
> David
>
>
>
> --
> View this message in context:
> http://sword-dev.350566.n4.nabble.com/Soft-hyphens-tp4657045.html
> Sent from the SWORD Dev mailing list archive at Nabble.com.
>
> _______________________________________________
> sword-devel mailing list: [email protected]
> http://www.crosswire.org/mailman/listinfo/sword-devel
> Instructions to unsubscribe/change your settings at above page
_______________________________________________
sword-devel mailing list: [email protected]
http://www.crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page