Thanks DM, for the reminder.

Even for English, when we include those modern versions that make use of
contractions such as 

"I'm"
"You've"
"He's"
"They're"
"We'd"
"She'll"
"Can't"

It's easy for humans to spot the fact that "m" "ve" "s" "re" "d" "ll" & "t"
are not whole words in and of themselves. Yet that's what would result from
stripping the apostrophes.

Anyone using a front-end in which one of the search options is "whole
words", might end up with misleading results.
The Lucene search index would presumably include such suffices as distinct
words.

A proper indexing method would classify each whole contraction as a "word".

David



--
View this message in context: 
http://sword-dev.350566.n4.nabble.com/Re-Search-bug-New-Arabic-Bible-Not-Shaped-SVD-Version-tp4651330p4651385.html
Sent from the SWORD Dev mailing list archive at Nabble.com.

_______________________________________________
sword-devel mailing list: sword-devel@crosswire.org
http://www.crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page

Reply via email to