Please remember,
SWORD already supports a search normalization layer. We have
normalizers for many things like accents, diacritics, etc., that we run
on the text before passing the text to lucene (or using our own search
mechanism).
SWORD has distinct stages where it applies filters. The two most
obvious are the render stage and the search stage (names Render and
Strip in the engine). We have many filters that do many different
things and any can be applied to a module for normalizing during search
by including a: LocalStripFilter=FilterName in the module's .conf file.
Here are the filters currently available:
http://www.crosswire.org/svn/sword/trunk/src/modules/filters/
So, for example, we use have:
LocalStripFilter=UTF8GreekAccents
LocalStripFilter=PapyriPlain
To normalize papyrilogical searches on the Duke Databank of Papyri:
http://crosswire.org/study/wordsearchresults.jsp?mod=DDP&searchTerm=%CF%80%CE%B1%CF%81%CE%B1%CE%B3%CE%B3%CE%B5%CE%BB%CE%BB*
These normalizations discussed certainly need to be discussed and
considered but we have a mechanism in place to do this in SWORD.
Troy
On 03/03/2013 05:57 PM, DM Smith wrote:
On Mar 3, 2013, at 11:53 AM, Chris Burrell <ch...@burrell.me.uk
<mailto:ch...@burrell.me.uk>> wrote:
Yes although in French only the contacted form is correct
WRT indexing and searching, it really doesn't matter which is correct.
The normalization is not visible to the user. Normalization often goes
to forms that are ugly for the end-user.
-- DM
On 3 Mar 2013 16:10, "David Haslam" <dfh...@googlemail.com
<mailto:dfh...@googlemail.com>> wrote:
There are similar issues in French modules.
e.g. Some French Bibles have "coeur", some have "coeur", and some
even use
both!
etc., etc.
David
--
View this message in context:
http://sword-dev.350566.n4.nabble.com/Searching-for-hyphenated-words-tp4652016p4652042.html
Sent from the SWORD Dev mailing list archive at Nabble.com
<http://Nabble.com>.
_______________________________________________
sword-devel mailing list: sword-devel@crosswire.org
<mailto:sword-devel@crosswire.org>
http://www.crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page
_______________________________________________
sword-devel mailing list: sword-devel@crosswire.org
<mailto:sword-devel@crosswire.org>
http://www.crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page
_______________________________________________
sword-devel mailing list: sword-devel@crosswire.org
http://www.crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page
_______________________________________________
sword-devel mailing list: sword-devel@crosswire.org
http://www.crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page