IIRC, the StandardAnalyzer that SWORD uses doesn't allow for that. It has its 
own handling of the punctuation that is fixed. I've said before, the analyzer 
is only good for English like languages.

In Him,
        DM

On Dec 10, 2012, at 11:17 AM, David Haslam <dfh...@googlemail.com> wrote:

> There are some languages in which the apostrophe is used a letter of the
> alphabet rather than an item of punctuation.
> 
> e.g. Somali, in which the apostrophe represents the /Alef/.
> 
> See http://en.wikipedia.org/wiki/Somali_alphabet
> 
> Guessing that our Lucene indexing method generally strips out such
> punctuation marks, it would be a useful enhancement in SWORD to be able to
> specify in the conf file that a particular punctuation mark should be parsed
> as a letter, such that the search index would then include the words
> containing this letter.
> 
> David
> 
> PS. There is a related issue in the SomKQA module that I'm researching with
> the providers of the source text. 
> It's conceivable that all the single right quotation marks should really be
> apostrophes.
> Their inclusion in the text may easily have been due to an artifact of their
> original editing environment.
> 
> 
> 
> --
> View this message in context: 
> http://sword-dev.350566.n4.nabble.com/Re-Search-bug-New-Arabic-Bible-Not-Shaped-SVD-Version-tp4651330p4651383.html
> Sent from the SWORD Dev mailing list archive at Nabble.com.
> 
> _______________________________________________
> sword-devel mailing list: sword-devel@crosswire.org
> http://www.crosswire.org/mailman/listinfo/sword-devel
> Instructions to unsubscribe/change your settings at above page


_______________________________________________
sword-devel mailing list: sword-devel@crosswire.org
http://www.crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page

Reply via email to