I don’t know what SWORD does.

JSword uses a Thai word break algorithm that’s part of Lucene. So to do a 
search, you have to know Thai well enough to know where words break or do an OR 
search.

Don’t remember if JSword does Khmer. Don’t think so.

The other technique that is available in Lucene is a windowing technique that 
breaks the search request into overlapping windows with a window size of a few 
characters (i think it is 4 to 5). I haven’t played with it.

DM

> On Oct 8, 2018, at 12:29 PM, David Haslam <[email protected]> wrote:
> 
> How does SWORD index a module written in a language whose writing system has 
> no space between words?
> 
> Examples include Khmer and Thai.
> 
> David
> 
> Sent from ProtonMail Mobile
> _______________________________________________
> sword-devel mailing list: [email protected]
> http://www.crosswire.org/mailman/listinfo/sword-devel
> Instructions to unsubscribe/change your settings at above page


_______________________________________________
sword-devel mailing list: [email protected]
http://www.crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page

Reply via email to