ne 10, 2008 2:09:10 AM
> Subject: Re: How international languages are supported in Lucene
>
> On Tuesday 10 June 2008 07:49:29 Otis Gospodnetic wrote:
> > Hi Daniel,
> >
> > What makes you say that about language detection? Wouldn't that depend on
> > the lang
On Tuesday 10 June 2008 07:49:29 Otis Gospodnetic wrote:
> Hi Daniel,
>
> What makes you say that about language detection? Wouldn't that depend on
> the language detection approach or tool one uses and on the type and amount
> of content one trains language detector on? And what is the threshold
Thanks,
Otis --
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
- Original Message
> From: Daniel Noll <[EMAIL PROTECTED]>
> To: java-user@lucene.apache.org
> Sent: Thursday, June 5, 2008 7:36:11 PM
> Subject: Re: How international languages are support
Thanks,
Otis --
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
- Original Message
> From: Daniel Noll <[EMAIL PROTECTED]>
> To: java-user@lucene.apache.org
> Sent: Thursday, June 5, 2008 7:36:11 PM
> Subject: Re: How international languages are support
Thanks Erick.
-Original Message-
From: Erick Erickson [mailto:[EMAIL PROTECTED]
Sent: Thursday, June 05, 2008 9:51 AM
To: java-user@lucene.apache.org
Subject: Re: How international languages are supported in Lucene
See below
On Thu, Jun 5, 2008 at 12:04 PM, Michael Siu <[EM
nalyze them
with the same analyzer. Especially when you get different
language encodings in the document.
Best
Erick
>
> Thanks again.
>
>
>
> -Original Message-
> From: Grant Ingersoll [mailto:[EMAIL PROTECTED]
> Sent: Thursday, June 05, 2008 8:53 AM
> To: ja
@lucene.apache.org
Subject: Re: How international languages are supported in Lucene
Hi Michael,
That's a pretty open ended question and, I'm assuming, by
"international languages" you mean non-English :-). You might get
some mileage out of
http://wiki.apache.org/lucene-java/IndexingOther
Hi Michael,
That's a pretty open ended question and, I'm assuming, by
"international languages" you mean non-English :-). You might get
some mileage out of http://wiki.apache.org/lucene-java/IndexingOtherLanguages
but it is a bit out of date (namely the sandbox references).
Lucene inde