quite stable - you
should consider using it even pre-release.
Steve
> -Original Message-
> From: Martin O'Shea [mailto:app...@dsl.pipex.com]
> Sent: Sunday, October 24, 2010 5:29 PM
> To: java-user@lucene.apache.org
> Subject: FW: Use of hyphens in StandardAnalyzer
&g
Browne.
--- On Mon, 10/25/10, Martin O'Shea wrote:
> From: Martin O'Shea
> Subject: FW: Use of hyphens in StandardAnalyzer
> To: java-user@lucene.apache.org
> Date: Monday, October 25, 2010, 12:28 AM
> A good suggestion. But I'm using
> Lucene 3.0.2 and the construct
t: RE: Use of hyphens in StandardAnalyzer
Hi Martin,
StandardTokenizer and -Analyzer have been changed, as of future version 3.1
(the next release) to support the Unicode segmentation rules in UAX#29. My
(untested) guess is that your hyphenated word will be kept as a single token if
you se
n the constructor.
Steve
> -Original Message-
> From: Martin O'Shea [mailto:app...@dsl.pipex.com]
> Sent: Sunday, October 24, 2010 3:59 PM
> To: java-user@lucene.apache.org
> Subject: Use of hyphens in StandardAnalyzer
>
> Hello
>
>
>
> I have a Standar
constructor.
Steve
> -Original Message-
> From: Martin O'Shea [mailto:app...@dsl.pipex.com]
> Sent: Sunday, October 24, 2010 3:59 PM
> To: java-user@lucene.apache.org
> Subject: Use of hyphens in StandardAnalyzer
>
> Hello
>
>
>
> I have a StandardAnal
Hello
I have a StandardAnalyzer working which retrieves words and frequencies from
a single document using a TermVectorMapper which is populating a HashMap.
But if I use the following text as a field in my document, i.e.
addDoc(w, "lucene Lawton-Browne Lucene");
The word frequencie