RE: Use of hyphens in StandardAnalyzer

2010-10-24 Thread Steven A Rowe
quite stable - you should consider using it even pre-release. Steve > -Original Message- > From: Martin O'Shea [mailto:app...@dsl.pipex.com] > Sent: Sunday, October 24, 2010 5:29 PM > To: java-user@lucene.apache.org > Subject: FW: Use of hyphens in StandardAnalyzer &g

Re: FW: Use of hyphens in StandardAnalyzer

2010-10-24 Thread Ahmet Arslan
Browne. --- On Mon, 10/25/10, Martin O'Shea wrote: > From: Martin O'Shea > Subject: FW: Use of hyphens in StandardAnalyzer > To: java-user@lucene.apache.org > Date: Monday, October 25, 2010, 12:28 AM > A good suggestion. But I'm using > Lucene 3.0.2 and the construct

FW: Use of hyphens in StandardAnalyzer

2010-10-24 Thread Martin O'Shea
t: RE: Use of hyphens in StandardAnalyzer Hi Martin, StandardTokenizer and -Analyzer have been changed, as of future version 3.1 (the next release) to support the Unicode segmentation rules in UAX#29. My (untested) guess is that your hyphenated word will be kept as a single token if you se

RE: Use of hyphens in StandardAnalyzer

2010-10-24 Thread Martin O'Shea
n the constructor. Steve > -Original Message- > From: Martin O'Shea [mailto:app...@dsl.pipex.com] > Sent: Sunday, October 24, 2010 3:59 PM > To: java-user@lucene.apache.org > Subject: Use of hyphens in StandardAnalyzer > > Hello > > > > I have a Standar

RE: Use of hyphens in StandardAnalyzer

2010-10-24 Thread Steven A Rowe
constructor. Steve > -Original Message- > From: Martin O'Shea [mailto:app...@dsl.pipex.com] > Sent: Sunday, October 24, 2010 3:59 PM > To: java-user@lucene.apache.org > Subject: Use of hyphens in StandardAnalyzer > > Hello > > > > I have a StandardAnal

Use of hyphens in StandardAnalyzer

2010-10-24 Thread Martin O'Shea
Hello I have a StandardAnalyzer working which retrieves words and frequencies from a single document using a TermVectorMapper which is populating a HashMap. But if I use the following text as a field in my document, i.e. addDoc(w, "lucene Lawton-Browne Lucene"); The word frequencie