Re: WhiteSpaceTokenizer

2014-08-15 Thread Jack Krupansky
-user@lucene.apache.org Subject: Re: WhiteSpaceTokenizer Thanks, Jack. I haven't added myself to the contributor list yet, will do that and then login and comment on that ticket. One quick comment: wouldn't it be more reasonable to throw exception it a token length is more than 255, if relaxin

Re: WhiteSpaceTokenizer

2014-08-15 Thread Sheng
11:38 PM > To: java-user@lucene.apache.org > Subject: WhiteSpaceTokenizer > > The length of token has to be shorter than 255, otherwise there will > be unpredictable behaviors for this tokenizer. I see 255 is set as a > private final in the src code, but there is no documentation to

Re: WhiteSpaceTokenizer

2014-08-15 Thread Jack Krupansky
: https://issues.apache.org/jira/browse/LUCENE-5785 -- Jack Krupansky -Original Message- From: Sheng Sent: Thursday, August 14, 2014 11:38 PM To: java-user@lucene.apache.org Subject: WhiteSpaceTokenizer The length of token has to be shorter than 255, otherwise there will be unpredictable

WhiteSpaceTokenizer

2014-08-14 Thread Sheng
The length of token has to be shorter than 255, otherwise there will be unpredictable behaviors for this tokenizer. I see 255 is set as a private final in the src code, but there is no documentation to explicitly address that. Can we either make that number configurable (if not an option, I'd like

Re: WhitespaceTokenizer, incrementToke() ArrayOutOfBoundException

2013-04-15 Thread Jack Krupansky
:53 AM To: java-user@lucene.apache.org Subject: RE: WhitespaceTokenizer, incrementToke() ArrayOutOfBoundException Hi, It was always mandatory! In Lucene 2.x/3.x some Tokenizers just returned bogus, undefined stuff if not correctly reset before usage, especially when Tokenizers are "reused&q

RE: WhitespaceTokenizer, incrementToke() ArrayOutOfBoundException

2013-04-15 Thread Uwe Schindler
213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de > -Original Message- > From: Jack Krupansky [mailto:j...@basetechnology.com] > Sent: Monday, April 15, 2013 4:25 PM > To: java-user@lucene.apache.org > Subject: Re: WhitespaceTokenizer, incrementToke() > ArrayOutOfBound

RE: WhitespaceTokenizer, incrementToke() ArrayOutOfBoundException

2013-04-15 Thread andi rexha
Thank you, that was the reason. > From: j...@basetechnology.com > To: java-user@lucene.apache.org > Subject: Re: WhitespaceTokenizer, incrementToke() ArrayOutOfBoundException > Date: Mon, 15 Apr 2013 10:25:26 -0400 > > I didn't read your code, but do you have the "

Re: WhitespaceTokenizer, incrementToke() ArrayOutOfBoundException

2013-04-15 Thread Jack Krupansky
I didn't read your code, but do you have the "reset" that is now mandatory and throws AIOOBE if not present? -- Jack Krupansky -Original Message- From: andi rexha Sent: Monday, April 15, 2013 10:21 AM To: java-user@lucene.apache.org Subject: WhitespaceTokenizer

WhitespaceTokenizer, incrementToke() ArrayOutOfBoundException

2013-04-15 Thread andi rexha
Hi, I have tryed to get all the tokens from a TokenStream in the same way as I was doing in the 3.x version of Lucene, but now (at least with WhitespaceTokenizer) I get an exception: Exception in thread "main" java.lang.ArrayIndexOutOfBoundsException: