Hi, No problem! I also updated the JavaDocs in trunk, 4.x and 3.6.1 to prevent this wrong usage (missing offset, count, charset).
I am glad that I was able to assist you! Uwe ----- Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de > -----Original Message----- > From: Mansour Al Akeel [mailto:mansour.alak...@gmail.com] > Sent: Saturday, June 23, 2012 11:21 PM > To: java-user@lucene.apache.org > Subject: Re: StandardTokenizer and split tokens > > Uwe, > thank you for the advice. I updated my code. > > > On Sat, Jun 23, 2012 at 3:15 AM, Uwe Schindler <u...@thetaphi.de> wrote: > >> I found the main issue. > >> I was using ByteRef without the length. This fixed the problem. > >> > >> String word = new > > String(ref.bytes,ref.offset,ref.length); > > > > Please see my other mail, using no character set here is the second > > problem of your code, this is the correct way to do: > > > > String word = ref.utf8ToString(); > > > > Uwe > > > > > > --------------------------------------------------------------------- > > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > > For additional commands, e-mail: java-user-h...@lucene.apache.org > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > For additional commands, e-mail: java-user-h...@lucene.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org