I copied the workaround from HTMLCharFilter to StandardTokenizer's code generator. There was a regex, stripping those ctors - unfortunately this regex depends on a missing period in the javadocs (was not able to fix it lazy, non-greedy,... or whatever regex like).
See my commit in the lucene4199 branch. ----- Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: [email protected] > -----Original Message----- > From: Robert Muir [mailto:[email protected]] > Sent: Sunday, July 08, 2012 2:32 PM > To: [email protected] > Subject: prohibit jflex generation of Tokenizer(InputStream) that uses system > default charset? > > Have a look at StandardTokenizerImpl:769 > > /** > * Creates a new scanner. > * There is also java.io.Reader version of this constructor. > * > * @param in the java.io.Inputstream to read input from. > */ > public StandardTokenizerImpl(java.io.InputStream in) { > this(new java.io.InputStreamReader(in)); > } > > Is there any jflex option to prevent generating this? > > -- > lucidimagination.com > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [email protected] For additional > commands, e-mail: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
