None in this example. The analyzer could be a PerFieldAnalyzerWrapper, and the actual TokenStream retrieved would depend on the field specified. The fieldName parameter is not used in StandardAnalyzer.TokenStream, I could have passed null if I knew that when I wrote the code.

On 2012-06-15 14:28, vicente garcia wrote:
Thank you very much, it works!!

But what is the meaning of "field"?

Thanks a lot :)

On Fri, Jun 15, 2012 at 2:23 PM, Simon Svensson<si...@devhost.se>  wrote:
            var analyzer = new StandardAnalyzer(Version.LUCENE_29);
            var textReader = new StringReader("hola mi nombre es Vicente");
            var tokenStream = analyzer.TokenStream("field", textReader);
            var terms = new List<String>();
            var termAttribute =
(TermAttribute)tokenStream.GetAttribute(typeof(TermAttribute));
            while(tokenStream.IncrementToken()) {
                terms.Add(termAttribute.Term());
            }

            // terms = { "hola", "mi", "nombre", "es", "vicente" ]


On 2012-06-15 14:01, vicente garcia wrote:
Hi, I have a little doubt.

I'd like to tokenize a string. Something like this:

StandardAnalyzer analyzer = new StandardAnalyzer("hola mi nombre es
Vicente");

List<string>    tokens = analyzer.GetTokens();

And tokens is: [hola] [mi] [nombre] [es] [Vicente]

is this possible?

Thanks :)




Reply via email to