var analyzer = new StandardAnalyzer(Version.LUCENE_29);
            var textReader = new StringReader("hola mi nombre es Vicente");
            var tokenStream = analyzer.TokenStream("field", textReader);
            var terms = new List<String>();
var termAttribute = (TermAttribute)tokenStream.GetAttribute(typeof(TermAttribute));
            while(tokenStream.IncrementToken()) {
                terms.Add(termAttribute.Term());
            }

            // terms = { "hola", "mi", "nombre", "es", "vicente" ]

On 2012-06-15 14:01, vicente garcia wrote:
Hi, I have a little doubt.

I'd like to tokenize a string. Something like this:

StandardAnalyzer analyzer = new StandardAnalyzer("hola mi nombre es Vicente");

List<string>  tokens = analyzer.GetTokens();

And tokens is: [hola] [mi] [nombre] [es] [Vicente]

is this possible?

Thanks :)


Reply via email to