Re: Iterating over all documents in an index

2011-02-12 Thread Georger Araujo
27;re really undoing all the work that went into > inverting the index in the first place. > 2> it's lossy. For instance, a term that's been stemmed will only have > the stemmed version in the index. Is that OK? > > Best > Erick > > On Sat, Feb 12, 2011 at 9:07 AM,

Iterating over all documents in an index

2011-02-12 Thread Georger Araujo
Hi, I want to iterate over all documents in a given index. I've found the following piece of code [1]: IndexReader reader = // create IndexReader for (int i=0; ihttp://stackoverflow.com/questions/2311845/is-it-possible-to-iterate-through-documents-stored-in-lucene-index [2] https://cwiki.apache.or

Re: Extending org.apache.lucene.analysis.br.BrazilianAnalyzer to discard numeric tokens

2011-02-08 Thread Georger Araujo
2011/2/7 Robert Muir > On Sun, Feb 6, 2011 at 3:28 PM, Georger Araujo > wrote: > > Hi, > > I started using Lucene a few weeks ago, and I must say I'm amazed. Hats > off > > to the developers and the community! > > I'd like to write

Extending org.apache.lucene.analysis.br.BrazilianAnalyzer to discard numeric tokens

2011-02-06 Thread Georger Araujo
Hi, I started using Lucene a few weeks ago, and I must say I'm amazed. Hats off to the developers and the community! I'd like to write a custom analyzer whose only difference to org.apache.lucene.analysis.br.BrazilianAnalyzer is that I want it to discard numeric tokens from the input. I've looked a