This sounds like a good solution, but may not be viable in my situation. I
think I might run into problems since my index changes very often; several
times an hour. I don't think it would be very efficient to rebuild the
field mapped array after each new document is incrementally added to the
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]]
I think this still works if the the document number continue
to increase
by one when documents are added incrementally.
Does anyone know if this is true (I haven't looked at the code yet).
Yes, that is true, so long as you do not delete
Hello. We currently have an older version of Lucene running on our intranet,
which I did not install (I'm new to Lucene), and I've been testing it out. I
noticed that it appears not to be using meta tag information in ranking or
in displaying. I was wondering if the newer version will allow this
I am confused about how Lucene performs the parsing of an Html document. It
doesn't do any tag striping (or does it?) consequently does that mean it
also indexes all html tags? If so then a request for searching body will
return any and all html documents previously indexed.
I'd appreciate anyone
here is the full indexing code. and it does use the SimpleAnalyzer.
thanks for your help!
import java.io.File;
import java.io.IOException;
import java.io.OutputStream;
import java.util.*;
import org.apache.lucene.analysis.SimpleAnalyzer;
import org.apache.lucene.index.IndexWriter;
import
I am trying index a set of data, storing only a primary key. This primary key I
left un-indexed. There is one text field, that I indexed and tokenized.
The others I neither want to store or tokenized. My reasoning was that not
tokenizing would produce the smallest index. The remaining
I am new here too but here's my 2 cents.
If you don't tokenize your db textvalues, what do you say will be the
resulting terms indexed? I think not what you expect.
Your non tokenized fields probably are not filtered out hence a lastname
like 'Smith' will not be a hit if the query is 'smith' the