Lucene should work quite well for this, you'll just need some
infrastructure around it to get the file and extract the contents (see
Lucene's Tika project). And, yes, Lucene is thread-safe, so you can
index safely as you describe.
On Oct 11, 2008, at 10:22 AM, Mag Gam wrote:
Hello All,
How large of a subset are you talking?
You might look at the FitleredTermEnum class, but you will probably
have to do some work on it to extend it to what you want
If you are talking a smallish subset (say, at most a couple hundred
docs), then you could store Term Vectors and use the TermVe
Have a look at the o.a.lucene.search.function package and the
ValueSourceQuery. You will probably be able to factor in those pieces
during scoring, so no need to resort at all.
-Grant
On Oct 8, 2008, at 11:15 AM, excitingComm2 wrote:
Hi everybody,
I am using Lucene for searching items i
Hi,
I want to search for sets of documents. For instance I index some folders
with documents in it and now I do not want to find certain documents but
folders.
Sample:
folder A
doc 1, contains X, Y
doc 2, contains Y, Z
folder B
doc 3, contains X, Y
doc 4, contains A, Z
Now I want to fi
Hello, I noticed that indexSearcher.explain() method is not supposed to be
run for a large collection of documents, so I am looking for an alternative
that just explains why a document matched without all the scoring
information. Basically, I would like to know which field of the document was
resp
Hello, How can I get a list of all the terms for a particular field? Is
the right approach to extend FilteredTermEnum?
Thanks!!
all folders which match "A AND Y", do you search for file name?
If yes, A or Y in "A AND Y" is a Strring too, so you can do it by:
construct a Lucene Document for each folder, and name of files under the
folder is the search data.
2008/10/13 <[EMAIL PROTECTED]>
> Hi,
>
> I want to search for sets
Someone just asked this question a week ago (unforunatley they asked it on
the wrong list)...
http://www.nabble.com/Can-I-filter-the-results-returned-by-IndexReader.terms%28field%29-using-a-field--to19849593.html#a19849593
: Subject: Enumerating all the terms of a particular field
-Hoss
--
: the "anonymous" SVN (http://svn.apache.org/repos/asf/lucene/java/trunk/)
: does not work for me (I am using Eclipse 3.3, and have the subversion
plug-in, v.
: 1.2.4, and have successfully checked out code using SVN from other
repositories).
: Apparently here I need a user-id and pwd -- what is