, November 10, 2004 7:23 PM
Subject: RE: Acedemic Question About Indexing
I have an application that I run monthly that indexes 40 million documents
into 6 indexes, then uses a multisearcher. The advantage for me is that I
can have multiple writers indexing 1/6 of that total data reducing the time
indexing 1/6 of that total data reducing the time it takes to
index by about 5X.
-Original Message-
From: Luke Shannon [mailto:[EMAIL PROTECTED]
Sent: Wednesday, November 10, 2004 2:39 PM
To: Lucene Users List
Subject: Re: Acedemic Question About Indexing
Don't worry, regardless of what I
11:37 AM
To: Lucene Users List
Cc: Will Allen
Subject: Re: Acedemic Question About Indexing
Will,
could you give more details about your architecture?
-each time update o create new indexes
-data stored at each index
etc.
because it is quite interesting, and I would like to test it.
Sodel
I am working on debugging an existing Lucene implementation.
Before I started, I built a demo to understand Lucene. In my demo I indexed
the entire content hierarhcy all at once, and than optimize this index and
used it for queries. It was time consuming but very simply.
The code I am currently
Uh, I hate to market it, but it's in the book. But you don't have
to wait for it, as there already is a Lucene demo that does what you
described. I am not sure if the demo always recreates the index or
whether it deletes and re-adds only the new and modified files, but if
it's the former,
?
Thanks,
Luke
- Original Message -
From: Otis Gospodnetic [EMAIL PROTECTED]
To: Lucene Users List [EMAIL PROTECTED]
Sent: Wednesday, November 10, 2004 2:08 PM
Subject: Re: Acedemic Question About Indexing
Uh, I hate to market it, but it's in the book. But you don't have
Shannon [mailto:[EMAIL PROTECTED]
Sent: Wednesday, November 10, 2004 2:39 PM
To: Lucene Users List
Subject: Re: Acedemic Question About Indexing
Don't worry, regardless of what I learn in this forum I am telling my
company to get me a copy of that bad boy when it comes out (which as far as
I am
Hi Every body !
i'm working with Lucene LARM Crawler for about 3
weeks ; so i'm a beginner ! and have a lot of question
that some of them have answered in Mailing list
Archive;but for some of them (most important of them )
i have'nt find any clear answer yet !
i ask my Questions in each email
Parag,
I'm not sure if I understood your question correctly, but it seems like
you want to create a Field that holds the path information (e.g.
TEST/subdir1 or TEST/subdir2, and so on), and then include that in the
query based on which path(s) you want to search.
You could use TEST to search
Hi all,
If I will create the index of files in different thread (which may be
invoked at any time)then is it possible to index on files from the root
directory and then selectively search on the different path on created
index.
For example first I will index from root directiory say , TEST.
10 matches
Mail list logo