RE: search a subdirectory (New to Lucene)

2006-02-23 Thread John Hamilton
I reindexed with the path as a keyword field and now the PrefixQuery filter 
does exactly what I need.  Thanks!

I'm going to hold off on the paragraph-level indexing for now, but that does 
sound interesting.

many thanks,

John

-Original Message-
From: Erik Hatcher [mailto:[EMAIL PROTECTED]
Sent: Wednesday, February 22, 2006 3:18 PM
To: java-user@lucene.apache.org
Subject: Re: search a subdirectory (New to Lucene)


I presume by saying subdirectory you're referring to filesystem  
directories and you're indexing a directory tree of files.   If you  
index the path (perhaps relative from the root is best) as a keyword  
field (untokenized, but indexed) you could perform filtering on a / 
path/subpath sort of way using PrefixQuery.

As for paragraphs - how you index a document is entirely  
application dependent.  Maybe it makes sense to parse the documents  
before handing them to Lucene such that you're creating a Lucene  
Document for each paragraph rather than for each entire file.   
Slicing the granularity of a domain into Documents is a fascinating  
topic :)

Erik


On Feb 22, 2006, at 1:00 PM, John Hamilton wrote:

 I'm new to Lucene and was wondering what is the best way to perform  
 a search on a subdirectory or subdirectories within the index?  My  
 thought at this point is to build a query to first search for files  
 in the required directory(ies) and then use that query to make a  
 QueryFilter and use that QueryFilter in the actual search.  Is  
 there an easier way?

 On an unrelated note, does anybody know of a way to get results a  
 the section level within a document?  For example, could I find not  
 just a document that matches my query, but the paragraph within  
 that document that best matches the query?

 thanks,

 John


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



search a subdirectory (New to Lucene)

2006-02-22 Thread John Hamilton
I'm new to Lucene and was wondering what is the best way to perform a search on 
a subdirectory or subdirectories within the index?  My thought at this point is 
to build a query to first search for files in the required directory(ies) and 
then use that query to make a QueryFilter and use that QueryFilter in the 
actual search.  Is there an easier way?
 
On an unrelated note, does anybody know of a way to get results a the section 
level within a document?  For example, could I find not just a document that 
matches my query, but the paragraph within that document that best matches the 
query?
 
thanks,
 
John


Re: search a subdirectory (New to Lucene)

2006-02-22 Thread Erik Hatcher
I presume by saying subdirectory you're referring to filesystem  
directories and you're indexing a directory tree of files.   If you  
index the path (perhaps relative from the root is best) as a keyword  
field (untokenized, but indexed) you could perform filtering on a / 
path/subpath sort of way using PrefixQuery.


As for paragraphs - how you index a document is entirely  
application dependent.  Maybe it makes sense to parse the documents  
before handing them to Lucene such that you're creating a Lucene  
Document for each paragraph rather than for each entire file.   
Slicing the granularity of a domain into Documents is a fascinating  
topic :)


Erik


On Feb 22, 2006, at 1:00 PM, John Hamilton wrote:

I'm new to Lucene and was wondering what is the best way to perform  
a search on a subdirectory or subdirectories within the index?  My  
thought at this point is to build a query to first search for files  
in the required directory(ies) and then use that query to make a  
QueryFilter and use that QueryFilter in the actual search.  Is  
there an easier way?


On an unrelated note, does anybody know of a way to get results a  
the section level within a document?  For example, could I find not  
just a document that matches my query, but the paragraph within  
that document that best matches the query?


thanks,

John



-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]