Ashwin Satyanarayana wrote:

Hello,

I am new to Lucene. I was trying to use Lucene with TREC-6 Data. The dataset 
for TREC-6 used in 1997 contains many input files.  Each input file hasmultiple 
documents
(some files contain over 200 documents) tagged by DOCNO. The result given
by Lucene to a query is a list of files and not documents.

Q1) Is there a way of getting the query results in terms of documents
within the files rather than files ( without modifying the code)?
In lucene a Document object is the unit of search/storage/indexing. It may or may not correspond to an user's view of files or documents.


Q2) If the above is not posssible, what would be the best way to modify
the code?
To achieve what you want, I think you need to store and/or index each of your documents as a lucene Document. You may also want to store the file name and document identifier as a lucene field in the lucene Document.


Thanks and Regards,
Ashwin

Questions on how to use lucene should be addressed to the lucene users mailing list. This one is for developers developing lucene itself.

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to