I have a lucene index that basically contains parsed IIS logs. I use this to
calculate some custom web statistics.

 

Currently, my lucene index contains only indexed fields - nothing is stored.
E.g. I can query "cs-uri-stem:/index.htm and date:20080304" to check how
often index.htm was requested today, but I cannot subsequently read
cs-uri-stem or date from the returned documents.

 

Now, this approach works pretty well with one disadvantage: I cannot figure
out the date of my last stored item. I can search all documents and sort on
date, but I cannot tell the dates of my returned documents. I need this for
only one purpose: figuring out the date of the last added document.

 

Does anyone have an idea if I would be able to tackle this _without_
indexing all data and _without_ implementing a custom checkpoint file? If I
can get this done, it would save me more than half of the index disk space.

Igor

 

Reply via email to