Hi everyone,

I've been working on a news system which is similar to google news. We have
a set of news web-pages which are crawled using nutch-0.9. I cluster the
pages using carrot-clustering plugin. To get the recent pages I need to
search using the date of the page. I modified the query-parser plugins of
nutch using that of lucene so that it supports queries like
[date:yyyymmdd-yyyymmdd  queryterm]  where queryterm is the query.

But a news system will not have any queries, it just displays the lastest
news articles in clusters from the whole index. I want to retrieve the
latest news pages based on their  date.
I index the dates of every document as a lucene field.

My requirement is I'll just give the date range [date:yyyymmdd-yyyymmdd ] as
query and I should get the  documents whose date is in the range
specified[no query term other than date]results. But nutch is'nt giving any
results.It requires a query.

 Can anyone help me with this where I can get the documents based on the
modified date but no query will be given.

Any help in this regard would be greatly appreciated.

Thanks in anticipation
Aditya Veluguri

Reply via email to