Markus gave me a little hint, but he's not available today. And This is an
urgent issue.

The question is simple (nutch 1.5.1 and solr 3.6.1 working together):

- The URL patterns in regex-urlfilter.txt control the behavior of crawling,
i.e., which pages to visit (or not to visit)
- What I need to do is to specificy **which pages to be indexed by solr**
(this is a subset of the pages visited) --> I wonder whether there is a
place to specify such URL patterns.

Reply via email to