Hi

Do you want to set different type of rules to different type of files? I
find regex-urlfilter plugin did not provide this feature and other
*-urlfilter plugins also did not provide this feature.

Maybe you can add a interface like

protected Reader[] getRulesReaders(Configuration conf) throws IOException

to get multi-readers for all configure files in RegexURLFilterBase class.


On Tue, Aug 19, 2014 at 1:42 AM, Ali Nazemian <[email protected]> wrote:

> Dear all,
> Hi,
> I use nutch 1.8 for crawl some web sites. For this purpose I want to change
> nutch in a way that different regex-urlfilter file loads for different
> types of file. For example one for html files and another for image files.
> (jpg/jpeg, ... ) Does nutch consider such situation? Or I should change
> some line of codes? (probably regex-urlfilter plugin)
> Best regards.
>
> --
> A.Nazemian
>



-- 
Don't Grow Old, Grow Up... :-)

Reply via email to