Hi,

 

I´d like to change the crawl-urlfilter.txt and nutch-site.xml depending on
the crawl. At the moment, I only use the “nutch crawl” command in a little
self-made .sh-script. In future, I´ll be in need of the other commands like
“nutch inject, fetch, …” too.

 

I think of something like “nutch crawl …. –urlfilter my_url_filter_file
–conffile my_nutch_site_xml_file”.

 

Am I right to make changes in the
org/apache/nutch/util/NutchConfiguration.java? If yes, how can I pass the
arguments?

 

If not, where do I have to modify the code to achieve this? I am not very
familiar with Java but I think I understand the code If I know where to go.

 

Thanks for every help!

 

Felix.

 

Reply via email to