Re: String menu
But the crawl-urlfilter.txt not accept only characters instead of strings? If accepted, as I write? # Skip URLs containing certain characters as probable queries, etc.. -[...@=] Could be? # Skip URLs containing certain characters as probable queries, etc.. - [ menu] Thanks QueroVc wrote: Please could someone tell me how to not get the crawl URLs that contain the word menu. Thanks -- View this message in context: http://old.nabble.com/String-%22menu%22-tp27693743p27714490.html Sent from the Nutch - User mailing list archive at Nabble.com.
Re: String menu
QueroVc schrieb: But the crawl-urlfilter.txt not accept only characters instead of strings? If accepted, as I write? # Skip URLs containing certain characters as probable queries, etc.. -[...@=] Could be? # Skip URLs containing certain characters as probable queries, etc.. - [ menu] -menu these are regular expressions. http://www.regular-expressions.info/ Thanks QueroVc wrote: Please could someone tell me how to not get the crawl URLs that contain the word menu. Thanks
String menu
Please could someone tell me how to not get the crawl URLs that contain the word menu. Thanks -- View this message in context: http://old.nabble.com/String-%22menu%22-tp27692447p27692447.html Sent from the Nutch - User mailing list archive at Nabble.com.
String menu
Please could someone tell me how to not get the crawl URLs that contain the word menu. Thanks -- View this message in context: http://old.nabble.com/String-%22menu%22-tp27693743p27693743.html Sent from the Nutch - User mailing list archive at Nabble.com.
Re: String menu
you can edit regex-urlfilter.txt to exclude those urls if you use fetch command, or crawl-urlfilter.txt if you use crawl command. QueroVc schrieb: Please could someone tell me how to not get the crawl URLs that contain the word menu. Thanks