Re: String menu

2010-03-01 Thread QueroVc

But the crawl-urlfilter.txt not accept only characters instead of strings?

If accepted, as I write?

# Skip URLs containing certain characters as probable queries, etc..
-[...@=]

Could be?

# Skip URLs containing certain characters as probable queries, etc..
- [ menu]

Thanks


QueroVc wrote:
 
 Please could someone tell me how to not get the crawl URLs that contain
 the word menu.
 
 Thanks
 

-- 
View this message in context: 
http://old.nabble.com/String-%22menu%22-tp27693743p27714490.html
Sent from the Nutch - User mailing list archive at Nabble.com.



Re: String menu

2010-03-01 Thread reinhard schwab
QueroVc schrieb:
 But the crawl-urlfilter.txt not accept only characters instead of strings?

 If accepted, as I write?

 # Skip URLs containing certain characters as probable queries, etc..
 -[...@=]

 Could be?

 # Skip URLs containing certain characters as probable queries, etc..
 - [ menu]
   
-menu

these are regular expressions.
http://www.regular-expressions.info/
 Thanks


 QueroVc wrote:
   
 Please could someone tell me how to not get the crawl URLs that contain
 the word menu.

 Thanks

 

   



String menu

2010-02-22 Thread QueroVc

Please could someone tell me how to not get the crawl URLs that contain the
word menu.

Thanks
-- 
View this message in context: 
http://old.nabble.com/String-%22menu%22-tp27692447p27692447.html
Sent from the Nutch - User mailing list archive at Nabble.com.



String menu

2010-02-22 Thread QueroVc

Please could someone tell me how to not get the crawl URLs that contain the
word menu.

Thanks
-- 
View this message in context: 
http://old.nabble.com/String-%22menu%22-tp27693743p27693743.html
Sent from the Nutch - User mailing list archive at Nabble.com.



Re: String menu

2010-02-22 Thread reinhard schwab
you can edit regex-urlfilter.txt to exclude those urls if you use fetch
command,
or crawl-urlfilter.txt if you use crawl command.

QueroVc schrieb:
 Please could someone tell me how to not get the crawl URLs that contain the
 word menu.

 Thanks