Yes, all sites, when the url contains with your filter will be removed.
You can try it with:
bin/nutch prune /srv/segments/ -dryrun -queries qry.txt -output del.txt
With the "-dryrun" you can't delete anything. This is only show what
will deleted without it.
Gal Nitzan wrotte:
Hi Ferenc,
Thank you for the information, regrettably I didn't figure it out yet.
Do you mean to write a text file on which every line contains
+url:sample.com , and it shall remove that site from the index?
Thanks,
Gal
[EMAIL PROTECTED] wrote:
e.g.:
+url:sex-com
You can try, how from sex.com to +url:sex-com with: bin/nutch
org.apache.nutch.searcher.Query
Regards,
Ferenc
Gal Nitzan wrotte:
Hi,
Does anyone know how to use the prune option?
what should be in the [-queries filename] file?
Regards,
Gal
.
-------------------------------------------------------
SF.Net email is sponsored by:
Tame your development challenges with Apache's Geronimo App Server.
Download it for free - -and be entered to win a 42" plasma tv or your very
own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general