Ilya Basin <[email protected]> writes:

> Here's my script to download IBM javadocs:
>
> (
>     rm -rf wget-test
>     mkdir wget-test
>     cd wget-test
>     
> starturl="http://www-01.ibm.com/support/knowledgecenter/api/content/SSZLC2_7.0.0/com.ibm.commerce.api.doc/allclasses-noframe.html";
>     wget -d -r -R robots.txt --page-requisites -nH --cut-dirs=5 --no-parent 
> "$starturl" 2>&1 | tee wget.log
> )
>
> regardless of '-R' option, wget downloads robots.txt and refuses to
> follow links starting with "/support/knowledgecenter/api/".

No need to use any workaround, you should be able to achieve the same
behavior with "-e robots=off" as documented.

Regards,
Giuseppe

Reply via email to