Thanks averyone for the contributions.
Ultimately, our purpose is to process documents from the site into our
search database, so probably the most important thing is to limit the
number of files being processed. The case of the URLs in the html
probably wouldn't cause us much concern, but I
Does --page-requisites load content from other hosts as well, or must
I explicitly issue a --span-hosts with it?
The manpage unambiguously says about --span-hosts Enable spanning
across hosts when doing recursive retrieving, but at the --span-
hosts section it does not mention whether wget
a simple url-rewriting conf should fix the problem, wihout touch the file system
everything can be done server side
Best Regards
On Thu, Jun 19, 2008 at 6:29 AM, Coombe, Allan David (DPS)
[EMAIL PROTECTED] wrote:
Thanks averyone for the contributions.
Ultimately, our purpose is to process
without touching the file system
On Thu, Jun 19, 2008 at 9:23 AM, mm w [EMAIL PROTECTED] wrote:
a simple url-rewriting conf should fix the problem, wihout touch the file
system
everything can be done server side
Best Regards
On Thu, Jun 19, 2008 at 6:29 AM, Coombe, Allan David (DPS)
mm w wrote:
a simple url-rewriting conf should fix the problem, wihout touch the file
system
everything can be done server side
Why do you assume the user of wget has any control over the server from which
content is being downloaded?
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
Stefan Nowak wrote:
Does --page-requisites load content from other hosts as well, or must I
explicitly issue a --span-hosts with it?
The manpage unambiguously says about --span-hosts Enable spanning
across hosts when doing recursive retrieving,
not al, but in this particular case I pretty sure they have
On Thu, Jun 19, 2008 at 10:42 AM, Tony Lewis [EMAIL PROTECTED] wrote:
mm w wrote:
a simple url-rewriting conf should fix the problem, wihout touch the file
system
everything can be done server side
Why do you assume the user of
Thanks for the clarification -- man-page versus authorative info-manual.
If you allow me to give you some feedback:
I honestly overread the information in the manpage, as I thought the
bottom paragraphs are only some examples and special cases.
I think the fact, that --page-requisites