Gabriel Somlo wrote:
If I try to add --convert-links into the mix, the referencing link
does get rewritten, but ends up looking like

"../site.com/article.cgi?25.html"

which is designed for offline viewing via "file://", and is unsuitable
for actually hosting both the referencing and referenced sites as
virtual servers in a web server within the sandbox.
Are you using --span-hosts ? Otherwise wget won't be crawling pages to
a different host and thus won't produce a relative url down to the hostname.

I think that not using --span-hosts will suit your use case.



If not, assuming I can come up with a patch, would there be any
interest in upstreaming this type of additional functionality ?
However, you may provide a patch for making links relative to the hostname in
such case (you would need to add a parameter for --convert-links to enable
that alternative conversion).




Bug notice: just listing the domains to span on --domains doesn't seem to work
Deciding whether to enqueue "http://www.example.org/script.js";.
This is not the same hostname as the parent's (www.example.org and www.example.com).
Decided NOT to load it.



Reply via email to