Jason Sharpee wrote:
:    It would even be more handy, IMO, to also be able to specify the remote
: link depth at a specific local depth value.
: 
: ie.  For Linux Today:
: 
: Local-Depth=2
: Local-Depth-Level=1 Remote-Depth=0
: Local-Depth-Level=2 Remote-Depth=1
: 
:     Would grab the local contents page without banner's offsite links, and
: when a story link is traversed to depth 2, it would then allow spidering
: out to the complete story link.
: 
: Would this be usefull or could this be handled another existing way?

        I had a similar problem here (think downloading comments from
slashdot, but only those with score >=4. Or recoding from utf-8
to iso 8859-2. Or sending cokies or Accept-Language: or Accept-Charset:
HTTP headers).

        I think the best solution would be to use wget or some other generic
web spider to mirror this, and then include the file:// URLs
in your home.html.

-Yenya

-- 
| Jan "Yenya" Kasprzak  <kas at {fi.muni.cz - work | yenya.net - private}> |
| GPG: ID 1024/D3498839      Fingerprint 0D99A7FB206605D7 8B35FCDE05B18A5E |
| http://www.fi.muni.cz/~kas/   Czech Linux Homepage: http://www.linux.cz/ |
|\    Pruning the incoming mailbox after being 10 days off-line. Sorry    /|
| \   for the delayed reply.                                             / |

Reply via email to