On Sun, 30 Sep 2001, Geoff Hutchison wrote:

re. logging broken links
>
> I don't really see this as a needed feature to ht://Dig itself since
> you can do this as a script running on top of the htdig output using
> the -s flag. See
> <http://www.htdig.org/files/contrib/scripts/showdead.pl>
> <http://www.htdig.org/files/contrib/scripts/report_missing_pages.pl>

I played with this briefly.
Htdig listed a broken link (404) on the same server, and a 404 link to
another server.

It didn't list a 502 (connection refused) or 500 (unknown host).
It also doesn't grab any mail information or owner metadata, so
you'd either have to have a database of sites/URL fragments against
authors, or re-spider the list of referring documents to gather author
information, if you wanted to mail authors or sort the broken list
by author.
My robot produces e.g. http://www.triumf.ca/trsearch/errors2.html
(but I'm too lazy to fix my links - [EMAIL PROTECTED])




-- 
Andrew Daviel
Are you always losing things?  - http://huzizit.com



_______________________________________________
htdig-dev mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/htdig-dev

Reply via email to