>Finally I will create a class called a Crawler (or maybe I'll use
>Retriever) which coordinates the traversal of the doc tree. Its only
>callback from the Parsable will be got_href, which obviously it needs
I think you'd also want a got_redirect in some form too, to handle
the META Http-Equiv refreshes and so on.
>subdirectory. I'm not looking forward to when it comes time to merge
>my work back into the main branch, especially if other people have
>been working much in this subdirectory.
I don't think there has really been a ton of development on the 3-2-x
branch recently, esp. in the htdig/ directory. But you'd be surprised
how well CVS merges can go.
>I hope to have something to share within the next week or so--before
Any continuing progress? As I stated a while ago, my next project is
with htsearch/ after attending to some misc. cleanups. So I don't
expect that there will continue to be many conflicts.
-Geoff
_______________________________________________
htdig-dev mailing list
[EMAIL PROTECTED]
http://lists.sourceforge.net/lists/listinfo/htdig-dev