I've just checked in some more parser fixes. Four changes: * the spider is now depth-first * identification of in-line anchors should now *work* * list items now use Unicode "bullet" characters, falling back to "o" * that darn Redhat urllib bug should go away
Please give it a whirl and let me know what I've broken :-). Bill
