I am crawling my own site, which includes an ancient MovableType installation. When it gets to http://xcski.com/movabletype/mt.cgi, it produces an invalid "outlink" (seen by an exception in the crawl, and in the following readseg dump): Outlinks: 8 outlink: toUrl: http://xcski.com/movabletype/text/css anchor: outlink: toUrl: http://xcski.com/movabletype/</style> anchor: outlink: toUrl: http://xcski.com/movabletype/mt.cgi?__mode=start_recover ancho r: outlink: toUrl: http://xcski.com/movabletype/styles.css anchor: outlink: toUrl: http://xcski.com/movabletype/images/mt-logo.gif anchor: outlink: toUrl: http://xcski.com/movabletype/images/spacer.gif anchor: outlink: toUrl: http://xcski.com/movabletype/images/spacer.gif anchor: outlink: toUrl: http://xcski.com/movabletype/mt.cgi# anchor: Forgot your passw ord?
Looking through the text returned by just doing a wget on that URL, I don't see any href that's anywhere near a </style>, so I can't figure out why it's doing that. -- http://www.linkedin.com/in/paultomblin
