I am crawling my own site, which includes an ancient MovableType
installation.  When it gets to http://xcski.com/movabletype/mt.cgi, it
produces an invalid "outlink" (seen by an exception in the crawl, and
in the following readseg dump):
Outlinks: 8
  outlink: toUrl: http://xcski.com/movabletype/text/css anchor:
  outlink: toUrl: http://xcski.com/movabletype/</style> anchor:
  outlink: toUrl: http://xcski.com/movabletype/mt.cgi?__mode=start_recover ancho
r:
  outlink: toUrl: http://xcski.com/movabletype/styles.css anchor:
  outlink: toUrl: http://xcski.com/movabletype/images/mt-logo.gif anchor:
  outlink: toUrl: http://xcski.com/movabletype/images/spacer.gif anchor:
  outlink: toUrl: http://xcski.com/movabletype/images/spacer.gif anchor:
  outlink: toUrl: http://xcski.com/movabletype/mt.cgi# anchor: Forgot your passw
ord?

Looking through the text returned by just doing a wget on that URL, I
don't see any href that's anywhere near a </style>, so I can't figure
out why it's doing that.


-- 
http://www.linkedin.com/in/paultomblin

Reply via email to