Just noticed a new (to me) Geocities obfuscation technique that uses embedded relative path(s): http://geocities.com/./qryz/../cristinasantiago49/?q=u-og3sygmores7rhqzn5ba That breaks my own subsite extraction code. :(
The pedantic part of my brain wants to rewrite my code to auto-adjust for relative paths, so I can continue testing the subsite against Uribl's great subsite list. The expedient part of my brain is thinking that either a ".." or a "/./" in a URL are most shiny signs of spam (or major mailing list stupidity), so I'm going to start with those as simple rules. Other than borked mailing lists, can anyone recall seeing either of those patterns in a legitimate emailed URL? Stay dry, - "Chip"