BasicURLNormalizer should collapse runs of slashes with a single slash
----------------------------------------------------------------------

                 Key: NUTCH-620
                 URL: https://issues.apache.org/jira/browse/NUTCH-620
             Project: Nutch
          Issue Type: Bug
          Components: fetcher
    Affects Versions: 0.9.0
         Environment: JDK 1.6 update 5, Tomcat 6, Windows Server 2003, 
            Reporter: Mark DeSpain
             Fix For: 1.0.0


The BasicURLNormalizer should collapse runs of slash characters '/' with a 
single slash.  

For example,  the following URLs should be normalized to 
http://lucene.apache.org/nutch/about.html

* http://lucene.apache.org/nutch//about.html 

* http://lucene.apache.org//nutch/about.html 

* http://lucene.apache.org/////nutch////about.html (an exaggerated example)


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to