On 11/12/2004 09:02 AM, Matthias Jaekle wrote:
Hi,
I had this problem with old nutch versions.
Did you checkout the newest nutch version from cvs?
This should be fixed in the current version.
Matthias


I don't see any code in the BasicUrlNormalizer that would do this. Is it possible that what was fixed for you didn't have to do with URL normalization but rather URL parsing? Meaning for you, Nutch was previously not "parsing" the URLs properly when it was encountering them?


I believe code to normalize these URLs should be put in BasicUrlNormalizer.java (and add relevent tests).


Luke Baker


------------------------------------------------------- This SF.Net email is sponsored by: Sybase ASE Linux Express Edition - download now for FREE LinuxWorld Reader's Choice Award Winner for best database on Linux. http://ads.osdn.com/?ad_id=5588&alloc_id=12065&op=click _______________________________________________ Nutch-developers mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to