I have developed a patch as well as some tests.

May I send the two updated files to the list - so that someone can review
and comitt? 

> -----Original Message-----
> From: [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] On 
> Behalf Of Luke Baker
> Sent: Freitag, 12. November 2004 15:23
> To: [EMAIL PROTECTED]
> Subject: Re: [Nutch-dev] [SPAM] url normalization
> 
> On 11/12/2004 09:02 AM, Matthias Jaekle wrote:
> > Hi,
> > I had this problem with old nutch versions.
> > Did you checkout the newest nutch version from cvs?
> > This should be fixed in the current version.
> > Matthias
> > 
> 
> I don't see any code in the BasicUrlNormalizer that would do 
> this.  Is it possible that what was fixed for you didn't have 
> to do with URL normalization but rather URL parsing?  Meaning 
> for you, Nutch was previously not "parsing" the URLs properly 
> when it was encountering them?
> 
> I believe code to normalize these URLs should be put in 
> BasicUrlNormalizer.java (and add relevent tests).
> 
> 
> Luke Baker
> 
> 
> -------------------------------------------------------
> This SF.Net email is sponsored by:
> Sybase ASE Linux Express Edition - download now for FREE
> LinuxWorld Reader's Choice Award Winner for best database on Linux.
> http://ads.osdn.com/?ad_id=5588&alloc_id=12065&op=click
> _______________________________________________
> Nutch-developers mailing list
> [EMAIL PROTECTED]
> https://lists.sourceforge.net/lists/listinfo/nutch-developers
> 
> 





-------------------------------------------------------
This SF.Net email is sponsored by: InterSystems CACHE
FREE OODBMS DOWNLOAD - A multidimensional database that combines
robust object and relational technologies, making it a perfect match
for Java, C++,COM, XML, ODBC and JDBC. www.intersystems.com/match8
_______________________________________________
Nutch-developers mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to