Sven: Yes...I would say simply attach the files. One of the committers should add it to CVS in the next day or two.
-----Original Message----- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Sven Wende Sent: Friday, November 12, 2004 8:11 PM To: [EMAIL PROTECTED] Subject: [JNK] RE: [Nutch-dev] [SPAM] url normalization I have developed a patch as well as some tests. May I send the two updated files to the list - so that someone can review and comitt? > -----Original Message----- > From: [EMAIL PROTECTED] > [mailto:[EMAIL PROTECTED] On Behalf Of > Luke Baker > Sent: Freitag, 12. November 2004 15:23 > To: [EMAIL PROTECTED] > Subject: Re: [Nutch-dev] [SPAM] url normalization > > On 11/12/2004 09:02 AM, Matthias Jaekle wrote: > > Hi, > > I had this problem with old nutch versions. > > Did you checkout the newest nutch version from cvs? > > This should be fixed in the current version. > > Matthias > > > > I don't see any code in the BasicUrlNormalizer that would do this. Is > it possible that what was fixed for you didn't have to do with URL > normalization but rather URL parsing? Meaning for you, Nutch was > previously not "parsing" the URLs properly when it was encountering > them? > > I believe code to normalize these URLs should be put in > BasicUrlNormalizer.java (and add relevent tests). > > > Luke Baker > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Sybase ASE Linux Express Edition - download now for FREE LinuxWorld > Reader's Choice Award Winner for best database on Linux. > http://ads.osdn.com/?ad_id=5588&alloc_id=12065&op=click > _______________________________________________ > Nutch-developers mailing list > [EMAIL PROTECTED] > https://lists.sourceforge.net/lists/listinfo/nutch-developers > > ------------------------------------------------------- This SF.Net email is sponsored by: InterSystems CACHE FREE OODBMS DOWNLOAD - A multidimensional database that combines robust object and relational technologies, making it a perfect match for Java, C++,COM, XML, ODBC and JDBC. www.intersystems.com/match8 _______________________________________________ Nutch-developers mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/nutch-developers ------------------------------------------------------- This SF.Net email is sponsored by: InterSystems CACHE FREE OODBMS DOWNLOAD - A multidimensional database that combines robust object and relational technologies, making it a perfect match for Java, C++,COM, XML, ODBC and JDBC. www.intersystems.com/match8 _______________________________________________ Nutch-developers mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/nutch-developers