Author: otis Date: Thu Aug 10 22:07:41 2006 New Revision: 430672 URL: http://svn.apache.org/viewvc?rev=430672&view=rev Log: - Added png|PNG
Modified: lucene/nutch/trunk/conf/crawl-urlfilter.txt.template lucene/nutch/trunk/conf/regex-urlfilter.txt.template Modified: lucene/nutch/trunk/conf/crawl-urlfilter.txt.template URL: http://svn.apache.org/viewvc/lucene/nutch/trunk/conf/crawl-urlfilter.txt.template?rev=430672&r1=430671&r2=430672&view=diff ============================================================================== --- lucene/nutch/trunk/conf/crawl-urlfilter.txt.template (original) +++ lucene/nutch/trunk/conf/crawl-urlfilter.txt.template Thu Aug 10 22:07:41 2006 @@ -12,7 +12,7 @@ -^(file|ftp|mailto): # skip image and other suffixes we can't yet parse --\.(gif|GIF|jpg|JPG|ico|ICO|css|sit|eps|wmf|zip|ppt|mpg|xls|gz|rpm|tgz|mov|MOV|exe|png)$ +-\.(gif|GIF|jpg|JPG|png|PNG|ico|ICO|css|sit|eps|wmf|zip|ppt|mpg|xls|gz|rpm|tgz|mov|MOV|exe)$ # skip URLs containing certain characters as probable queries, etc. [EMAIL PROTECTED] Modified: lucene/nutch/trunk/conf/regex-urlfilter.txt.template URL: http://svn.apache.org/viewvc/lucene/nutch/trunk/conf/regex-urlfilter.txt.template?rev=430672&r1=430671&r2=430672&view=diff ============================================================================== --- lucene/nutch/trunk/conf/regex-urlfilter.txt.template (original) +++ lucene/nutch/trunk/conf/regex-urlfilter.txt.template Thu Aug 10 22:07:41 2006 @@ -10,7 +10,7 @@ -^(file|ftp|mailto): # skip image and other suffixes we can't yet parse --\.(gif|GIF|jpg|JPG|ico|ICO|css|sit|eps|wmf|zip|ppt|mpg|xls|gz|rpm|tgz|mov|MOV|exe)$ +-\.(gif|GIF|jpg|JPG|png|PNG|ico|ICO|css|sit|eps|wmf|zip|ppt|mpg|xls|gz|rpm|tgz|mov|MOV|exe)$ # skip URLs containing certain characters as probable queries, etc. [EMAIL PROTECTED] ------------------------------------------------------------------------- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642 _______________________________________________ Nutch-cvs mailing list Nutch-cvs@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nutch-cvs