Hi Folks,
I updated to the latest SVN revision (385691) today, and I am now seeing a
Null Pointer exception in the AnalyzerFactory.java class. It seems that in
some cases, the method:
private Extension getExtension(String lang) { Extension extension =
(Extension) this.conf.getObject(lang); if (extension == null) {
extension = findExtension(lang); if (extension != null) {
this.conf.setObject(lang, extension); } } return extension; }
Has a null "lang" parameter passed to it, which causes a NullPointer
exception at line: 81 in
src/java/org/apache/nutch/analyzer/AnalyzerFactory.java
I found that if I checked for null in the lang variable, and returned null
if lang == null, that my crawl finished. Here is a small patch that will fix
the crawl:
Index:
/Users/mattmann/src/nutch/src/java/org/apache/nutch/analysis/AnalyzerFactory
.java ===================================================================
---
/Users/mattmann/src/nutch/src/java/org/apache/nutch/analysis/AnalyzerFactory
.java (revision 385691) +++
/Users/mattmann/src/nutch/src/java/org/apache/nutch/analysis/AnalyzerFactory
.java (working copy) @@ -78,14 +78,19 @@ private Extension
getExtension(String lang) { - Extension extension = (Extension)
this.conf.getObject(lang); - if (extension == null) { - extension =
findExtension(lang); - if (extension != null) { -
this.conf.setObject(lang, extension); - } - } - return extension;
+ if(lang == null){ + return null; + } + else{ +
Extension extension = (Extension) this.conf.getObject(lang); + if
(extension == null) { + extension = findExtension(lang); +
if (extension != null) { + this.conf.setObject(lang, extension);
+ } + } + return extension; + } }
private Extension findExtension(String lang) {
NOTE: not sure if returning null is the right thing to do here, but hey, at
least it made my crawl finish! :-)
Cheers,
Chris
______________________________________________
Chris A. Mattmann
[EMAIL PROTECTED]
Staff Member
Modeling and Data Management Systems Section (387)
Data Management Systems and Technologies Group
_________________________________________________
Jet Propulsion Laboratory Pasadena, CA
Office: 171-266B Mailstop: 171-246
_______________________________________________________
Disclaimer: The opinions presented within are my own and do not reflect
those of either NASA, JPL, or the California Institute of Technology.