I'm trying to apply timeout.patch to Nutch branch-1.1, but I get the error:

wget https://issues.apache.org/jira/secure/attachment/12448699/timeout.patch
patch -p0 < timeout.patch


patching file src/java/org/apache/nutch/parse/ParseCallable.java
patching file src/java/org/apache/nutch/parse/ParseUtil.java
Hunk #2 FAILED at 48.
Hunk #3 succeeded at 86 (offset -2 lines).
Hunk #5 succeeded at 141 (offset -2 lines).
1 out of 5 hunks FAILED -- saving rejects to file
src/java/org/apache/nutch/parse/ParseUtil.java.rej

ParseUtil.java.rej
***************
*** 42,47 ****
    public static final Log LOG = LogFactory.getLog(ParseUtil.class);
    private ParserFactory parserFactory;
    private Configuration conf;
    
    /**
     * 
--- 48,54 ----
    public static final Log LOG = LogFactory.getLog(ParseUtil.class);
    private ParserFactory parserFactory;
    private Configuration conf;
+   private int MAX_PARSE_TIME = 30; // 30 seconds should be enough for
anybody...
    
    /**
     * 

The original ParseUtil.java does not have the line : private Configuration
conf;

  /* our log stream */
  public static final Log LOG = LogFactory.getLog(ParseUtil.class);
  private ParserFactory parserFactory;
  
  /**


I have tried to manually patch it, but I don't think that works, because the
first time I hit a timeout I get a error that kills the process

2010-07-21 15:39:28,642 DEBUG parse.ParseUtil - Parsing
[http://1420wackmorningshow.blogspot.com/feeds/posts/default] with
[org.apache.nutch.parse.tika.tikapar...@3909ea96]
2010-07-21 15:39:28,642 DEBUG tika.TikaParser - Using Tika parser
org.apache.tika.parser.xml.DcXMLParser for mime-type application/xml
2010-07-21 15:39:38,643 WARN  parse.ParseUtil - TIMEOUT parsing
http://1420wackmorningshow.blogspot.com/feeds/posts/default with
org.apache.nutch.parse.tika.tikapar...@3909ea96
2010-07-21 15:39:38,644 WARN  parse.ParseUtil - Unable to successfully parse
content http://1420wackmorningshow.blogspot.com/feeds/posts/default of type
application/xml
2010-07-21 15:39:38,645 WARN  mapred.LocalJobRunner - job_local_0001
java.lang.NullPointerException
        at org.apache.nutch.parse.ParseSegment.map(ParseSegment.java:91)
        at org.apache.nutch.parse.ParseSegment.map(ParseSegment.java:41)
        at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:50)
        at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:358)
        at org.apache.hadoop.mapred.MapTask.run(MapTask.java:307)
        at
org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:177)


Should this patch work with Branch 1.1?

Thanks
Brad



Reply via email to