[jira] [Resolved] (NUTCH-1640) OOM in ParseSegment Phase

2013-10-07 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche resolved NUTCH-1640. -- Resolution: Fixed Committed revision 1529802. Thanks Mitesh. OOM in ParseSegment Phase

[jira] [Commented] (NUTCH-1562) Order of execution for scoring filters

2013-10-07 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13788039#comment-13788039 ] Julien Nioche commented on NUTCH-1562: -- Hi Seb You are right about the order from

[jira] [Resolved] (NUTCH-1562) Order of execution for scoring filters

2013-10-07 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche resolved NUTCH-1562. -- Resolution: Fixed Committed revision 1529813. Order of execution for scoring filters

[jira] [Updated] (NUTCH-1588) Port NUTCH-1245 URL gone with 404 after db.fetch.interval.max stays db_unfetched in CrawlDb and is generated over and over again to 2.x

2013-10-07 Thread Talat UYARER (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Talat UYARER updated NUTCH-1588: Attachment: NUTCH-1588-final.patch I updated coding's style. Thanks for notice, Sebastian Nagel

[jira] [Commented] (NUTCH-1562) Order of execution for scoring filters

2013-10-07 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13788059#comment-13788059 ] Hudson commented on NUTCH-1562: --- SUCCESS: Integrated in Nutch-trunk #2380 (See

[jira] [Updated] (NUTCH-1606) Check that Factory classes use the cache in a thread safe way

2013-10-07 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche updated NUTCH-1606: - Attachment: NUTCH-1606.patch Synchronized methods on ObjectCache + calls from

[jira] [Created] (NUTCH-1652) Avoid instanciation of MimeUtil for each Content object created

2013-10-07 Thread Julien Nioche (JIRA)
Julien Nioche created NUTCH-1652: Summary: Avoid instanciation of MimeUtil for each Content object created Key: NUTCH-1652 URL: https://issues.apache.org/jira/browse/NUTCH-1652 Project: Nutch

[jira] [Commented] (NUTCH-961) Expose Tika's boilerpipe support

2013-10-07 Thread Nguyen Manh Tien (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13788911#comment-13788911 ] Nguyen Manh Tien commented on NUTCH-961: I used patch NUTCH-961-2.1-v2.patch for