[jira] [Commented] (NUTCH-1433) Upgrade to Tika 1.2

2012-07-20 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13418999#comment-13418999 ] Markus Jelsma commented on NUTCH-1433: -- {code} 2012-07-20 10:15:49,402 WARN

[jira] [Updated] (NUTCH-1365) Fix crawlId functionalilty by making using of new gora configuration

2012-07-20 Thread Ferdy Galema (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ferdy Galema updated NUTCH-1365: Attachment: NUTCH-1365-v3.patch Small improvement of the patch by showing the crawlId name in the

[jira] [Updated] (NUTCH-1433) Upgrade to Tika 1.2

2012-07-20 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1433?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche updated NUTCH-1433: - Attachment: NUTCH-1433-trunk-2.patch Dependency to juniversalchardet needed in root ivy.xml

[jira] [Commented] (NUTCH-1433) Upgrade to Tika 1.2

2012-07-20 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13419014#comment-13419014 ] Julien Nioche commented on NUTCH-1433: -- Markus : I can't reproduce this issue. Are

[jira] [Commented] (NUTCH-1433) Upgrade to Tika 1.2

2012-07-20 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13419018#comment-13419018 ] Markus Jelsma commented on NUTCH-1433: -- Hmm, i did clean the build on trunk! Anyway,

[jira] [Created] (NUTCH-1434) Indexer to delete robots noIndex

2012-07-20 Thread Markus Jelsma (JIRA)
Markus Jelsma created NUTCH-1434: Summary: Indexer to delete robots noIndex Key: NUTCH-1434 URL: https://issues.apache.org/jira/browse/NUTCH-1434 Project: Nutch Issue Type: New Feature

[jira] [Commented] (NUTCH-1341) NotModified time set to now but page not modified

2012-07-20 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13419070#comment-13419070 ] Markus Jelsma commented on NUTCH-1341: -- Any comments on this one?

[jira] [Commented] (NUTCH-1388) Optionally maintain custom fetch interval despite AdaptiveFetchSchedule

2012-07-20 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13419069#comment-13419069 ] Markus Jelsma commented on NUTCH-1388: -- Comments? Optionally

[jira] [Commented] (NUTCH-1341) NotModified time set to now but page not modified

2012-07-20 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13419084#comment-13419084 ] Julien Nioche commented on NUTCH-1341: -- Looks like a reasonable thing to do

[jira] [Commented] (NUTCH-1434) Indexer to delete robots noIndex

2012-07-20 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1434?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13419125#comment-13419125 ] Lewis John McGibbney commented on NUTCH-1434: - Can we clarify exactly what we

[jira] [Commented] (NUTCH-1388) Optionally maintain custom fetch interval despite AdaptiveFetchSchedule

2012-07-20 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13419123#comment-13419123 ] Markus Jelsma commented on NUTCH-1388: -- That would not make injecting flexible. We

[jira] [Commented] (NUTCH-1430) Freegenerator records overwrite CrawlDB records with AdaptiveFetchSchedule

2012-07-20 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1430?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13419153#comment-13419153 ] Markus Jelsma commented on NUTCH-1430: -- Now that i've got|had your attention anyway,

[jira] [Resolved] (NUTCH-1388) Optionally maintain custom fetch interval despite AdaptiveFetchSchedule

2012-07-20 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma resolved NUTCH-1388. -- Resolution: Fixed Committed for 1.6 in rev. 1363741. Thanks for reviewing!

[jira] [Commented] (NUTCH-1433) Upgrade to Tika 1.2

2012-07-20 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13419175#comment-13419175 ] Julien Nioche commented on NUTCH-1433: -- Committed in trunk : revision 1363794.

[jira] [Commented] (NUTCH-1388) Optionally maintain custom fetch interval despite AdaptiveFetchSchedule

2012-07-20 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13419205#comment-13419205 ] Hudson commented on NUTCH-1388: --- Integrated in nutch-trunk-maven #359 (See

[jira] [Commented] (NUTCH-1433) Upgrade to Tika 1.2

2012-07-20 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13419206#comment-13419206 ] Hudson commented on NUTCH-1433: --- Integrated in nutch-trunk-maven #359 (See

[jira] [Commented] (NUTCH-1433) Upgrade to Tika 1.2

2012-07-20 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13419260#comment-13419260 ] Julien Nioche commented on NUTCH-1433: -- Anyone to test the patch for 2.x?

[jira] [Commented] (NUTCH-1433) Upgrade to Tika 1.2

2012-07-20 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13419258#comment-13419258 ] Julien Nioche commented on NUTCH-1433: -- Hmm, probably had a problem with the ivy

[jira] [Commented] (NUTCH-1433) Upgrade to Tika 1.2

2012-07-20 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13419307#comment-13419307 ] Hudson commented on NUTCH-1433: --- Integrated in nutch-trunk-maven #360 (See

[jira] [Commented] (NUTCH-1388) Optionally maintain custom fetch interval despite AdaptiveFetchSchedule

2012-07-20 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13419744#comment-13419744 ] Hudson commented on NUTCH-1388: --- Integrated in Nutch-trunk #1903 (See

[jira] [Commented] (NUTCH-1433) Upgrade to Tika 1.2

2012-07-20 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1433?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13419745#comment-13419745 ] Hudson commented on NUTCH-1433: --- Integrated in Nutch-trunk #1903 (See