[jira] [Created] (NUTCH-1384) Typo in ParseSegment's run-method

2012-06-11 Thread Matthias Agethle (JIRA)
Matthias Agethle created NUTCH-1384: --- Summary: Typo in ParseSegment's run-method Key: NUTCH-1384 URL: https://issues.apache.org/jira/browse/NUTCH-1384 Project: Nutch Issue Type: Bug

[jira] [Updated] (NUTCH-1385) More robust plug-in order properties in nutch-site.xml

2012-06-11 Thread Andy Xue (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Xue updated NUTCH-1385: Attachment: nutch-1385.txt More robust plug-in order properties in nutch-site.xml

[jira] [Created] (NUTCH-1385) More robust plug-in order properties in nutch-site.xml

2012-06-11 Thread Andy Xue (JIRA)
Andy Xue created NUTCH-1385: --- Summary: More robust plug-in order properties in nutch-site.xml Key: NUTCH-1385 URL: https://issues.apache.org/jira/browse/NUTCH-1385 Project: Nutch Issue Type:

[jira] [Commented] (NUTCH-1383) IndexingFiltersChecker to show error message instead of null pointer exception

2012-06-11 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1383?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13292668#comment-13292668 ] Markus Jelsma commented on NUTCH-1383: -- +1 IndexingFiltersChecker

[jira] [Updated] (NUTCH-1384) Typo in ParseSegment's run-method

2012-06-11 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1384?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1384: - Fix Version/s: 1.6 Typo in ParseSegment's run-method -

[jira] [Updated] (NUTCH-1385) More robust plug-in order properties in nutch-site.xml

2012-06-11 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1385: - Fix Version/s: 1.6 More robust plug-in order properties in nutch-site.xml

[jira] [Updated] (NUTCH-1385) More robust plug-in order properties in nutch-site.xml

2012-06-11 Thread Andy Xue (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Xue updated NUTCH-1385: Description: When listing multiple scoring filters in certain properties (listed below) in

[jira] [Updated] (NUTCH-1385) More robust plug-in order properties in nutch-site.xml

2012-06-11 Thread Andy Xue (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Xue updated NUTCH-1385: Description: When listing multiple scoring filters in certain properties (listed below) in

[jira] [Assigned] (NUTCH-1384) Typo in ParseSegment's run-method

2012-06-11 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1384?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma reassigned NUTCH-1384: Assignee: Markus Jelsma Typo in ParseSegment's run-method

[jira] [Resolved] (NUTCH-1385) More robust plug-in order properties in nutch-site.xml

2012-06-11 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma resolved NUTCH-1385. -- Resolution: Fixed Committed for 1.6 in rev. 1348764. Thanks Andy. More

[jira] [Resolved] (NUTCH-1384) Typo in ParseSegment's run-method

2012-06-11 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1384?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma resolved NUTCH-1384. -- Resolution: Fixed Committed for 1.6 in rev. 1348766. Thanks Matthias Typo in

[jira] [Resolved] (NUTCH-1262) Map `duplicating` content-types to a single type

2012-06-11 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1262?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma resolved NUTCH-1262. -- Resolution: Fixed Committed for 1.6 in rev. 1348785. Thanks. Map

[jira] [Updated] (NUTCH-1262) Map `duplicating` content-types to a single type

2012-06-11 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1262?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Markus Jelsma updated NUTCH-1262: - Description: Similar or duplicating content-types can end-up differently in an index. With, for

[jira] [Commented] (NUTCH-1385) More robust plug-in order properties in nutch-site.xml

2012-06-11 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13292815#comment-13292815 ] Lewis John McGibbney commented on NUTCH-1385: - Excellent Andy. Thanks for your

[jira] [Commented] (NUTCH-1262) Map `duplicating` content-types to a single type

2012-06-11 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13293039#comment-13293039 ] Hudson commented on NUTCH-1262: --- Integrated in nutch-trunk-maven #306 (See

[jira] [Commented] (NUTCH-1385) More robust plug-in order properties in nutch-site.xml

2012-06-11 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13293040#comment-13293040 ] Hudson commented on NUTCH-1385: --- Integrated in nutch-trunk-maven #306 (See

[jira] [Commented] (NUTCH-1384) Typo in ParseSegment's run-method

2012-06-11 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1384?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13293041#comment-13293041 ] Hudson commented on NUTCH-1384: --- Integrated in nutch-trunk-maven #306 (See

[jira] [Resolved] (NUTCH-1360) Suport the storing of IP address connected to when web crawling

2012-06-11 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1360. - Resolution: Fixed Committed @revision 1348993 in trunk as well.

[jira] [Commented] (NUTCH-1360) Suport the storing of IP address connected to when web crawling

2012-06-11 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13293085#comment-13293085 ] Hudson commented on NUTCH-1360: --- Integrated in nutch-trunk-maven #307 (See

[jira] [Updated] (NUTCH-1364) Add a counter for malformed urls

2012-06-11 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1364: Attachment: NUTCH-1364-trunk.patch Patch ffor trunk Add a

[jira] [Commented] (NUTCH-1364) Add a counter for malformed urls

2012-06-11 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1364?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13293138#comment-13293138 ] Markus Jelsma commented on NUTCH-1364: -- I never saw a malformed URL in the generator

[jira] [Updated] (NUTCH-1364) Add a counter in Generator for malformed urls

2012-06-11 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney updated NUTCH-1364: Summary: Add a counter in Generator for malformed urls (was: Add a counter for

[jira] [Resolved] (NUTCH-1364) Add a counter in Generator for malformed urls

2012-06-11 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lewis John McGibbney resolved NUTCH-1364. - Resolution: Fixed Committed @revision 1349076 in trunk Thanks Markus for

[jira] [Commented] (NUTCH-1364) Add a counter in Generator for malformed urls

2012-06-11 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1364?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13293245#comment-13293245 ] Hudson commented on NUTCH-1364: --- Integrated in nutch-trunk-maven #308 (See

[jira] [Commented] (NUTCH-1360) Suport the storing of IP address connected to when web crawling

2012-06-11 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1329#comment-1329 ] Hudson commented on NUTCH-1360: --- Integrated in Nutch-trunk #1868 (See

[jira] [Commented] (NUTCH-1385) More robust plug-in order properties in nutch-site.xml

2012-06-11 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13293336#comment-13293336 ] Hudson commented on NUTCH-1385: --- Integrated in Nutch-trunk #1868 (See

[jira] [Commented] (NUTCH-1262) Map `duplicating` content-types to a single type

2012-06-11 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13293334#comment-13293334 ] Hudson commented on NUTCH-1262: --- Integrated in Nutch-trunk #1868 (See

[jira] [Commented] (NUTCH-1384) Typo in ParseSegment's run-method

2012-06-11 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1384?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13293337#comment-13293337 ] Hudson commented on NUTCH-1384: --- Integrated in Nutch-trunk #1868 (See

[jira] [Commented] (NUTCH-1364) Add a counter in Generator for malformed urls

2012-06-11 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/NUTCH-1364?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13293335#comment-13293335 ] Hudson commented on NUTCH-1364: --- Integrated in Nutch-trunk #1868 (See