[
https://issues.apache.org/jira/browse/NUTCH-2049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14706789#comment-14706789
]
Sebastian Nagel commented on NUTCH-2049:
----------------------------------------
+1 to commit, as said, looking on performance of the unit tests can be done
later, some details below.
{noformat}
% time ant clean runtime test
(before)
Total time: 5 minutes 34 seconds
real 5m35.133s
user 7m30.968s
sys 0m21.528s
(after patching)
Total time: 6 minutes 39 seconds
real 6m39.794s
user 9m31.444s
sys 0m26.780s
{noformat}
These tests show significant differences, `-' before, `+' after patching:
{noformat}
[junit] Running org.apache.nutch.crawl.TestGenerator
- [junit] Tests run: 4, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
32.846 sec
+ [junit] Tests run: 4, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
36.279 sec
...
[junit] Running org.apache.nutch.fetcher.TestFetcher
- [junit] Tests run: 2, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
12.068 sec
+ [junit] Tests run: 2, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
14.273 sec
...
[junit] Running org.apache.nutch.parse.TestParserFactory
- [junit] Tests run: 2, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
2.783 sec
+ [junit] Tests run: 2, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
4.038 sec
...
[junit] Running org.apache.nutch.segment.TestSegmentMerger
- [junit] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
75.408 sec
+ [junit] Tests run: 1, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
91.652 sec
[junit] Running org.apache.nutch.segment.TestSegmentMergerCrawlDatums
- [junit] Tests run: 7, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
69.821 sec
+ [junit] Tests run: 7, Failures: 0, Errors: 0, Skipped: 0, Time elapsed:
84.443 sec
{noformat}
> Upgrade Trunk to Hadoop > 2.4 stable
> ------------------------------------
>
> Key: NUTCH-2049
> URL: https://issues.apache.org/jira/browse/NUTCH-2049
> Project: Nutch
> Issue Type: Improvement
> Components: build
> Reporter: Lewis John McGibbney
> Assignee: Lewis John McGibbney
> Labels: memex
> Fix For: 1.11
>
> Attachments: NUTCH-2049.patch, NUTCH-2049v2.patch, NUTCH-2049v3.patch
>
>
> Convo here - http://www.mail-archive.com/dev%40nutch.apache.org/msg18225.html
> I am +1 for taking trunk (or a branch of trunk) to explicit dependency on >
> Hadoop 2.6.
> We can run our tests, we can validate, we can fix.
> I will be doing validation on 2.X in paralegal as this is what I use on my
> own projects.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)