[
https://issues.apache.org/jira/browse/LUCENE-3994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13259762#comment-13259762
]
Michael McCandless commented on LUCENE-3994:
--------------------------------------------
bq. so all the time is spent in GZIPInputStream.inflateBytes!
Ugh, nice find Robert!
I think for nightly hudson we should just pre-gunzip the file?
I was also curious if this is substantially slowing down tests for the
checked-in lines file ... it's much smaller so much less seeking. I ran a few
tests (ran all lucene tests, using the python runner, with compressed vs
uncompressed) and it seems to be in the noise...
> some nightly tests take hours
> -----------------------------
>
> Key: LUCENE-3994
> URL: https://issues.apache.org/jira/browse/LUCENE-3994
> Project: Lucene - Java
> Issue Type: Bug
> Components: general/build
> Affects Versions: 4.0
> Reporter: Robert Muir
> Fix For: 4.0
>
> Attachments: LUCENE-3994.patch
>
>
> The nightly builds are taking 4-7 hours.
> This is caused by a few bad apples (can be seen
> https://builds.apache.org/job/Lucene-trunk/1896/testReport/).
> The top 5 are (all in analysis):
> * TestSynonymMapFilter: 1 hr 54 min
> * TestRandomChains: 1 hr 22 min
> * TestRemoveDuplicatesTokenFilter: 32 min
> * TestMappingCharFilter: 28 min
> * TestWordDelimiterFilter: 22 min
> so thats 4.5 hours right there for that run....
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]