[jira] [Updated] (LUCENE-2735) First Cut at GroupVarInt with FixedIntBlockIndexInput / Output
[ https://issues.apache.org/jira/browse/LUCENE-2735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Smiley updated LUCENE-2735: - Fix Version/s: (was: 4.7) 4.8 First Cut at GroupVarInt with FixedIntBlockIndexInput / Output -- Key: LUCENE-2735 URL: https://issues.apache.org/jira/browse/LUCENE-2735 Project: Lucene - Core Issue Type: Improvement Components: core/index Affects Versions: 4.0-ALPHA Reporter: Simon Willnauer Assignee: Simon Willnauer Priority: Minor Fix For: 4.8 Attachments: LUCENE-2735.patch, LUCENE-2735.patch, LUCENE-2735.patch, LUCENE-2735_alt.patch I have hacked together a FixedIntBlockIndex impl with Group VarInt encoding - this does way worse than standard codec in benchmarks but I guess that is mainly due to the FixedIntBlockIndex limitations. Once LUCENE-2723 is in / or builds with trunk again I will update and run some tests. The isolated microbenchmark shows that there could be improvements over vint even in java though and I am sure we can make it faster impl. wise. -- This message was sent by Atlassian JIRA (v6.2#6252) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Updated] (LUCENE-2735) First Cut at GroupVarInt with FixedIntBlockIndexInput / Output
[ https://issues.apache.org/jira/browse/LUCENE-2735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-2735: -- Fix Version/s: (was: 4.3) 4.4 First Cut at GroupVarInt with FixedIntBlockIndexInput / Output -- Key: LUCENE-2735 URL: https://issues.apache.org/jira/browse/LUCENE-2735 Project: Lucene - Core Issue Type: Improvement Components: core/index Affects Versions: 4.0-ALPHA Reporter: Simon Willnauer Assignee: Simon Willnauer Priority: Minor Fix For: 4.4 Attachments: LUCENE-2735_alt.patch, LUCENE-2735.patch, LUCENE-2735.patch, LUCENE-2735.patch I have hacked together a FixedIntBlockIndex impl with Group VarInt encoding - this does way worse than standard codec in benchmarks but I guess that is mainly due to the FixedIntBlockIndex limitations. Once LUCENE-2723 is in / or builds with trunk again I will update and run some tests. The isolated microbenchmark shows that there could be improvements over vint even in java though and I am sure we can make it faster impl. wise. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Updated] (LUCENE-2735) First Cut at GroupVarInt with FixedIntBlockIndexInput / Output
[ https://issues.apache.org/jira/browse/LUCENE-2735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steve Rowe updated LUCENE-2735: --- Fix Version/s: (was: 4.1) 4.2 First Cut at GroupVarInt with FixedIntBlockIndexInput / Output -- Key: LUCENE-2735 URL: https://issues.apache.org/jira/browse/LUCENE-2735 Project: Lucene - Core Issue Type: Improvement Components: core/index Affects Versions: 4.0-ALPHA Reporter: Simon Willnauer Assignee: Simon Willnauer Priority: Minor Fix For: 4.2 Attachments: LUCENE-2735_alt.patch, LUCENE-2735.patch, LUCENE-2735.patch, LUCENE-2735.patch I have hacked together a FixedIntBlockIndex impl with Group VarInt encoding - this does way worse than standard codec in benchmarks but I guess that is mainly due to the FixedIntBlockIndex limitations. Once LUCENE-2723 is in / or builds with trunk again I will update and run some tests. The isolated microbenchmark shows that there could be improvements over vint even in java though and I am sure we can make it faster impl. wise. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] Updated: (LUCENE-2735) First Cut at GroupVarInt with FixedIntBlockIndexInput / Output
[ https://issues.apache.org/jira/browse/LUCENE-2735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simon Willnauer updated LUCENE-2735: Attachment: LUCENE-2735.patch here is a new patch with fixed testcase and updated micro benchmark First Cut at GroupVarInt with FixedIntBlockIndexInput / Output -- Key: LUCENE-2735 URL: https://issues.apache.org/jira/browse/LUCENE-2735 Project: Lucene - Java Issue Type: Improvement Components: Index Affects Versions: 4.0 Reporter: Simon Willnauer Assignee: Simon Willnauer Priority: Minor Fix For: 4.0 Attachments: LUCENE-2735.patch, LUCENE-2735.patch, LUCENE-2735.patch, LUCENE-2735_alt.patch I have hacked together a FixedIntBlockIndex impl with Group VarInt encoding - this does way worse than standard codec in benchmarks but I guess that is mainly due to the FixedIntBlockIndex limitations. Once LUCENE-2723 is in / or builds with trunk again I will update and run some tests. The isolated microbenchmark shows that there could be improvements over vint even in java though and I am sure we can make it faster impl. wise. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] Updated: (LUCENE-2735) First Cut at GroupVarInt with FixedIntBlockIndexInput / Output
[ https://issues.apache.org/jira/browse/LUCENE-2735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simon Willnauer updated LUCENE-2735: Attachment: LUCENE-2735.patch here is a rough patch First Cut at GroupVarInt with FixedIntBlockIndexInput / Output -- Key: LUCENE-2735 URL: https://issues.apache.org/jira/browse/LUCENE-2735 Project: Lucene - Java Issue Type: Improvement Components: Index Affects Versions: 4.0 Reporter: Simon Willnauer Priority: Minor Fix For: 4.0 Attachments: LUCENE-2735.patch I have hacked together a FixedIntBlockIndex impl with Group VarInt encoding - this does way worse than standard codec in benchmarks but I guess that is mainly due to the FixedIntBlockIndex limitations. Once LUCENE-2723 is in / or builds with trunk again I will update and run some tests. The isolated microbenchmark shows that there could be improvements over vint even in java though and I am sure we can make it faster impl. wise. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] Updated: (LUCENE-2735) First Cut at GroupVarInt with FixedIntBlockIndexInput / Output
[ https://issues.apache.org/jira/browse/LUCENE-2735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yonik Seeley updated LUCENE-2735: - Attachment: LUCENE-2735_alt.patch Here's an update that removes array lookups in favor of calculating the lengths, in addition to a bunch of other little optimizations that may or may not matter (such as removing unnecessary masks, and checking single-byte values first rather than 4 byte values during encoding). The lookup table would have taken substantial memory: 256*(64+4*4) == 20K and would have taken up a good fraction of L1 cache (perhaps not detectable in a micro-benchmark, but perhaps significant in a full application). Anyway - I made no attempt at benchmarking it, so YMMV. First Cut at GroupVarInt with FixedIntBlockIndexInput / Output -- Key: LUCENE-2735 URL: https://issues.apache.org/jira/browse/LUCENE-2735 Project: Lucene - Java Issue Type: Improvement Components: Index Affects Versions: 4.0 Reporter: Simon Willnauer Priority: Minor Fix For: 4.0 Attachments: LUCENE-2735.patch, LUCENE-2735_alt.patch I have hacked together a FixedIntBlockIndex impl with Group VarInt encoding - this does way worse than standard codec in benchmarks but I guess that is mainly due to the FixedIntBlockIndex limitations. Once LUCENE-2723 is in / or builds with trunk again I will update and run some tests. The isolated microbenchmark shows that there could be improvements over vint even in java though and I am sure we can make it faster impl. wise. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] Updated: (LUCENE-2735) First Cut at GroupVarInt with FixedIntBlockIndexInput / Output
[ https://issues.apache.org/jira/browse/LUCENE-2735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-2735: --- Attachment: LUCENE-2735.patch I made a codec for GVint (attached) but a few tests fail with spooky exceptions, eg TestPhraseQuery.testRandomPhrases and TestCodecs.testRandomPostings and TestIndicesEquals.testInstantiatedIndexWriter (in contrib/instantiated which, somehow, is really good at finding sneaky codec problems!). First Cut at GroupVarInt with FixedIntBlockIndexInput / Output -- Key: LUCENE-2735 URL: https://issues.apache.org/jira/browse/LUCENE-2735 Project: Lucene - Java Issue Type: Improvement Components: Index Affects Versions: 4.0 Reporter: Simon Willnauer Priority: Minor Fix For: 4.0 Attachments: LUCENE-2735.patch, LUCENE-2735.patch, LUCENE-2735_alt.patch I have hacked together a FixedIntBlockIndex impl with Group VarInt encoding - this does way worse than standard codec in benchmarks but I guess that is mainly due to the FixedIntBlockIndex limitations. Once LUCENE-2723 is in / or builds with trunk again I will update and run some tests. The isolated microbenchmark shows that there could be improvements over vint even in java though and I am sure we can make it faster impl. wise. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org