[GitHub] lucene-solr pull request #527: LUCENE-8609: Allow getting consistent docstat...

2018-12-13 Thread s1monw
GitHub user s1monw opened a pull request: https://github.com/apache/lucene-solr/pull/527 LUCENE-8609: Allow getting consistent docstats from IndexWriter Today we have #numDocs() and #maxDoc() on IndexWriter. This is enough to get all stats for the current index but it's subject

[GitHub] lucene-solr issue #526: LUCENE-8608: Extract utility class to iterate over t...

2018-12-13 Thread s1monw
Github user s1monw commented on the issue: https://github.com/apache/lucene-solr/pull/526 @mikemccand I pushed changes. I think it's ready --- - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org

[GitHub] lucene-solr pull request #526: LUCENE-8608: Extract utility class to iterate...

2018-12-12 Thread s1monw
GitHub user s1monw opened a pull request: https://github.com/apache/lucene-solr/pull/526 LUCENE-8608: Extract utility class to iterate over terms docs Today we re-implement the same algorithm in various places when we want to consume all docs for a set/list of terms

[GitHub] lucene-solr pull request #523: LUCENE-8602: Share TermsEnum if possible whil...

2018-12-11 Thread s1monw
GitHub user s1monw opened a pull request: https://github.com/apache/lucene-solr/pull/523 LUCENE-8602: Share TermsEnum if possible while applying DV updates Today we pull a new terms enum when we apply DV updates even though the field stays the same which is the common case

[GitHub] lucene-solr issue #522: LUCENE-8599: Use sparse bitset to store docs in Sing...

2018-12-10 Thread s1monw
Github user s1monw commented on the issue: https://github.com/apache/lucene-solr/pull/522 @jpountz I will get this in as is and then explore what we can do as a followup with DocIdSetBuilder. We do need additional stats to make efficient use of it IMO and this already yields

[GitHub] lucene-solr pull request #522: LUCENE-8599: Use sparse bitset to store docs ...

2018-12-10 Thread s1monw
GitHub user s1monw opened a pull request: https://github.com/apache/lucene-solr/pull/522 LUCENE-8599: Use sparse bitset to store docs in SingleValueDocValuesFieldUpdates Using a sparse bitset in SingleValueDocValuesFieldUdpates allows storing which documents have an update much

[GitHub] lucene-solr issue #521: LUCENE-8598: Improve field updates packed values

2018-12-10 Thread s1monw
Github user s1monw commented on the issue: https://github.com/apache/lucene-solr/pull/521 > t's interesting that you found out that most of the time is spent with packed ints when sorting, likely because of swaps: the sorting impl that is being used (InPlaceMergeSorter) is the i

[GitHub] lucene-solr pull request #521: LUCENE-8598: Improve field updates packed val...

2018-12-10 Thread s1monw
Github user s1monw commented on a diff in the pull request: https://github.com/apache/lucene-solr/pull/521#discussion_r240234021 --- Diff: lucene/core/src/java/org/apache/lucene/index/NumericDocValuesFieldUpdates.java --- @@ -53,14 +55,28 @@ BytesRef binaryValue

[GitHub] lucene-solr pull request #521: LUCENE-8598: Improve field updates packed val...

2018-12-09 Thread s1monw
Github user s1monw commented on a diff in the pull request: https://github.com/apache/lucene-solr/pull/521#discussion_r240047328 --- Diff: lucene/core/src/test/org/apache/lucene/index/Benchmark.java --- @@ -0,0 +1,94 @@ +/* + * Licensed to the Apache Software Foundation

[GitHub] lucene-solr pull request #521: LUCENE-8598: Improve field updates packed val...

2018-12-09 Thread s1monw
GitHub user s1monw opened a pull request: https://github.com/apache/lucene-solr/pull/521 LUCENE-8598: Improve field updates packed values DocValuesFieldUpdats are using compact settings for packet ints that causes dramatic slowdowns when the updates are finished and sorted

[GitHub] lucene-solr pull request #517: LUCENE-8595: Fix interleaved DV update and re...

2018-12-06 Thread s1monw
GitHub user s1monw opened a pull request: https://github.com/apache/lucene-solr/pull/517 LUCENE-8595: Fix interleaved DV update and reset This change fixes a bug where interleaved update and reset value to the same doc in the same updates package looses an update

[GitHub] lucene-solr issue #516: LUCENE-8594: DV update are broken for updates on new...

2018-12-06 Thread s1monw
Github user s1monw commented on the issue: https://github.com/apache/lucene-solr/pull/516 @jpountz I pushed changes --- - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail

[GitHub] lucene-solr pull request #516: LUCENE-8594: DV update are broken for updates...

2018-12-06 Thread s1monw
Github user s1monw commented on a diff in the pull request: https://github.com/apache/lucene-solr/pull/516#discussion_r239409024 --- Diff: lucene/backward-codecs/src/test/org/apache/lucene/index/TestBackwardsCompatibility.java --- @@ -1587,6 +1590,76 @@ public void

[GitHub] lucene-solr pull request #516: LUCENE-8594: DV update are broken for updates...

2018-12-06 Thread s1monw
Github user s1monw commented on a diff in the pull request: https://github.com/apache/lucene-solr/pull/516#discussion_r239408501 --- Diff: lucene/backward-codecs/src/test/org/apache/lucene/index/TestBackwardsCompatibility.java --- @@ -1587,6 +1590,76 @@ public void

[GitHub] lucene-solr pull request #516: LUCENE-8594: DV update are broken for updates...

2018-12-06 Thread s1monw
Github user s1monw commented on a diff in the pull request: https://github.com/apache/lucene-solr/pull/516#discussion_r239408275 --- Diff: lucene/backward-codecs/src/test/org/apache/lucene/index/TestBackwardsCompatibility.java --- @@ -1587,6 +1590,76 @@ public void

[GitHub] lucene-solr issue #516: LUCENE-8594: DV update are broken for updates on new...

2018-12-06 Thread s1monw
Github user s1monw commented on the issue: https://github.com/apache/lucene-solr/pull/516 this was found in https://github.com/elastic/elasticsearch/pull/36286 --- - To unsubscribe, e-mail: dev-unsubscr

[GitHub] lucene-solr pull request #516: LUCENE-8594: DV update are broken for updates...

2018-12-06 Thread s1monw
GitHub user s1monw opened a pull request: https://github.com/apache/lucene-solr/pull/516 LUCENE-8594: DV update are broken for updates on new field A segmemnt written with Lucene70Codec failes if it ties to update a DV field that didn't exist in the index before it was upgraded

[GitHub] lucene-solr pull request #515: LUCENE-8593?: Specialize single value numeric...

2018-12-05 Thread s1monw
GitHub user s1monw opened a pull request: https://github.com/apache/lucene-solr/pull/515 LUCENE-8593?: Specialize single value numeric DV updates The case when all values are the the same on a numeric field update is common for soft_deletes. With the new infrastucture

[GitHub] lucene-solr pull request #513: LUCENE-8590: Optimize DocValues update datast...

2018-12-05 Thread s1monw
Github user s1monw commented on a diff in the pull request: https://github.com/apache/lucene-solr/pull/513#discussion_r239136654 --- Diff: lucene/core/src/java/org/apache/lucene/index/FieldUpdatesBuffer.java --- @@ -0,0 +1,280 @@ +/* + * Licensed to the Apache Software

[GitHub] lucene-solr issue #513: LUCENE-8590: Optimize DocValues update datastructure...

2018-12-05 Thread s1monw
Github user s1monw commented on the issue: https://github.com/apache/lucene-solr/pull/513 > Any sense of how much of RAM reduction this is over current impl? I ran a very simple measurement on top of `BufferedUpdates` adding 10k random updates with a constant seed. H

[GitHub] lucene-solr pull request #513: LUCENE-8590: Optimize DocValues update datast...

2018-12-05 Thread s1monw
Github user s1monw commented on a diff in the pull request: https://github.com/apache/lucene-solr/pull/513#discussion_r239062234 --- Diff: lucene/core/src/java/org/apache/lucene/index/FieldUpdatesBuffer.java --- @@ -0,0 +1,235 @@ +/* + * Licensed to the Apache Software

[GitHub] lucene-solr pull request #513: LUCENE-8590: Optimize DocValues update datast...

2018-12-05 Thread s1monw
Github user s1monw commented on a diff in the pull request: https://github.com/apache/lucene-solr/pull/513#discussion_r239062190 --- Diff: lucene/core/src/java/org/apache/lucene/index/FieldUpdatesBuffer.java --- @@ -0,0 +1,235 @@ +/* + * Licensed to the Apache Software

[GitHub] lucene-solr pull request #513: LUCENE-8590: Optimize DocValues update datast...

2018-12-05 Thread s1monw
Github user s1monw commented on a diff in the pull request: https://github.com/apache/lucene-solr/pull/513#discussion_r239036539 --- Diff: lucene/core/src/java/org/apache/lucene/index/FieldUpdatesBuffer.java --- @@ -0,0 +1,235 @@ +/* + * Licensed to the Apache Software

[GitHub] lucene-solr pull request #513: LUCENE-8590: Optimize DocValues update datast...

2018-12-05 Thread s1monw
Github user s1monw commented on a diff in the pull request: https://github.com/apache/lucene-solr/pull/513#discussion_r239036461 --- Diff: lucene/core/src/java/org/apache/lucene/index/FieldUpdatesBuffer.java --- @@ -0,0 +1,235 @@ +/* + * Licensed to the Apache Software

[GitHub] lucene-solr pull request #513: LUCENE-8590: Optimize DocValues update datast...

2018-12-05 Thread s1monw
Github user s1monw commented on a diff in the pull request: https://github.com/apache/lucene-solr/pull/513#discussion_r239036263 --- Diff: lucene/core/src/java/org/apache/lucene/index/BufferedUpdates.java --- @@ -288,15 +186,24 @@ void clear() { deleteTerms.clear

[GitHub] lucene-solr issue #513: LUCENE-8590: Optimize DocValues update datastructure...

2018-12-04 Thread s1monw
Github user s1monw commented on the issue: https://github.com/apache/lucene-solr/pull/513 @mikemccand can you take a look? --- - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e

[GitHub] lucene-solr pull request #513: LUCENE-8590: Optimize DocValues update datast...

2018-12-04 Thread s1monw
GitHub user s1monw opened a pull request: https://github.com/apache/lucene-solr/pull/513 LUCENE-8590: Optimize DocValues update datastructures Today we are using a LinkedHashMap to buffer doc-values updates in BufferedUpdates. This on the one hand uses an Object based

[GitHub] lucene-solr issue #503: LUCENE-8571: Don't block on FrozenBufferedUpdates#ap...

2018-11-25 Thread s1monw
Github user s1monw commented on the issue: https://github.com/apache/lucene-solr/pull/503 @msokolov I measures a pretty signficant lock contention on this lock without my change. with the change it went down significantly. Refreshes got 30% faster for that benchmark (small docs

[GitHub] lucene-solr pull request #503: LUCENE-8571: Don't block on FrozenBufferedUpd...

2018-11-21 Thread s1monw
Github user s1monw closed the pull request at: https://github.com/apache/lucene-solr/pull/503 --- - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org

[GitHub] lucene-solr issue #503: LUCENE-8571: Don't block on FrozenBufferedUpdates#ap...

2018-11-21 Thread s1monw
Github user s1monw commented on the issue: https://github.com/apache/lucene-solr/pull/503 pushed --- - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h

[GitHub] lucene-solr issue #503: LUCENE-8571: Don't block on FrozenBufferedUpdates#ap...

2018-11-21 Thread s1monw
Github user s1monw commented on the issue: https://github.com/apache/lucene-solr/pull/503 I recommend to look at the diff with `?w=1` -> https://github.com/apache/lucene-solr/pull/503/files?w=1 --- - To unsubscr

[GitHub] lucene-solr issue #503: LUCENE-8571: Don't block on FrozenBufferedUpdates#ap...

2018-11-21 Thread s1monw
Github user s1monw commented on the issue: https://github.com/apache/lucene-solr/pull/503 @mikemccand can you take a look --- - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e

[GitHub] lucene-solr pull request #503: LUCENE-8571: Don't block on FrozenBufferedUpd...

2018-11-21 Thread s1monw
GitHub user s1monw opened a pull request: https://github.com/apache/lucene-solr/pull/503 LUCENE-8571: Don't block on FrozenBufferedUpdates#apply during IW#processEvents While indexing we try to apply frozen deletes packages concurrently on indexing threads if necessary

[GitHub] lucene-solr pull request #502: LUCENE-8569: Never count soft-deletes if read...

2018-11-20 Thread s1monw
Github user s1monw commented on a diff in the pull request: https://github.com/apache/lucene-solr/pull/502#discussion_r235273221 --- Diff: lucene/core/src/java/org/apache/lucene/index/IndexWriter.java --- @@ -4398,42 +4426,41 @@ private int mergeMiddle(MergePolicy.OneMerge merge

[GitHub] lucene-solr issue #502: LUCENE-8569: Never count soft-deletes if reader has ...

2018-11-20 Thread s1monw
Github user s1monw commented on the issue: https://github.com/apache/lucene-solr/pull/502 @mikemccand can you take a look at this? --- - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional

[GitHub] lucene-solr pull request #502: LUCENE-8569: Never count soft-deletes if read...

2018-11-20 Thread s1monw
GitHub user s1monw opened a pull request: https://github.com/apache/lucene-solr/pull/502 LUCENE-8569: Never count soft-deletes if reader has no hard-deletes Today we count the actual soft-deletes during a merge which is unnecessary if there are no hard-deletes present

[GitHub] lucene-solr issue #480: LUCENE-8535: Drop out of the box Block-Join highligh...

2018-10-18 Thread s1monw
Github user s1monw commented on the issue: https://github.com/apache/lucene-solr/pull/480 @jimczi pushed some changes --- - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail

[GitHub] lucene-solr pull request #480: LUCENE-8535: Drop out of the box Block-Join h...

2018-10-18 Thread s1monw
GitHub user s1monw opened a pull request: https://github.com/apache/lucene-solr/pull/480 LUCENE-8535: Drop out of the box Block-Join highlight support Highlighter doesn't support ToParent and ToChildBlockJoinQuery out of the box anymore. In oder to highlight on Block-Join

[GitHub] lucene-solr pull request #454: LUCENE-8503: Simplify unwrapping Filter*Reade...

2018-09-17 Thread s1monw
GitHub user s1monw opened a pull request: https://github.com/apache/lucene-solr/pull/454 LUCENE-8503: Simplify unwrapping Filter*Reader Today we have 3 different kinds of FilterIndexReader. While FilterDirecotryReader and FilterLeafReader are simple to distinguish

[GitHub] lucene-solr pull request #453: LUCENE-8458: Allow access to delegate in Filt...

2018-09-17 Thread s1monw
GitHub user s1monw opened a pull request: https://github.com/apache/lucene-solr/pull/453 LUCENE-8458: Allow access to delegate in FilterCodecReader FilterCodecReader doesn't allow access to it's delegate like other filter readers. This adds a new getDelegate method to access

[GitHub] lucene-solr pull request #445: LUCENE-8484: Allow SubReaderWrapper to filter...

2018-09-04 Thread s1monw
GitHub user s1monw opened a pull request: https://github.com/apache/lucene-solr/pull/445 LUCENE-8484: Allow SubReaderWrapper to filter out entire readers Today we can only wrap readers in SubReaderWrapper but never filter them out entirely. This causes a invariant for soft

[GitHub] lucene-solr pull request #372: LUCENE-8267: removed references to memory cod...

2018-05-08 Thread s1monw
Github user s1monw commented on a diff in the pull request: https://github.com/apache/lucene-solr/pull/372#discussion_r186633149 --- Diff: solr/core/src/test/org/apache/solr/uninverting/TestFieldCacheVsDocValues.java --- @@ -619,6 +619,6 @@ private void assertEquals(long numOrds