[jira] [Commented] (LUCENE-10545) Allow Github PR link in CHANGES.html

2022-04-28 Thread Tomoko Uchida (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10545?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17529748#comment-17529748 ] Tomoko Uchida commented on LUCENE-10545: Note that this does not replace Jira issue with Github

[jira] [Commented] (LUCENE-10543) Achieve contribution workflow perfection (with progress)

2022-04-28 Thread Tomoko Uchida (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10543?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17529735#comment-17529735 ] Tomoko Uchida commented on LUCENE-10543: I opened [LUCENE-10393] and a PR for it. Can you

[jira] [Comment Edited] (LUCENE-10543) Achieve contribution workflow perfection (with progress)

2022-04-28 Thread Tomoko Uchida (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10543?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17529735#comment-17529735 ] Tomoko Uchida edited comment on LUCENE-10543 at 4/29/22 3:21 AM: - I

[GitHub] [lucene] mocobeta commented on a diff in pull request #854: LUCENE-10393: allow to link github pr from changes

2022-04-28 Thread GitBox
mocobeta commented on code in PR #854: URL: https://github.com/apache/lucene/pull/854#discussion_r861437113 ## lucene/CHANGES.txt: ## @@ -53,7 +53,7 @@ Other * LUCENE-10253: The @BadApple annotation has been removed from the test framework. (Adrien Grand) -* LUCENE-10393:

[GitHub] [lucene] mocobeta commented on a diff in pull request #854: LUCENE-10393: allow to link github pr from changes

2022-04-28 Thread GitBox
mocobeta commented on code in PR #854: URL: https://github.com/apache/lucene/pull/854#discussion_r861437113 ## lucene/CHANGES.txt: ## @@ -53,7 +53,7 @@ Other * LUCENE-10253: The @BadApple annotation has been removed from the test framework. (Adrien Grand) -* LUCENE-10393:

[jira] [Commented] (LUCENE-10545) Allow Github PR link in CHANGES.html

2022-04-28 Thread Tomoko Uchida (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10545?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17529732#comment-17529732 ] Tomoko Uchida commented on LUCENE-10545: This PR shows a concrete example to show how it'll

[jira] [Updated] (LUCENE-10545) Allow Github PR link in CHANGES.html

2022-04-28 Thread Tomoko Uchida (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10545?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tomoko Uchida updated LUCENE-10545: --- Description: {{changes2html}} already supports links to Github PRs (LUCENE-5383), but the

[jira] [Updated] (LUCENE-10545) Allow Github PR link in CHANGES.html

2022-04-28 Thread Tomoko Uchida (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10545?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tomoko Uchida updated LUCENE-10545: --- Description: {{changes2html}} already supports links to Github PRs ([LUCENE-5383]), but

[GitHub] [lucene] mocobeta commented on a diff in pull request #854: LUCENE-10393: allow to link github pr from changes

2022-04-28 Thread GitBox
mocobeta commented on code in PR #854: URL: https://github.com/apache/lucene/pull/854#discussion_r861432053 ## gradle/documentation/changes-to-html/changes2html.pl: ## @@ -572,7 +572,7 @@ $item =~ s{((LUCENE|SOLR|INFRA)\s+(\d{3,}))} {$1}gi; # Link

[GitHub] [lucene] mocobeta commented on a diff in pull request #854: LUCENE-10393: allow to link github pr from changes

2022-04-28 Thread GitBox
mocobeta commented on code in PR #854: URL: https://github.com/apache/lucene/pull/854#discussion_r861431387 ## lucene/CHANGES.txt: ## @@ -53,7 +53,7 @@ Other * LUCENE-10253: The @BadApple annotation has been removed from the test framework. (Adrien Grand) -* LUCENE-10393:

[GitHub] [lucene] mocobeta opened a new pull request, #854: LUCENE-10393: allow to link github pr from changes

2022-04-28 Thread GitBox
mocobeta opened a new pull request, #854: URL: https://github.com/apache/lucene/pull/854 changes2html already supports links to Github PRs, but the link is obsoleted so a small modification is needed to make it work again. With this change, developers can directly link to their pull

[jira] [Created] (LUCENE-10545) Allow Github PR link in CHANGES.html

2022-04-28 Thread Tomoko Uchida (Jira)
Tomoko Uchida created LUCENE-10545: -- Summary: Allow Github PR link in CHANGES.html Key: LUCENE-10545 URL: https://issues.apache.org/jira/browse/LUCENE-10545 Project: Lucene - Core Issue

[GitHub] [lucene] Yuti-G commented on pull request #843: LUCENE-10538: TopN is not being used in getTopChildren

2022-04-28 Thread GitBox
Yuti-G commented on PR #843: URL: https://github.com/apache/lucene/pull/843#issuecomment-1112806474 Thanks @gsmiller for confirming! I will create another Jira issue to propose adding getAllChildren to Facets, and revisit this issue after getting feedback from the community. Thanks again

[jira] [Commented] (LUCENE-10493) Can we unify the viterbi search logic in the tokenizers of kuromoji and nori?

2022-04-28 Thread ASF subversion and git services (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10493?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17529710#comment-17529710 ] ASF subversion and git services commented on LUCENE-10493: -- Commit

[GitHub] [lucene] mocobeta merged pull request #846: LUCENE-10493: move n-best logic to analysis-common

2022-04-28 Thread GitBox
mocobeta merged PR #846: URL: https://github.com/apache/lucene/pull/846 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [lucene] mocobeta commented on pull request #853: LUCENE-10524 Add benchmark suite details to CONTRIBUTING.md

2022-04-28 Thread GitBox
mocobeta commented on PR #853: URL: https://github.com/apache/lucene/pull/853#issuecomment-1112798859 @gautamworah96 This looks good to me! I wonder if it makes sense to mention the off-the-shelf https://github.com/apache/lucene/tree/main/lucene/benchmark module here too (I didn't find

[GitHub] [lucene] gsmiller commented on pull request #843: LUCENE-10538: TopN is not being used in getTopChildren

2022-04-28 Thread GitBox
gsmiller commented on PR #843: URL: https://github.com/apache/lucene/pull/843#issuecomment-1112797399 > Are you suggesting that I add another getAllChildren API to retain the current behavior of getTopChildren, and then fix getTopChildren by returning ranges sorted by counts, or do you

[GitHub] [lucene] gsmiller commented on pull request #848: LUCENE-10530: Avoid floating point precision bug in TestTaxonomyFacetAssociations

2022-04-28 Thread GitBox
gsmiller commented on PR #848: URL: https://github.com/apache/lucene/pull/848#issuecomment-1112794825 @vigyasharma yeah exactly. So the faceting implementation was summing the doc values in a different order from the test case. I wish I could do something similar for the overall

[GitHub] [lucene] gsmiller commented on a diff in pull request #848: LUCENE-10530: Avoid floating point precision bug in TestTaxonomyFacetAssociations

2022-04-28 Thread GitBox
gsmiller commented on code in PR #848: URL: https://github.com/apache/lucene/pull/848#discussion_r861408137 ## lucene/facet/src/test/org/apache/lucene/facet/taxonomy/TestTaxonomyFacetAssociations.java: ## @@ -500,10 +500,7 @@ private void validateFloats(

[GitHub] [lucene] mocobeta commented on pull request #850: LUCENE-10541: limit the default length of MockTokenizer tokens to 255.

2022-04-28 Thread GitBox
mocobeta commented on PR #850: URL: https://github.com/apache/lucene/pull/850#issuecomment-1112791650 I traced method calls of `BaseTokenStreamTestCase.checkRandomData(Random random, Analyzer a, int iterations, int maxWordLength)` with larger values than 255 for `maxWordLength`. Those

[jira] [Commented] (LUCENE-10524) Augment CONTRIBUTING.md guide with instructions on how/when to benchmark

2022-04-28 Thread Gautam Worah (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10524?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17529692#comment-17529692 ] Gautam Worah commented on LUCENE-10524: --- I've submitted a pretty basic patch. It may be better to

[jira] [Commented] (LUCENE-10292) AnalyzingInfixSuggester thread safety: lookup() fails during (re)build()

2022-04-28 Thread ASF subversion and git services (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10292?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17529677#comment-17529677 ] ASF subversion and git services commented on LUCENE-10292: -- Commit

[GitHub] [lucene] vigyasharma commented on a diff in pull request #848: LUCENE-10530: Avoid floating point precision bug in TestTaxonomyFacetAssociations

2022-04-28 Thread GitBox
vigyasharma commented on code in PR #848: URL: https://github.com/apache/lucene/pull/848#discussion_r861353455 ## lucene/facet/src/test/org/apache/lucene/facet/taxonomy/TestTaxonomyFacetAssociations.java: ## @@ -500,10 +500,7 @@ private void validateFloats(

[jira] [Commented] (LUCENE-10292) AnalyzingInfixSuggester thread safety: lookup() fails during (re)build()

2022-04-28 Thread ASF subversion and git services (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10292?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17529676#comment-17529676 ] ASF subversion and git services commented on LUCENE-10292: -- Commit

[GitHub] [lucene] gautamworah96 opened a new pull request, #853: LUCENE-10524 Add benchmark suite details to CONTRIBUTING.md

2022-04-28 Thread GitBox
gautamworah96 opened a new pull request, #853: URL: https://github.com/apache/lucene/pull/853 Today, new contributors are usually unaware of where luceneutil benchmarks are and when/how to run them. Committers usually end up pointing contributors to the benchmarks package when they make

[jira] [Resolved] (LUCENE-9476) Add a bulk ordinal->FacetLabel API

2022-04-28 Thread Gautam Worah (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9476?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gautam Worah resolved LUCENE-9476. -- Resolution: Fixed The PR was merged into the Lucene 9 branch and released in version Lucene

[jira] [Commented] (LUCENE-10292) AnalyzingInfixSuggester thread safety: lookup() fails during (re)build()

2022-04-28 Thread Chris M. Hostetter (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10292?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17529668#comment-17529668 ] Chris M. Hostetter commented on LUCENE-10292: - {quote} I'm still not sure whether these

[GitHub] [lucene] Yuti-G commented on pull request #843: LUCENE-10538: TopN is not being used in getTopChildren

2022-04-28 Thread GitBox
Yuti-G commented on PR #843: URL: https://github.com/apache/lucene/pull/843#issuecomment-1112633027 Hi @gsmiller, thanks for your feedback! Yes, I was proposing keeping the existing behavior of getTopChildren but utilizing the top-n param, as I thought it would safe because we would not

[GitHub] [lucene] gsmiller commented on pull request #843: LUCENE-10538: TopN is not being used in getTopChildren

2022-04-28 Thread GitBox
gsmiller commented on PR #843: URL: https://github.com/apache/lucene/pull/843#issuecomment-1112602017 Thanks for all your thoughts @Yuti-G! I think I still disagree with changing the current behavior. If we want to implement the contract of `getTopChildren` properly, we'd need to sort by

[GitHub] [lucene] dweiss commented on pull request #850: LUCENE-10541: limit the default length of MockTokenizer tokens to 255.

2022-04-28 Thread GitBox
dweiss commented on PR #850: URL: https://github.com/apache/lucene/pull/850#issuecomment-1112594924 I ran nightly tests as well, they passed. BUILD SUCCESSFUL in 24m 6s 832 actionable tasks: 480 executed, 352 up-to-date -- This is an automated message from the Apache Git Service. To

[jira] [Commented] (LUCENE-10088) Too many open files in TestIndexWriterMergePolicy.testStressUpdateSameDocumentWithMergeOnGetReader

2022-04-28 Thread ASF subversion and git services (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17529611#comment-17529611 ] ASF subversion and git services commented on LUCENE-10088: -- Commit

[jira] [Commented] (LUCENE-10088) Too many open files in TestIndexWriterMergePolicy.testStressUpdateSameDocumentWithMergeOnGetReader

2022-04-28 Thread ASF subversion and git services (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17529603#comment-17529603 ] ASF subversion and git services commented on LUCENE-10088: -- Commit

[GitHub] [lucene] dweiss merged pull request #851: LUCENE-10088: backport simple text codec suppression and fs handle limit

2022-04-28 Thread GitBox
dweiss merged PR #851: URL: https://github.com/apache/lucene/pull/851 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [lucene] dweiss commented on a diff in pull request #850: LUCENE-10541: limit the default length of MockTokenizer tokens to 255.

2022-04-28 Thread GitBox
dweiss commented on code in PR #850: URL: https://github.com/apache/lucene/pull/850#discussion_r861243269 ## lucene/test-framework/src/java/org/apache/lucene/tests/analysis/MockTokenizer.java: ## @@ -66,11 +67,11 @@ public class MockTokenizer extends Tokenizer { * Limit the

[jira] [Resolved] (LUCENE-10518) FieldInfos consistency check can refuse to open Lucene 8 index

2022-04-28 Thread Nhat Nguyen (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nhat Nguyen resolved LUCENE-10518. -- Fix Version/s: 9.1.1 9.2 Resolution: Fixed > FieldInfos

[jira] [Commented] (LUCENE-10542) FieldSource exists implementation can avoid value retrieval

2022-04-28 Thread Robert Muir (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17529580#comment-17529580 ] Robert Muir commented on LUCENE-10542: -- Hi [~krisden], I think there might be more optimizations

[jira] [Commented] (LUCENE-10518) FieldInfos consistency check can refuse to open Lucene 8 index

2022-04-28 Thread ASF subversion and git services (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17529578#comment-17529578 ] ASF subversion and git services commented on LUCENE-10518: -- Commit

[GitHub] [lucene] dnhatn merged pull request #852: LUCENE-10518: Relax field consistency check for old indices (#842)

2022-04-28 Thread GitBox
dnhatn merged PR #852: URL: https://github.com/apache/lucene/pull/852 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[jira] [Updated] (LUCENE-10542) FieldSource exists implementation can avoid value retrieval

2022-04-28 Thread Kevin Risden (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kevin Risden updated LUCENE-10542: -- Description: While looking at LUCENE-10534, found that *FieldSource exists implementation

[jira] [Commented] (LUCENE-10534) MinFloatFunction / MaxFloatFunction exists check can be slow

2022-04-28 Thread Kevin Risden (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17529572#comment-17529572 ] Kevin Risden commented on LUCENE-10534: --- Updated metrics - there is a benefit to the new

[GitHub] [lucene] dnhatn opened a new pull request, #852: LUCENE-10518: Relax field consistency check for old indices (#842)

2022-04-28 Thread GitBox
dnhatn opened a new pull request, #852: URL: https://github.com/apache/lucene/pull/852 This change relaxes the field consistency check for old indices as we didn't enforce that in the previous versions. This commit also disables the optimization that relies on the field consistency for

[jira] [Commented] (LUCENE-10518) FieldInfos consistency check can refuse to open Lucene 8 index

2022-04-28 Thread ASF subversion and git services (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17529522#comment-17529522 ] ASF subversion and git services commented on LUCENE-10518: -- Commit

[GitHub] [lucene] dnhatn merged pull request #842: LUCENE-10518: Relax field consistency check for old indices

2022-04-28 Thread GitBox
dnhatn merged PR #842: URL: https://github.com/apache/lucene/pull/842 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [lucene] dnhatn commented on pull request #842: LUCENE-10518: Relax field consistency check for old indices

2022-04-28 Thread GitBox
dnhatn commented on PR #842: URL: https://github.com/apache/lucene/pull/842#issuecomment-1112420113 @mayya-sharipova @jpountz Thanks for reviews. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [lucene] dnhatn commented on a diff in pull request #842: LUCENE-10518: Relax field consistency check for old indices

2022-04-28 Thread GitBox
dnhatn commented on code in PR #842: URL: https://github.com/apache/lucene/pull/842#discussion_r861098734 ## lucene/core/src/java/org/apache/lucene/index/FieldInfos.java: ## @@ -350,6 +360,11 @@ static final class FieldNumbers { this.omitNorms = new HashMap<>();

[jira] [Commented] (LUCENE-10534) MinFloatFunction / MaxFloatFunction exists check can be slow

2022-04-28 Thread Kevin Risden (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17529513#comment-17529513 ] Kevin Risden commented on LUCENE-10534: --- I reviewed the jmh tests again and realized I didn't

[GitHub] [lucene] mocobeta commented on a diff in pull request #850: LUCENE-10541: limit the default length of MockTokenizer tokens to 255.

2022-04-28 Thread GitBox
mocobeta commented on code in PR #850: URL: https://github.com/apache/lucene/pull/850#discussion_r861050340 ## lucene/test-framework/src/java/org/apache/lucene/tests/analysis/MockTokenizer.java: ## @@ -66,11 +67,11 @@ public class MockTokenizer extends Tokenizer { * Limit

[GitHub] [lucene] uschindler commented on a diff in pull request #850: LUCENE-10541: limit the default length of MockTokenizer tokens to 255.

2022-04-28 Thread GitBox
uschindler commented on code in PR #850: URL: https://github.com/apache/lucene/pull/850#discussion_r861032318 ## lucene/test-framework/src/java/org/apache/lucene/tests/analysis/MockTokenizer.java: ## @@ -66,11 +67,11 @@ public class MockTokenizer extends Tokenizer { * Limit

[GitHub] [lucene] mocobeta commented on a diff in pull request #850: LUCENE-10541: limit the default length of MockTokenizer tokens to 255.

2022-04-28 Thread GitBox
mocobeta commented on code in PR #850: URL: https://github.com/apache/lucene/pull/850#discussion_r861028741 ## lucene/test-framework/src/java/org/apache/lucene/tests/analysis/MockTokenizer.java: ## @@ -66,11 +67,11 @@ public class MockTokenizer extends Tokenizer { * Limit

[jira] [Commented] (LUCENE-10544) Should ExitableTermsEnum wrap postings and impacts?

2022-04-28 Thread Greg Miller (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17529481#comment-17529481 ] Greg Miller commented on LUCENE-10544: -- Thanks [~jpountz]. One issue with the collector approach

[jira] [Commented] (LUCENE-10544) Should ExitableTermsEnum wrap postings and impacts?

2022-04-28 Thread Adrien Grand (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17529470#comment-17529470 ] Adrien Grand commented on LUCENE-10544: --- I suspect that this is due to the fact that the original

[jira] [Updated] (LUCENE-10544) Should ExitableTermsEnum wrap postings and impacts?

2022-04-28 Thread Greg Miller (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10544?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Greg Miller updated LUCENE-10544: - Description: While looking into options for LUCENE-10151, I noticed that

[jira] [Created] (LUCENE-10544) Should ExitableTermsEnum wrap postings and impacts?

2022-04-28 Thread Greg Miller (Jira)
Greg Miller created LUCENE-10544: Summary: Should ExitableTermsEnum wrap postings and impacts? Key: LUCENE-10544 URL: https://issues.apache.org/jira/browse/LUCENE-10544 Project: Lucene - Core

[GitHub] [lucene] mikemccand commented on pull request #663: Lucene-10188: Give SortedSetDocValues a docValueCount()?

2022-04-28 Thread GitBox
mikemccand commented on PR #663: URL: https://github.com/apache/lucene/pull/663#issuecomment-1112261833 Thanks @spike-liu -- this looks great to me! I'll leave it open for at least 48 hours to see if anyone else wants to approve/comment, else lazy consensus kicks in! -- This is an

[jira] [Commented] (LUCENE-10543) Achieve contribution workflow perfection (with progress)

2022-04-28 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10543?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17529452#comment-17529452 ] Michael McCandless commented on LUCENE-10543: - Also, I love this new "Achieve XYZ

[jira] [Commented] (LUCENE-10543) Achieve contribution workflow perfection (with progress)

2022-04-28 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10543?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17529437#comment-17529437 ] Michael McCandless commented on LUCENE-10543: - +1 to work out a migration plan to switch to

[GitHub] [lucene] gsmiller commented on a diff in pull request #849: LUCENE-10530: Avoid floating point precision bug in TestTaxonomyFacetAssociations

2022-04-28 Thread GitBox
gsmiller commented on code in PR #849: URL: https://github.com/apache/lucene/pull/849#discussion_r860876168 ## lucene/facet/src/test/org/apache/lucene/facet/taxonomy/TestTaxonomyFacetAssociations.java: ## @@ -142,6 +146,34 @@ public static void beforeClass() throws Exception {

[GitHub] [lucene] gsmiller commented on pull request #848: LUCENE-10530: Avoid floating point precision bug in TestTaxonomyFacetAssociations

2022-04-28 Thread GitBox
gsmiller commented on PR #848: URL: https://github.com/apache/lucene/pull/848#issuecomment-1112187182 @gautamworah96 thanks for looking! The mismatch in the original Jira was a slightly different check, which is the one now set to a delta of `0`. The fix proposed here is to ensure the

[jira] [Commented] (LUCENE-10541) What to do about massive terms in our Wikipedia EN LineFileDocs?

2022-04-28 Thread Michael McCandless (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17529416#comment-17529416 ] Michael McCandless commented on LUCENE-10541: - {quote}enwiki lines contains 2 million

[GitHub] [lucene] mocobeta commented on pull request #846: LUCENE-10493: move n-best logic to analysis-common

2022-04-28 Thread GitBox
mocobeta commented on PR #846: URL: https://github.com/apache/lucene/pull/846#issuecomment-1112127870 @mikemccand thank you for revewing! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [lucene] jpountz commented on pull request #780: LUCENE-10496: avoid unnecessary attempts to evaluate skipping doc if index sort and search sort are in opposite direction

2022-04-28 Thread GitBox
jpountz commented on PR #780: URL: https://github.com/apache/lucene/pull/780#issuecomment-1112127776 Why do you sort the index if you need to sort in both directions? Is it for range queries? Could you use points instead? Sorry I don't have good ideas for making it less intrusive.

[GitHub] [lucene] wjp719 commented on pull request #780: LUCENE-10496: avoid unnecessary attempts to evaluate skipping doc if index sort and search sort are in opposite direction

2022-04-28 Thread GitBox
wjp719 commented on PR #780: URL: https://github.com/apache/lucene/pull/780#issuecomment-1112123414 @jpountz Ok, tanks. As users ofter need to search on both directions but index data can only be index sorted by one directions. This situation may be common for users. > If you can

[GitHub] [lucene] jpountz commented on pull request #780: LUCENE-10496: avoid unnecessary attempts to evaluate skipping doc if index sort and search sort are in opposite direction

2022-04-28 Thread GitBox
jpountz commented on PR #780: URL: https://github.com/apache/lucene/pull/780#issuecomment-1112113792 If you can find an approach that is less intrusive, maybe I would reconsider, but it still looks to me like we're adding complexity to work around a bad user decisions, which doesn't feel

[GitHub] [lucene] jpountz commented on a diff in pull request #842: LUCENE-10518: Relax field consistency check for old indices

2022-04-28 Thread GitBox
jpountz commented on code in PR #842: URL: https://github.com/apache/lucene/pull/842#discussion_r860789641 ## lucene/core/src/java/org/apache/lucene/index/FieldInfos.java: ## @@ -350,6 +360,11 @@ static final class FieldNumbers { this.omitNorms = new HashMap<>();

[jira] [Commented] (LUCENE-10543) Achieve contribution workflow perfection (with progress)

2022-04-28 Thread Tomoko Uchida (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10543?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17529343#comment-17529343 ] Tomoko Uchida commented on LUCENE-10543: I just found that it looks like {{changes2html.pl}}

[jira] [Commented] (LUCENE-10543) Achieve contribution workflow perfection (with progress)

2022-04-28 Thread Tomoko Uchida (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10543?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17529333#comment-17529333 ] Tomoko Uchida commented on LUCENE-10543: bq. Creating a PR still requires signing up for Apache

[GitHub] [lucene] uschindler commented on a diff in pull request #851: LUCENE-10088: backport simple text codec suppression and fs handle limit

2022-04-28 Thread GitBox
uschindler commented on code in PR #851: URL: https://github.com/apache/lucene/pull/851#discussion_r860662889 ## lucene/test-framework/src/java/org/apache/lucene/tests/mockfile/HandleLimitFS.java: ## @@ -17,16 +17,38 @@ package org.apache.lucene.tests.mockfile; import

[GitHub] [lucene] uschindler commented on a diff in pull request #850: LUCENE-10541: limit the default length of MockTokenizer tokens to 255.

2022-04-28 Thread GitBox
uschindler commented on code in PR #850: URL: https://github.com/apache/lucene/pull/850#discussion_r860657767 ## lucene/test-framework/src/java/org/apache/lucene/tests/analysis/MockTokenizer.java: ## @@ -66,11 +67,11 @@ public class MockTokenizer extends Tokenizer { * Limit

[GitHub] [lucene] mocobeta commented on a diff in pull request #850: LUCENE-10541: limit the default length of MockTokenizer tokens to 255.

2022-04-28 Thread GitBox
mocobeta commented on code in PR #850: URL: https://github.com/apache/lucene/pull/850#discussion_r860642306 ## lucene/test-framework/src/java/org/apache/lucene/tests/analysis/MockTokenizer.java: ## @@ -66,11 +67,11 @@ public class MockTokenizer extends Tokenizer { * Limit

[GitHub] [lucene] mocobeta commented on a diff in pull request #850: LUCENE-10541: limit the default length of MockTokenizer tokens to 255.

2022-04-28 Thread GitBox
mocobeta commented on code in PR #850: URL: https://github.com/apache/lucene/pull/850#discussion_r860642306 ## lucene/test-framework/src/java/org/apache/lucene/tests/analysis/MockTokenizer.java: ## @@ -66,11 +67,11 @@ public class MockTokenizer extends Tokenizer { * Limit

[GitHub] [lucene] dweiss commented on a diff in pull request #850: LUCENE-10541: limit the default length of MockTokenizer tokens to 255.

2022-04-28 Thread GitBox
dweiss commented on code in PR #850: URL: https://github.com/apache/lucene/pull/850#discussion_r860641028 ## lucene/test-framework/src/java/org/apache/lucene/tests/analysis/MockTokenizer.java: ## @@ -66,11 +67,11 @@ public class MockTokenizer extends Tokenizer { * Limit the

[GitHub] [lucene] rmuir commented on a diff in pull request #850: LUCENE-10541: limit the default length of MockTokenizer tokens to 255.

2022-04-28 Thread GitBox
rmuir commented on code in PR #850: URL: https://github.com/apache/lucene/pull/850#discussion_r860635303 ## lucene/test-framework/src/java/org/apache/lucene/tests/analysis/MockTokenizer.java: ## @@ -66,11 +67,11 @@ public class MockTokenizer extends Tokenizer { * Limit the

[GitHub] [lucene] rmuir commented on a diff in pull request #850: LUCENE-10541: limit the default length of MockTokenizer tokens to 255.

2022-04-28 Thread GitBox
rmuir commented on code in PR #850: URL: https://github.com/apache/lucene/pull/850#discussion_r860631960 ## lucene/test-framework/src/java/org/apache/lucene/tests/analysis/MockTokenizer.java: ## @@ -66,11 +67,11 @@ public class MockTokenizer extends Tokenizer { * Limit the

[GitHub] [lucene] rmuir commented on a diff in pull request #850: LUCENE-10541: limit the default length of MockTokenizer tokens to 255.

2022-04-28 Thread GitBox
rmuir commented on code in PR #850: URL: https://github.com/apache/lucene/pull/850#discussion_r860628445 ## lucene/test-framework/src/java/org/apache/lucene/tests/analysis/MockTokenizer.java: ## @@ -66,11 +67,11 @@ public class MockTokenizer extends Tokenizer { * Limit the

[GitHub] [lucene] rmuir commented on a diff in pull request #850: LUCENE-10541: limit the default length of MockTokenizer tokens to 255.

2022-04-28 Thread GitBox
rmuir commented on code in PR #850: URL: https://github.com/apache/lucene/pull/850#discussion_r860626938 ## lucene/test-framework/src/java/org/apache/lucene/tests/analysis/MockTokenizer.java: ## @@ -66,11 +67,11 @@ public class MockTokenizer extends Tokenizer { * Limit the

[GitHub] [lucene] dweiss commented on a diff in pull request #850: LUCENE-10541: limit the default length of MockTokenizer tokens to 255.

2022-04-28 Thread GitBox
dweiss commented on code in PR #850: URL: https://github.com/apache/lucene/pull/850#discussion_r860625375 ## lucene/test-framework/src/java/org/apache/lucene/tests/analysis/MockTokenizer.java: ## @@ -66,11 +67,11 @@ public class MockTokenizer extends Tokenizer { * Limit the

[GitHub] [lucene] uschindler commented on a diff in pull request #851: LUCENE-10088: backport simple text codec suppression and fs handle limit

2022-04-28 Thread GitBox
uschindler commented on code in PR #851: URL: https://github.com/apache/lucene/pull/851#discussion_r860623990 ## lucene/test-framework/src/java/org/apache/lucene/tests/mockfile/HandleLimitFS.java: ## @@ -17,16 +17,38 @@ package org.apache.lucene.tests.mockfile; import

[GitHub] [lucene] uschindler commented on a diff in pull request #850: LUCENE-10541: limit the default length of MockTokenizer tokens to 255.

2022-04-28 Thread GitBox
uschindler commented on code in PR #850: URL: https://github.com/apache/lucene/pull/850#discussion_r860622081 ## lucene/test-framework/src/java/org/apache/lucene/tests/analysis/MockTokenizer.java: ## @@ -66,11 +67,11 @@ public class MockTokenizer extends Tokenizer { * Limit

[GitHub] [lucene] rmuir commented on a diff in pull request #851: LUCENE-10088: backport simple text codec suppression and fs handle limit

2022-04-28 Thread GitBox
rmuir commented on code in PR #851: URL: https://github.com/apache/lucene/pull/851#discussion_r860621619 ## lucene/test-framework/src/java/org/apache/lucene/tests/mockfile/HandleLimitFS.java: ## @@ -17,16 +17,38 @@ package org.apache.lucene.tests.mockfile; import

[GitHub] [lucene] rmuir commented on a diff in pull request #851: LUCENE-10088: backport simple text codec suppression and fs handle limit

2022-04-28 Thread GitBox
rmuir commented on code in PR #851: URL: https://github.com/apache/lucene/pull/851#discussion_r860619792 ## lucene/test-framework/src/java/org/apache/lucene/tests/mockfile/HandleLimitFS.java: ## @@ -17,16 +17,38 @@ package org.apache.lucene.tests.mockfile; import

[GitHub] [lucene] uschindler commented on a diff in pull request #851: LUCENE-10088: backport simple text codec suppression and fs handle limit

2022-04-28 Thread GitBox
uschindler commented on code in PR #851: URL: https://github.com/apache/lucene/pull/851#discussion_r860618388 ## lucene/test-framework/src/java/org/apache/lucene/tests/mockfile/HandleLimitFS.java: ## @@ -17,16 +17,38 @@ package org.apache.lucene.tests.mockfile; import

[GitHub] [lucene] dweiss commented on a diff in pull request #851: LUCENE-10088: backport simple text codec suppression and fs handle limit

2022-04-28 Thread GitBox
dweiss commented on code in PR #851: URL: https://github.com/apache/lucene/pull/851#discussion_r860616950 ## lucene/test-framework/src/java/org/apache/lucene/tests/mockfile/HandleLimitFS.java: ## @@ -17,16 +17,38 @@ package org.apache.lucene.tests.mockfile; import

[GitHub] [lucene] uschindler commented on a diff in pull request #851: LUCENE-10088: backport simple text codec suppression and fs handle limit

2022-04-28 Thread GitBox
uschindler commented on code in PR #851: URL: https://github.com/apache/lucene/pull/851#discussion_r860615990 ## lucene/test-framework/src/java/org/apache/lucene/tests/mockfile/HandleLimitFS.java: ## @@ -17,16 +17,38 @@ package org.apache.lucene.tests.mockfile; import

[GitHub] [lucene] uschindler commented on a diff in pull request #851: LUCENE-10088: backport simple text codec suppression and fs handle limit

2022-04-28 Thread GitBox
uschindler commented on code in PR #851: URL: https://github.com/apache/lucene/pull/851#discussion_r860615393 ## lucene/test-framework/src/java/org/apache/lucene/tests/mockfile/HandleLimitFS.java: ## @@ -17,16 +17,38 @@ package org.apache.lucene.tests.mockfile; import

[jira] [Commented] (LUCENE-10541) What to do about massive terms in our Wikipedia EN LineFileDocs?

2022-04-28 Thread Uwe Schindler (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17529286#comment-17529286 ] Uwe Schindler commented on LUCENE-10541: I think another option might be to use

[GitHub] [lucene] uschindler commented on pull request #850: LUCENE-10541: limit the default length of MockTokenizer tokens to 255.

2022-04-28 Thread GitBox
uschindler commented on PR #850: URL: https://github.com/apache/lucene/pull/850#issuecomment-877517 Maybe we should set the limit to the maximum term size. This would allow us to test also longer terms. -- This is an automated message from the Apache Git Service. To respond to the

[jira] [Commented] (LUCENE-10543) Achieve contribution workflow perfection (with progress)

2022-04-28 Thread Robert Muir (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10543?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17529270#comment-17529270 ] Robert Muir commented on LUCENE-10543: -- It was entirely too difficult to find the issue! I knew it

[jira] [Commented] (LUCENE-9871) Achieve build system perfection (with progress)

2022-04-28 Thread Robert Muir (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-9871?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17529263#comment-17529263 ] Robert Muir commented on LUCENE-9871: - just a reminder, haven't forgot about java version

[jira] [Commented] (LUCENE-10541) What to do about massive terms in our Wikipedia EN LineFileDocs?

2022-04-28 Thread Dawid Weiss (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17529261#comment-17529261 ] Dawid Weiss commented on LUCENE-10541: -- Filed a PR at https://github.com/apache/lucene/pull/850.

[jira] [Commented] (LUCENE-10543) Achieve contribution workflow perfection (with progress)

2022-04-28 Thread Dawid Weiss (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10543?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17529255#comment-17529255 ] Dawid Weiss commented on LUCENE-10543: -- ("with progress"... yeah, that's why LUCENE-9871 is still

[jira] [Commented] (LUCENE-10543) Achieve contribution workflow perfection (with progress)

2022-04-28 Thread Robert Muir (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10543?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17529245#comment-17529245 ] Robert Muir commented on LUCENE-10543: -- Believe it or not, there's actually no "link" to the

[jira] [Commented] (LUCENE-10543) Achieve contribution workflow perfection (with progress)

2022-04-28 Thread Robert Muir (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10543?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17529240#comment-17529240 ] Robert Muir commented on LUCENE-10543: -- another idea, add a simple "fork me on github" to the

[jira] [Commented] (LUCENE-10543) Achieve contribution workflow perfection (with progress)

2022-04-28 Thread Robert Muir (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10543?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17529236#comment-17529236 ] Robert Muir commented on LUCENE-10543: -- another idea is to use github wiki functionality vs the

[jira] [Created] (LUCENE-10543) Achieve contribution workflow perfection (with progress)

2022-04-28 Thread Robert Muir (Jira)
Robert Muir created LUCENE-10543: Summary: Achieve contribution workflow perfection (with progress) Key: LUCENE-10543 URL: https://issues.apache.org/jira/browse/LUCENE-10543 Project: Lucene - Core

[jira] [Commented] (LUCENE-10292) AnalyzingInfixSuggester thread safety: lookup() fails during (re)build()

2022-04-28 Thread Dawid Weiss (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10292?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17529223#comment-17529223 ] Dawid Weiss commented on LUCENE-10292: -- Thanks Chris. I'm still not sure whether these tests make

[jira] [Commented] (LUCENE-10531) Mark testLukeCanBeLaunched @Nightly test and make a dedicated Github CI workflow for it

2022-04-28 Thread Dawid Weiss (Jira)
[ https://issues.apache.org/jira/browse/LUCENE-10531?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17529218#comment-17529218 ] Dawid Weiss commented on LUCENE-10531: -- Fine with me. > Mark testLukeCanBeLaunched @Nightly test

[GitHub] [lucene] dweiss commented on a diff in pull request #844: LUCENE-10539: Return a stream of completions from FSTCompletion.

2022-04-28 Thread GitBox
dweiss commented on code in PR #844: URL: https://github.com/apache/lucene/pull/844#discussion_r860523184 ## lucene/suggest/src/java/org/apache/lucene/search/suggest/fst/FSTCompletion.java: ## @@ -184,110 +195,174 @@ public List lookup(CharSequence key, int num) {