[jira] [Commented] (LUCENE-8204) ReqOptSumScorer should leverage sub scorers' per-block max scores

2018-08-08 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16573030#comment-16573030 ] Jim Ferenczi commented on LUCENE-8204: -- Thanks Adrien, I pushed a new patch that addresses your

[jira] [Resolved] (LUCENE-8439) DisjunctionMaxScorer should leverage sub scorers' per-block max scores

2018-08-08 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi resolved LUCENE-8439. -- Resolution: Fixed Fix Version/s: master (8.0) Thanks [~jpountz] ! >

[jira] [Commented] (LUCENE-8448) Slowdown of nested boolean queries after LUCENE-8060

2018-08-14 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8448?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16579721#comment-16579721 ] Jim Ferenczi commented on LUCENE-8448: -- We've tried several things with Adrien to optimize the

[jira] [Updated] (LUCENE-8448) Slowdown of nested boolean queries after LUCENE-8060

2018-08-14 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi updated LUCENE-8448: - Attachment: LUCENE-8448.patch > Slowdown of nested boolean queries after LUCENE-8060 >

[jira] [Commented] (LUCENE-8466) FrozenBufferedUpdates#apply*Deletes is incorrect when index sorting is enabled

2018-08-27 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16594050#comment-16594050 ] Jim Ferenczi commented on LUCENE-8466: -- Thanks Tomás and sorry Vish for not adding you in the first

[jira] [Resolved] (LUCENE-8466) FrozenBufferedUpdates#apply*Deletes is incorrect when index sorting is enabled

2018-08-27 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi resolved LUCENE-8466. -- Resolution: Fixed Fix Version/s: master (8.0) 7.5 Thanks Adrien and

[jira] [Updated] (LUCENE-8466) FrozenBufferedUpdates#apply*Deletes is incorrect when index sorting is enabled

2018-08-27 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi updated LUCENE-8466: - Attachment: LUCENE-8466.patch > FrozenBufferedUpdates#apply*Deletes is incorrect when index

[jira] [Commented] (LUCENE-8466) FrozenBufferedUpdates#apply*Deletes is incorrect when index sorting is enabled

2018-08-27 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16593610#comment-16593610 ] Jim Ferenczi commented on LUCENE-8466: -- Here is a patch that fixes delete by query. It seems that

[jira] [Commented] (LUCENE-8306) Allow iteration over the term positions of a Match

2018-07-20 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16550491#comment-16550491 ] Jim Ferenczi commented on LUCENE-8306: -- +1, thanks [~romseygeek] , the patch looks good.  > Allow

[jira] [Resolved] (LUCENE-8402) TestPriorityQueue failures

2018-07-20 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8402?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi resolved LUCENE-8402. -- Resolution: Fixed I removed the invalid assertions, thanks [~thetaphi]. > TestPriorityQueue

[jira] [Commented] (LUCENE-8401) Add PassageBuilder to help construct highlights using MatchesIterator

2018-07-18 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16547486#comment-16547486 ] Jim Ferenczi commented on LUCENE-8401: -- I like the approach here. A few comments: * The text

[jira] [Commented] (LUCENE-8306) Allow iteration over the term positions of a Match

2018-07-18 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16548104#comment-16548104 ] Jim Ferenczi commented on LUCENE-8306: -- Would it be easier if getSubMatches returns null when

[jira] [Comment Edited] (LUCENE-8402) TestPriorityQueue failures

2018-07-16 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8402?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16545634#comment-16545634 ] Jim Ferenczi edited comment on LUCENE-8402 at 7/16/18 7:09 PM: --- Since it

[jira] [Commented] (LUCENE-8402) TestPriorityQueue failures

2018-07-16 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8402?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16545634#comment-16545634 ] Jim Ferenczi commented on LUCENE-8402: -- Since it is a deprecated function I don't think we should

[jira] [Issue Comment Deleted] (LUCENE-8402) TestPriorityQueue failures

2018-07-16 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8402?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi updated LUCENE-8402: - Comment: was deleted (was: Found by the build in

[jira] [Updated] (LUCENE-8402) TestPriorityQueue failures

2018-07-16 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8402?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi updated LUCENE-8402: - Attachment: LUCENE-8402.patch > TestPriorityQueue failures > -- > >

[jira] [Commented] (LUCENE-8402) TestPriorityQueue failures

2018-07-16 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8402?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16545121#comment-16545121 ] Jim Ferenczi commented on LUCENE-8402: -- Here is a patch that removes the assertions around reused

[jira] [Commented] (LUCENE-8402) TestPriorityQueue failures

2018-07-16 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8402?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16545158#comment-16545158 ] Jim Ferenczi commented on LUCENE-8402: -- Found by the build in

[jira] [Commented] (LUCENE-8402) TestPriorityQueue failures

2018-07-16 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8402?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16545159#comment-16545159 ] Jim Ferenczi commented on LUCENE-8402: -- Found by the build in

[jira] [Created] (LUCENE-8402) TestPriorityQueue failures

2018-07-16 Thread Jim Ferenczi (JIRA)
Jim Ferenczi created LUCENE-8402: Summary: TestPriorityQueue failures Key: LUCENE-8402 URL: https://issues.apache.org/jira/browse/LUCENE-8402 Project: Lucene - Core Issue Type: Test

[jira] [Updated] (LUCENE-8204) ReqOptSumScorer should leverage sub scorers' per-block max scores

2018-07-25 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi updated LUCENE-8204: - Attachment: LUCENE-8204.patch > ReqOptSumScorer should leverage sub scorers' per-block max

[jira] [Commented] (LUCENE-8204) ReqOptSumScorer should leverage sub scorers' per-block max scores

2018-07-25 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16555729#comment-16555729 ] Jim Ferenczi commented on LUCENE-8204: -- Here is a patch that implements the block skipping logic. I

[jira] [Commented] (LUCENE-8476) Optimizations in UserDictionary (KoreanAnalyzer)

2018-09-04 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8476?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16603349#comment-16603349 ] Jim Ferenczi commented on LUCENE-8476: -- Thanks [~danmuzi] ! The new patch looks good, I'll commit

[jira] [Commented] (SOLR-12655) Add Korean analyzer JAR file (NORI) and schema.xml example to Solr

2018-09-05 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-12655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16604160#comment-16604160 ] Jim Ferenczi commented on SOLR-12655: - [~y100421] we use the mecab-ko-dic-2.0.3-20170922 version for

[jira] [Commented] (LUCENE-8382) Don't propagate calls to setMinCompetitiveScore in MultiCollector

2018-07-04 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8382?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16532681#comment-16532681 ] Jim Ferenczi commented on LUCENE-8382: -- +1 > Don't propagate calls to setMinCompetitiveScore in

[jira] [Closed] (LUCENE-7638) Optimize graph query produced by QueryBuilder

2018-01-24 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-7638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi closed LUCENE-7638. > Optimize graph query produced by QueryBuilder > - > >

[jira] [Closed] (LUCENE-7699) Apply graph articulation points optimization to phrase graph queries

2018-01-24 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-7699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi closed LUCENE-7699. > Apply graph articulation points optimization to phrase graph queries >

[jira] [Assigned] (LUCENE-8137) GraphTokenStreamFiniteStrings does not handle position inc > 1 in multi-word synoyms

2018-01-24 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi reassigned LUCENE-8137: Assignee: Jim Ferenczi > GraphTokenStreamFiniteStrings does not handle position inc > 1

[jira] [Created] (LUCENE-8137) GraphTokenStreamFiniteStrings does not handle position inc > 1 in multi-word synoyms

2018-01-24 Thread Jim Ferenczi (JIRA)
Jim Ferenczi created LUCENE-8137: Summary: GraphTokenStreamFiniteStrings does not handle position inc > 1 in multi-word synoyms Key: LUCENE-8137 URL: https://issues.apache.org/jira/browse/LUCENE-8137

[jira] [Resolved] (LUCENE-8199) TestBackwardsCompatibility#testAllVersionsTested should fail if the version of a bwc index is missing

2018-03-12 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi resolved LUCENE-8199. -- Resolution: Won't Fix > TestBackwardsCompatibility#testAllVersionsTested should fail if the

[jira] [Commented] (LUCENE-8199) TestBackwardsCompatibility#testAllVersionsTested should fail if the version of a bwc index is missing

2018-03-12 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16395179#comment-16395179 ] Jim Ferenczi commented on LUCENE-8199: -- Argh, scratch that, this is only true for bugfix releases.

[jira] [Closed] (LUCENE-8199) TestBackwardsCompatibility#testAllVersionsTested should fail if the version of a bwc index is missing

2018-03-12 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi closed LUCENE-8199. > TestBackwardsCompatibility#testAllVersionsTested should fail if the version > of a bwc index is

[jira] [Created] (LUCENE-8199) TestBackwardsCompatibility#testAllVersionsTested should fail if the version of a bwc index is missing

2018-03-12 Thread Jim Ferenczi (JIRA)
Jim Ferenczi created LUCENE-8199: Summary: TestBackwardsCompatibility#testAllVersionsTested should fail if the version of a bwc index is missing Key: LUCENE-8199 URL:

[jira] [Commented] (LUCENE-8196) Add IntervalQuery and IntervalsSource to expose minimum interval semantics across term fields

2018-03-08 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16391723#comment-16391723 ] Jim Ferenczi commented on LUCENE-8196: -- {quote} I was a bit annoyed to see the field masking hack

[jira] [Commented] (LUCENE-8196) Add IntervalQuery and IntervalsSource to expose minimum interval semantics across term fields

2018-03-09 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16393188#comment-16393188 ] Jim Ferenczi commented on LUCENE-8196: -- {quote} I'd rather keep the API as it is, with the field

[jira] [Commented] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-04-10 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16432078#comment-16432078 ] Jim Ferenczi commented on LUCENE-8231: -- I attached a new patch that fixes an issue with offsets of

[jira] [Updated] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-04-10 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi updated LUCENE-8231: - Attachment: LUCENE-8231.patch > Nori, a Korean analyzer based on mecab-ko-dic >

[jira] [Updated] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-04-12 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi updated LUCENE-8231: - Attachment: LUCENE-8231.patch > Nori, a Korean analyzer based on mecab-ko-dic >

[jira] [Commented] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-04-12 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16435420#comment-16435420 ] Jim Ferenczi commented on LUCENE-8231: -- Thanks Robert. I attached a new patch that changes the enum

[jira] [Commented] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-04-12 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16436001#comment-16436001 ] Jim Ferenczi commented on LUCENE-8231: -- I agree this will also simplify the understanding of these

[jira] [Commented] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-04-12 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16436019#comment-16436019 ] Jim Ferenczi commented on LUCENE-8231: -- No because FilteringTokenFilter doesn't handle

[jira] [Created] (LUCENE-8250) Should FilteringTokenFilter handle positionLength

2018-04-12 Thread Jim Ferenczi (JIRA)
Jim Ferenczi created LUCENE-8250: Summary: Should FilteringTokenFilter handle positionLength Key: LUCENE-8250 URL: https://issues.apache.org/jira/browse/LUCENE-8250 Project: Lucene - Core

[jira] [Updated] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-04-12 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi updated LUCENE-8231: - Attachment: LUCENE-8231.patch > Nori, a Korean analyzer based on mecab-ko-dic >

[jira] [Commented] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-04-12 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16436042#comment-16436042 ] Jim Ferenczi commented on LUCENE-8231: -- I think that the Japanese analyzer has the same issue and

[jira] [Commented] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-04-12 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16435922#comment-16435922 ] Jim Ferenczi commented on LUCENE-8231: -- Sure, I added two more ctr in the last patch, one with

[jira] [Commented] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-04-12 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16436066#comment-16436066 ] Jim Ferenczi commented on LUCENE-8231: -- Ok I'll restore the KoreanPartOfSpeechStopFilter then and we

[jira] [Updated] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-04-12 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi updated LUCENE-8231: - Attachment: LUCENE-8231.patch > Nori, a Korean analyzer based on mecab-ko-dic >

[jira] [Updated] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-04-12 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi updated LUCENE-8231: - Attachment: LUCENE-8231.patch > Nori, a Korean analyzer based on mecab-ko-dic >

[jira] [Commented] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-04-12 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16435802#comment-16435802 ] Jim Ferenczi commented on LUCENE-8231: -- Right, I changed the Analyzer but not the Tokenizer. I

[jira] [Comment Edited] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-04-12 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16436123#comment-16436123 ] Jim Ferenczi edited comment on LUCENE-8231 at 4/12/18 6:42 PM: --- I attached

[jira] [Commented] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-04-12 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16436123#comment-16436123 ] Jim Ferenczi commented on LUCENE-8231: -- I attached a new patch that restores the

[jira] [Updated] (LUCENE-8250) Should FilteringTokenFilter handle positionLength

2018-04-13 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8250?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi updated LUCENE-8250: - Attachment: LUCENE-8250.patch > Should FilteringTokenFilter handle positionLength >

[jira] [Commented] (LUCENE-8250) Should FilteringTokenFilter handle positionLength

2018-04-13 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16436960#comment-16436960 ] Jim Ferenczi commented on LUCENE-8250: -- I attached a small test that I hope illustrate the issue.

[jira] [Commented] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-04-13 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16437065#comment-16437065 ] Jim Ferenczi commented on LUCENE-8231: -- Thanks a lot Robert ! Any objections to backport to 7x ? >

[jira] [Updated] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-04-12 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi updated LUCENE-8231: - Attachment: LUCENE-8231.patch > Nori, a Korean analyzer based on mecab-ko-dic >

[jira] [Commented] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-04-12 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16435755#comment-16435755 ] Jim Ferenczi commented on LUCENE-8231: -- I attached a new patch that passes precommit checks. The

[jira] [Resolved] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-04-13 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi resolved LUCENE-8231. -- Resolution: Fixed Fix Version/s: master (8.0) 7.4 Thanks Robert and

[jira] [Commented] (LUCENE-8255) Can we make index sorting work for soft deletes

2018-04-16 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8255?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16439948#comment-16439948 ] Jim Ferenczi commented on LUCENE-8255: -- {quote} This also means that sorting such a segment on merge

[jira] [Commented] (LUCENE-8196) Add IntervalQuery and IntervalsSource to expose minimum interval semantics across term fields

2018-04-24 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16449409#comment-16449409 ] Jim Ferenczi commented on LUCENE-8196: -- I don't think we should prevent anything ;). *unordered* is

[jira] [Updated] (LUCENE-8196) Add IntervalQuery and IntervalsSource to expose minimum interval semantics across term fields

2018-04-24 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8196?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi updated LUCENE-8196: - Attachment: LUCENE-8196-debug.patch > Add IntervalQuery and IntervalsSource to expose minimum

[jira] [Commented] (LUCENE-8196) Add IntervalQuery and IntervalsSource to expose minimum interval semantics across term fields

2018-04-24 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16450039#comment-16450039 ] Jim Ferenczi commented on LUCENE-8196: -- I don't think an operator can prevent anything here, a query

[jira] [Commented] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-03-30 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16420467#comment-16420467 ] Jim Ferenczi commented on LUCENE-8231: -- I attached a new patch that adds a better compression for

[jira] [Comment Edited] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-03-30 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16420467#comment-16420467 ] Jim Ferenczi edited comment on LUCENE-8231 at 3/30/18 12:59 PM: I

[jira] [Updated] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-03-30 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi updated LUCENE-8231: - Attachment: LUCENE-8231.patch > Nori, a Korean analyzer based on mecab-ko-dic >

[jira] [Updated] (LUCENE-8231) Godori, a Korean analyzer based on mecab-ko-dic

2018-03-28 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi updated LUCENE-8231: - Attachment: (was: LUCENE-8231.patch) > Godori, a Korean analyzer based on mecab-ko-dic >

[jira] [Updated] (LUCENE-8231) Godori, a Korean analyzer based on mecab-ko-dic

2018-03-28 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi updated LUCENE-8231: - Attachment: LUCENE-8231.patch > Godori, a Korean analyzer based on mecab-ko-dic >

[jira] [Created] (LUCENE-8231) Godori, a Korean analyzer based on mecab-ko-dic

2018-03-28 Thread Jim Ferenczi (JIRA)
Jim Ferenczi created LUCENE-8231: Summary: Godori, a Korean analyzer based on mecab-ko-dic Key: LUCENE-8231 URL: https://issues.apache.org/jira/browse/LUCENE-8231 Project: Lucene - Core

[jira] [Updated] (LUCENE-8231) Godori, a Korean analyzer based on mecab-ko-dic

2018-03-28 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi updated LUCENE-8231: - Attachment: (was: LUCENE-8231) > Godori, a Korean analyzer based on mecab-ko-dic >

[jira] [Updated] (LUCENE-8231) Godori, a Korean analyzer based on mecab-ko-dic

2018-03-28 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi updated LUCENE-8231: - Attachment: LUCENE-8231.patch > Godori, a Korean analyzer based on mecab-ko-dic >

[jira] [Updated] (LUCENE-8231) Godori, a Korean analyzer based on mecab-ko-dic

2018-03-28 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi updated LUCENE-8231: - Attachment: LUCENE-8231 > Godori, a Korean analyzer based on mecab-ko-dic >

[jira] [Commented] (LUCENE-8196) Add IntervalQuery and IntervalsSource to expose minimum interval semantics across term fields

2018-03-29 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16418695#comment-16418695 ] Jim Ferenczi commented on LUCENE-8196: -- +1 > Add IntervalQuery and IntervalsSource to expose

[jira] [Commented] (LUCENE-8231) Godori, a Korean analyzer based on mecab-ko-dic

2018-03-29 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16418690#comment-16418690 ] Jim Ferenczi commented on LUCENE-8231: -- Thanks for looking Robert ! {quote} Should there be a

[jira] [Updated] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-03-29 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi updated LUCENE-8231: - Summary: Nori, a Korean analyzer based on mecab-ko-dic (was: Godori, a Korean analyzer based on

[jira] [Updated] (LUCENE-8231) Godori, a Korean analyzer based on mecab-ko-dic

2018-03-29 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi updated LUCENE-8231: - Attachment: LUCENE-8231.patch > Godori, a Korean analyzer based on mecab-ko-dic >

[jira] [Updated] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-03-29 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi updated LUCENE-8231: - Description: There is a dictionary similar to IPADIC but for Korean called mecab-ko-dic: It is

[jira] [Commented] (LUCENE-8229) Add a method to Weight to retrieve matches for a single document

2018-03-29 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8229?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16418708#comment-16418708 ] Jim Ferenczi commented on LUCENE-8229: -- I like the proposal here. For simple queries it makes the

[jira] [Commented] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-03-29 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1641#comment-1641 ] Jim Ferenczi commented on LUCENE-8231: -- {quote} and looking more, you'd need full byte range to do

[jira] [Comment Edited] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-04-02 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16421999#comment-16421999 ] Jim Ferenczi edited comment on LUCENE-8231 at 4/2/18 7:33 AM: -- Hi Robert,

[jira] [Commented] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-04-02 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16421999#comment-16421999 ] Jim Ferenczi commented on LUCENE-8231: -- Hi Robert, thanks for your testings and suggestions ! I

[jira] [Comment Edited] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-04-02 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16421999#comment-16421999 ] Jim Ferenczi edited comment on LUCENE-8231 at 4/2/18 8:32 AM: -- Hi Robert,

[jira] [Updated] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-04-02 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi updated LUCENE-8231: - Attachment: LUCENE-8231.patch > Nori, a Korean analyzer based on mecab-ko-dic >

[jira] [Updated] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-04-02 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi updated LUCENE-8231: - Attachment: LUCENE-8231.patch > Nori, a Korean analyzer based on mecab-ko-dic >

[jira] [Updated] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-04-02 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi updated LUCENE-8231: - Attachment: (was: LUCENE-8231.patch) > Nori, a Korean analyzer based on mecab-ko-dic >

[jira] [Commented] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-04-02 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16422791#comment-16422791 ] Jim Ferenczi commented on LUCENE-8231: -- I attached a new patch with lots of cleanups and fixes. I

[jira] [Updated] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-03-29 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi updated LUCENE-8231: - Attachment: LUCENE-8231-remap-hangul.patch > Nori, a Korean analyzer based on mecab-ko-dic >

[jira] [Commented] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-03-29 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16419178#comment-16419178 ] Jim Ferenczi commented on LUCENE-8231: -- Sure I attached a new patch (LUCENE-8231-remap-hangul.patch)

[jira] [Commented] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-03-29 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16419121#comment-16419121 ] Jim Ferenczi commented on LUCENE-8231: -- I tried this approach and generated a new FST with the remap

[jira] [Commented] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-04-04 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16425218#comment-16425218 ] Jim Ferenczi commented on LUCENE-8231: -- Hi Robert, I pushed another iteration that moves the

[jira] [Updated] (LUCENE-8231) Nori, a Korean analyzer based on mecab-ko-dic

2018-04-04 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Ferenczi updated LUCENE-8231: - Attachment: LUCENE-8231.patch > Nori, a Korean analyzer based on mecab-ko-dic >

[jira] [Commented] (LUCENE-8202) Add a FixedShingleFilter

2018-03-22 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16409390#comment-16409390 ] Jim Ferenczi commented on LUCENE-8202: -- sure +1 for the exception, I don't think that this limit

[jira] [Commented] (LUCENE-8202) Add a FixedShingleFilter

2018-03-22 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16409382#comment-16409382 ] Jim Ferenczi commented on LUCENE-8202: -- +1 to set position length to 1, this is a fixed size shingle

[jira] [Commented] (LUCENE-8202) Add a FixedShingleFilter

2018-03-21 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16407723#comment-16407723 ] Jim Ferenczi commented on LUCENE-8202: -- +1, thanks Alan. > Add a FixedShingleFilter >

[jira] [Commented] (LUCENE-8196) Add IntervalQuery and IntervalsSource to expose minimum interval semantics across term fields

2018-03-19 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16404532#comment-16404532 ] Jim Ferenczi commented on LUCENE-8196: -- +1 too, there are some places where you could initialize the

[jira] [Commented] (LUCENE-8182) BoostingQuery applies the wrong boost to the query score

2018-03-01 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8182?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16382845#comment-16382845 ] Jim Ferenczi commented on LUCENE-8182: -- Thanks [~hossman] . I pushed a commit to add the missing

[jira] [Created] (LUCENE-8526) StandardTokenizer doesn't separate hangul characters from other non-CJK chars

2018-10-05 Thread Jim Ferenczi (JIRA)
Jim Ferenczi created LUCENE-8526: Summary: StandardTokenizer doesn't separate hangul characters from other non-CJK chars Key: LUCENE-8526 URL: https://issues.apache.org/jira/browse/LUCENE-8526

[jira] [Commented] (LUCENE-8526) StandardTokenizer doesn't separate hangul characters from other non-CJK chars

2018-10-05 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16640245#comment-16640245 ] Jim Ferenczi commented on LUCENE-8526: -- Ok thanks for explaining [~steve_rowe]. I thought that

[jira] [Commented] (LUCENE-8526) StandardTokenizer doesn't separate hangul characters from other non-CJK chars

2018-10-05 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16640282#comment-16640282 ] Jim Ferenczi commented on LUCENE-8526: -- Sounds great [~steve_rowe]. I'll prepare a patch. >

[jira] [Created] (LUCENE-8529) Use the completion key to tiebreak completion suggestion

2018-10-11 Thread Jim Ferenczi (JIRA)
Jim Ferenczi created LUCENE-8529: Summary: Use the completion key to tiebreak completion suggestion Key: LUCENE-8529 URL: https://issues.apache.org/jira/browse/LUCENE-8529 Project: Lucene - Core

[jira] [Commented] (LUCENE-8531) QueryBuilder hard-codes inOrder=true for generated sloppy span near queries

2018-10-15 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8531?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16650813#comment-16650813 ] Jim Ferenczi commented on LUCENE-8531: -- (Multi)PhraseQuery-s allows some reordering but the

[jira] [Commented] (LUCENE-8535) Should we drop support for highlighting block-join queris

2018-10-17 Thread Jim Ferenczi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16654045#comment-16654045 ] Jim Ferenczi commented on LUCENE-8535: -- +1 to support this through the extension points. We can add

<    1   2   3   4   5   >