[
https://issues.apache.org/jira/browse/LUCENE-8204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16573030#comment-16573030
]
Jim Ferenczi commented on LUCENE-8204:
--
Thanks Adrien, I pushed a new patch that addresses your
[
https://issues.apache.org/jira/browse/LUCENE-8439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi resolved LUCENE-8439.
--
Resolution: Fixed
Fix Version/s: master (8.0)
Thanks [~jpountz] !
>
[
https://issues.apache.org/jira/browse/LUCENE-8448?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16579721#comment-16579721
]
Jim Ferenczi commented on LUCENE-8448:
--
We've tried several things with Adrien to optimize the
[
https://issues.apache.org/jira/browse/LUCENE-8448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi updated LUCENE-8448:
-
Attachment: LUCENE-8448.patch
> Slowdown of nested boolean queries after LUCENE-8060
>
[
https://issues.apache.org/jira/browse/LUCENE-8466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16594050#comment-16594050
]
Jim Ferenczi commented on LUCENE-8466:
--
Thanks Tomás and sorry Vish for not adding you in the first
[
https://issues.apache.org/jira/browse/LUCENE-8466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi resolved LUCENE-8466.
--
Resolution: Fixed
Fix Version/s: master (8.0)
7.5
Thanks Adrien and
[
https://issues.apache.org/jira/browse/LUCENE-8466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi updated LUCENE-8466:
-
Attachment: LUCENE-8466.patch
> FrozenBufferedUpdates#apply*Deletes is incorrect when index
[
https://issues.apache.org/jira/browse/LUCENE-8466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16593610#comment-16593610
]
Jim Ferenczi commented on LUCENE-8466:
--
Here is a patch that fixes delete by query. It seems that
[
https://issues.apache.org/jira/browse/LUCENE-8306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16550491#comment-16550491
]
Jim Ferenczi commented on LUCENE-8306:
--
+1, thanks [~romseygeek] , the patch looks good.
> Allow
[
https://issues.apache.org/jira/browse/LUCENE-8402?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi resolved LUCENE-8402.
--
Resolution: Fixed
I removed the invalid assertions, thanks [~thetaphi].
> TestPriorityQueue
[
https://issues.apache.org/jira/browse/LUCENE-8401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16547486#comment-16547486
]
Jim Ferenczi commented on LUCENE-8401:
--
I like the approach here. A few comments:
* The text
[
https://issues.apache.org/jira/browse/LUCENE-8306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16548104#comment-16548104
]
Jim Ferenczi commented on LUCENE-8306:
--
Would it be easier if getSubMatches returns null when
[
https://issues.apache.org/jira/browse/LUCENE-8402?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16545634#comment-16545634
]
Jim Ferenczi edited comment on LUCENE-8402 at 7/16/18 7:09 PM:
---
Since it
[
https://issues.apache.org/jira/browse/LUCENE-8402?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16545634#comment-16545634
]
Jim Ferenczi commented on LUCENE-8402:
--
Since it is a deprecated function I don't think we should
[
https://issues.apache.org/jira/browse/LUCENE-8402?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi updated LUCENE-8402:
-
Comment: was deleted
(was: Found by the build in
[
https://issues.apache.org/jira/browse/LUCENE-8402?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi updated LUCENE-8402:
-
Attachment: LUCENE-8402.patch
> TestPriorityQueue failures
> --
>
>
[
https://issues.apache.org/jira/browse/LUCENE-8402?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16545121#comment-16545121
]
Jim Ferenczi commented on LUCENE-8402:
--
Here is a patch that removes the assertions around reused
[
https://issues.apache.org/jira/browse/LUCENE-8402?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16545158#comment-16545158
]
Jim Ferenczi commented on LUCENE-8402:
--
Found by the build in
[
https://issues.apache.org/jira/browse/LUCENE-8402?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16545159#comment-16545159
]
Jim Ferenczi commented on LUCENE-8402:
--
Found by the build in
Jim Ferenczi created LUCENE-8402:
Summary: TestPriorityQueue failures
Key: LUCENE-8402
URL: https://issues.apache.org/jira/browse/LUCENE-8402
Project: Lucene - Core
Issue Type: Test
[
https://issues.apache.org/jira/browse/LUCENE-8204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi updated LUCENE-8204:
-
Attachment: LUCENE-8204.patch
> ReqOptSumScorer should leverage sub scorers' per-block max
[
https://issues.apache.org/jira/browse/LUCENE-8204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16555729#comment-16555729
]
Jim Ferenczi commented on LUCENE-8204:
--
Here is a patch that implements the block skipping logic. I
[
https://issues.apache.org/jira/browse/LUCENE-8476?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16603349#comment-16603349
]
Jim Ferenczi commented on LUCENE-8476:
--
Thanks [~danmuzi] ! The new patch looks good, I'll commit
[
https://issues.apache.org/jira/browse/SOLR-12655?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16604160#comment-16604160
]
Jim Ferenczi commented on SOLR-12655:
-
[~y100421] we use the mecab-ko-dic-2.0.3-20170922 version for
[
https://issues.apache.org/jira/browse/LUCENE-8382?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16532681#comment-16532681
]
Jim Ferenczi commented on LUCENE-8382:
--
+1
> Don't propagate calls to setMinCompetitiveScore in
[
https://issues.apache.org/jira/browse/LUCENE-7638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi closed LUCENE-7638.
> Optimize graph query produced by QueryBuilder
> -
>
>
[
https://issues.apache.org/jira/browse/LUCENE-7699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi closed LUCENE-7699.
> Apply graph articulation points optimization to phrase graph queries
>
[
https://issues.apache.org/jira/browse/LUCENE-8137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi reassigned LUCENE-8137:
Assignee: Jim Ferenczi
> GraphTokenStreamFiniteStrings does not handle position inc > 1
Jim Ferenczi created LUCENE-8137:
Summary: GraphTokenStreamFiniteStrings does not handle position
inc > 1 in multi-word synoyms
Key: LUCENE-8137
URL: https://issues.apache.org/jira/browse/LUCENE-8137
[
https://issues.apache.org/jira/browse/LUCENE-8199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi resolved LUCENE-8199.
--
Resolution: Won't Fix
> TestBackwardsCompatibility#testAllVersionsTested should fail if the
[
https://issues.apache.org/jira/browse/LUCENE-8199?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16395179#comment-16395179
]
Jim Ferenczi commented on LUCENE-8199:
--
Argh, scratch that, this is only true for bugfix releases.
[
https://issues.apache.org/jira/browse/LUCENE-8199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi closed LUCENE-8199.
> TestBackwardsCompatibility#testAllVersionsTested should fail if the version
> of a bwc index is
Jim Ferenczi created LUCENE-8199:
Summary: TestBackwardsCompatibility#testAllVersionsTested should
fail if the version of a bwc index is missing
Key: LUCENE-8199
URL:
[
https://issues.apache.org/jira/browse/LUCENE-8196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16391723#comment-16391723
]
Jim Ferenczi commented on LUCENE-8196:
--
{quote}
I was a bit annoyed to see the field masking hack
[
https://issues.apache.org/jira/browse/LUCENE-8196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16393188#comment-16393188
]
Jim Ferenczi commented on LUCENE-8196:
--
{quote}
I'd rather keep the API as it is, with the field
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16432078#comment-16432078
]
Jim Ferenczi commented on LUCENE-8231:
--
I attached a new patch that fixes an issue with offsets of
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi updated LUCENE-8231:
-
Attachment: LUCENE-8231.patch
> Nori, a Korean analyzer based on mecab-ko-dic
>
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi updated LUCENE-8231:
-
Attachment: LUCENE-8231.patch
> Nori, a Korean analyzer based on mecab-ko-dic
>
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16435420#comment-16435420
]
Jim Ferenczi commented on LUCENE-8231:
--
Thanks Robert.
I attached a new patch that changes the enum
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16436001#comment-16436001
]
Jim Ferenczi commented on LUCENE-8231:
--
I agree this will also simplify the understanding of these
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16436019#comment-16436019
]
Jim Ferenczi commented on LUCENE-8231:
--
No because FilteringTokenFilter doesn't handle
Jim Ferenczi created LUCENE-8250:
Summary: Should FilteringTokenFilter handle positionLength
Key: LUCENE-8250
URL: https://issues.apache.org/jira/browse/LUCENE-8250
Project: Lucene - Core
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi updated LUCENE-8231:
-
Attachment: LUCENE-8231.patch
> Nori, a Korean analyzer based on mecab-ko-dic
>
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16436042#comment-16436042
]
Jim Ferenczi commented on LUCENE-8231:
--
I think that the Japanese analyzer has the same issue and
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16435922#comment-16435922
]
Jim Ferenczi commented on LUCENE-8231:
--
Sure, I added two more ctr in the last patch, one with
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16436066#comment-16436066
]
Jim Ferenczi commented on LUCENE-8231:
--
Ok I'll restore the KoreanPartOfSpeechStopFilter then and we
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi updated LUCENE-8231:
-
Attachment: LUCENE-8231.patch
> Nori, a Korean analyzer based on mecab-ko-dic
>
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi updated LUCENE-8231:
-
Attachment: LUCENE-8231.patch
> Nori, a Korean analyzer based on mecab-ko-dic
>
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16435802#comment-16435802
]
Jim Ferenczi commented on LUCENE-8231:
--
Right, I changed the Analyzer but not the Tokenizer. I
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16436123#comment-16436123
]
Jim Ferenczi edited comment on LUCENE-8231 at 4/12/18 6:42 PM:
---
I attached
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16436123#comment-16436123
]
Jim Ferenczi commented on LUCENE-8231:
--
I attached a new patch that restores the
[
https://issues.apache.org/jira/browse/LUCENE-8250?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi updated LUCENE-8250:
-
Attachment: LUCENE-8250.patch
> Should FilteringTokenFilter handle positionLength
>
[
https://issues.apache.org/jira/browse/LUCENE-8250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16436960#comment-16436960
]
Jim Ferenczi commented on LUCENE-8250:
--
I attached a small test that I hope illustrate the issue.
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16437065#comment-16437065
]
Jim Ferenczi commented on LUCENE-8231:
--
Thanks a lot Robert ! Any objections to backport to 7x ?
>
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi updated LUCENE-8231:
-
Attachment: LUCENE-8231.patch
> Nori, a Korean analyzer based on mecab-ko-dic
>
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16435755#comment-16435755
]
Jim Ferenczi commented on LUCENE-8231:
--
I attached a new patch that passes precommit checks. The
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi resolved LUCENE-8231.
--
Resolution: Fixed
Fix Version/s: master (8.0)
7.4
Thanks Robert and
[
https://issues.apache.org/jira/browse/LUCENE-8255?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16439948#comment-16439948
]
Jim Ferenczi commented on LUCENE-8255:
--
{quote}
This also means that sorting such a segment on merge
[
https://issues.apache.org/jira/browse/LUCENE-8196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16449409#comment-16449409
]
Jim Ferenczi commented on LUCENE-8196:
--
I don't think we should prevent anything ;). *unordered* is
[
https://issues.apache.org/jira/browse/LUCENE-8196?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi updated LUCENE-8196:
-
Attachment: LUCENE-8196-debug.patch
> Add IntervalQuery and IntervalsSource to expose minimum
[
https://issues.apache.org/jira/browse/LUCENE-8196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16450039#comment-16450039
]
Jim Ferenczi commented on LUCENE-8196:
--
I don't think an operator can prevent anything here, a query
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16420467#comment-16420467
]
Jim Ferenczi commented on LUCENE-8231:
--
I attached a new patch that adds a better compression for
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16420467#comment-16420467
]
Jim Ferenczi edited comment on LUCENE-8231 at 3/30/18 12:59 PM:
I
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi updated LUCENE-8231:
-
Attachment: LUCENE-8231.patch
> Nori, a Korean analyzer based on mecab-ko-dic
>
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi updated LUCENE-8231:
-
Attachment: (was: LUCENE-8231.patch)
> Godori, a Korean analyzer based on mecab-ko-dic
>
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi updated LUCENE-8231:
-
Attachment: LUCENE-8231.patch
> Godori, a Korean analyzer based on mecab-ko-dic
>
Jim Ferenczi created LUCENE-8231:
Summary: Godori, a Korean analyzer based on mecab-ko-dic
Key: LUCENE-8231
URL: https://issues.apache.org/jira/browse/LUCENE-8231
Project: Lucene - Core
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi updated LUCENE-8231:
-
Attachment: (was: LUCENE-8231)
> Godori, a Korean analyzer based on mecab-ko-dic
>
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi updated LUCENE-8231:
-
Attachment: LUCENE-8231.patch
> Godori, a Korean analyzer based on mecab-ko-dic
>
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi updated LUCENE-8231:
-
Attachment: LUCENE-8231
> Godori, a Korean analyzer based on mecab-ko-dic
>
[
https://issues.apache.org/jira/browse/LUCENE-8196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16418695#comment-16418695
]
Jim Ferenczi commented on LUCENE-8196:
--
+1
> Add IntervalQuery and IntervalsSource to expose
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16418690#comment-16418690
]
Jim Ferenczi commented on LUCENE-8231:
--
Thanks for looking Robert !
{quote}
Should there be a
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi updated LUCENE-8231:
-
Summary: Nori, a Korean analyzer based on mecab-ko-dic (was: Godori, a
Korean analyzer based on
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi updated LUCENE-8231:
-
Attachment: LUCENE-8231.patch
> Godori, a Korean analyzer based on mecab-ko-dic
>
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi updated LUCENE-8231:
-
Description:
There is a dictionary similar to IPADIC but for Korean called mecab-ko-dic:
It is
[
https://issues.apache.org/jira/browse/LUCENE-8229?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16418708#comment-16418708
]
Jim Ferenczi commented on LUCENE-8229:
--
I like the proposal here. For simple queries it makes the
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1641#comment-1641
]
Jim Ferenczi commented on LUCENE-8231:
--
{quote}
and looking more, you'd need full byte range to do
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16421999#comment-16421999
]
Jim Ferenczi edited comment on LUCENE-8231 at 4/2/18 7:33 AM:
--
Hi Robert,
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16421999#comment-16421999
]
Jim Ferenczi commented on LUCENE-8231:
--
Hi Robert, thanks for your testings and suggestions !
I
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16421999#comment-16421999
]
Jim Ferenczi edited comment on LUCENE-8231 at 4/2/18 8:32 AM:
--
Hi Robert,
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi updated LUCENE-8231:
-
Attachment: LUCENE-8231.patch
> Nori, a Korean analyzer based on mecab-ko-dic
>
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi updated LUCENE-8231:
-
Attachment: LUCENE-8231.patch
> Nori, a Korean analyzer based on mecab-ko-dic
>
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi updated LUCENE-8231:
-
Attachment: (was: LUCENE-8231.patch)
> Nori, a Korean analyzer based on mecab-ko-dic
>
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16422791#comment-16422791
]
Jim Ferenczi commented on LUCENE-8231:
--
I attached a new patch with lots of cleanups and fixes. I
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi updated LUCENE-8231:
-
Attachment: LUCENE-8231-remap-hangul.patch
> Nori, a Korean analyzer based on mecab-ko-dic
>
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16419178#comment-16419178
]
Jim Ferenczi commented on LUCENE-8231:
--
Sure I attached a new patch (LUCENE-8231-remap-hangul.patch)
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16419121#comment-16419121
]
Jim Ferenczi commented on LUCENE-8231:
--
I tried this approach and generated a new FST with the remap
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16425218#comment-16425218
]
Jim Ferenczi commented on LUCENE-8231:
--
Hi Robert,
I pushed another iteration that moves the
[
https://issues.apache.org/jira/browse/LUCENE-8231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jim Ferenczi updated LUCENE-8231:
-
Attachment: LUCENE-8231.patch
> Nori, a Korean analyzer based on mecab-ko-dic
>
[
https://issues.apache.org/jira/browse/LUCENE-8202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16409390#comment-16409390
]
Jim Ferenczi commented on LUCENE-8202:
--
sure +1 for the exception, I don't think that this limit
[
https://issues.apache.org/jira/browse/LUCENE-8202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16409382#comment-16409382
]
Jim Ferenczi commented on LUCENE-8202:
--
+1 to set position length to 1, this is a fixed size shingle
[
https://issues.apache.org/jira/browse/LUCENE-8202?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16407723#comment-16407723
]
Jim Ferenczi commented on LUCENE-8202:
--
+1, thanks Alan.
> Add a FixedShingleFilter
>
[
https://issues.apache.org/jira/browse/LUCENE-8196?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16404532#comment-16404532
]
Jim Ferenczi commented on LUCENE-8196:
--
+1 too, there are some places where you could initialize the
[
https://issues.apache.org/jira/browse/LUCENE-8182?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16382845#comment-16382845
]
Jim Ferenczi commented on LUCENE-8182:
--
Thanks [~hossman] . I pushed a commit to add the missing
Jim Ferenczi created LUCENE-8526:
Summary: StandardTokenizer doesn't separate hangul characters from
other non-CJK chars
Key: LUCENE-8526
URL: https://issues.apache.org/jira/browse/LUCENE-8526
[
https://issues.apache.org/jira/browse/LUCENE-8526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16640245#comment-16640245
]
Jim Ferenczi commented on LUCENE-8526:
--
Ok thanks for explaining [~steve_rowe]. I thought that
[
https://issues.apache.org/jira/browse/LUCENE-8526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16640282#comment-16640282
]
Jim Ferenczi commented on LUCENE-8526:
--
Sounds great [~steve_rowe]. I'll prepare a patch.
>
Jim Ferenczi created LUCENE-8529:
Summary: Use the completion key to tiebreak completion suggestion
Key: LUCENE-8529
URL: https://issues.apache.org/jira/browse/LUCENE-8529
Project: Lucene - Core
[
https://issues.apache.org/jira/browse/LUCENE-8531?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16650813#comment-16650813
]
Jim Ferenczi commented on LUCENE-8531:
--
(Multi)PhraseQuery-s allows some reordering but the
[
https://issues.apache.org/jira/browse/LUCENE-8535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16654045#comment-16654045
]
Jim Ferenczi commented on LUCENE-8535:
--
+1 to support this through the extension points. We can add
201 - 300 of 466 matches
Mail list logo