[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16196975#comment-16196975
]
ASF GitHub Bot commented on LUCENE-7287:
Github user arysin closed the pull request at:
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15630171#comment-15630171
]
Cassandra Targett commented on LUCENE-7287:
---
I missed it last go-around. I don't know if I will
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15628837#comment-15628837
]
Andriy Rysin commented on LUCENE-7287:
--
Cassandra looks like 6.2 is out could you please add
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15365063#comment-15365063
]
Andriy Rysin commented on LUCENE-7287:
--
Thanks Michael, much appreciated!
> New lemma-tizer plugin
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15364098#comment-15364098
]
Michael McCandless commented on LUCENE-7287:
[~arysin], I pushed the normalization changes
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15364096#comment-15364096
]
ASF subversion and git services commented on LUCENE-7287:
-
Commit
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15364093#comment-15364093
]
ASF subversion and git services commented on LUCENE-7287:
-
Commit
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15360116#comment-15360116
]
Michael McCandless commented on LUCENE-7287:
[~arysin] thank you! I'll merge this likely
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15358188#comment-15358188
]
Andriy Rysin commented on LUCENE-7287:
--
Hey [~mikemccand], can we please merge the pull request
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15348927#comment-15348927
]
Andriy Rysin commented on LUCENE-7287:
--
Ok, I was able to run solr with Ukrainian analyzer and I can
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15348895#comment-15348895
]
ASF GitHub Bot commented on LUCENE-7287:
GitHub user arysin opened a pull request:
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15348585#comment-15348585
]
Andriy Rysin commented on LUCENE-7287:
--
I've created the dictionary that collapses token+lemma in
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15347153#comment-15347153
]
Andriy Rysin commented on LUCENE-7287:
--
Ok, then I'll prepare the changes as part of this ticket.
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15346949#comment-15346949
]
Ahmet Arslan commented on LUCENE-7287:
--
This is a new feature that is never released, new ticket may
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15346878#comment-15346878
]
Andriy Rysin commented on LUCENE-7287:
--
Hmm, that does not look right. Yes we can either use
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15346875#comment-15346875
]
Ahmet Arslan commented on LUCENE-7287:
--
Hi,
multiple tokens OK, but multiple identical tokens look
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15346862#comment-15346862
]
Andriy Rysin commented on LUCENE-7287:
--
Thanks Ahmet!
Shall I create mappings_uk.txt so we can use
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15346857#comment-15346857
]
Ahmet Arslan commented on LUCENE-7287:
--
Please see screenshots in the attachments section at the
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15346816#comment-15346816
]
Ahmet Arslan commented on LUCENE-7287:
--
Hi,
I was able to run the analyzer successfully. Without
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15346413#comment-15346413
]
Andriy Rysin commented on LUCENE-7287:
--
Sure, I can add a comment, but I guess I need to test the
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15344484#comment-15344484
]
Cassandra Targett commented on LUCENE-7287:
---
If you do that (make a comment on the page with
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15344403#comment-15344403
]
Ahmet Arslan commented on LUCENE-7287:
--
only committers have rights to edit confluence wiki.
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15344396#comment-15344396
]
Andriy Rysin commented on LUCENE-7287:
--
I've logged in into cwiki but I don't seem to have rights to
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15344290#comment-15344290
]
Ahmet Arslan commented on LUCENE-7287:
--
I think you, as the author of Ukrainian. Thanks!
> New
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15344252#comment-15344252
]
Andriy Rysin commented on LUCENE-7287:
--
Thanks Ahmet, that looks good! Would you add/push those
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15343877#comment-15343877
]
Ahmet Arslan commented on LUCENE-7287:
--
So, Solr field type counterpart of this analyzer would be
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15343258#comment-15343258
]
Andriy Rysin commented on LUCENE-7287:
--
I don't know much about solr, but I think
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15342944#comment-15342944
]
Ahmet Arslan commented on LUCENE-7287:
--
Can we use this analyzer in solr?
{code:xml}
{code}
>
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15341899#comment-15341899
]
ASF subversion and git services commented on LUCENE-7287:
-
Commit
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15341896#comment-15341896
]
ASF subversion and git services commented on LUCENE-7287:
-
Commit
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15341848#comment-15341848
]
Michael McCandless commented on LUCENE-7287:
[~thetaphi] oh yeah I'll fix that!
> New
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15341653#comment-15341653
]
Uwe Schindler commented on LUCENE-7287:
---
Mike: Can you remove the absolute path here?
{code:java}
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15341444#comment-15341444
]
ASF subversion and git services commented on LUCENE-7287:
-
Commit
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15341432#comment-15341432
]
ASF subversion and git services commented on LUCENE-7287:
-
Commit
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15341398#comment-15341398
]
Michael McCandless commented on LUCENE-7287:
Thanks [~arysin], I'll tweak the javadocs for
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15340996#comment-15340996
]
Andriy Rysin commented on LUCENE-7287:
--
Looks cool, thanks a lot Michael!
I wonder if we should add
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338468#comment-15338468
]
Michael McCandless commented on LUCENE-7287:
bq. Or we could put it under analisys/morfologik
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15337982#comment-15337982
]
Andriy Rysin commented on LUCENE-7287:
--
I guess it does not fit under analysis/common as it depends
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336280#comment-15336280
]
Michael McCandless commented on LUCENE-7287:
[~arysin] I think this looks nice, thank you! I
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15333888#comment-15333888
]
Andriy Rysin commented on LUCENE-7287:
--
[~mikemccand], [~iorixxx] does this implementation look good
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15326753#comment-15326753
]
Andriy Rysin commented on LUCENE-7287:
--
Thanks for the hint, I've changed the code to use
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15326628#comment-15326628
]
Ahmet Arslan commented on LUCENE-7287:
--
May be MappingCharFilter could be used instead of a token
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15326624#comment-15326624
]
Andriy Rysin commented on LUCENE-7287:
--
I've added a token filter for unicode apostrophes and stress
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15326066#comment-15326066
]
Andriy Rysin commented on LUCENE-7287:
--
Ok, guys, I've created little project with Ukrainian
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15324651#comment-15324651
]
Michael McCandless commented on LUCENE-7287:
Thanks for the detailed analysis [~arysin]! On
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15323800#comment-15323800
]
Andriy Rysin commented on LUCENE-7287:
--
Ok, I've imported lucene-sorl and the Ukrainian analyzer
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15318587#comment-15318587
]
Michael McCandless commented on LUCENE-7287:
That sounds like a great solution [~arysin]!
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15317807#comment-15317807
]
Andriy Rysin commented on LUCENE-7287:
--
I just realized that Lucene includes morfologik analyzer
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15306947#comment-15306947
]
Andriy Rysin commented on LUCENE-7287:
--
>From my point of view we can use dict_uk as a source for
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15306770#comment-15306770
]
Dmitry Chaplinsky commented on LUCENE-7287:
---
I really want this project to happen.
[~iorixxx],
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15304009#comment-15304009
]
Andriy Rysin commented on LUCENE-7287:
--
BTW how does hunspell stemming works for "exceptions"? There
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15304000#comment-15304000
]
Andriy Rysin commented on LUCENE-7287:
--
So do we need to build hunspell dictionary (this may take me
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15299381#comment-15299381
]
Ahmet Arslan commented on LUCENE-7287:
--
This looks like a wrapper for string to string mapping. No
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15299149#comment-15299149
]
Michael McCandless commented on LUCENE-7287:
bq. There's no alternative open dictionary for
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15298680#comment-15298680
]
Andriy Rysin commented on LUCENE-7287:
--
There's no alternative open dictionary for Ukrainian with
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15298624#comment-15298624
]
Michael McCandless commented on LUCENE-7287:
bq. The dictionary originally is coming from
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15298505#comment-15298505
]
Dmytro Hambal commented on LUCENE-7287:
---
[~mikemccand] speaking of this data file, we had an idea
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15298470#comment-15298470
]
Andriy Rysin commented on LUCENE-7287:
--
Quick check via jvisualvm shows ~400MB used by the
[
https://issues.apache.org/jira/browse/LUCENE-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15293384#comment-15293384
]
Michael McCandless commented on LUCENE-7287:
Thanks [~mr_gambal], this sounds nice! The
59 matches
Mail list logo