[jira] [Commented] (LUCENE-7398) Nested Span Queries are buggy

2017-01-16 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-7398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15824303#comment-15824303 ] Artem Lukanin commented on LUCENE-7398: --- The patch has a bug. The following sentence is not found,

[jira] [Commented] (LUCENE-7398) Nested Span Queries are buggy

2017-01-16 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-7398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15823865#comment-15823865 ] Artem Lukanin commented on LUCENE-7398: --- Actually testNestedOrQuery4 works if I setSlop(3). I

[jira] [Commented] (LUCENE-7398) Nested Span Queries are buggy

2017-01-16 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-7398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15823633#comment-15823633 ] Artem Lukanin commented on LUCENE-7398: --- This issue describes only a partial problem, when 2

[jira] [Commented] (LUCENE-6590) Explore different ways to apply boosts

2016-12-22 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15769633#comment-15769633 ] Artem Lukanin commented on LUCENE-6590: --- Actually, we need SpanOrWeight. It was possible to create

[jira] [Commented] (LUCENE-6590) Explore different ways to apply boosts

2016-12-21 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-6590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15766752#comment-15766752 ] Artem Lukanin commented on LUCENE-6590: --- Now it is impossible to extend `SpanOrQuery` to make

[jira] [Commented] (SOLR-4533) Synonyms, created in custom filters are ignored after tokenizers.

2016-11-16 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-4533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15670656#comment-15670656 ] Artem Lukanin commented on SOLR-4533: - Sorry for not providing a test case 3 years ago, but I'm not in

[jira] [Created] (LUCENE-5433) Double boosting in BooleanQuery.rewrite

2014-02-04 Thread Artem Lukanin (JIRA)
Artem Lukanin created LUCENE-5433: - Summary: Double boosting in BooleanQuery.rewrite Key: LUCENE-5433 URL: https://issues.apache.org/jira/browse/LUCENE-5433 Project: Lucene - Core Issue

[jira] [Commented] (SOLR-5634) getNGroups returns null if group.format=simple and group.ngroups=true

2014-02-02 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-5634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13888927#comment-13888927 ] Artem Lukanin commented on SOLR-5634: - I am glad to help! getNGroups returns null if

[jira] [Created] (SOLR-5651) Make labels clickable and the output wrapped in UI

2014-01-21 Thread Artem Lukanin (JIRA)
Artem Lukanin created SOLR-5651: --- Summary: Make labels clickable and the output wrapped in UI Key: SOLR-5651 URL: https://issues.apache.org/jira/browse/SOLR-5651 Project: Solr Issue Type: Bug

[jira] [Updated] (SOLR-5651) Make labels clickable and the output wrapped in UI

2014-01-21 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-5651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Artem Lukanin updated SOLR-5651: Attachment: SOLR-5651_clickable_labels.patch The patch resolves the issue. Make labels clickable

[jira] [Updated] (SOLR-5651) Make labels clickable and the output wrapped in UI

2014-01-21 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-5651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Artem Lukanin updated SOLR-5651: Attachment: SOLR-5651_clickable_labels.patch added a space after checkboxes Make labels clickable

[jira] [Updated] (SOLR-5634) getNGroups returns null if group.format=simple and group.ngroups=true

2014-01-15 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-5634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Artem Lukanin updated SOLR-5634: Attachment: SOLR-5634.patch Attached a PATCH, resolving the issue. getNGroups returns null if

[jira] [Created] (SOLR-5634) getNGroups returns null if group.format=simple and group.ngroups=true

2014-01-14 Thread Artem Lukanin (JIRA)
Artem Lukanin created SOLR-5634: --- Summary: getNGroups returns null if group.format=simple and group.ngroups=true Key: SOLR-5634 URL: https://issues.apache.org/jira/browse/SOLR-5634 Project: Solr

[jira] [Commented] (LUCENE-4963) Deprecate broken TokenFilter constructors

2013-07-26 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13720760#comment-13720760 ] Artem Lukanin commented on LUCENE-4963: --- There should be an option in

[jira] [Commented] (LUCENE-4963) Deprecate broken TokenFilter constructors

2013-07-26 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13720762#comment-13720762 ] Artem Lukanin commented on LUCENE-4963: --- There should be an option in

[jira] [Commented] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-07-18 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13712488#comment-13712488 ] Artem Lukanin commented on LUCENE-5030: --- Great! Thanks for reviewing.

[jira] [Commented] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-07-17 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13710797#comment-13710797 ] Artem Lukanin commented on LUCENE-5030: --- Then I have to override (and copy a lot of

[jira] [Updated] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-07-17 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Artem Lukanin updated LUCENE-5030: -- Attachment: LUCENE-5030.patch Moved the parameter from AnalyzingLookupFactory to

[jira] [Commented] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-07-17 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13710889#comment-13710889 ] Artem Lukanin commented on LUCENE-5030: --- Michael, I got your idea. I will refactor

[jira] [Updated] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-07-17 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Artem Lukanin updated LUCENE-5030: -- Attachment: LUCENE-5030.patch The code is refactored not to touch AnalyzingSuggester. Please,

[jira] [Commented] (LUCENE-4845) Add AnalyzingInfixSuggester

2013-07-08 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13701814#comment-13701814 ] Artem Lukanin commented on LUCENE-4845: --- I guess, there should be an

[jira] [Updated] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-07-04 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Artem Lukanin updated LUCENE-5030: -- Attachment: LUCENE-5030.patch I have renamed the variables in comments and tests for

[jira] [Commented] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-07-03 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13699304#comment-13699304 ] Artem Lukanin commented on LUCENE-5030: --- in ant precommit I get this error:

[jira] [Updated] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-07-02 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Artem Lukanin updated LUCENE-5030: -- Attachment: LUCENE-5030.patch The javadocs are fixed. FuzzySuggester has to

[jira] [Commented] (LUCENE-5012) Make graph-based TokenFilters easier

2013-07-01 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13696626#comment-13696626 ] Artem Lukanin commented on LUCENE-5012: --- I guess WordDelimiterFilter is a good

[jira] [Commented] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-07-01 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13697537#comment-13697537 ] Artem Lukanin commented on LUCENE-5030: --- Cool! FuzzySuggester has

[jira] [Updated] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-06-27 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Artem Lukanin updated LUCENE-5030: -- Attachment: LUCENE-5030.patch Done. Please, review LUCENE-5030.patch

[jira] [Commented] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-06-27 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13694610#comment-13694610 ] Artem Lukanin commented on LUCENE-5030: --- BTW, for your {code}// TODO: is there a

[jira] [Updated] (LUCENE-5051) Incorrect abbreviation synonyms treating in WordDelimiterFilter

2013-06-27 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Artem Lukanin updated LUCENE-5051: -- Description: If there are 2 abbreviation synonyms in the stream, they are not treated as

[jira] [Updated] (LUCENE-5051) Incorrect abbreviation synonyms treating in WordDelimiterFilter

2013-06-27 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Artem Lukanin updated LUCENE-5051: -- Attachment: LUCENE-5051.patch I've added a test, demonstrating the bug.

[jira] [Updated] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-06-26 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Artem Lukanin updated LUCENE-5030: -- Attachment: nonlatin_fuzzySuggester_combo1.patch Sorry, I don't understand, why

[jira] [Updated] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-06-26 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Artem Lukanin updated LUCENE-5030: -- Attachment: nonlatin_fuzzySuggester_combo2.patch I have restored testStolenBytes completely

[jira] [Comment Edited] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-06-26 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13693867#comment-13693867 ] Artem Lukanin edited comment on LUCENE-5030 at 6/26/13 9:35 AM:

[jira] [Commented] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-06-24 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13691744#comment-13691744 ] Artem Lukanin commented on LUCENE-5030: --- I have added UNICODE_AWARE option in

[jira] [Updated] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-06-24 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Artem Lukanin updated LUCENE-5030: -- Attachment: nonlatin_fuzzySuggester_combo.patch I have uploaded a lucene/solr combo patch

[jira] [Commented] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-06-21 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13690102#comment-13690102 ] Artem Lukanin commented on LUCENE-5030: --- I'm uploading 3 results of benchmarking:

[jira] [Updated] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-06-21 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Artem Lukanin updated LUCENE-5030: -- Attachment: benchmark-wo_convertion.txt benchmark-old.txt

[jira] [Comment Edited] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-06-21 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13690102#comment-13690102 ] Artem Lukanin edited comment on LUCENE-5030 at 6/21/13 7:49 AM:

[jira] [Commented] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-06-21 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13690124#comment-13690124 ] Artem Lukanin commented on LUCENE-5030: --- OK, I will add a new option UNICODE_AWARE

[jira] [Commented] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-06-20 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13688924#comment-13688924 ] Artem Lukanin commented on LUCENE-5030: --- I ran this command: {code}ant

[jira] [Updated] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-06-20 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Artem Lukanin updated LUCENE-5030: -- Attachment: nonlatin_fuzzySuggester.patch I used INFO_SEP and INFO_SEP2 for separators and

[jira] [Commented] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-06-20 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13689231#comment-13689231 ] Artem Lukanin commented on LUCENE-5030: --- OK, in general the performance is worse

[jira] [Commented] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-06-20 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13689296#comment-13689296 ] Artem Lukanin commented on LUCENE-5030: --- The last patch with INFO_SEP/2 was posted

[jira] [Commented] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-06-19 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13687822#comment-13687822 ] Artem Lukanin commented on LUCENE-5030: --- I see, that some tests in

[jira] [Updated] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-06-19 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Artem Lukanin updated LUCENE-5030: -- Attachment: nonlatin_fuzzySuggester.patch now tests in FuzzySuggesterTest and

[jira] [Commented] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-06-19 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13687917#comment-13687917 ] Artem Lukanin commented on LUCENE-5030: --- Possibly we should change it to INFO_SEP2

[jira] [Commented] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-06-18 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13686447#comment-13686447 ] Artem Lukanin commented on LUCENE-5030: --- you already have private static final int

[jira] [Updated] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-06-18 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Artem Lukanin updated LUCENE-5030: -- Attachment: nonlatin_fuzzySuggester4.patch I have fixed testRandom, which repeats the logic

[jira] [Updated] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-06-17 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Artem Lukanin updated LUCENE-5030: -- Attachment: nonlatin_fuzzySuggester1.patch Now all the tests pass except testRandom when

[jira] [Commented] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-06-17 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13685481#comment-13685481 ] Artem Lukanin commented on LUCENE-5030: --- Sorry for autoformatting, I will upload

[jira] [Commented] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-06-17 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13685516#comment-13685516 ] Artem Lukanin commented on LUCENE-5030: --- BTW, if I replace it with

[jira] [Updated] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-06-17 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Artem Lukanin updated LUCENE-5030: -- Attachment: nonlatin_fuzzySuggester2.patch the patch without autoformatting

[jira] [Commented] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-06-17 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13685551#comment-13685551 ] Artem Lukanin commented on LUCENE-5030: --- I see, the patch still has autoformatting

[jira] [Updated] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-06-17 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Artem Lukanin updated LUCENE-5030: -- Attachment: nonlatin_fuzzySuggester3.patch with untouched trailing spaces

[jira] [Updated] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-06-13 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Artem Lukanin updated LUCENE-5030: -- Attachment: nonlatin_fuzzySuggester.patch I've added a test, which demonstrates the bug. I

[jira] [Created] (LUCENE-5051) Incorrect abbreviation synonyms treating in WordDelimiterFilter

2013-06-11 Thread Artem Lukanin (JIRA)
Artem Lukanin created LUCENE-5051: - Summary: Incorrect abbreviation synonyms treating in WordDelimiterFilter Key: LUCENE-5051 URL: https://issues.apache.org/jira/browse/LUCENE-5051 Project: Lucene -

[jira] [Commented] (LUCENE-5051) Incorrect abbreviation synonyms treating in WordDelimiterFilter

2013-06-11 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13680296#comment-13680296 ] Artem Lukanin commented on LUCENE-5051: --- Correct treatment: before and after

[jira] [Comment Edited] (LUCENE-5051) Incorrect abbreviation synonyms treating in WordDelimiterFilter

2013-06-11 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13680296#comment-13680296 ] Artem Lukanin edited comment on LUCENE-5051 at 6/11/13 8:23 AM:

[jira] [Issue Comment Deleted] (LUCENE-5051) Incorrect abbreviation synonyms treating in WordDelimiterFilter

2013-06-11 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Artem Lukanin updated LUCENE-5051: -- Comment: was deleted (was: Correct treatment: before and after WordDelimiterFilter: {code}

[jira] [Updated] (LUCENE-5051) Incorrect abbreviation synonyms treating in WordDelimiterFilter

2013-06-11 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Artem Lukanin updated LUCENE-5051: -- Description: If there are 2 abbreviation synonyms in the stream, they are not treated as

[jira] [Updated] (LUCENE-5051) Incorrect abbreviation synonyms treating in WordDelimiterFilter

2013-06-11 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Artem Lukanin updated LUCENE-5051: -- Attachment: incorrect_synonym_treating_sample.patch a patch for Solr configs, which shows the

[jira] [Commented] (LUCENE-4991) QueryParser doesnt handle synonyms correctly for chinese

2013-06-07 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4991?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13677935#comment-13677935 ] Artem Lukanin commented on LUCENE-4991: --- I guess, this should fix my issue

[jira] [Created] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-06-03 Thread Artem Lukanin (JIRA)
Artem Lukanin created LUCENE-5030: - Summary: FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters Key: LUCENE-5030 URL:

[jira] [Updated] (LUCENE-5030) FuzzySuggester has to operate FSTs of Unicode-letters, not UTF-8, to work correctly for 1-byte (like English) and multi-byte (non-Latin) letters

2013-06-03 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Artem Lukanin updated LUCENE-5030: -- Description: There is a limitation in the current FuzzySuggester implementation: it computes

[jira] [Comment Edited] (LUCENE-5012) Make graph-based TokenFilters easier

2013-05-24 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13666519#comment-13666519 ] Artem Lukanin edited comment on LUCENE-5012 at 5/24/13 6:19 PM:

[jira] [Commented] (LUCENE-5012) Make graph-based TokenFilters easier

2013-05-24 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13666519#comment-13666519 ] Artem Lukanin commented on LUCENE-5012: --- Great! I was asking several people about

[jira] [Comment Edited] (LUCENE-5012) Make graph-based TokenFilters easier

2013-05-24 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-5012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13666519#comment-13666519 ] Artem Lukanin edited comment on LUCENE-5012 at 5/24/13 6:19 PM:

[jira] [Commented] (SOLR-4489) StringIndexOutOfBoundsException in SpellCheckComponent

2013-03-18 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-4489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13604973#comment-13604973 ] Artem Lukanin commented on SOLR-4489: - The same issue with wordbreak in Solr 4.1:

[jira] [Commented] (SOLR-4489) StringIndexOutOfBoundsException in SpellCheckComponent

2013-03-18 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-4489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13604986#comment-13604986 ] Artem Lukanin commented on SOLR-4489: - with combineWords=false I have another exception

[jira] [Comment Edited] (SOLR-4489) StringIndexOutOfBoundsException in SpellCheckComponent

2013-03-18 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-4489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13604986#comment-13604986 ] Artem Lukanin edited comment on SOLR-4489 at 3/18/13 10:22 AM:

[jira] [Comment Edited] (SOLR-4489) StringIndexOutOfBoundsException in SpellCheckComponent

2013-03-18 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-4489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13604986#comment-13604986 ] Artem Lukanin edited comment on SOLR-4489 at 3/18/13 10:24 AM:

[jira] [Comment Edited] (SOLR-4489) StringIndexOutOfBoundsException in SpellCheckComponent

2013-03-18 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-4489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13604973#comment-13604973 ] Artem Lukanin edited comment on SOLR-4489 at 3/18/13 10:24 AM:

[jira] [Comment Edited] (SOLR-4489) StringIndexOutOfBoundsException in SpellCheckComponent

2013-03-18 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-4489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13604973#comment-13604973 ] Artem Lukanin edited comment on SOLR-4489 at 3/18/13 10:24 AM:

[jira] [Created] (SOLR-4533) Synonyms, created in custom filters are ignored after tokenizers.

2013-03-06 Thread Artem Lukanin (JIRA)
Artem Lukanin created SOLR-4533: --- Summary: Synonyms, created in custom filters are ignored after tokenizers. Key: SOLR-4533 URL: https://issues.apache.org/jira/browse/SOLR-4533 Project: Solr

[jira] [Updated] (SOLR-4533) Synonyms, created in custom filters are ignored after tokenizers.

2013-03-06 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-4533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Artem Lukanin updated SOLR-4533: Attachment: synonyms.patch Fixed Synonyms, created in custom filters are ignored

[jira] [Commented] (SOLR-4375) Add config or schema option to turn off compressed stored fields in the Lucene 4.1 index format

2013-01-30 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-4375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13566327#comment-13566327 ] Artem Lukanin commented on SOLR-4375: - I failed to make a custom codec LUCENE_41CUSTOM,

[jira] [Comment Edited] (SOLR-4375) Add config or schema option to turn off compressed stored fields in the Lucene 4.1 index format

2013-01-30 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-4375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13566327#comment-13566327 ] Artem Lukanin edited comment on SOLR-4375 at 1/30/13 9:38 AM: --

[jira] [Comment Edited] (SOLR-4375) Add config or schema option to turn off compressed stored fields in the Lucene 4.1 index format

2013-01-30 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-4375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13566327#comment-13566327 ] Artem Lukanin edited comment on SOLR-4375 at 1/30/13 9:41 AM: --

[jira] [Commented] (SOLR-4375) Add config or schema option to turn off compressed stored fields in the Lucene 4.1 index format

2013-01-30 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-4375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13566358#comment-13566358 ] Artem Lukanin commented on SOLR-4375: - Yes, default is good. Just it would be better if

[jira] [Commented] (SOLR-4375) Add config or schema option to turn off compressed stored fields in the Lucene 4.1 index format

2013-01-30 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-4375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13566376#comment-13566376 ] Artem Lukanin commented on SOLR-4375: - Thank you. I will try to do it.

[jira] [Commented] (SOLR-4375) Add config or schema option to turn off compressed stored fields in the Lucene 4.1 index format

2013-01-29 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-4375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13565506#comment-13565506 ] Artem Lukanin commented on SOLR-4375: - I don't use highlights (there are no hl=true in

[jira] [Commented] (SOLR-4375) Add config or schema option to turn off compressed stored fields in the Lucene 4.1 index format

2013-01-29 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-4375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13565548#comment-13565548 ] Artem Lukanin commented on SOLR-4375: - Yes, sure, here are my numbers for mean response

[jira] [Commented] (LUCENE-4321) java.io.FilterReader considered harmful

2013-01-15 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13553580#comment-13553580 ] Artem Lukanin commented on LUCENE-4321: --- Oh, indeed!

[jira] [Updated] (LUCENE-4321) java.io.FilterReader considered harmful

2013-01-14 Thread Artem Lukanin (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Artem Lukanin updated LUCENE-4321: -- Attachment: NoRandomReadMockTokenizer.java I had to extend MockTokenizer, because I read the