[jira] [Commented] (LUCENE-8132) HyphenationDecompoundTokenFilter does not set position/offset attributes correctly

2018-01-22 Thread Adrien Grand (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16335455#comment-16335455 ] Adrien Grand commented on LUCENE-8132: -- No, the hyphenation decompounder would have to be the first

[jira] [Commented] (LUCENE-8132) HyphenationDecompoundTokenFilter does not set position/offset attributes correctly

2018-01-22 Thread Holger Bruch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16335427#comment-16335427 ] Holger Bruch commented on LUCENE-8132: -- I’m not as deeply in Lucene as you are. What would be the

[jira] [Commented] (LUCENE-8132) HyphenationDecompoundTokenFilter does not set position/offset attributes correctly

2018-01-22 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16334273#comment-16334273 ] Robert Muir commented on LUCENE-8132: - Thats what HyphenationDecompoundTokenFilter already does. I

[jira] [Commented] (LUCENE-8132) HyphenationDecompoundTokenFilter does not set position/offset attributes correctly

2018-01-22 Thread Adrien Grand (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16334080#comment-16334080 ] Adrien Grand commented on LUCENE-8132: -- I haven't though about concrete use-cases, but for instance

[jira] [Commented] (LUCENE-8132) HyphenationDecompoundTokenFilter does not set position/offset attributes correctly

2018-01-22 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16334074#comment-16334074 ] Robert Muir commented on LUCENE-8132: - why do you need to decompound more than once? The

[jira] [Commented] (LUCENE-8132) HyphenationDecompoundTokenFilter does not set position/offset attributes correctly

2018-01-22 Thread Adrien Grand (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16334065#comment-16334065 ] Adrien Grand commented on LUCENE-8132: -- I'm not sure how practical this would be: some tokenizers

[jira] [Commented] (LUCENE-8132) HyphenationDecompoundTokenFilter does not set position/offset attributes correctly

2018-01-22 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16334027#comment-16334027 ] Robert Muir commented on LUCENE-8132: - Maybe the right solution is just to fix it correctly and

[jira] [Commented] (LUCENE-8132) HyphenationDecompoundTokenFilter does not set position/offset attributes correctly

2018-01-22 Thread Holger Bruch (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16334016#comment-16334016 ] Holger Bruch commented on LUCENE-8132: -- Ok, seems hard to get right for all cases. I wonder, if the

[jira] [Commented] (LUCENE-8132) HyphenationDecompoundTokenFilter does not set position/offset attributes correctly

2018-01-21 Thread Adrien Grand (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-8132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16333962#comment-16333962 ] Adrien Grand commented on LUCENE-8132: -- I agree this sounds wrong. Unfortunately, inserting