[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12729373#action_12729373
]
KuroSaka TeruHiko commented on LUCENE-1629:
---
WordTokenizer extends Tokenizer,
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12729381#action_12729381
]
Robert Muir commented on LUCENE-1629:
-
bq. Shouldn't WordTokenizer rather extends
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12714670#action_12714670
]
Mingfai Ma commented on LUCENE-1629:
re. 平假名 and 片假名 in Japanese
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12714285#action_12714285
]
Otis Gospodnetic commented on LUCENE-1629:
--
I just got to look at this code and I
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12714293#action_12714293
]
Xiaoping Gao commented on LUCENE-1629:
--
I think the algorithm of Hidden Markov Model
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12710712#action_12710712
]
Xiaoping Gao commented on LUCENE-1629:
--
The dictionary is loaded in to 2 classes:
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12710118#action_12710118
]
Koji Sekiguchi commented on LUCENE-1629:
bq. koji, have you considered using icu
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12710147#action_12710147
]
Mingfai Ma commented on LUCENE-1629:
i'm not sure if the character mapping is a
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12709796#action_12709796
]
Mingfai Ma commented on LUCENE-1629:
hi Xiaoping,
I'm interested to get the Chinese
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12709867#action_12709867
]
Xiaoping Gao commented on LUCENE-1629:
--
Hello Mingfai!
coredict.mem is converted
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12709866#action_12709866
]
Xiaoping Gao commented on LUCENE-1629:
--
Hello Mingfai!
coredict.mem is converted
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12709880#action_12709880
]
Robert Muir commented on LUCENE-1629:
-
if you acquire the big5 resources, do you think
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12709885#action_12709885
]
Robert Muir commented on LUCENE-1629:
-
another potential issue with big5 i want to
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12709974#action_12709974
]
Mingfai Ma commented on LUCENE-1629:
could we use CC-CEDICT's dictionary instead? it
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12710035#action_12710035
]
Koji Sekiguchi commented on LUCENE-1629:
Just an FYI. There have been a working
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12710070#action_12710070
]
Robert Muir commented on LUCENE-1629:
-
koji, have you considered using icu transforms
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12709345#action_12709345
]
Michael McCandless commented on LUCENE-1629:
Awesome! I've applied your
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12709352#action_12709352
]
Uwe Schindler commented on LUCENE-1629:
---
Fine!
Should I commit the ArabicAnalyzer
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12709355#action_12709355
]
Michael McCandless commented on LUCENE-1629:
bq. Should I commit the
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12709357#action_12709357
]
Michael McCandless commented on LUCENE-1629:
bq. The analyzer (and many more)
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12709416#action_12709416
]
Xiaoping Gao commented on LUCENE-1629:
--
Test successful on my laptop now! Thank all
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12709425#action_12709425
]
Uwe Schindler commented on LUCENE-1629:
---
Hi Xiaoping,
Thanks! The code is now
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12708878#action_12708878
]
Michael McCandless commented on LUCENE-1629:
(Shooting in the dark, here,
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12708887#action_12708887
]
Uwe Schindler commented on LUCENE-1629:
---
I wonder, why this build fragment did not
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12708889#action_12708889
]
Michael McCandless commented on LUCENE-1629:
That fragment is under
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12708892#action_12708892
]
Uwe Schindler commented on LUCENE-1629:
---
I will look into it this evening and
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12708894#action_12708894
]
Michael McCandless commented on LUCENE-1629:
OK, I agree, separation of
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12708909#action_12708909
]
Uwe Schindler commented on LUCENE-1629:
---
Its only needed to have the src/resources
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12708912#action_12708912
]
Erik Hatcher commented on LUCENE-1629:
--
My initial thought is to move the copy
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12708340#action_12708340
]
Uwe Schindler commented on LUCENE-1629:
---
I know this, the problem with th lucene
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12707969#action_12707969
]
Michael McCandless commented on LUCENE-1629:
When I run ant test in
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12708009#action_12708009
]
Xiaoping Gao commented on LUCENE-1629:
--
On Mon, May 11, 2009 at 6:57 PM, Michael
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12708032#action_12708032
]
Michael McCandless commented on LUCENE-1629:
I do have the file, but at
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12708060#action_12708060
]
Uwe Schindler commented on LUCENE-1629:
---
Did the jar ANT task also adds the non
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12708067#action_12708067
]
Xiaoping Gao commented on LUCENE-1629:
--
I think Schindler should be right.
I modified
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12708139#action_12708139
]
Uwe Schindler commented on LUCENE-1629:
---
I did some checks now, it is the problem of
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12708181#action_12708181
]
Michael McCandless commented on LUCENE-1629:
bq. The simpliest would be to
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12708320#action_12708320
]
Xiaoping Gao commented on LUCENE-1629:
--
I think it is unacceptable to ask every
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12707649#action_12707649
]
Michael McCandless commented on LUCENE-1629:
Xiaoping, could you turn the
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12707683#action_12707683
]
Xiaoping Gao commented on LUCENE-1629:
--
to Robert Muir:
The dictionary only supports
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12707703#action_12707703
]
Robert Muir commented on LUCENE-1629:
-
Xiaoping, thanks. I see they didn't get great
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12707278#action_12707278
]
Michael McCandless commented on LUCENE-1629:
bq. all the code working on
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12707280#action_12707280
]
Michael McCandless commented on LUCENE-1629:
When I apply the patch and then
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12706782#action_12706782
]
Michael McCandless commented on LUCENE-1629:
Patch looks good -- thanks
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12706887#action_12706887
]
Uwe Schindler commented on LUCENE-1629:
---
Hi Xiaoping,
looks good, but I have some
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12706928#action_12706928
]
Xiaoping Gao commented on LUCENE-1629:
--
to McCandless:
There is lots of code
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12706948#action_12706948
]
Robert Muir commented on LUCENE-1629:
-
Hi,
I see in the paper that lexical resources
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12707042#action_12707042
]
Michael McCandless commented on LUCENE-1629:
bq. There is lots of code
I'd prefer it to stay 1.4 for now and would be willing to make the
change, if needed.
-- DM
On May 7, 2009, at 3:04 PM, Michael McCandless (JIRA) wrote:
[
[
https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12707235#action_12707235
]
Xiaoping Gao commented on LUCENE-1629:
--
I have ported the code to Java1.4 today,
50 matches
Mail list logo