[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-07-09 Thread KuroSaka TeruHiko (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12729373#action_12729373 ] KuroSaka TeruHiko commented on LUCENE-1629: --- WordTokenizer extends Tokenizer,

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-07-09 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12729381#action_12729381 ] Robert Muir commented on LUCENE-1629: - bq. Shouldn't WordTokenizer rather extends

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-29 Thread Mingfai Ma (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12714670#action_12714670 ] Mingfai Ma commented on LUCENE-1629: re. 平假名 and 片假名 in Japanese

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-28 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12714285#action_12714285 ] Otis Gospodnetic commented on LUCENE-1629: -- I just got to look at this code and I

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-28 Thread Xiaoping Gao (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12714293#action_12714293 ] Xiaoping Gao commented on LUCENE-1629: -- I think the algorithm of Hidden Markov Model

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-19 Thread Xiaoping Gao (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12710712#action_12710712 ] Xiaoping Gao commented on LUCENE-1629: -- The dictionary is loaded in to 2 classes:

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-16 Thread Koji Sekiguchi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12710118#action_12710118 ] Koji Sekiguchi commented on LUCENE-1629: bq. koji, have you considered using icu

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-16 Thread Mingfai Ma (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12710147#action_12710147 ] Mingfai Ma commented on LUCENE-1629: i'm not sure if the character mapping is a

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-15 Thread Mingfai Ma (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12709796#action_12709796 ] Mingfai Ma commented on LUCENE-1629: hi Xiaoping, I'm interested to get the Chinese

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-15 Thread Xiaoping Gao (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12709867#action_12709867 ] Xiaoping Gao commented on LUCENE-1629: -- Hello Mingfai! coredict.mem is converted

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-15 Thread Xiaoping Gao (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12709866#action_12709866 ] Xiaoping Gao commented on LUCENE-1629: -- Hello Mingfai! coredict.mem is converted

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-15 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12709880#action_12709880 ] Robert Muir commented on LUCENE-1629: - if you acquire the big5 resources, do you think

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-15 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12709885#action_12709885 ] Robert Muir commented on LUCENE-1629: - another potential issue with big5 i want to

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-15 Thread Mingfai Ma (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12709974#action_12709974 ] Mingfai Ma commented on LUCENE-1629: could we use CC-CEDICT's dictionary instead? it

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-15 Thread Koji Sekiguchi (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12710035#action_12710035 ] Koji Sekiguchi commented on LUCENE-1629: Just an FYI. There have been a working

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-15 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12710070#action_12710070 ] Robert Muir commented on LUCENE-1629: - koji, have you considered using icu transforms

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-14 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12709345#action_12709345 ] Michael McCandless commented on LUCENE-1629: Awesome! I've applied your

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-14 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12709352#action_12709352 ] Uwe Schindler commented on LUCENE-1629: --- Fine! Should I commit the ArabicAnalyzer

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-14 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12709355#action_12709355 ] Michael McCandless commented on LUCENE-1629: bq. Should I commit the

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-14 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12709357#action_12709357 ] Michael McCandless commented on LUCENE-1629: bq. The analyzer (and many more)

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-14 Thread Xiaoping Gao (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12709416#action_12709416 ] Xiaoping Gao commented on LUCENE-1629: -- Test successful on my laptop now! Thank all

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-14 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12709425#action_12709425 ] Uwe Schindler commented on LUCENE-1629: --- Hi Xiaoping, Thanks! The code is now

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-13 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12708878#action_12708878 ] Michael McCandless commented on LUCENE-1629: (Shooting in the dark, here,

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-13 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12708887#action_12708887 ] Uwe Schindler commented on LUCENE-1629: --- I wonder, why this build fragment did not

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-13 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12708889#action_12708889 ] Michael McCandless commented on LUCENE-1629: That fragment is under

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-13 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12708892#action_12708892 ] Uwe Schindler commented on LUCENE-1629: --- I will look into it this evening and

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-13 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12708894#action_12708894 ] Michael McCandless commented on LUCENE-1629: OK, I agree, separation of

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-13 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12708909#action_12708909 ] Uwe Schindler commented on LUCENE-1629: --- Its only needed to have the src/resources

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-13 Thread Erik Hatcher (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12708912#action_12708912 ] Erik Hatcher commented on LUCENE-1629: -- My initial thought is to move the copy

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-12 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12708340#action_12708340 ] Uwe Schindler commented on LUCENE-1629: --- I know this, the problem with th lucene

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-11 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12707969#action_12707969 ] Michael McCandless commented on LUCENE-1629: When I run ant test in

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-11 Thread Xiaoping Gao (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12708009#action_12708009 ] Xiaoping Gao commented on LUCENE-1629: -- On Mon, May 11, 2009 at 6:57 PM, Michael

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-11 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12708032#action_12708032 ] Michael McCandless commented on LUCENE-1629: I do have the file, but at

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-11 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12708060#action_12708060 ] Uwe Schindler commented on LUCENE-1629: --- Did the jar ANT task also adds the non

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-11 Thread Xiaoping Gao (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12708067#action_12708067 ] Xiaoping Gao commented on LUCENE-1629: -- I think Schindler should be right. I modified

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-11 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12708139#action_12708139 ] Uwe Schindler commented on LUCENE-1629: --- I did some checks now, it is the problem of

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-11 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12708181#action_12708181 ] Michael McCandless commented on LUCENE-1629: bq. The simpliest would be to

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-11 Thread Xiaoping Gao (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12708320#action_12708320 ] Xiaoping Gao commented on LUCENE-1629: -- I think it is unacceptable to ask every

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-09 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12707649#action_12707649 ] Michael McCandless commented on LUCENE-1629: Xiaoping, could you turn the

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-09 Thread Xiaoping Gao (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12707683#action_12707683 ] Xiaoping Gao commented on LUCENE-1629: -- to Robert Muir: The dictionary only supports

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-09 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12707703#action_12707703 ] Robert Muir commented on LUCENE-1629: - Xiaoping, thanks. I see they didn't get great

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-08 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12707278#action_12707278 ] Michael McCandless commented on LUCENE-1629: bq. all the code working on

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-08 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12707280#action_12707280 ] Michael McCandless commented on LUCENE-1629: When I apply the patch and then

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-07 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12706782#action_12706782 ] Michael McCandless commented on LUCENE-1629: Patch looks good -- thanks

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-07 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12706887#action_12706887 ] Uwe Schindler commented on LUCENE-1629: --- Hi Xiaoping, looks good, but I have some

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-07 Thread Xiaoping Gao (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12706928#action_12706928 ] Xiaoping Gao commented on LUCENE-1629: -- to McCandless: There is lots of code

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-07 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12706948#action_12706948 ] Robert Muir commented on LUCENE-1629: - Hi, I see in the paper that lexical resources

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-07 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12707042#action_12707042 ] Michael McCandless commented on LUCENE-1629: bq. There is lots of code

Re: [jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-07 Thread DM Smith
I'd prefer it to stay 1.4 for now and would be willing to make the change, if needed. -- DM On May 7, 2009, at 3:04 PM, Michael McCandless (JIRA) wrote: [

[jira] Commented: (LUCENE-1629) contrib intelligent Analyzer for Chinese

2009-05-07 Thread Xiaoping Gao (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1629?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12707235#action_12707235 ] Xiaoping Gao commented on LUCENE-1629: -- I have ported the code to Java1.4 today,