Hi,

> Recently, I was study the UIMA, through documents, understand the UIMA can 
> support Chinese, but how the UIMA support Chinese not mentioned in the 
> document, so support for the UIMA Chinese this piece is more confused for me 
> , I hope you can give me some detailed document or material, help me to solve 
> my confused! 
> Thank you very much!

we have some support for Chinese in DKPro Core. 

DKPro Core ASL [1]:

- Tokenizer/segmenter using LanguageTool [2, 3]
- Part-of-speech tagger using TreeTagger [4, 5] (TreeTagger is research only)

DKPro Core GPL [6]:

- Part-of-speech tagger using Stanford NLP [7,8]
- Parser using Stanford NLP [7,9]
- Parser using Berkeley Parser [10, 11]

Some of these components may only be available in the SVN trunk version.

That said, we do not really work on Chinese data, so this is rather a 
proof-of-concept (checking character set works, models are loaded properly, 
etc). If you try it out and have feedback, please tell us :)

-- Richard

[1] http://code.google.com/p/dkpro-core-asl

[2] http://www.languagetool.org

[3] 
http://code.google.com/p/dkpro-core-asl/source/browse/de.tudarmstadt.ukp.dkpro.core-asl/trunk/de.tudarmstadt.ukp.dkpro.core.languagetool-asl/src/test/java/de/tudarmstadt/ukp/dkpro/core/languagetool/LanguageToolSegmenterTest.java

[4] http://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/

[5] 
http://code.google.com/p/dkpro-core-asl/source/browse/de.tudarmstadt.ukp.dkpro.core-asl/trunk/de.tudarmstadt.ukp.dkpro.core.treetagger-asl/src/test/java/de/tudarmstadt/ukp/dkpro/core/treetagger/TreeTaggerPosLemmaTT4JTest.java#128

[6] http://code.google.com/p/dkpro-core-gpl

[7] http://www-nlp.stanford.edu/software/index.shtml

[8] 
http://code.google.com/p/dkpro-core-gpl/source/browse/de.tudarmstadt.ukp.dkpro.core-gpl/trunk/de.tudarmstadt.ukp.dkpro.core.stanfordnlp-gpl/src/test/java/de/tudarmstadt/ukp/dkpro/core/stanfordnlp/StanfordPosTaggerTest.java#64

[9] 
http://code.google.com/p/dkpro-core-gpl/source/browse/de.tudarmstadt.ukp.dkpro.core-gpl/trunk/de.tudarmstadt.ukp.dkpro.core.stanfordnlp-gpl/src/test/java/de/tudarmstadt/ukp/dkpro/core/stanfordnlp/StanfordParserTest.java#316

[10] http://code.google.com/p/berkeleyparser/

[11] 
http://code.google.com/p/dkpro-core-gpl/source/browse/de.tudarmstadt.ukp.dkpro.core-gpl/trunk/de.tudarmstadt.ukp.dkpro.core.berkeleyparser-gpl/src/test/java/de/tudarmstadt/ukp/dkpro/core/berkeleyparser/BerkeleyParserTest.java#114

Reply via email to