Hi Sawyer,

In this mail I will just provide an update to the original information in [1]

* Paoding: As Paoding is not compatible to Solr 4 those
implementations can no longer be used with the Stanbol SNAPSHOT. This
means that you have to use Smartcn.
* If you want to test with DBpedia you can use the index available at
http://dev.iks-project.eu/downloads/stanbol-indices/dbpedia-3.8/chinese/smartcn/
* If you what to index your own dataset with Chinese languages please
also follow the information in [1]. But make sure to use the Smartcn
implementations.

A demo for DBpedia is available at the nightly build server at

     http://dev.iks-project.eu:8081/enhancer/chain/dbpedia-proper-noun

As mentioned in [1] quality is currently not really good, because NLP
is limited to sentence detection and tokenization. However I am
currently working on the integration of Stanford NLP with Apache
Stanbol [2]. While this integration currently only supports English I
will extend this to also support the other languages supported by
Stanford NLP. This also includes POS tagging and NER for Chinese. My
current plan is to finish this work in the first or second week of
June.

With this available Linking results should be better. But still the
Entity Linking engines are not optimized for JCK languages. Currently
they make some assumptions that are nice for letter based languages,
but do not necessarily work well for JCK languages.

best
Rupert

[1] http://stanbol.markmail.org/thread/gqrqnl3ght2pgmob
[2] https://github.com/westei/stanbol-stanfordnlp

On Fri, May 17, 2013 at 10:38 AM, Sawyer Chen <[email protected]> wrote:
> Hi,
>
> I have seen the chinese enhance chain on the demo site a few months ago and
> that seems not available now. But I'm curious of how to setup a Chinese
> enhance Chain.
> Please point out the starting direction and/or readme/doc of creating a
> chinese enhance chain which I can dig into this.
> Sorry for repost if this problem has been answered before.
>
> Sawyer



-- 
| Rupert Westenthaler             [email protected]
| Bodenlehenstraße 11                             ++43-699-11108907
| A-5500 Bischofshofen

Reply via email to