Re: [Kim-discussion] Problem with Large KB Gazetteer and chinese caracters

2012-04-05 Thread Philip Alexiev
Hello Fabian, The LKB gazetteer uses its own tokenization, which is generally - whitespace based. This is the reason why it won't work over asian texts. Unfortunately we no longer support it. All the best, Philip On 4 Apr 2012, at 10:59 AM, Fabian Cretton wrote: Dear all, I am using

Re: [Kim-discussion] Rép. : Re: Problem with Large KB Gazetteer and chinese caracters

2012-04-05 Thread Philip Alexiev
Hi Fabian, Using any tokenizer with support for Gate is easily applicable in the ancestor of the LKB gazetteer - the LD gazetteer. We are currently discussing the future and licensing of this component. Expect outcome from us the following weeks. All the best, Philip On 5 Apr 2012, at 3:54

Re: [Kim-discussion] Rép. : Re: Problem with Large KB Gazetteer and chinese caracters

2012-04-05 Thread Fabian Cretton
Hello Philip, Do you mean the Linked Data Gazetteer you told me about in another post (so a successor of the LarkgeKB) ? I am currently working on a research project, where I need to implement those features in the next 2-3 weeks. You told me in another post: The matter is being discussed at