Hey Sebastian,

you can also have a look at [1], which is what we used to create 
internationalized models. What you will need is (1) stop words [that's easy] 
(2) tokenization [that's a bit more critical]. We support Korean tokenization 
out of the box via java.text but since I suppose Korean is hard to tokenize, it 
will likely give better results to have a supervised OpenNLP tokenizer 
tokenizer.

Best,
Jo


[1] https://github.com/jodaiber/model-quickstarter


Am 19.04.2013 um 14:16 schrieb Pablo N. Mendes:

> 
> All of the documentation is work in progress. Please feel free to improve the 
> confusing points and fix the outdated information.
> 
> We have another side of the docs that have been updated more recently:
> https://github.com/dbpedia-spotlight/dbpedia-spotlight/wiki/Internationalization-(DB-backed-core)
> 
> It uses a different pipeline from the link you pointed out. But this may be 
> easier to work with Korean at the current state of affairs.
> 
> Cheers,
> Pablo
> 
> 
> On Fri, Apr 19, 2013 at 2:08 PM, Sebastian Hellmann 
> <[email protected]> wrote:
> Dear list,
> I am not sure, whether this shouldn't go to the developers list....
> 
> Can somebody tell me what the best way is to create a Korean Spotlight.
> http://wiki.dbpedia.org/Internationalization
> Seems to be outdated. It also confused us a lot. I hope there is a
> simpler way with more straightforward instructions.
> 
> All the best,
> Sebastian
> 
> 
> 
> 
> --
> Dipl. Inf. Sebastian Hellmann
> Department of Computer Science, University of Leipzig
> Projects: http://nlp2rdf.org , http://linguistics.okfn.org ,
> http://dbpedia.org/Wiktionary , http://dbpedia.org
> Homepage: http://bis.informatik.uni-leipzig.de/SebastianHellmann
> Research Group: http://aksw.org
> 
> ------------------------------------------------------------------------------
> Precog is a next-generation analytics platform capable of advanced
> analytics on semi-structured data. The platform includes APIs for building
> apps and a phenomenal toolset for data science. Developers can use
> our toolset for easy data analysis & visualization. Get a free account!
> http://www2.precog.com/precogplatform/slashdotnewsletter
> _______________________________________________
> Dbp-spotlight-users mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/dbp-spotlight-users
> 
> 
> 
> -- 
> 
> Pablo N. Mendes
> http://pablomendes.com
> ------------------------------------------------------------------------------
> Precog is a next-generation analytics platform capable of advanced
> analytics on semi-structured data. The platform includes APIs for building
> apps and a phenomenal toolset for data science. Developers can use
> our toolset for easy data analysis & visualization. Get a free account!
> http://www2.precog.com/precogplatform/slashdotnewsletter_______________________________________________
> Dbp-spotlight-users mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/dbp-spotlight-users

------------------------------------------------------------------------------
Precog is a next-generation analytics platform capable of advanced
analytics on semi-structured data. The platform includes APIs for building
apps and a phenomenal toolset for data science. Developers can use
our toolset for easy data analysis & visualization. Get a free account!
http://www2.precog.com/precogplatform/slashdotnewsletter
_______________________________________________
Dbp-spotlight-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbp-spotlight-users

Reply via email to