> > I'd like to add this idea: > > > > > > task: dictionary induction from wikis > > difficulty: 3. medium > > description: Extract dictionaries from linguistic wikis > > rationale: Wiki dictionaries and encyclopedias (e.g. omegawiki, > > wiktionary, wikipedia) contain information (e.g. bilingual equivalences, > > morphological features, conjugations, etc) that could be exploited to > > speed up the development of dictionaries for Apertium. This task aims at > > automatically building dictionaries by extracting different pieces of > > information from wiki structures such as interlingual links, infoboxes, > > etc. > > requirements: SQL, mediawiki syntax, perl, maybe C++ or Java > > > > FWIW, there's a branch of dbpedia that has started to do something > similar: > http://dbpedia.hg.sourceforge.net/hgweb/dbpedia/extraction_framework/file/e406efd61660 > > At the moment, it only targets de.wiktionary, but (where possible) > it's being designed to have templates independent of the code, so it > should be about as easy to adapt to new wiktionaries as dbpedia proper > is to new wikipedias (i.e., there are (unavoidably) some things that > need to be added to the code, but most information comes from the > templates). -- mostly Scala, some Java > > There's also a Freedict-related project to extract TEI dictionaries > from the Russian wiktionary: > http://wiktionary-export.nataraj.su/en/about.html -- Perl > > There's also a Java-based parser for wikimedia-style wikis: > http://code.google.com/p/jwpl/
Thanks for the info. I've posted the idea adding some dbpedia-related bits > > Would anyone else be interested as mentor? > > How about you? :) If someone's interested in working on the dbpedia > framework, I've done some work on it, and would be happy to mentor > that. Sure, I've added your name as well :) a ------------------------------------------------------------------------------ What You Don't Know About Data Connectivity CAN Hurt You This paper provides an overview of data connectivity, details its effect on application quality, and explores various alternative solutions. http://p.sf.net/sfu/progress-d2d _______________________________________________ Apertium-stuff mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/apertium-stuff
