On 7 March 2011 16:56, Jimmy O'Regan <[email protected]> wrote: > On 7 March 2011 16:28, Antonio Toral <[email protected]> wrote: >> Hi, >> >> I'd like to add this idea: >> >> >> task: dictionary induction from wikis >> difficulty: 3. medium >> description: Extract dictionaries from linguistic wikis >> rationale: Wiki dictionaries and encyclopedias (e.g. omegawiki, >> wiktionary, wikipedia) contain information (e.g. bilingual equivalences, >> morphological features, conjugations, etc) that could be exploited to >> speed up the development of dictionaries for Apertium. This task aims at >> automatically building dictionaries by extracting different pieces of >> information from wiki structures such as interlingual links, infoboxes, >> etc. >> requirements: SQL, mediawiki syntax, perl, maybe C++ or Java >> > > FWIW, there's a branch of dbpedia that has started to do something > similar: > http://dbpedia.hg.sourceforge.net/hgweb/dbpedia/extraction_framework/file/e406efd61660 >
BTW, I mentioned dbpedia to you at freerbmt as a way of checking the type of proper names. Here's a sample sparql query to get place names: http://dbpedia.org/snorql/?query=SELECT+*+WHERE+{%0D%0A%3Fsubject+rdf:type+%3Chttp://dbpedia.org/ontology/Place%3E.%0D%0A%3Fsubject+rdfs:label+%3Flabelen.%0D%0A%3Fsubject+rdfs:label+%3Flabeles.%0D%0AFILTER+(lang(%3Flabelen)+%3D+%22en%22+%26%26+lang(%3Flabeles)+%3D+%22es%22)%0D%0A}+LIMIT+20 You could, say, change http://dbpedia.org/ontology/Place to http://dbpedia.org/ontology/Monarch or http://dbpedia.org/ontology/Pope to get a list of translatable names of people, etc. -- <Leftmost> jimregan, that's because deep inside you, you are evil. <Leftmost> Also not-so-deep inside you. ------------------------------------------------------------------------------ What You Don't Know About Data Connectivity CAN Hurt You This paper provides an overview of data connectivity, details its effect on application quality, and explores various alternative solutions. http://p.sf.net/sfu/progress-d2d _______________________________________________ Apertium-stuff mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/apertium-stuff
