Hi, also consider that each wiktionary aims to contains every single word for each human language.
So you can't do the assumption of reading only italian Wiktionary to have a list of all the italian words. The best approach is to read all Wiktionaries looking each one for italian words. In my case I used both english and italian wiktionaries to retrieve a huge list of italian words (other editions were irrelevant). Cheers, Riccardo 2015-11-17 10:09 GMT+01:00 Dimitris Kontokostas <[email protected]>: > Hi Raphael > > The Wiktionary mappings are unmaintained for quite some time now. > You are more than welcome to update them and match the current wiktionary > structure or look at other options such as dbnary. > > Best, > Dimitris > > > On Mon, Nov 16, 2015 at 5:59 PM, Raphael Boyer <[email protected]> > wrote: > >> >> Dear all, >> >> I tried recently to extract data from a french Wiktionary dump with the >> extractor of the community on github. >> >> But this create strange data like this, for every word. >> Few data about every word, word from other languages are parsed too. I >> don't know if it's normal. >> >> <http://wiktionary.dbpedia.org/resource/encyclopédie> < >> http://usefulinc.com/ns/doap#creator> < >> http://de.wiktionary.org/w/index.php?title=encyclopédie&action=history> . >> <http://wiktionary.dbpedia.org/resource/encyclopédie> < >> http://www.monnet-project.eu/lemon#sense> < >> http://wiktionary.dbpedia.org/resource/encyclopédie> . >> <http://wiktionary.dbpedia.org/resource/encyclopédie> < >> http://www.w3.org/2000/01/rdf-schema#label> "encyclopédie"^^< >> http://www.w3.org/2001/XMLSchema#string> . >> <http://wiktionary.dbpedia.org/resource/encyclopédie> < >> http://www.w3.org/2000/01/rdf-schema#seeAlso> < >> http://de.wiktionary.org/wiki/encyclopédie> . >> <http://wiktionary.dbpedia.org/resource/encyclopédie> < >> http://www.w3.org/1999/02/22-rdf-syntax-ns#type> < >> http://wiktionary.dbpedia.org/terms/LexicalEntity> . >> <http://wiktionary.dbpedia.org/resource/encyclopédie> < >> http://www.w3.org/1999/02/22-rdf-syntax-ns#type> < >> http://www.monnet-project.eu/lemon#LexicalSense> . >> <http://wiktionary.dbpedia.org/resource/encyclopédie> < >> http://www.w3.org/1999/02/22-rdf-syntax-ns#type> < >> http://www.monnet-project.eu/lemon#LexicalEntry> . >> <http://wiktionary.dbpedia.org/resource/encyclopédie> < >> http://wiktionary.dbpedia.org/terms/statistics> "7-139"^^< >> http://www.w3.org/2001/XMLSchema#string> . >> >> <http://wiktionary.dbpedia.org/resource/accueil> < >> http://usefulinc.com/ns/doap#creator> < >> http://de.wiktionary.org/w/index.php?title=accueil&action=history> . >> <http://wiktionary.dbpedia.org/resource/accueil> < >> http://www.monnet-project.eu/lemon#sense> < >> http://wiktionary.dbpedia.org/resource/accueil> . >> <http://wiktionary.dbpedia.org/resource/accueil> < >> http://www.w3.org/2000/01/rdf-schema#label> "accueil"^^< >> http://www.w3.org/2001/XMLSchema#string> . >> <http://wiktionary.dbpedia.org/resource/accueil> < >> http://www.w3.org/2000/01/rdf-schema#seeAlso> < >> http://de.wiktionary.org/wiki/accueil> . >> <http://wiktionary.dbpedia.org/resource/accueil> < >> http://www.w3.org/1999/02/22-rdf-syntax-ns#type> < >> http://www.monnet-project.eu/lemon#LexicalEntry> . >> <http://wiktionary.dbpedia.org/resource/accueil> < >> http://www.w3.org/1999/02/22-rdf-syntax-ns#type> < >> http://www.monnet-project.eu/lemon#LexicalSense> . >> <http://wiktionary.dbpedia.org/resource/accueil> < >> http://www.w3.org/1999/02/22-rdf-syntax-ns#type> < >> http://wiktionary.dbpedia.org/terms/LexicalEntity> . >> <http://wiktionary.dbpedia.org/resource/accueil> < >> http://wiktionary.dbpedia.org/terms/statistics> "7-112"^^< >> http://www.w3.org/2001/XMLSchema#string> . >> >> If you have any idea of what is happening. >> >> With regards >> >> Raphaël Boyer >> WIMMICS TEAM >> INRIA France >> >> >> ------------------------------------------------------------------------------ >> Presto, an open source distributed SQL query engine for big data, >> initially >> developed by Facebook, enables you to easily query your data on Hadoop in >> a >> more interactive manner. Teradata is also now providing full enterprise >> support for Presto. Download a free open source copy now. >> http://pubads.g.doubleclick.net/gampad/clk?id=250295911&iu=/4140 >> _______________________________________________ >> Dbpedia-discussion mailing list >> [email protected] >> https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion >> >> > > > -- > Kontokostas Dimitris > > > ------------------------------------------------------------------------------ > > _______________________________________________ > Dbpedia-discussion mailing list > [email protected] > https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion > >
------------------------------------------------------------------------------
_______________________________________________ Dbpedia-discussion mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
