http://storage.googleapis.com/books/ngrams/books/datasetsv2.html
2013/7/23 Bináris <[email protected]> > Once you have a "list of words which are used on the web" (this must be got > from an outer source, nothing to do it within Wiktionary), the easiest way > is to run a bot, e.g. Pywikipedia. > > 2013/7/23 Mathieu Stumpf <[email protected]> > > > Hello, > > > > Here is what I would like to do : generating reports which give, for a > > given language, a list of words which are used on the web with a number > > evaluating its occurencies, but which are not in a given wiktionary. > > > > How would you recommand to implemente that within the wikimedia > > infrastructure? > > > > -- > > Association Culture-Libre > > http://www.culture-libre.org/ > > > > ______________________________**_________________ > > Wikitech-l mailing list > > [email protected] > > https://lists.wikimedia.org/**mailman/listinfo/wikitech-l< > https://lists.wikimedia.org/mailman/listinfo/wikitech-l> > > > > > -- > Bináris > _______________________________________________ > Wikitech-l mailing list > [email protected] > https://lists.wikimedia.org/mailman/listinfo/wikitech-l > _______________________________________________ Wikitech-l mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikitech-l
