I'd like to announce the preliminary availability of a very fascinating dataset
http://blog.databaseanimals.com/wikipedia-pagelinks-in-amazon-s3 The gist is that the page counts contain hourly usage information for every page in all the wikipedias, wiktionaries, wikimedia commons, etc. This 3TB data set is in the us-east-1 zone and can be easily worked on with Amazon MapReduce. Because this data is preliminary I'm not ready to make it requester-paid, but if you write to me I can authorize your AWS keys for access to it. -- Paul Houle Expert on Freebase, DBpedia, Hadoop and RDF (607) 539 6254 paul.houle on Skype ontol...@gmail.com ------------------------------------------------------------------------------ Sponsored by Intel(R) XDK Develop, test and display web and hybrid apps with a single code base. Download it for free now! http://pubads.g.doubleclick.net/gampad/clk?id=111408631&iu=/4140/ostg.clktrk _______________________________________________ Dbpedia-discussion mailing list Dbpedia-discussion@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion