I'd like to announce the preliminary availability of a very fascinating dataset

http://blog.databaseanimals.com/wikipedia-pagelinks-in-amazon-s3

The gist is that the page counts contain hourly usage information for
every page in all the wikipedias,  wiktionaries,  wikimedia commons,
etc.

This 3TB data set is in the us-east-1 zone and can be easily worked on
with Amazon MapReduce.  Because this data is preliminary I'm not ready
to make it requester-paid,  but if you write to me I can authorize
your AWS keys for access to it.

-- 
Paul Houle
Expert on Freebase, DBpedia, Hadoop and RDF
(607) 539 6254    paul.houle on Skype   ontol...@gmail.com

------------------------------------------------------------------------------
Sponsored by Intel(R) XDK 
Develop, test and display web and hybrid apps with a single code base.
Download it for free now!
http://pubads.g.doubleclick.net/gampad/clk?id=111408631&iu=/4140/ostg.clktrk
_______________________________________________
Dbpedia-discussion mailing list
Dbpedia-discussion@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to