The main limitation is that MongoDB has only rudimentary support for parallelism. I'm trying to design a system that various departments can use as a data source, and the statistics on the Editor Trends page show MongoDB maxed out for days to dump en.wiki. I'd like more ability to grow capacity, especially long-term.
On Sun, Feb 13, 2011 at 15:43, Steven Walling <[email protected]> wrote: > On Sun, Feb 13, 2011 at 3:32 PM, David Strauss <[email protected]> > wrote: >> >> > Edit history in an accessible form -- create a queryable NoSQL form of >> data dumps >> >> I'd like to get this started ASAP. I think we can set up a bridge to >> synchronize directly from MediaWiki to a tool like Cassandra. It will >> provide a superior source for both XML dumps and analysis. > > See http://strategy.wikimedia.org/wiki/Editor_Trends_Study/Software for an > already ongoing project very similar to this notion. > > > _______________________________________________ > Wiki-research-l mailing list > [email protected] > https://lists.wikimedia.org/mailman/listinfo/wiki-research-l > > -- David Strauss | [email protected] | +1 512 577 5827 [mobile] _______________________________________________ Wiki-research-l mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
