I'm also curious how this will play out over time.

Will we have clusters of peer CouchDB servers that have one View Controller each or one CouchDB server with many mapreduce controllers?

I'm thinking in big picture there are two scalability issues:

1. Huge datasets. The concept of sharding would apply here.
2. Complex maps. In some cases it might be faster to do these on multiple machines with one CouchDB server.

I'm not one to suggest any architectural solutions at this point, just thinking out loud at this point.

Cortland Klein <[EMAIL PROTECTED]> +1 408 506 9791
http://pixelcort.com/

Sent from my iPhone

On Apr 30, 2008, at 3:52 PM, "Krzysztof Kulewski" <[EMAIL PROTECTED] > wrote:

Hello,
I have a question for which I couldn't find solution on docs available online. Please help: Lets suppose that I have one couchdb server storing text documents. I want to do some map reduce on it. Map part take very long time per one doc, so I want to distribute maps between 20 map servers (btw. maybe unreliable). And then somebody will do the reduce part. As I can see, "CouchDB delegates computation of Views to external query servers." But I want to use 20 separate machines for doing map part to obtain 20x speed-up. How to distribute this map part? Is there any pretty solution out of the box?
BR,
Krzysztof

Reply via email to