Hi Riak folks,

I'm new to Riak and have been reading up the wiki, watching videos and getting 
the development environment up and running.

I'll be using Riak as a key-value pair storage system, and run various 
MapReduce operations over the datastore for data analysis using the Python 
client. I have a few questions on the usage of python client.

1. For a 3-node cluster with number of copies for each document set to two, is 
there a way in which I wish to  distribute fetch requests uniformly across the 
three nodes?  I realise riak internally fetches the record from whichever node 
the data is available.
    This is the code I use to fetch values from keys:
            client = riak.RiakClient(host, port)
            bucket = self.client.bucket(bucketName)
            for key in keys:
                post = self.bucket.get(key)
     This would send all fetches to the same node. Is it better to put a 
loadbalancer in front of the three nodes to distribute the requests uniformly, 
or is creating a client object lightweight enough that I can dynamically 
connect to whichever client?

2. My familiarity with erlang and Javascript is minimal. Would somebody answer 
if the following things are doable/advisable to do?
     a. Most of the processes I want to run on the data are in python and php. 
Is it possible to call these processes from erlang/javascript?
     b. I've been looking for MapReduce tutorials for python client, but 
couldn't find any. Can somebody point to a sample MapReduce from python client, 
for example, to retrieve multiple records from Riak? (Given a set of keys 
[1,2,3,4], retrieve the records for these from Riak using MapReduce).

--
Thanks in advance,
Ishwar.
_______________________________________________
riak-users mailing list
[email protected]
http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com

Reply via email to