RE: DataCenters each with their own local data source

Mathieu Lalonde Tue, 22 Nov 2011 18:15:52 -0800

Hi,

Thanks for the quick reply.  Sorry if my question was not clear.  I tried to 
provide more info.


________________________________
> Date: Tue, 22 Nov 2011 20:43:33 -0500 
> Subject: Re: DataCenters each with their own local data source 
> From: md.jahangi...@gmail.com 
> To: user@cassandra.apache.org 
>  
> Distributing writes to all D.C.s? or reads?

Writes should be distributed within the local D.C.
Reads could be D.C. specific, or could ask a question to all D.C. instances.

>  
> If each D.C. has data specific to that particular geo, why do you have  
> to read from remote D.C. ? 

Users want to aggregate information from different D.C.

>  
> You can easily incorporate logic to re-direct operation(either  
> write/read) to appropriate(local) D.C.

In the middle tier, or does Cassandra have explicit support for that?

>  
> Still wondering why you want to do so?. Am assuming you want to store  
> data as per I.P. (geo location data) ..Anyways, it was not very clear  
> from your question what you are trying to do. 

Each data center has a local data source that we can't afford to replicate 
across data centers.
Users may be interested in querying multiple data centers.

>  
> Thanks, 
> Jahangir Mohammed. 
>  
> On Tue, Nov 22, 2011 at 7:57 PM, Mathieu Lalonde  
> <mrlalo...@live.ca<mailto:mrlalo...@live.ca>> wrote: 
>  
>  
> Hi, 
>  
> I am wondering if Cassandra's features and datacenter awareness can  
> help me with my scalability problems. 
>  
> Suppose that I have a 10-20 Data centers, each with their own local  
> (massive) source of time series data.  I would like: 
> - to avoid replication across data centers (this seems doable based on:  
> http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Different-KeySpaces-for-different-nodes-in-the-same-ring-td5096393.html#a5096568
>   
> ) 
> - writes for local data to be done on the local data center (not sure  
> about that one) 
> - reads from a master data center to any remote data centers (not sure  
> about that one either) 
>  
> It sounds like I am trying to use Cassandra in a very different way  
> that it was intended to be used. 
> Should I simply have a middle-tier that takes care of distributing  
> reads to multiple data centers and treat each data center as its own  
> autonomous cluster? 
>  
> Thanks! 
> Matt 
>  
>  
>

RE: DataCenters each with their own local data source

Reply via email to