The add seconds to the bucket.  Also, the data will get cached-it's not going 
to hit disk on every read.

Look at the key cache settings on the table.  Also, in 2.1 you have even more 
control over caching.

--
Colin
320-221-9531


> On Jun 7, 2014, at 4:30 PM, Kevin Burton <bur...@spinn3r.com> wrote:
> 
> 
>> On Sat, Jun 7, 2014 at 1:34 PM, Colin <colpcl...@gmail.com> wrote:
>> Maybe it makes sense to describe what you're trying to accomplish in more 
>> detail.
> 
> Essentially , I'm appending writes of recent data by our crawler and sending 
> that data to our customers.
>  
> They need to sync to up to date writes…we need to get them writes within 
> seconds. 
> 
>> A common bucketing approach is along the lines of year, month, day, hour, 
>> minute, etc and then use a timeuuid as a cluster column.  
> 
> I mean that is acceptable.. but that means for that 1 minute interval, all 
> writes are going to that one node (and its replicas)
> 
> So that means the total cluster throughput is bottlenecked on the max disk 
> throughput.
> 
> Same thing for reads… unless our customers are lagged, they are all going to 
> stampede and ALL of them are going to read data from one node, in a one 
> minute timeframe.
> 
> That's no fun..  that will easily DoS our cluster.
>  
>> Depending upon the semantics of the transport protocol you plan on 
>> utilizing, either the client code keep track of pagination, or the app 
>> server could, if you utilized some type of request/reply/ack flow.  You 
>> could keep sequence numbers for each client, and begin streaming data to 
>> them or allowing query upon reconnect, etc.
>> 
>> But again, more details of the use case might prove useful.
> 
> I think if we were to just 100 buckets it would probably work just fine.  
> We're probably not going to be more than 100 nodes in the next year and if we 
> are that's still reasonable performance.  
> 
> I mean if each box has a 400GB SSD that's 40TB of VERY fast data. 
> 
> Kevin
> 
> -- 
> Founder/CEO Spinn3r.com
> Location: San Francisco, CA
> Skype: burtonator
> blog: http://burtonator.wordpress.com
> … or check out my Google+ profile
> 
> War is peace. Freedom is slavery. Ignorance is strength. Corporations are 
> people.

Reply via email to