I *think* i may have isolated this issue to a client version- though it 
doesn't make sense to me why the sqoop plugin isn't working. I'm going to 
try upgrading my client libs to the newest version.

On Wednesday, March 12, 2014 4:03:07 PM UTC-4, Corey Nolet wrote:
>
> Would it possible for someone to provide me with an effective example on 
> how to use the TapClient in couchbase/memcached with a couchbase server 
> installation?
>
> I've been banging my head against the wall for days on this. I need to be 
> able to dump out my couchbase keys/values every hour into HDFS so I can 
> map/reduce over them. I'm using CDH3u4 and the Sqoop connector is freezing 
> up when it begins its map/reduce job. I do not have the luxury of updating 
> to the Sqoop CDH4 version unfortunately but I've seen people complaining of 
> the same problems with that version.
>
> What I've tried is using the TapClient with both the Couchbase libraries 
> and the spy memcached libraries in java. Even with exponential backoff, I 
> can't seem to get the TapClient to return a message where I can pull off a 
> key and a value (it appears I get 'null" for getNextmessage() even with an 
> appropriate timeout of 5 minutes).
>
> What can I do to get this to work? I've been using Couchbase behind 
> Twitter Storm to help with caching for CEP. I've also been using it as a 
> real-time query engine of the underlying CEP cache with ElasticSearch for 
> my customer. If I can't dump the data out to HDFS directly, then I may need 
> to look at other options. I am trying to stay away from views because I 
> want to hit memory directly. I'd also like to preserve data locality if 
> possible (connect directly to memcached or tell couchbase exactly which 
> node(s) i'd like to retrieve keys from.
>
> What are my options here?
>
>
> I'm wondering if BigCouch would allow me to do this effectively.
>
> Thanks much!
>
>
> On Monday, March 10, 2014 11:52:57 PM UTC-4, Corey Nolet wrote:
>>
>> I recently tried the Sqoop connector for Couchbase 2 and it doesn't 
>> appear to be working as expected. I have written my own InputFormat here:
>>
>>
>> https://github.com/cjnolet/cloud-toolkit/blob/master/src/main/java/org/calrissian/hadoop/memcached/MemcachedInputFormat.java
>>
>> I haven't gotten a chance to test it yet but I wanted to know if MOXI 
>> would make it hard to get the locality that Im expecting from each of the 
>> memcached instances. When I connect to a memcached instance (backing 
>> couchbase) on port 11211, will each of those memcached instances give me 
>> ALL of the keys in couchbase? or will they only give me the keys that they 
>> contain separately?
>>
>>
>> Thanks!
>>
>

-- 
You received this message because you are subscribed to the Google Groups 
"Couchbase" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to couchbase+unsubscr...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to