Hi all,

I am trying to figure out how to write the GoraCI.Verifier job for the
Gora-DynamoDB. This is because Amazon DynamoDB does not have MapReduce
native support. The only MR support it has is through Apache Hive but
Gora doesn't support it yet so this is not a way to go. While reading
about EMR integration with DynamoDB [1] I saw that there is a specific
class that handles the integration between these two, this is class is
"org.apache.hadoop.hive.dynamodb.DynamoDBStorageHandler" but I haven't
been able to find this to review it.
The problem is that due to the fact that Amazon DynamoDB is not file
backed, Hadoop's input formats do not fit very well in this case. What
I was thinking was on using a dummy input format to start reading from
Amazon-DynamoDB and locally persisting it while scaling the tests. I
am not sure if this would be the best approach and I would really
appreciate to hear some opinions (:


Renato M.






[1] http://aws.amazon.com/articles/28549

Reply via email to