Trying to discover source for the DynamoDBInputFormat.
Not appearing in:

- https://github.com/aws/aws-sdk-java
- https://github.com/apache/hive

Then came across 
http://stackoverflow.com/questions/17077774/jar-containing-org-apache-hadoop-hive-dynamodb.
Unsure whether this represents the latest situation…

ian 


On 4 Jul 2014, at 16:58, Nick Pentreath <nick.pentre...@gmail.com> wrote:

> I should qualify by saying there is boto support for dynamodb - but not for 
> the inputFormat. You could roll your own python-based connection but this 
> involves figuring out how to split the data in dynamo - inputFormat takes 
> care of this so should be the easier approach 
> —
> Sent from Mailbox
> 
> 
> On Fri, Jul 4, 2014 at 8:51 AM, Ian Wilkinson <ia...@me.com> wrote:
> 
> Excellent. Let me get browsing on this.
> 
> 
> Huge thanks,
> ian
> 
> 
> On 4 Jul 2014, at 16:47, Nick Pentreath <nick.pentre...@gmail.com> wrote:
> 
>> No boto support for that. 
>> 
>> In master there is Python support for loading Hadoop inputFormat. Not sure 
>> if it will be in 1.0.1 or 1.1
>> 
>> I master docs under the programming guide are instructions and also under 
>> examples project there are pyspark examples of using Cassandra and HBase. 
>> These should hopefully give you enough to get started. 
>> 
>> Depending on how easy it is to use the dynamo DB format, you may have to 
>> write a custom converter (see the mentioned examples for storm details).
>> 
>> Sent from my iPhone
>> 
>> On 4 Jul 2014, at 08:38, Ian Wilkinson <ia...@me.com> wrote:
>> 
>>> Hi Nick,
>>> 
>>> I’m going to be working with python primarily. Are you aware of
>>> comparable boto support?
>>> 
>>> ian
>>> 
>>> On 4 Jul 2014, at 16:32, Nick Pentreath <nick.pentre...@gmail.com> wrote:
>>> 
>>>> You should be able to use DynamoDBInputFormat (I think this should be part 
>>>> of AWS libraries for Java) and create a HadoopRDD from that.
>>>> 
>>>> 
>>>> On Fri, Jul 4, 2014 at 8:28 AM, Ian Wilkinson <ia...@me.com> wrote:
>>>> Hi,
>>>> 
>>>> I noticed mention of DynamoDB as input source in
>>>> http://ampcamp.berkeley.edu/wp-content/uploads/2012/06/matei-zaharia-amp-camp-2012-advanced-spark.pdf.
>>>> 
>>>> Unfortunately, Google is not coming to my rescue on finding
>>>> further mention for this support.
>>>> 
>>>> Any pointers would be well received.
>>>> 
>>>> Big thanks,
>>>> ian
>>>> 
>>> 
> 
> 

Reply via email to