[jira] [Commented] (STORM-1839) Kinesis Spout

ASF GitHub Bot (JIRA) Tue, 26 Jul 2016 10:31:16 -0700

    [ 
https://issues.apache.org/jira/browse/STORM-1839?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15394137#comment-15394137
 ]


ASF GitHub Bot commented on STORM-1839:
---------------------------------------

Github user harshach commented on a diff in the pull request:

    https://github.com/apache/storm/pull/1586#discussion_r72297573
  
    --- Diff: external/storm-kinesis/README.md ---
    @@ -0,0 +1,139 @@
    +#Storm Kinesis Spout
    +Provides core storm spout for consuming data from a stream in Amazon 
Kinesis Streams. It stores the sequence numbers that can be committed in 
zookeeper and 
    +starts consuming records after that sequence number on restart by default. 
Below is the code sample to create a sample topology that uses the spout. Each 
    +object used in configuring the spout is explained below. Ideally, the 
number of spout tasks should be equal to number of shards in kinesis. However 
each task 
    +can read from more than one shard.
    +
    +```java
    +public class KinesisSpoutTopology {
    +    public static void main (String args[]) throws 
InvalidTopologyException, AuthorizationException, AlreadyAliveException {
    +        String topologyName = args[0];
    +        RecordToTupleMapper recordToTupleMapper = new 
TestRecordToTupleMapper();
    +        KinesisConnectionInfo kinesisConnectionInfo = new 
KinesisConnectionInfo(new CredentialsProviderChain(), new 
ClientConfiguration(), Regions.US_WEST_2,
    +                1000);
    +        org.apache.storm.kinesis.spout.Config config = new 
org.apache.storm.kinesis.spout.Config(args[1], ShardIteratorType.TRIM_HORIZON,
    +                recordToTupleMapper, new Date(), new 
ExponentialBackoffRetrier(), new ZkInfo(), kinesisConnectionInfo, 10000L);
    +        KinesisSpout kinesisSpout = new KinesisSpout(config);
    +        TopologyBuilder topologyBuilder = new TopologyBuilder();
    +        topologyBuilder.setSpout("spout", kinesisSpout, 3);
    +        topologyBuilder.setBolt("bolt", new KinesisBoltTest(), 
1).shuffleGrouping("spout");
    +        Config topologyConfig = new Config();
    +        topologyConfig.setDebug(true);
    +        topologyConfig.setNumWorkers(3);
    +        StormSubmitter.submitTopology(topologyName, topologyConfig, 
topologyBuilder.createTopology());
    +    }
    +}
    +```
    +As you can see above the spout takes an object of Config in its 
constructor. The constructor of Config takes 8 objects as explained below.
    +
    +#### `String` streamName
    +name of kinesis stream to consume data from
    +
    +#### `ShardIteratorType` shardIteratorType
    +3 types are supported - TRIM_HORIZON(beginning of shard), LATEST and 
AT_TIMESTAMP. By default this argument is ignored if state for shards 
    +is found in zookeeper. Hence they will apply the first time a topology is 
started. If you want to use any of these in subsequent runs of the topology, 
you 
    +will need to clear the state of zookeeper node used for storing sequence 
numbers
    +#### `RecordToTupleMapper` recordToTupleMapper
    --- End diff --
    
    line break


> Kinesis Spout
> -------------
>
>                 Key: STORM-1839
>                 URL: https://issues.apache.org/jira/browse/STORM-1839
>             Project: Apache Storm
>          Issue Type: Improvement
>            Reporter: Sriharsha Chintalapani
>            Assignee: Priyank Shah
>
> As Storm is increasingly used in Cloud environments. It will great to have a 
> Kinesis Spout integration in Apache Storm.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (STORM-1839) Kinesis Spout

Reply via email to