Jakob Homan created SAMZA-263:
---------------------------------

             Summary: Create SystemConsumer and SystemProducer for HDFS
                 Key: SAMZA-263
                 URL: https://issues.apache.org/jira/browse/SAMZA-263
             Project: Samza
          Issue Type: Improvement
            Reporter: Jakob Homan
            Assignee: Jakob Homan


It would be nice to be able to read/write from HDFS, particularly for 
bootstrapping purposes.  A few points:

* Per the discussion [about 
leveldb|https://issues.apache.org/jira/browse/SAMZA-236?focusedCommentId=13985982&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13985982]
 this support should be separated into its own package and project (jar) for 
easy testing and severability.
* Similar to the Kafka RegexTopicGenerator, we can enumerate (recursively or 
not) the files in an HDFS directory during job startup.
* Connectivity with HCatalog would be interesting as well, but should be 
handled in a separate JIRA.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to