Hi Madhu, Welcome! I suggest subscribing to the dev@ mailing list and using the same email address when sending to the list, to avoid your email being caught in moderation.
It would be great to have a connector for Apache Hive. Keep in mind that several folks have expressed interest in using and contributing this connector. As far as I know, nobody is *actively* working on it, so you should be good to go. Please use BEAM-1158 [1] to coordinate this work with any other interested contributor. Note that there are several different ways of connecting Beam and Hive. The simplest one is to write HiveIO that which would run a Hive query and process Hive's results in Beam. Another would be to use Beam within Hive to compute the results of a Hive query. Finally, one could possibly write a Hive-based DSL on top of a Beam SDK. All of these approaches are valid and somewhat orthogonal one to another. I'm assuming you are after the first one. If so, and if you plan to follow already established patterns in other connectors, you don't necessarily need a design document. Otherwise, please start with a design document. We have linked a template in the Contribution Guide [2, 3]. Once again, welcome and let us know if we can help in any way! Davor [1] https://issues.apache.org/jira/browse/BEAM-1158 [2] https://beam.apache.org/contribute/contribution-guide/ [3] https://docs.google.com/document/d/1qYQPGtabN5-E4MjHsecqqC7PXvJtXvZukPfLXQ8rHJs On Mon, Feb 6, 2017 at 4:27 PM, Madhusudan Borkar <[email protected]> wrote: > Hello, > > I am Big Data Architect working at eTouch Systems. We are GCP partners. We > are planning to contribute to Beam by developing a connector for Apache > Hive as a data source. > I understand that before any development work begins, we need to submit our > design to Beam community. I would like to request you to please share a > "design template" document for the same. We will submit our design > document, using the template. > > > Thank you. > > best regards > Madhu Borkar >
