[ 
https://issues.apache.org/jira/browse/BEAM-13141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17440724#comment-17440724
 ] 

Kenneth Knowles commented on BEAM-13141:
----------------------------------------

Making it possible to build a job without access is a good idea. Would you be 
interested in making this change [~prathapreddy22]?

> Support to submit Jobs using HBaseIO to DataflowRunner without local access 
> to HBase Cluster
> --------------------------------------------------------------------------------------------
>
>                 Key: BEAM-13141
>                 URL: https://issues.apache.org/jira/browse/BEAM-13141
>             Project: Beam
>          Issue Type: Improvement
>          Components: sdk-java-core
>            Reporter: Prathap Kumar Parvathareddy
>            Priority: P3
>
> +*Context*+
> As of today HBase IO interacts with Hbase cluster while building execution 
> graph for validating the existence of table, calculating splits etc 
> https://github.com/apache/beam/blob/master/sdks/java/io/hbase/src/main/java/org/apache/beam/sdk/io/hbase/HBaseIO.java#L237
> In certain scenarios dataflow jobs are launched from systems that does not 
> have network access to Hbase cluster during graph construction stage. but can 
> access only during execution time on google cloud. However due to current 
> implementation of local access to HbaseIO, the job can be launched only from 
> systems that has network access to Hbase Cluster.
> *+Requirement+*
>  Modify HbaseIO to accept a flag (say hasLocalAccess) and if flag is set to 
> false defer validations , split calculation logic etc to job execution time 
> rather than job construction time.
>  



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to