Ismaël Mejía created BEAM-9554:
----------------------------------
Summary: Improve connection reuse on HBaseIO.ReadAll
Key: BEAM-9554
URL: https://issues.apache.org/jira/browse/BEAM-9554
Project: Beam
Issue Type: Improvement
Components: io-java-hbase
Reporter: Ismaël Mejía
Assignee: Ismaël Mejía
The recent refactor of HBase.ReadAll in BEAM-9279 creates new connections in
the @ProcessElement method (once per element), in the case that a pipeline is
used on streaming mode this could be costly so we should find a way to cache
and reuse connections to avoid both slow start of reads and saturating the
clusters.
Notice that this is an ongoing issue for DoFn based IOs that manifested first
on Writes for JdbcIO BEAM-7230 and was recently discussed too in the context of
the CassandraIO refactor:
https://github.com/apache/beam/pull/10546#issuecomment-580619044
--
This message was sent by Atlassian Jira
(v8.3.4#803005)