On Mon, Jul 25, 2022 at 12:53 PM Lina Mårtensson via dev <
dev@beam.apache.org> wrote:

> Hi dev,
>
> We're starting to incorporate BigTable in our stack and I've delighted
> my co-workers with how easy it was to create some BigTables with
> Beam... but there doesn't appear to be a reader for BigTable in
> Python.
>
> First off, is there a good reason why not/any reason why it would be
> difficult?
>

There's was a previous effort to implement a Python BT source but that was
not completed:
https://github.com/apache/beam/pull/11295#issuecomment-646378304


>
> I could write one, but before I start, I'd love some input to make it
> easier.
>
> It appears that there would be two options: either write one in
> Python, or try to set one up with x-language from Java which I see is
> done e.g. with the Spanner IO Connector.
> Any recommendation on which one to pick or potential pitfalls in either
> choice?
>
> If I write one in Python, what should I think about?
> It is not obvious to me how to achieve parallelization, so any tips
> here would be welcome.
>

I would strongly prefer developing a  Python wrapper for the existing Java
BT source using Beam's Multi-language Pipelines framework over developing a
new Python source.
https://beam.apache.org/documentation/programming-guide/#multi-language-pipelines

Thanks,
Cham



>
> Thanks!
> -Lina
>

Reply via email to