So I know going into this question that there will be varying opinions.
However I've noticed some things since starting with beam full time a few
weeks ago.

1. Python is second party SDK to Beam and doesn't seem to be to feature
parity with Java.
2. Even on supporting modules like fastavro Python still doesn't match up
with Java features.
3. Almost all tutorials and documentation around Beam and Big Data are done
in Java making it harder to learn the Python side of things.

So with these observations I'm curious. Is it just the age of the Python
SDK as the reason behind the lack of feature parity?

I'm also curious, are there any noticeable performance differences with
using Python SDK vs Java SDK in dataflow?

Thanks,
Shannon

Reply via email to