So I know going into this question that there will be varying opinions. However I've noticed some things since starting with beam full time a few weeks ago.
1. Python is second party SDK to Beam and doesn't seem to be to feature parity with Java. 2. Even on supporting modules like fastavro Python still doesn't match up with Java features. 3. Almost all tutorials and documentation around Beam and Big Data are done in Java making it harder to learn the Python side of things. So with these observations I'm curious. Is it just the age of the Python SDK as the reason behind the lack of feature parity? I'm also curious, are there any noticeable performance differences with using Python SDK vs Java SDK in dataflow? Thanks, Shannon
