Thanks for reporting. This seems to be due to a known Python bug: https://bugs.python.org/issue1777412
Seems like above bug has not been fixed for Python 2.x line. I created https://issues.apache.org/jira/browse/BEAM-4171 for tracking the Beam issue and possibly updating the Beam SDK to not use 'strftime'. Thanks, Cham On Tue, Apr 24, 2018 at 2:33 PM Yatish Gupta <[email protected]> wrote: > Hey, > > I am getting an exception when running beam on BigQuery rows that contain > Timestamps less than 1900. I get a similar error when using the > DirectRunner. > > File > "/usr/local/lib/python2.7/dist-packages/dataflow_worker/nativebigqueryavroio.py", > line 83, in _fix_field_value > return dt.strftime('%Y-%m-%d %H:%M:%S.%f UTC') > ValueError: year=1851 is before 1900; the datetime strftime() methods > require year >= 1900 > > How do I read these rows without changing my schema? > > Thanks! > >
