[
https://issues.apache.org/jira/browse/BEAM-9154?focusedWorklogId=490232&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-490232
]
ASF GitHub Bot logged work on BEAM-9154:
----------------------------------------
Author: ASF GitHub Bot
Created on: 24/Sep/20 14:20
Start Date: 24/Sep/20 14:20
Worklog Time Spent: 10m
Work Description: kamilwu commented on a change in pull request #12929:
URL: https://github.com/apache/beam/pull/12929#discussion_r494358966
##########
File path: .test-infra/jenkins/job_LoadTests_Combine_Flink_Python.groovy
##########
@@ -171,7 +171,7 @@ PhraseTriggeringPostCommitBuilder.postCommitJob(
loadTestJob(delegate, CommonTestProperties.TriggeringContext.PR,
'streaming')
}
-CronJobBuilder.cronJob('beam_LoadTests_Python_Combine_Flink_Streaming', 'H 16
* * *', this) {
+CronJobBuilder.cronJob('beam_LoadTests_Python_Combine_Flink_Streaming', 'H 17
* * *', this) {
Review comment:
Yes. This job is flaky, because it's being executed with its batch
version at the same time on the same Jenkins worker. Our Flink cluster requires
local port forwarding. When two jobs set up port forwarding at the same time,
one of them fails eventually. I hope this change will fix that.
It is not connected with the Chicago Taxi Example. I can create a separate
PR if you think that would be better.
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
Issue Time Tracking
-------------------
Worklog Id: (was: 490232)
Time Spent: 50m (was: 40m)
> Move Chicago Taxi Example to Python 3
> -------------------------------------
>
> Key: BEAM-9154
> URL: https://issues.apache.org/jira/browse/BEAM-9154
> Project: Beam
> Issue Type: Improvement
> Components: testing
> Reporter: Kamil Wasilewski
> Assignee: Kamil Wasilewski
> Priority: P1
> Time Spent: 50m
> Remaining Estimate: 0h
>
> The Chicago Taxi Example[1] should be moved to the latest version of Python
> supported by Beam (currently it's Python 3.7).
> At the moment, the following error occurs when running the benchmark on
> Python 3.7 (requires futher investigation):
> {code:java}
> Traceback (most recent call last):
> File "preprocess.py", line 259, in <module>
> main()
> File "preprocess.py", line 254, in main
> project=known_args.metric_reporting_project
> File "preprocess.py", line 155, in transform_data
> ('Analyze' >> tft_beam.AnalyzeDataset(preprocessing_fn)))
> File
> "/Users/kamilwasilewski/proj/beam/sdks/python/apache_beam/transforms/ptransform.py",
> line 987, in __ror__
> return self.transform.__ror__(pvalueish, self.label)
> File
> "/Users/kamilwasilewski/proj/beam/sdks/python/apache_beam/transforms/ptransform.py",
> line 547, in __ror__
> result = p.apply(self, pvalueish, label)
> File
> "/Users/kamilwasilewski/proj/beam/sdks/python/apache_beam/pipeline.py", line
> 532, in apply
> return self.apply(transform, pvalueish)
> File
> "/Users/kamilwasilewski/proj/beam/sdks/python/apache_beam/pipeline.py", line
> 573, in apply
> pvalueish_result = self.runner.apply(transform, pvalueish, self._options)
> File
> "/Users/kamilwasilewski/proj/beam/sdks/python/apache_beam/runners/runner.py",
> line 193, in apply
> return m(transform, input, options)
> File
> "/Users/kamilwasilewski/proj/beam/sdks/python/apache_beam/runners/runner.py",
> line 223, in apply_PTransform
> return transform.expand(input)
> File
> "/Users/kamilwasilewski/proj/beam/build/gradleenv/2022703441/lib/python3.7/site-packages/tensorflow_transform/beam/impl.py",
> line 825, in expand
> input_metadata))
> File
> "/Users/kamilwasilewski/proj/beam/build/gradleenv/2022703441/lib/python3.7/site-packages/tensorflow_transform/beam/impl.py",
> line 716, in expand
> output_signature = self._preprocessing_fn(copied_inputs)
> File "preprocess.py", line 102, in preprocessing_fn
> _fill_in_missing(inputs[key]),
> KeyError: 'company'
> {code}
> [1] sdks/python/apache_beam/testing/benchmarks/chicago_taxi
--
This message was sent by Atlassian Jira
(v8.3.4#803005)