[ 
https://issues.apache.org/jira/browse/BEAM-9550?focusedWorklogId=411076&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-411076
 ]

ASF GitHub Bot logged work on BEAM-9550:
----------------------------------------

                Author: ASF GitHub Bot
            Created on: 27/Mar/20 13:46
            Start Date: 27/Mar/20 13:46
    Worklog Time Spent: 10m 
      Work Description: kamilwu commented on issue #11193: [BEAM-9550] Increase 
JVM Metaspace size for the TaskExecutors.
URL: https://github.com/apache/beam/pull/11193#issuecomment-605009385
 
 
   @mxm Thanks, the option names are now correct and properly recognized by 
Flink.
   
   I had one more problem. GroupByKey (as well as coGroupByKey) tests got stuck 
at some point and their progress didn't change despite the fact that the job 
was running. It seems changing the execution mode to `BATCH_FORCED` solved the 
problem. It is fine to keep this mode enabled permanently? 
 
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


Issue Time Tracking
-------------------

    Worklog Id:     (was: 411076)
    Time Spent: 4h 50m  (was: 4h 40m)

> beam_PostCommit_Python_Chicago_Taxi_Flink OOM
> ---------------------------------------------
>
>                 Key: BEAM-9550
>                 URL: https://issues.apache.org/jira/browse/BEAM-9550
>             Project: Beam
>          Issue Type: Bug
>          Components: runner-flink, test-failures
>            Reporter: Kyle Weaver
>            Assignee: Kamil Wasilewski
>            Priority: Major
>              Labels: currently-failing
>          Time Spent: 4h 50m
>  Remaining Estimate: 0h
>
> https://builds.apache.org/job/beam_PostCommit_Python_Chicago_Taxi_Flink/
> The following error has been occurring consistently for several days:
> 07:57:26 ERROR:root:java.lang.OutOfMemoryError: Metaspace
> 07:57:27 Traceback (most recent call last):
> 07:57:27   File "tfdv_analyze_and_validate.py", line 227, in <module>
> 07:57:27     main()
> 07:57:27   File "tfdv_analyze_and_validate.py", line 212, in main
> 07:57:27     project=known_args.metric_reporting_project)
> 07:57:27   File "tfdv_analyze_and_validate.py", line 132, in compute_stats
> 07:57:27     result.wait_until_finish()
> 07:57:27   File 
> "/home/jenkins/jenkins-slave/workspace/beam_PostCommit_Python_Chicago_Taxi_Flink/src/build/gradleenv/1866363813/local/lib/python2.7/site-packages/apache_beam/runners/portability/portable_runner.py",
>  line 545, in wait_until_finish
> 07:57:27     (self._job_id, self._state, self._last_error_message()))
> 07:57:27 RuntimeError: Pipeline 
> chicago-taxi-tfdv-20200317-144954-eval_9742ac2b-26bf-4d1d-835e-572d4efacfcb 
> failed in state FAILED: java.lang.OutOfMemoryError: Metaspace



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to