See <https://builds.apache.org/job/beam_LoadTests_Java_GBK_Dataflow_Batch/396/display/redirect?page=changes>
Changes: [relax] switch cogbk to use Beam transform [relax] finish join [relax] support side-input joins [relax] support side-input joins [relax] spotless [relax] make FieldAccessDescriptor always be field-insertion order [relax] fix side-input joins [relax] fix bug [relax] remove obsolete test [relax] add javadoc [relax] add unit tests [relax] update sql transform ------------------------------------------ [...truncated 73.87 KB...] > Task :model:fn-execution:shadowJar UP-TO-DATE > Task :sdks:java:core:compileJava UP-TO-DATE > Task :sdks:java:core:classes UP-TO-DATE > Task :sdks:java:core:shadowJar UP-TO-DATE > Task :sdks:java:extensions:protobuf:extractIncludeProto UP-TO-DATE > Task :sdks:java:io:synthetic:compileJava UP-TO-DATE > Task :sdks:java:io:synthetic:classes UP-TO-DATE > Task :sdks:java:io:synthetic:jar UP-TO-DATE > Task :sdks:java:extensions:protobuf:generateProto NO-SOURCE > Task :sdks:java:io:kinesis:compileJava UP-TO-DATE > Task :sdks:java:io:kinesis:classes UP-TO-DATE > Task :vendor:sdks-java-extensions-protobuf:compileJava UP-TO-DATE > Task :vendor:sdks-java-extensions-protobuf:classes UP-TO-DATE > Task :runners:local-java:compileJava UP-TO-DATE > Task :runners:local-java:classes UP-TO-DATE > Task :vendor:sdks-java-extensions-protobuf:shadowJar UP-TO-DATE > Task :runners:local-java:jar UP-TO-DATE > Task :sdks:java:fn-execution:compileJava UP-TO-DATE > Task :sdks:java:fn-execution:classes UP-TO-DATE > Task :sdks:java:fn-execution:jar UP-TO-DATE > Task :sdks:java:extensions:protobuf:compileJava UP-TO-DATE > Task :sdks:java:extensions:protobuf:classes UP-TO-DATE > Task :sdks:java:extensions:protobuf:jar UP-TO-DATE > Task :sdks:java:io:kinesis:jar UP-TO-DATE > Task :sdks:java:extensions:google-cloud-platform-core:compileJava UP-TO-DATE > Task :sdks:java:extensions:google-cloud-platform-core:classes UP-TO-DATE > Task :sdks:java:extensions:google-cloud-platform-core:jar UP-TO-DATE > Task :runners:core-construction-java:compileJava UP-TO-DATE > Task :runners:core-construction-java:classes UP-TO-DATE > Task :runners:core-construction-java:jar UP-TO-DATE > Task :sdks:java:expansion-service:compileJava UP-TO-DATE > Task :sdks:java:expansion-service:classes UP-TO-DATE > Task :sdks:java:testing:test-utils:compileJava UP-TO-DATE > Task :sdks:java:testing:test-utils:classes UP-TO-DATE > Task :sdks:java:expansion-service:jar UP-TO-DATE > Task :sdks:java:testing:test-utils:jar UP-TO-DATE > Task :sdks:java:io:kafka:compileJava UP-TO-DATE > Task :sdks:java:io:kafka:classes UP-TO-DATE > Task :sdks:java:io:kafka:jar UP-TO-DATE > Task :runners:core-java:compileJava UP-TO-DATE > Task :runners:core-java:classes UP-TO-DATE > Task :runners:core-java:jar UP-TO-DATE > Task :sdks:java:harness:compileJava UP-TO-DATE > Task :sdks:java:harness:classes UP-TO-DATE > Task :sdks:java:harness:jar UP-TO-DATE > Task :sdks:java:harness:shadowJar UP-TO-DATE > Task :sdks:java:io:google-cloud-platform:compileJava UP-TO-DATE > Task :sdks:java:io:google-cloud-platform:classes UP-TO-DATE > Task :runners:java-fn-execution:compileJava UP-TO-DATE > Task :runners:java-fn-execution:classes UP-TO-DATE > Task :sdks:java:io:google-cloud-platform:jar UP-TO-DATE > Task :runners:java-fn-execution:jar UP-TO-DATE > Task :runners:direct-java:compileJava UP-TO-DATE > Task :runners:direct-java:classes UP-TO-DATE > Task :runners:direct-java:shadowJar UP-TO-DATE > Task :runners:google-cloud-dataflow-java:compileJava UP-TO-DATE > Task :runners:google-cloud-dataflow-java:classes UP-TO-DATE > Task :runners:google-cloud-dataflow-java:jar UP-TO-DATE > Task :sdks:java:testing:load-tests:compileJava UP-TO-DATE > Task :sdks:java:testing:load-tests:classes UP-TO-DATE > Task :sdks:java:testing:load-tests:jar UP-TO-DATE > Task :runners:google-cloud-dataflow-java:worker:legacy-worker:compileJava > UP-TO-DATE > Task :runners:google-cloud-dataflow-java:worker:legacy-worker:classes > UP-TO-DATE > Task :runners:google-cloud-dataflow-java:worker:legacy-worker:shadowJar > UP-TO-DATE > Task :sdks:java:testing:load-tests:run Mar 23, 2020 2:23:50 PM org.apache.beam.runners.dataflow.options.DataflowPipelineOptions$StagingLocationFactory create INFO: No stagingLocation provided, falling back to gcpTempLocation Mar 23, 2020 2:23:50 PM org.apache.beam.runners.dataflow.DataflowRunner fromOptions INFO: PipelineOptions.filesToStage was not specified. Defaulting to files from the classpath: will stage 172 files. Enable logging at DEBUG level to see which files will be staged. Mar 23, 2020 2:23:51 PM org.apache.beam.runners.dataflow.DataflowRunner run INFO: Executing pipeline on the Dataflow Service, which will have billing implications related to Google Compute Engine usage and other Google Cloud Services. Mar 23, 2020 2:23:51 PM org.apache.beam.runners.dataflow.util.PackageUtil stageClasspathElements INFO: Uploading 173 files from PipelineOptions.filesToStage to staging location to prepare for execution. Mar 23, 2020 2:23:52 PM org.apache.beam.runners.dataflow.util.PackageUtil stageClasspathElements INFO: Staging files complete: 173 files cached, 0 files newly uploaded in 0 seconds Mar 23, 2020 2:23:52 PM org.apache.beam.runners.dataflow.DataflowPipelineTranslator$Translator addStep INFO: Adding Read input as step s1 Mar 23, 2020 2:23:52 PM org.apache.beam.runners.dataflow.DataflowPipelineTranslator$Translator addStep INFO: Adding Collect start time metrics as step s2 Mar 23, 2020 2:23:52 PM org.apache.beam.runners.dataflow.DataflowPipelineTranslator$Translator addStep INFO: Adding Total bytes monitor as step s3 Mar 23, 2020 2:23:52 PM org.apache.beam.runners.dataflow.DataflowPipelineTranslator$Translator addStep INFO: Adding Window.Into()/Window.Assign as step s4 Mar 23, 2020 2:23:53 PM org.apache.beam.runners.dataflow.DataflowPipelineTranslator$Translator addStep INFO: Adding Group by key (0) as step s5 Mar 23, 2020 2:23:53 PM org.apache.beam.runners.dataflow.DataflowPipelineTranslator$Translator addStep INFO: Adding Ungroup and reiterate (0) as step s6 Mar 23, 2020 2:23:53 PM org.apache.beam.runners.dataflow.DataflowPipelineTranslator$Translator addStep INFO: Adding Collect end time metrics (0) as step s7 Mar 23, 2020 2:23:53 PM org.apache.beam.runners.dataflow.DataflowRunner run INFO: Staging pipeline description to gs://temp-storage-for-perf-tests/loadtests/staging/ Mar 23, 2020 2:23:53 PM org.apache.beam.runners.dataflow.DataflowRunner run INFO: Dataflow SDK version: 2.21.0-SNAPSHOT Mar 23, 2020 2:23:53 PM org.apache.beam.runners.dataflow.options.DataflowPipelineOptions$DefaultGcpRegionFactory create WARNING: Region will default to us-central1. Future releases of Beam will require the user to set the region explicitly. https://cloud.google.com/compute/docs/regions-zones/regions-zones Mar 23, 2020 2:23:54 PM org.apache.beam.runners.dataflow.DataflowRunner run INFO: To access the Dataflow monitoring console, please navigate to https://console.cloud.google.com/dataflow/jobs/us-central1/2020-03-23_07_23_53-10944696645350614513?project=apache-beam-testing Mar 23, 2020 2:23:54 PM org.apache.beam.runners.dataflow.DataflowRunner run INFO: Submitted job: 2020-03-23_07_23_53-10944696645350614513 Mar 23, 2020 2:23:54 PM org.apache.beam.runners.dataflow.DataflowRunner run INFO: To cancel the job using the 'gcloud' tool, run: > gcloud dataflow jobs --project=apache-beam-testing cancel > --region=us-central1 2020-03-23_07_23_53-10944696645350614513 Mar 23, 2020 2:23:57 PM org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process WARNING: 2020-03-23T14:23:56.853Z: The workflow name is not a valid Cloud Label. Labels applied to Cloud resources (such as GCE Instances) for monitoring will be labeled with this modified job name: load0tests0java0dataflow0batch0gbk02-jenkins-0323142350-f3-4jrn. For the best monitoring experience, please name your job with a valid Cloud Label. For details, see: https://cloud.google.com/compute/docs/labeling-resources#restrictions Mar 23, 2020 2:23:57 PM org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process INFO: 2020-03-23T14:23:56.943Z: Checking permissions granted to controller Service Account. Mar 23, 2020 2:24:02 PM org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process INFO: 2020-03-23T14:24:02.021Z: Worker configuration: n1-standard-1 in us-central1-f. Mar 23, 2020 2:24:03 PM org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process INFO: 2020-03-23T14:24:02.662Z: Expanding CoGroupByKey operations into optimizable parts. Mar 23, 2020 2:24:03 PM org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process INFO: 2020-03-23T14:24:02.746Z: Expanding GroupByKey operations into optimizable parts. Mar 23, 2020 2:24:03 PM org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process INFO: 2020-03-23T14:24:02.790Z: Lifting ValueCombiningMappingFns into MergeBucketsMappingFns Mar 23, 2020 2:24:03 PM org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process INFO: 2020-03-23T14:24:02.902Z: Fusing adjacent ParDo, Read, Write, and Flatten operations Mar 23, 2020 2:24:03 PM org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process INFO: 2020-03-23T14:24:02.932Z: Fusing consumer Collect start time metrics into Read input Mar 23, 2020 2:24:03 PM org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process INFO: 2020-03-23T14:24:02.959Z: Fusing consumer Total bytes monitor into Collect start time metrics Mar 23, 2020 2:24:03 PM org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process INFO: 2020-03-23T14:24:02.997Z: Fusing consumer Window.Into()/Window.Assign into Total bytes monitor Mar 23, 2020 2:24:03 PM org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process INFO: 2020-03-23T14:24:03.029Z: Fusing consumer Group by key (0)/Reify into Window.Into()/Window.Assign Mar 23, 2020 2:24:03 PM org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process INFO: 2020-03-23T14:24:03.068Z: Fusing consumer Group by key (0)/Write into Group by key (0)/Reify Mar 23, 2020 2:24:03 PM org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process INFO: 2020-03-23T14:24:03.101Z: Fusing consumer Group by key (0)/GroupByWindow into Group by key (0)/Read Mar 23, 2020 2:24:03 PM org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process INFO: 2020-03-23T14:24:03.138Z: Fusing consumer Ungroup and reiterate (0) into Group by key (0)/GroupByWindow Mar 23, 2020 2:24:03 PM org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process INFO: 2020-03-23T14:24:03.170Z: Fusing consumer Collect end time metrics (0) into Ungroup and reiterate (0) Mar 23, 2020 2:24:03 PM org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process INFO: 2020-03-23T14:24:03.608Z: Executing operation Group by key (0)/Create Mar 23, 2020 2:24:03 PM org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process INFO: 2020-03-23T14:24:03.683Z: Starting 5 workers in us-central1-f... Mar 23, 2020 2:24:03 PM org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process INFO: 2020-03-23T14:24:03.717Z: Finished operation Group by key (0)/Create Mar 23, 2020 2:24:05 PM org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process INFO: 2020-03-23T14:24:03.849Z: Executing operation Read input+Collect start time metrics+Total bytes monitor+Window.Into()/Window.Assign+Group by key (0)/Reify+Group by key (0)/Write Mar 23, 2020 2:24:08 PM org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process WARNING: 2020-03-23T14:24:07.185Z: Your project already contains 100 Dataflow-created metric descriptors and Stackdriver will not create new Dataflow custom metrics for this job. Each unique user-defined metric name (independent of the DoFn in which it is defined) produces a new metric descriptor. To delete old / unused metric descriptors see https://developers.google.com/apis-explorer/#p/monitoring/v3/monitoring.projects.metricDescriptors.list and https://developers.google.com/apis-explorer/#p/monitoring/v3/monitoring.projects.metricDescriptors.delete Mar 23, 2020 2:24:30 PM org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process INFO: 2020-03-23T14:24:30.276Z: Autoscaling: Raised the number of workers to 3 based on the rate of progress in the currently running step(s). Mar 23, 2020 2:24:30 PM org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process INFO: 2020-03-23T14:24:30.315Z: Resized worker pool to 3, though goal was 5. This could be a quota issue. Mar 23, 2020 2:24:38 PM org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process INFO: 2020-03-23T14:24:35.785Z: Autoscaling: Raised the number of workers to 5 based on the rate of progress in the currently running step(s). Mar 23, 2020 2:24:49 PM org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process INFO: 2020-03-23T14:24:48.534Z: Workers have started successfully. Mar 23, 2020 2:24:49 PM org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process INFO: 2020-03-23T14:24:48.565Z: Workers have started successfully. Mar 23, 2020 2:25:38 PM org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process INFO: 2020-03-23T14:25:37.825Z: Finished operation Read input+Collect start time metrics+Total bytes monitor+Window.Into()/Window.Assign+Group by key (0)/Reify+Group by key (0)/Write Mar 23, 2020 2:25:38 PM org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process INFO: 2020-03-23T14:25:37.897Z: Executing operation Group by key (0)/Close Mar 23, 2020 2:25:38 PM org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process INFO: 2020-03-23T14:25:37.950Z: Finished operation Group by key (0)/Close Mar 23, 2020 2:25:38 PM org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process INFO: 2020-03-23T14:25:38.036Z: Executing operation Group by key (0)/Read+Group by key (0)/GroupByWindow+Ungroup and reiterate (0)+Collect end time metrics (0) Mar 23, 2020 2:26:08 PM org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process INFO: 2020-03-23T14:26:07.997Z: Finished operation Group by key (0)/Read+Group by key (0)/GroupByWindow+Ungroup and reiterate (0)+Collect end time metrics (0) Mar 23, 2020 2:26:10 PM org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process INFO: 2020-03-23T14:26:08.187Z: Cleaning up. Mar 23, 2020 2:26:10 PM org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process INFO: 2020-03-23T14:26:08.262Z: Stopping worker pool... Mar 23, 2020 2:27:52 PM org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process INFO: 2020-03-23T14:27:52.175Z: Autoscaling: Resized worker pool from 5 to 0. Mar 23, 2020 2:27:52 PM org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process INFO: 2020-03-23T14:27:52.219Z: Worker pool stopped. Mar 23, 2020 2:27:58 PM org.apache.beam.runners.dataflow.DataflowPipelineJob logTerminalState INFO: Job 2020-03-23_07_23_53-10944696645350614513 finished with status DONE. Load test results for test (ID): 56e76d4d-2ae1-449e-b003-96f490aa67ba and timestamp: 2020-03-23T14:23:51.007000000Z: Metric: Value: runtime_sec 66.961 total_bytes_count 2.0E9 Deprecated Gradle features were used in this build, making it incompatible with Gradle 6.0. Use '--warning-mode all' to show the individual deprecation warnings. See https://docs.gradle.org/5.2.1/userguide/command_line_interface.html#sec:command_line_warnings BUILD SUCCESSFUL in 4m 14s 72 actionable tasks: 1 executed, 71 up-to-date Publishing build scan... https://gradle.com/s/c6veljmv3sv6i [beam_LoadTests_Java_GBK_Dataflow_Batch] $ /bin/bash -xe /tmp/jenkins3271378743378694178.sh + echo src Load test: 2GB of 100kB records src src Load test: 2GB of 100kB records src [Gradle] - Launching build. [src] $ <https://builds.apache.org/job/beam_LoadTests_Java_GBK_Dataflow_Batch/ws/src/gradlew> -PloadTest.mainClass=org.apache.beam.sdk.loadtests.GroupByKeyLoadTest -Prunner=:runners:google-cloud-dataflow-java '-PloadTest.args=--project=apache-beam-testing --appName=load_tests_Java_Dataflow_batch_GBK_3 --tempLocation=gs://temp-storage-for-perf-tests/loadtests --publishToBigQuery=true --bigQueryDataset=load_test --bigQueryTable=java_dataflow_batch_GBK_3 --sourceOptions={"numRecords":20000,"keySizeBytes":10000,"valueSizeBytes":90000} --fanout=1 --iterations=1 --numWorkers=5 --autoscalingAlgorithm=NONE --streaming=false --runner=DataflowRunner' --continue --max-workers=12 -Dorg.gradle.jvmargs=-Xms2g -Dorg.gradle.jvmargs=-Xmx4g :sdks:java:testing:load-tests:run > Task :buildSrc:compileJava NO-SOURCE > Task :buildSrc:compileGroovy UP-TO-DATE > Task :buildSrc:pluginDescriptors UP-TO-DATE > Task :buildSrc:processResources UP-TO-DATE > Task :buildSrc:classes UP-TO-DATE > Task :buildSrc:jar UP-TO-DATE > Task :buildSrc:assemble UP-TO-DATE > Task :buildSrc:spotlessGroovy UP-TO-DATE > Task :buildSrc:spotlessGroovyCheck UP-TO-DATE > Task :buildSrc:spotlessGroovyGradle UP-TO-DATE > Task :buildSrc:spotlessGroovyGradleCheck UP-TO-DATE > Task :buildSrc:spotlessCheck UP-TO-DATE > Task :buildSrc:pluginUnderTestMetadata UP-TO-DATE > Task :buildSrc:compileTestJava NO-SOURCE > Task :buildSrc:compileTestGroovy NO-SOURCE > Task :buildSrc:processTestResources NO-SOURCE > Task :buildSrc:testClasses UP-TO-DATE > Task :buildSrc:test NO-SOURCE > Task :buildSrc:validateTaskProperties UP-TO-DATE > Task :buildSrc:check UP-TO-DATE > Task :buildSrc:build UP-TO-DATE Configuration on demand is an incubating feature. > Configure project :sdks:java:container Found go 1.12 in /usr/bin/go, use it. FAILURE: Build failed with an exception. * What went wrong: Could not determine the dependencies of task ':model:job-management:shadowJar'. > Could not resolve all dependencies for configuration > ':model:job-management:runtimeClasspath'. > Could not resolve io.grpc:grpc-api:[1.26.0]. Required by: project :model:job-management > io.grpc:grpc-auth:1.26.0 project :model:job-management > io.grpc:grpc-core:1.26.0 > Failed to list versions for io.grpc:grpc-api. > Unable to load Maven meta-data from https://oss.sonatype.org/content/repositories/staging/io/grpc/grpc-api/maven-metadata.xml. > Could not HEAD 'https://oss.sonatype.org/content/repositories/staging/io/grpc/grpc-api/maven-metadata.xml'. > Read timed out * Try: Run with --stacktrace option to get the stack trace. Run with --info or --debug option to get more log output. Run with --scan to get full insights. * Get more help at https://help.gradle.org Deprecated Gradle features were used in this build, making it incompatible with Gradle 6.0. Use '--warning-mode all' to show the individual deprecation warnings. See https://docs.gradle.org/5.2.1/userguide/command_line_interface.html#sec:command_line_warnings BUILD FAILED in 2m 5s Publishing build scan... https://gradle.com/s/px5c5vx2nn7we Build step 'Invoke Gradle script' changed build result to FAILURE Build step 'Invoke Gradle script' marked build as failure --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
