See
<https://ci-beam.apache.org/job/beam_PerformanceTests_ParquetIOIT/6346/display/redirect?page=changes>
Changes:
[Robert Bradshaw] Batch encoding and decoding of schema data.
[Robert Bradshaw] Add microbenchmark for batch row encoding.
[Robert Bradshaw] Add batch testing for standard row coders.
[noreply] Relax `pip` check in setup.py to allow installation via other package
[noreply] replaced tabs with spaces in readme file (#23446)
[noreply] [Playground] [Backend] Adding the tags field to the example response
[noreply] [Playground] [Backend] Edited the function for getting executable name
[noreply] Fix type inference for set/delete attr. (#23242)
[noreply] Support VR test including TestStream for Spark runner in streaming
mode
------------------------------------------
[...truncated 396.41 KB...]
INFO: 2022-09-30T22:25:15.298Z: Fusing consumer
View.AsSingleton/Combine.GloballyAsSingletonView/Combine.globally(Singleton)/Combine.perKey(Singleton)/Combine.GroupedValues
into
View.AsSingleton/Combine.GloballyAsSingletonView/Combine.globally(Singleton)/Combine.perKey(Singleton)/GroupByKey/Read
Sep 30, 2022 10:25:16 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:15.339Z: Fusing consumer
View.AsSingleton/Combine.GloballyAsSingletonView/Combine.globally(Singleton)/Combine.perKey(Singleton)/Combine.GroupedValues/Extract
into
View.AsSingleton/Combine.GloballyAsSingletonView/Combine.globally(Singleton)/Combine.perKey(Singleton)/Combine.GroupedValues
Sep 30, 2022 10:25:16 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:15.377Z: Fusing consumer
View.AsSingleton/Combine.GloballyAsSingletonView/Combine.globally(Singleton)/Values/Values/Map
into
View.AsSingleton/Combine.GloballyAsSingletonView/Combine.globally(Singleton)/Combine.perKey(Singleton)/Combine.GroupedValues/Extract
Sep 30, 2022 10:25:16 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:15.410Z: Fusing consumer
View.AsSingleton/Combine.GloballyAsSingletonView/BatchViewOverrides.GroupByWindowHashAsKeyAndWindowAsSortKey/ParDo(UseWindowHashAsKeyAndWindowAsSortKey)
into
View.AsSingleton/Combine.GloballyAsSingletonView/Combine.globally(Singleton)/Values/Values/Map
Sep 30, 2022 10:25:16 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:15.442Z: Fusing consumer
View.AsSingleton/Combine.GloballyAsSingletonView/BatchViewOverrides.GroupByWindowHashAsKeyAndWindowAsSortKey/BatchViewOverrides.GroupByKeyAndSortValuesOnly/Write
into
View.AsSingleton/Combine.GloballyAsSingletonView/BatchViewOverrides.GroupByWindowHashAsKeyAndWindowAsSortKey/ParDo(UseWindowHashAsKeyAndWindowAsSortKey)
Sep 30, 2022 10:25:16 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:15.472Z: Fusing consumer
View.AsSingleton/Combine.GloballyAsSingletonView/ParDo(IsmRecordForSingularValuePerWindow)
into
View.AsSingleton/Combine.GloballyAsSingletonView/BatchViewOverrides.GroupByWindowHashAsKeyAndWindowAsSortKey/BatchViewOverrides.GroupByKeyAndSortValuesOnly/Read
Sep 30, 2022 10:25:16 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:15.505Z: Unzipping flatten s60-u108 for input
s62.org.apache.beam.sdk.values.PCollection.<init>:405#77397181cd44e5f-c106
Sep 30, 2022 10:25:16 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:15.539Z: Fusing unzipped copy of
PAssert$0/GroupGlobally/GroupByKey/Reify, through flatten
PAssert$0/GroupGlobally/Flatten.PCollections/Unzipped-1, into producer
PAssert$0/GroupGlobally/WithKeys/AddKeys/Map
Sep 30, 2022 10:25:16 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:15.571Z: Unzipping flatten s56 for input
s51.org.apache.beam.sdk.values.PCollection.<init>:405#fe542dba50f8fd8a
Sep 30, 2022 10:25:16 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:15.607Z: Fusing unzipped copy of
View.AsSingleton/Combine.GloballyAsSingletonView/Combine.globally(Singleton)/WithKeys/AddKeys/Map,
through flatten Calculate hashcode/Flatten.PCollections, into producer
Calculate hashcode/Values/Values/Map
Sep 30, 2022 10:25:16 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:15.641Z: Unzipping flatten s56-c110 for input
s51.org.apache.beam.sdk.values.PCollection.<init>:405#fe542dba50f8fd8a
Sep 30, 2022 10:25:16 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:15.674Z: Fusing unzipped copy of
PAssert$0/GroupGlobally/Reify.Window/ParDo(Anonymous), through flatten
Calculate hashcode/Flatten.PCollections, into producer Calculate
hashcode/Values/Values/Map
Sep 30, 2022 10:25:16 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:15.708Z: Unzipping flatten s9 for input
s5.writtenRecords
Sep 30, 2022 10:25:16 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:15.741Z: Fusing unzipped copy of Write Parquet
files/WriteFiles/GatherTempFileResults/Add void key/AddKeys/Map, through
flatten Write Parquet
files/WriteFiles/WriteUnshardedBundlesToTempFiles/Flatten.PCollections, into
producer Write Parquet
files/WriteFiles/WriteUnshardedBundlesToTempFiles/WriteUnshardedBundles
Sep 30, 2022 10:25:16 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:15.779Z: Fusing consumer Write Parquet
files/WriteFiles/GatherTempFileResults/Reshuffle/GroupByKey/Reify into Write
Parquet
files/WriteFiles/GatherTempFileResults/Reshuffle/Window.Into()/Window.Assign
Sep 30, 2022 10:25:16 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:15.819Z: Fusing consumer Write Parquet
files/WriteFiles/GatherTempFileResults/Reshuffle/GroupByKey/Write into Write
Parquet files/WriteFiles/GatherTempFileResults/Reshuffle/GroupByKey/Reify
Sep 30, 2022 10:25:16 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:15.850Z: Fusing consumer Write Parquet
files/WriteFiles/GatherTempFileResults/Reshuffle/GroupByKey/GroupByWindow into
Write Parquet files/WriteFiles/GatherTempFileResults/Reshuffle/GroupByKey/Read
Sep 30, 2022 10:25:16 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:15.890Z: Fusing consumer Write Parquet
files/WriteFiles/GatherTempFileResults/Reshuffle/ExpandIterable into Write
Parquet
files/WriteFiles/GatherTempFileResults/Reshuffle/GroupByKey/GroupByWindow
Sep 30, 2022 10:25:16 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:15.924Z: Fusing consumer Write Parquet
files/WriteFiles/GatherTempFileResults/Drop key/Values/Map into Write Parquet
files/WriteFiles/GatherTempFileResults/Reshuffle/ExpandIterable
Sep 30, 2022 10:25:16 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:15.957Z: Fusing consumer Write Parquet
files/WriteFiles/GatherTempFileResults/Gather bundles into Write Parquet
files/WriteFiles/GatherTempFileResults/Drop key/Values/Map
Sep 30, 2022 10:25:16 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:15.996Z: Fusing consumer Write Parquet
files/WriteFiles/GatherTempFileResults/Reshuffle.ViaRandomKey/Pair with random
key into Write Parquet files/WriteFiles/GatherTempFileResults/Gather bundles
Sep 30, 2022 10:25:16 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:16.029Z: Fusing consumer Write Parquet
files/WriteFiles/GatherTempFileResults/Reshuffle.ViaRandomKey/Reshuffle/Window.Into()/Window.Assign
into Write Parquet
files/WriteFiles/GatherTempFileResults/Reshuffle.ViaRandomKey/Pair with random
key
Sep 30, 2022 10:25:16 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:16.063Z: Fusing consumer Write Parquet
files/WriteFiles/GatherTempFileResults/Reshuffle.ViaRandomKey/Reshuffle/GroupByKey/Reify
into Write Parquet
files/WriteFiles/GatherTempFileResults/Reshuffle.ViaRandomKey/Reshuffle/Window.Into()/Window.Assign
Sep 30, 2022 10:25:16 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:16.101Z: Fusing consumer Write Parquet
files/WriteFiles/GatherTempFileResults/Reshuffle.ViaRandomKey/Reshuffle/GroupByKey/Write
into Write Parquet
files/WriteFiles/GatherTempFileResults/Reshuffle.ViaRandomKey/Reshuffle/GroupByKey/Reify
Sep 30, 2022 10:25:16 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:16.133Z: Fusing consumer Write Parquet
files/WriteFiles/GatherTempFileResults/Reshuffle.ViaRandomKey/Reshuffle/GroupByKey/GroupByWindow
into Write Parquet
files/WriteFiles/GatherTempFileResults/Reshuffle.ViaRandomKey/Reshuffle/GroupByKey/Read
Sep 30, 2022 10:25:16 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:16.167Z: Fusing consumer Write Parquet
files/WriteFiles/GatherTempFileResults/Reshuffle.ViaRandomKey/Reshuffle/ExpandIterable
into Write Parquet
files/WriteFiles/GatherTempFileResults/Reshuffle.ViaRandomKey/Reshuffle/GroupByKey/GroupByWindow
Sep 30, 2022 10:25:16 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:16.198Z: Fusing consumer Write Parquet
files/WriteFiles/GatherTempFileResults/Reshuffle.ViaRandomKey/Values/Values/Map
into Write Parquet
files/WriteFiles/GatherTempFileResults/Reshuffle.ViaRandomKey/Reshuffle/ExpandIterable
Sep 30, 2022 10:25:16 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:16.233Z: Fusing consumer Write Parquet
files/WriteFiles/FinalizeTempFileBundles/Finalize into Write Parquet
files/WriteFiles/GatherTempFileResults/Reshuffle.ViaRandomKey/Values/Values/Map
Sep 30, 2022 10:25:16 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:16.267Z: Fusing consumer Write Parquet
files/WriteFiles/FinalizeTempFileBundles/Reshuffle.ViaRandomKey/Pair with
random key into Write Parquet files/WriteFiles/FinalizeTempFileBundles/Finalize
Sep 30, 2022 10:25:16 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:16.302Z: Fusing consumer Write Parquet
files/WriteFiles/FinalizeTempFileBundles/Reshuffle.ViaRandomKey/Reshuffle/Window.Into()/Window.Assign
into Write Parquet
files/WriteFiles/FinalizeTempFileBundles/Reshuffle.ViaRandomKey/Pair with
random key
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:16.334Z: Fusing consumer Write Parquet
files/WriteFiles/FinalizeTempFileBundles/Reshuffle.ViaRandomKey/Reshuffle/GroupByKey/Reify
into Write Parquet
files/WriteFiles/FinalizeTempFileBundles/Reshuffle.ViaRandomKey/Reshuffle/Window.Into()/Window.Assign
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:16.371Z: Fusing consumer Write Parquet
files/WriteFiles/FinalizeTempFileBundles/Reshuffle.ViaRandomKey/Reshuffle/GroupByKey/Write
into Write Parquet
files/WriteFiles/FinalizeTempFileBundles/Reshuffle.ViaRandomKey/Reshuffle/GroupByKey/Reify
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:16.404Z: Fusing consumer Write Parquet
files/WriteFiles/FinalizeTempFileBundles/Reshuffle.ViaRandomKey/Reshuffle/GroupByKey/GroupByWindow
into Write Parquet
files/WriteFiles/FinalizeTempFileBundles/Reshuffle.ViaRandomKey/Reshuffle/GroupByKey/Read
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:16.430Z: Fusing consumer Write Parquet
files/WriteFiles/FinalizeTempFileBundles/Reshuffle.ViaRandomKey/Reshuffle/ExpandIterable
into Write Parquet
files/WriteFiles/FinalizeTempFileBundles/Reshuffle.ViaRandomKey/Reshuffle/GroupByKey/GroupByWindow
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:16.465Z: Fusing consumer Write Parquet
files/WriteFiles/FinalizeTempFileBundles/Reshuffle.ViaRandomKey/Values/Values/Map
into Write Parquet
files/WriteFiles/FinalizeTempFileBundles/Reshuffle.ViaRandomKey/Reshuffle/ExpandIterable
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:16.502Z: Fusing consumer Gather write end times into
Write Parquet
files/WriteFiles/FinalizeTempFileBundles/Reshuffle.ViaRandomKey/Values/Values/Map
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:16.537Z: Fusing consumer Get file names/Values/Map
into Gather write end times
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:16.568Z: Fusing consumer Find
files/Reshuffle.ViaRandomKey/Pair with random key into Find files/Match
filepatterns
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:16.603Z: Fusing consumer Find
files/Reshuffle.ViaRandomKey/Reshuffle/Window.Into()/Window.Assign into Find
files/Reshuffle.ViaRandomKey/Pair with random key
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:16.638Z: Fusing consumer Find
files/Reshuffle.ViaRandomKey/Reshuffle/GroupByKey/Reify into Find
files/Reshuffle.ViaRandomKey/Reshuffle/Window.Into()/Window.Assign
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:16.663Z: Fusing consumer Find
files/Reshuffle.ViaRandomKey/Reshuffle/GroupByKey/Write into Find
files/Reshuffle.ViaRandomKey/Reshuffle/GroupByKey/Reify
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:16.697Z: Fusing consumer Find
files/Reshuffle.ViaRandomKey/Reshuffle/GroupByKey/GroupByWindow into Find
files/Reshuffle.ViaRandomKey/Reshuffle/GroupByKey/Read
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:16.731Z: Fusing consumer Find
files/Reshuffle.ViaRandomKey/Reshuffle/ExpandIterable into Find
files/Reshuffle.ViaRandomKey/Reshuffle/GroupByKey/GroupByWindow
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:16.771Z: Fusing consumer Find
files/Reshuffle.ViaRandomKey/Values/Values/Map into Find
files/Reshuffle.ViaRandomKey/Reshuffle/ExpandIterable
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:16.806Z: Fusing consumer Read matched
files/ParDo(ToReadableFile) into Find
files/Reshuffle.ViaRandomKey/Values/Values/Map
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:16.838Z: Fusing consumer Gather read start time into
Read matched files/ParDo(ToReadableFile)
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:16.860Z: Fusing consumer Read parquet
files/ParDo(SplitRead)/ParMultiDo(SplitRead)/Pair with initial restriction into
Gather read start time
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:16.894Z: Fusing consumer Read parquet
files/ParDo(SplitRead)/ParMultiDo(SplitRead)/Split restriction into Read
parquet files/ParDo(SplitRead)/ParMultiDo(SplitRead)/Pair with initial
restriction
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:16.934Z: Fusing consumer Read parquet
files/ParDo(SplitRead)/ParMultiDo(SplitRead)/Explode windows into Read parquet
files/ParDo(SplitRead)/ParMultiDo(SplitRead)/Split restriction
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:16.978Z: Fusing consumer Read parquet
files/ParDo(SplitRead)/ParMultiDo(SplitRead)/Assign unique key/AddKeys/Map into
Read parquet files/ParDo(SplitRead)/ParMultiDo(SplitRead)/Explode windows
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:17.017Z: Fusing consumer Read parquet
files/ParDo(SplitRead)/ParMultiDo(SplitRead)/ProcessKeyedElements/Reshuffle/Window.Into()/Window.Assign
into Read parquet files/ParDo(SplitRead)/ParMultiDo(SplitRead)/Assign unique
key/AddKeys/Map
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:17.055Z: Fusing consumer Read parquet
files/ParDo(SplitRead)/ParMultiDo(SplitRead)/ProcessKeyedElements/Reshuffle/GroupByKey/Reify
into Read parquet
files/ParDo(SplitRead)/ParMultiDo(SplitRead)/ProcessKeyedElements/Reshuffle/Window.Into()/Window.Assign
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:17.092Z: Fusing consumer Read parquet
files/ParDo(SplitRead)/ParMultiDo(SplitRead)/ProcessKeyedElements/Reshuffle/GroupByKey/Write
into Read parquet
files/ParDo(SplitRead)/ParMultiDo(SplitRead)/ProcessKeyedElements/Reshuffle/GroupByKey/Reify
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:17.139Z: Fusing consumer Read parquet
files/ParDo(SplitRead)/ParMultiDo(SplitRead)/ProcessKeyedElements/Reshuffle/GroupByKey/GroupByWindow
into Read parquet
files/ParDo(SplitRead)/ParMultiDo(SplitRead)/ProcessKeyedElements/Reshuffle/GroupByKey/Read
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:17.182Z: Fusing consumer Read parquet
files/ParDo(SplitRead)/ParMultiDo(SplitRead)/ProcessKeyedElements/Reshuffle/ExpandIterable
into Read parquet
files/ParDo(SplitRead)/ParMultiDo(SplitRead)/ProcessKeyedElements/Reshuffle/GroupByKey/GroupByWindow
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:17.219Z: Fusing consumer Read parquet
files/ParDo(SplitRead)/ParMultiDo(SplitRead)/ProcessKeyedElements/Drop
key/Values/Map into Read parquet
files/ParDo(SplitRead)/ParMultiDo(SplitRead)/ProcessKeyedElements/Reshuffle/ExpandIterable
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:17.260Z: Fusing consumer Read parquet
files/ParDo(SplitRead)/ParMultiDo(SplitRead)/ProcessKeyedElements/NaiveProcess
into Read parquet
files/ParDo(SplitRead)/ParMultiDo(SplitRead)/ProcessKeyedElements/Drop
key/Values/Map
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:17.308Z: Fusing consumer Gather read end time into
Read parquet
files/ParDo(SplitRead)/ParMultiDo(SplitRead)/ProcessKeyedElements/NaiveProcess
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:17.346Z: Fusing consumer Map records to strings/Map
into Gather read end time
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:17.402Z: Fusing consumer Calculate
hashcode/WithKeys/AddKeys/Map into Map records to strings/Map
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:17.463Z: Fusing consumer Calculate
hashcode/Combine.perKey(Hashing)/GroupByKey+Calculate
hashcode/Combine.perKey(Hashing)/Combine.GroupedValues/Partial into Calculate
hashcode/WithKeys/AddKeys/Map
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:17.529Z: Fusing consumer Calculate
hashcode/Combine.perKey(Hashing)/GroupByKey/Reify into Calculate
hashcode/Combine.perKey(Hashing)/GroupByKey+Calculate
hashcode/Combine.perKey(Hashing)/Combine.GroupedValues/Partial
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:17.586Z: Fusing consumer Calculate
hashcode/Combine.perKey(Hashing)/GroupByKey/Write into Calculate
hashcode/Combine.perKey(Hashing)/GroupByKey/Reify
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:17.625Z: Fusing consumer Calculate
hashcode/Combine.perKey(Hashing)/Combine.GroupedValues into Calculate
hashcode/Combine.perKey(Hashing)/GroupByKey/Read
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:17.666Z: Fusing consumer Calculate
hashcode/Combine.perKey(Hashing)/Combine.GroupedValues/Extract into Calculate
hashcode/Combine.perKey(Hashing)/Combine.GroupedValues
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:17.703Z: Fusing consumer Calculate
hashcode/Values/Values/Map into Calculate
hashcode/Combine.perKey(Hashing)/Combine.GroupedValues/Extract
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:17.737Z: Unzipping flatten s9-u138 for input
s10.org.apache.beam.sdk.values.PCollection.<init>:405#2587af97b4865538-c136
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:17.775Z: Fusing unzipped copy of Write Parquet
files/WriteFiles/GatherTempFileResults/Reshuffle/Window.Into()/Window.Assign,
through flatten Write Parquet
files/WriteFiles/WriteUnshardedBundlesToTempFiles/Flatten.PCollections/Unzipped-1,
into producer Write Parquet files/WriteFiles/GatherTempFileResults/Add void
key/AddKeys/Map
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:17.804Z: Fusing consumer Write Parquet
files/WriteFiles/GatherTempFileResults/Add void key/AddKeys/Map into Write
Parquet files/WriteFiles/WriteUnshardedBundlesToTempFiles/DropShardNum
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:17.843Z: Fusing consumer Write Parquet
files/WriteFiles/GatherTempFileResults/Reshuffle/Window.Into()/Window.Assign
into Write Parquet files/WriteFiles/GatherTempFileResults/Add void
key/AddKeys/Map
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:17.871Z: Fusing consumer Produce text lines into
Generate sequence/Read(BoundedCountingSource)
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:17.904Z: Fusing consumer Produce Avro records into
Produce text lines
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:17.940Z: Fusing consumer Gather write start times
into Produce Avro records
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:17.971Z: Fusing consumer Write Parquet
files/WriteFiles/WriteUnshardedBundlesToTempFiles/WriteUnshardedBundles into
Gather write start times
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:18.029Z: Fusing consumer Write Parquet
files/WriteFiles/WriteUnshardedBundlesToTempFiles/GroupUnwritten/Reify into
Write Parquet
files/WriteFiles/WriteUnshardedBundlesToTempFiles/WriteUnshardedBundles
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:18.073Z: Fusing consumer Write Parquet
files/WriteFiles/WriteUnshardedBundlesToTempFiles/GroupUnwritten/Write into
Write Parquet
files/WriteFiles/WriteUnshardedBundlesToTempFiles/GroupUnwritten/Reify
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:18.112Z: Fusing consumer Write Parquet
files/WriteFiles/WriteUnshardedBundlesToTempFiles/GroupUnwritten/GroupByWindow
into Write Parquet
files/WriteFiles/WriteUnshardedBundlesToTempFiles/GroupUnwritten/Read
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:18.145Z: Fusing consumer Write Parquet
files/WriteFiles/WriteUnshardedBundlesToTempFiles/WriteUnwritten into Write
Parquet
files/WriteFiles/WriteUnshardedBundlesToTempFiles/GroupUnwritten/GroupByWindow
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:18.177Z: Fusing consumer Write Parquet
files/WriteFiles/WriteUnshardedBundlesToTempFiles/DropShardNum into Write
Parquet files/WriteFiles/WriteUnshardedBundlesToTempFiles/WriteUnwritten
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:18.209Z: Fusing consumer
PAssert$0/GroupGlobally/Reify.Window/ParDo(Anonymous) into Calculate
hashcode/ProduceDefault
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:18.242Z: Fusing consumer
View.AsSingleton/Combine.GloballyAsSingletonView/Combine.globally(Singleton)/WithKeys/AddKeys/Map
into Calculate hashcode/ProduceDefault
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:18.276Z: Fusing consumer Calculate
hashcode/ProduceDefault into Calculate hashcode/CreateVoid/Read(CreateSource)
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:18.304Z: Fusing consumer
PAssert$0/GroupGlobally/GroupByKey/Reify into
PAssert$0/GroupGlobally/WithKeys/AddKeys/Map
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:18.337Z: Fusing consumer
PAssert$0/GroupGlobally/WithKeys/AddKeys/Map into
PAssert$0/GroupGlobally/Create.Values/Read(CreateSource)
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:18.814Z: Executing operation Write Parquet
files/WriteFiles/WriteUnshardedBundlesToTempFiles/GroupUnwritten/Create
Sep 30, 2022 10:25:19 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:18.905Z: Starting 5 ****s in us-central1-a...
Sep 30, 2022 10:25:22 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:19.395Z: Finished operation Write Parquet
files/WriteFiles/WriteUnshardedBundlesToTempFiles/GroupUnwritten/Create
Sep 30, 2022 10:25:22 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:19.558Z: Executing operation Write Parquet
files/WriteFiles/GatherTempFileResults/Reshuffle/GroupByKey/Create
Sep 30, 2022 10:25:22 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:19.858Z: Finished operation Write Parquet
files/WriteFiles/GatherTempFileResults/Reshuffle/GroupByKey/Create
Sep 30, 2022 10:25:22 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:20.007Z: Executing operation Generate
sequence/Read(BoundedCountingSource)+Produce text lines+Produce Avro
records+Gather write start times+Write Parquet
files/WriteFiles/WriteUnshardedBundlesToTempFiles/WriteUnshardedBundles+Write
Parquet files/WriteFiles/GatherTempFileResults/Add void key/AddKeys/Map+Write
Parquet
files/WriteFiles/GatherTempFileResults/Reshuffle/Window.Into()/Window.Assign+Write
Parquet
files/WriteFiles/GatherTempFileResults/Reshuffle/GroupByKey/Reify+Write Parquet
files/WriteFiles/GatherTempFileResults/Reshuffle/GroupByKey/Write+Write Parquet
files/WriteFiles/WriteUnshardedBundlesToTempFiles/GroupUnwritten/Reify+Write
Parquet files/WriteFiles/WriteUnshardedBundlesToTempFiles/GroupUnwritten/Write
Sep 30, 2022 10:25:31 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:25:29.487Z: Your project already contains 100
Dataflow-created metric descriptors, so new user metrics of the form
custom.googleapis.com/* will not be created. However, all user metrics are also
available in the metric dataflow.googleapis.com/job/user_counter. If you rely
on the custom metrics, you can delete old / unused metric descriptors. See
https://developers.google.com/apis-explorer/#p/monitoring/v3/monitoring.projects.metricDescriptors.list
and
https://developers.google.com/apis-explorer/#p/monitoring/v3/monitoring.projects.metricDescriptors.delete
Sep 30, 2022 10:26:02 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
SEVERE: 2022-09-30T22:25:59.939Z: Startup of the **** pool in zone
us-central1-a failed to bring up any of the desired 5 ****s. Please refer to
https://cloud.google.com/dataflow/docs/guides/common-errors#****-pool-failure
for help troubleshooting. QUOTA_EXCEEDED: Instance
'parquetioit0writethenread-09301524-t6pm-harness-7h4k' creation failed: Quota
'IN_USE_ADDRESSES' exceeded. Limit: 1200.0 in region us-central1.
Sep 30, 2022 10:26:02 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
SEVERE: 2022-09-30T22:25:59.972Z: Workflow failed.
Sep 30, 2022 10:26:02 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:26:00.062Z: Finished operation Generate
sequence/Read(BoundedCountingSource)+Produce text lines+Produce Avro
records+Gather write start times+Write Parquet
files/WriteFiles/WriteUnshardedBundlesToTempFiles/WriteUnshardedBundles+Write
Parquet files/WriteFiles/GatherTempFileResults/Add void key/AddKeys/Map+Write
Parquet
files/WriteFiles/GatherTempFileResults/Reshuffle/Window.Into()/Window.Assign+Write
Parquet
files/WriteFiles/GatherTempFileResults/Reshuffle/GroupByKey/Reify+Write Parquet
files/WriteFiles/GatherTempFileResults/Reshuffle/GroupByKey/Write+Write Parquet
files/WriteFiles/WriteUnshardedBundlesToTempFiles/GroupUnwritten/Reify+Write
Parquet files/WriteFiles/WriteUnshardedBundlesToTempFiles/GroupUnwritten/Write
Sep 30, 2022 10:26:02 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:26:00.165Z: Cleaning up.
Sep 30, 2022 10:26:02 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:26:00.271Z: Stopping **** pool...
Sep 30, 2022 10:26:23 PM
org.apache.beam.runners.dataflow.util.MonitoringUtil$LoggingHandler process
INFO: 2022-09-30T22:26:22.363Z: Worker pool stopped.
Sep 30, 2022 10:27:02 PM
org.apache.beam.runners.dataflow.DataflowPipelineJob logTerminalState
INFO: Job 2022-09-30_15_24_53-14441705117576631157 failed with status
FAILED.
org.apache.beam.sdk.io.parquet.ParquetIOIT > writeThenReadAll STANDARD_OUT
Load test results for test (ID): 257cdf40-60ee-4b06-b866-7a1fd8c57a92 and
timestamp: 2022-09-30T22:27:02.105000000Z:
Metric: Value:
read_time 0.0
dataset_size 1.08737E9
run_time 0.0
write_time 0.0
Gradle Test Executor 2 finished executing tests.
> Task :sdks:java:io:file-based-io-tests:integrationTest FAILED
org.apache.beam.sdk.io.parquet.ParquetIOIT > writeThenReadAll FAILED
java.lang.AssertionError: Values should be different. Actual: FAILED
at org.junit.Assert.fail(Assert.java:89)
at org.junit.Assert.failEquals(Assert.java:187)
at org.junit.Assert.assertNotEquals(Assert.java:163)
at org.junit.Assert.assertNotEquals(Assert.java:177)
at
org.apache.beam.sdk.io.parquet.ParquetIOIT.writeThenReadAll(ParquetIOIT.java:171)
1 test completed, 1 failed
Finished generating test XML results (0.026 secs) into:
<https://ci-beam.apache.org/job/beam_PerformanceTests_ParquetIOIT/ws/src/sdks/java/io/file-based-io-tests/build/test-results/integrationTest>
Generating HTML test report...
Finished generating test html results (0.028 secs) into:
<https://ci-beam.apache.org/job/beam_PerformanceTests_ParquetIOIT/ws/src/sdks/java/io/file-based-io-tests/build/reports/tests/integrationTest>
:sdks:java:io:file-based-io-tests:integrationTest (Thread[Execution **** Thread
6,5,main]) completed. Took 2 mins 25.877 secs.
FAILURE: Build failed with an exception.
* What went wrong:
Execution failed for task ':sdks:java:io:file-based-io-tests:integrationTest'.
> There were failing tests. See the report at:
> file://<https://ci-beam.apache.org/job/beam_PerformanceTests_ParquetIOIT/ws/src/sdks/java/io/file-based-io-tests/build/reports/tests/integrationTest/index.html>
* Try:
> Run with --stacktrace option to get the stack trace.
> Run with --debug option to get more log output.
* Get more help at https://help.gradle.org
Deprecated Gradle features were used in this build, making it incompatible with
Gradle 8.0.
You can use '--warning-mode all' to show the individual deprecation warnings
and determine if they come from your own scripts or plugins.
See
https://docs.gradle.org/7.5.1/userguide/command_line_interface.html#sec:command_line_warnings
BUILD FAILED in 3m 36s
148 actionable tasks: 91 executed, 55 from cache, 2 up-to-date
Publishing build scan...
https://gradle.com/s/ukupnmv66kjm4
Stopped 1 **** daemon(s).
Build step 'Invoke Gradle script' changed build result to FAILURE
Build step 'Invoke Gradle script' marked build as failure
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]