This is your daily summary of Beam's current P1 issues, not including flaky
tests.
See https://beam.apache.org/contribute/issue-priorities/#p1-critical for
the meaning and expectations around P1 issues.
https://github.com/apache/beam/issues/22011: [Bug]:
org.apache.beam.sdk.io.aws2.kinesis.KinesisIOWriteTest.testWriteFailure flaky
https://github.com/apache/beam/issues/22010: [Bug]:
org.apache.beam.runners.flink.FlinkRunnerTest.testEnsureStdoutStdErrIsRestored
flaky
https://github.com/apache/beam/issues/22009: [Bug]:
org.apache.beam.examples.subprocess.ExampleEchoPipelineTest.testExampleEchoPipeline
flaky
https://github.com/apache/beam/issues/22008: [Bug]:
org.apache.beam.sdk.io.gcp.spanner.SpannerIOWriteExceptionHandlingTest.testExceptionHandlingForWriteGrouped
flaky
https://github.com/apache/beam/issues/21999: [Bug]:
org.apache.beam.sdk.io.gcp.spanner.SpannerIOReadTest.runBatchQueryTestWithFailures
flaky
https://github.com/apache/beam/issues/21948: [Bug]: KinesisIO javadoc is no
longer up-to-date
https://github.com/apache/beam/issues/21946: [Bug]: No way to read or write to
file when running Beam in Flink
https://github.com/apache/beam/issues/21935: [Bug]: Reject illformed GBK Coders
https://github.com/apache/beam/issues/21897: [Feature Request]: Flink runner
savepoint backward compatibility
https://github.com/apache/beam/issues/21893: [Bug]: BigQuery Storage Write API
implementation does not support table partitioning
https://github.com/apache/beam/issues/21794: Dataflow runner creates a new
timer whenever the output timestamp is change
https://github.com/apache/beam/issues/21715: Data missing when using
CassandraIO.Read
https://github.com/apache/beam/issues/21713: 404s in BigQueryIO don't get
output to Failed Inserts PCollection
https://github.com/apache/beam/issues/21711: Python Streaming job failing to
drain with BigQueryIO write errors
https://github.com/apache/beam/issues/21703: pubsublite.ReadWriteIT failing in
beam_PostCommit_Java_DataflowV1 and V2
https://github.com/apache/beam/issues/21702: SpannerWriteIT failing in beam
PostCommit Java V1
https://github.com/apache/beam/issues/21700:
--dataflowServiceOptions=use_runner_v2 is broken
https://github.com/apache/beam/issues/21695: DataflowPipelineResult does not
raise exception for unsuccessful states.
https://github.com/apache/beam/issues/21694: BigQuery Storage API insert with
writeResult retry and write to error table
https://github.com/apache/beam/issues/21478: KafkaIO.read.withDynamicRead()
doesn't pick up new TopicPartitions
https://github.com/apache/beam/issues/21477: Add integration testing for BQ
Storage API write modes
https://github.com/apache/beam/issues/21476: WriteToBigQuery Dynamic table
destinations returns wrong tableId
https://github.com/apache/beam/issues/21475: Beam x-lang Dataflow tests failing
due to _InactiveRpcError
https://github.com/apache/beam/issues/21473: PVR_Spark2_Streaming perma-red
https://github.com/apache/beam/issues/21465: Kafka commit offset drop data on
failure for runners that have non-checkpointing shuffle
https://github.com/apache/beam/issues/21268: Race between member variable being
accessed due to leaking uninitialized state via OutboundObserverFactory
https://github.com/apache/beam/issues/21267: WriteToBigQuery submits a
duplicate BQ load job if a 503 error code is returned from googleapi
https://github.com/apache/beam/issues/21265:
apache_beam.runners.portability.fn_api_runner.translations_test.TranslationsTest.test_run_packable_combine_globally
'apache_beam.coders.coder_impl._AbstractIterable' object is not reversible
https://github.com/apache/beam/issues/21263: (Broken Pipe induced) Bricked
Dataflow Pipeline
https://github.com/apache/beam/issues/21262: Python AfterAny, AfterAll do not
follow spec
https://github.com/apache/beam/issues/21260: Python DirectRunner does not emit
data at GC time
https://github.com/apache/beam/issues/21259: Consumer group with random prefix
https://github.com/apache/beam/issues/21257: Either Create or DirectRunner
fails to produce all elements to the following transform
https://github.com/apache/beam/issues/21123: Multiple jobs running on Flink
session cluster reuse the persistent Python environment.
https://github.com/apache/beam/issues/21117: "Java IO IT Tests" - missing data
in grafana
https://github.com/apache/beam/issues/21115: JdbcIO date conversion is
sensitive to OS
https://github.com/apache/beam/issues/21111: Java creates an incorrect pipeline
proto when core-construction-java jar is not in the CLASSPATH
https://github.com/apache/beam/issues/21109: SDF BoundedSource seems to execute
significantly slower than 'normal' BoundedSource
https://github.com/apache/beam/issues/21108: java.io.InvalidClassException With
Flink Kafka
https://github.com/apache/beam/issues/20979: Portable runners should be able to
issue checkpoints to Splittable DoFn
https://github.com/apache/beam/issues/20978: PubsubIO.readAvroGenericRecord
creates SchemaCoder that fails to decode some Avro logical types
https://github.com/apache/beam/issues/20818: XmlIO.Read does not handle XML
encoding per spec
https://github.com/apache/beam/issues/20814: JmsIO is not acknowledging
messages correctly
https://github.com/apache/beam/issues/20813: No trigger early repeatedly for
session windows
https://github.com/apache/beam/issues/20812: Cross-language consistency
(RequiresStableInputs) is quietly broken (at least on portable flink runner)
https://github.com/apache/beam/issues/20692: Timer with dataflow runner can be
set multiple times (dataflow runner)
https://github.com/apache/beam/issues/20689: Kafka commitOffsetsInFinalize OOM
on Flink
https://github.com/apache/beam/issues/20532: Support for coder argument in
WriteToBigQuery
https://github.com/apache/beam/issues/20531: FileBasedSink: allow setting temp
directory provider per dynamic destination
https://github.com/apache/beam/issues/20530: Make non-portable Splittable DoFn
the only option when executing Java "Read" transforms
https://github.com/apache/beam/issues/20528: python
CombineGlobally().with_fanout() cause duplicate combine results for sliding
windows
https://github.com/apache/beam/issues/20333: beam_PerformanceTests_Kafka_IO
failing due to " provided port is already allocated"
https://github.com/apache/beam/issues/20332: FileIO writeDynamic with
AvroIO.sink not writing all data
https://github.com/apache/beam/issues/20330: Remove insecure ssl options from
MongoDBIO
https://github.com/apache/beam/issues/20109: SortValues should fail if
SecondaryKey coder is not deterministic
https://github.com/apache/beam/issues/20108: Python direct runner doesn't emit
empty pane when it should
https://github.com/apache/beam/issues/20009: Environment-sensitive provisioning
for Dataflow
https://github.com/apache/beam/issues/19971: [SQL] Some Hive tests throw
NullPointerException, but get marked as passing (Direct Runner)
https://github.com/apache/beam/issues/19817: datetime and decimal should be
logical types
https://github.com/apache/beam/issues/17950: [Bug]: Java Precommit permared