fastavro Cython fixes

2023-07-17 Thread Eddie Zhou
Hi folks,

I see some active work on https://github.com/apache/beam/issues/27526 for
Cython 3.0 fixes, but wondering if anyone knows the fastavro maintainers
and can prioritize the fix proposed in
https://github.com/fastavro/fastavro/issues/701

Thanks!

Best,
Eddie


Re: [ANNOUNCE] Beam 2.49.0 Released

2023-07-17 Thread Ahmet Altay via dev
Congratulations! Thanks to the release manager and all contributors!

On Mon, Jul 17, 2023 at 8:54 AM Yi Hu via user  wrote:

> The Apache Beam Team is pleased to announce the release of version 2.49.0.
>
> You can download the release here:
>
> https://beam.apache.org/get-started/downloads/ (website daily update
> pending)
>
> This release includes bug fixes, features, and improvements detailed on the
> Beam Blog: https://beam.apache.org/blog/beam-2.49.0/ (website daily
> update pending)
> and the Github release page
> https://github.com/apache/beam/releases/tag/v2.49.0
>
> Thanks to everyone who contributed to this release, and we hope you enjoy
> using Beam 2.49.0.
>
> -- Yi, on behalf of the Apache Beam Team.
>
>
> --
>
> Yi Hu, (he/him/his)
>
> Software Engineer
>
>
>


[ANNOUNCE] Beam 2.49.0 Released

2023-07-17 Thread Yi Hu via dev
The Apache Beam Team is pleased to announce the release of version 2.49.0.

You can download the release here:

https://beam.apache.org/get-started/downloads/ (website daily update
pending)

This release includes bug fixes, features, and improvements detailed on the
Beam Blog: https://beam.apache.org/blog/beam-2.49.0/ (website daily update
pending)
and the Github release page
https://github.com/apache/beam/releases/tag/v2.49.0

Thanks to everyone who contributed to this release, and we hope you enjoy
using Beam 2.49.0.

-- Yi, on behalf of the Apache Beam Team.


-- 

Yi Hu, (he/him/his)

Software Engineer


Re: [VOTE] Release 2.49.0, release candidate #2

2023-07-17 Thread Yi Hu via dev
Could a PMC member please help finalizing the release (
https://beam.apache.org/contribute/release-guide/#pmc-only-finalization),
mainly deploy the source release from staging (
https://dist.apache.org/repos/dist/dev/beam/2.49.0/) to release (will be
https://dist.apache.org/repos/dist/release/beam/2.49.0/). Thanks!


On Mon, Jul 17, 2023 at 7:28 AM Yi Hu  wrote:

> I'm happy to announce that we have unanimously approved this release.
>
> There are 8 approving votes, 4 of which are binding:
> * approver 1: Jan Lukavský
> * approver 2: Robert Bradshaw
> * approver 3: Chamikara Jayalath
> * approver 4: Ahmet Altay
>
> There are no disapproving votes.
>
> Thanks everyone!
>
> Note: there is an ongoing issue such that some reply emails not get
> delivered to certain email address (like gmail). Check the complete thread
> here: https://lists.apache.org/thread/r7r5q5mq7rqjrfbf8nj90smrdkss0sbf
>
>


[GitHub] [beam-site] Abacn merged pull request #646: Publish 2.49.0 release

2023-07-17 Thread via GitHub


Abacn merged PR #646:
URL: https://github.com/apache/beam-site/pull/646


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@beam.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[GitHub] [beam-site] Abacn commented on pull request #646: Publish 2.49.0 release

2023-07-17 Thread via GitHub


Abacn commented on PR #646:
URL: https://github.com/apache/beam-site/pull/646#issuecomment-1637991080

   R: @riteshghorse @jrmccluskey 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@beam.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [VOTE] Release 2.49.0, release candidate #2

2023-07-17 Thread Yi Hu via dev
I'm happy to announce that we have unanimously approved this release.

There are 8 approving votes, 4 of which are binding:
* approver 1: Jan Lukavský
* approver 2: Robert Bradshaw
* approver 3: Chamikara Jayalath
* approver 4: Ahmet Altay

There are no disapproving votes.

Thanks everyone!

Note: there is an ongoing issue such that some reply emails not get
delivered to certain email address (like gmail). Check the complete thread
here: https://lists.apache.org/thread/r7r5q5mq7rqjrfbf8nj90smrdkss0sbf

On Fri, Jul 14, 2023 at 4:50 PM Valentyn Tymofieiev via dev <
dev@beam.apache.org> wrote:

> +1. Tested a few python pipelines on Dataflow Runner V1 and Runner V2.
>
>
>
> On Thu, Jul 13, 2023 at 12:54 PM Svetak Sundhar via dev <
> dev@beam.apache.org> wrote:
>
>> +1 (Non-Binding)
>>
>> Python quickstart Dataflow runner.
>>
>>
>> Svetak Sundhar
>>
>>   Data Engineer
>> s vetaksund...@google.com
>>
>>
>>
>> On Thu, Jul 13, 2023 at 5:03 AM Jan Lukavský  wrote:
>>
>>> +1 (binding)
>>>
>>> Tested Java SDK with FlinkRunner.
>>>
>>>  Jan
>>> On 7/13/23 02:30, Bruno Volpato via dev wrote:
>>>
>>> +1 (non-binding).
>>>
>>> Tested with https://github.com/GoogleCloudPlatform/DataflowTemplates
>>> (Java SDK 11, Dataflow runner).
>>>
>>> Thanks Yi!
>>>
>>> On Tue, Jul 11, 2023 at 4:23 PM Yi Hu via dev 
>>> wrote:
>>>
 Hi everyone,
 Please review and vote on the release candidate #2 for the version
 2.49.0, as follows:
 [ ] +1, Approve the release
 [ ] -1, Do not approve the release (please provide specific comments)


 Reviewers are encouraged to test their own use cases with the release
 candidate, and vote +1 if
 no issues are found. Only PMC member votes will count towards the final
 vote, but votes from all
 community members is encouraged and helpful for finding regressions;
 you can either test your own
 use cases or use cases from the validation sheet [10].

 The complete staging area is available for your review, which includes:
 * GitHub Release notes [1],
 * the official Apache source release to be deployed to dist.apache.org
 [2], which is signed with the key with
 fingerprint either CB6974C8170405CB (y...@apache.org) or
 D20316F712213422 (GitHub Action automated) [3],
 * all artifacts to be deployed to the Maven Central Repository [4],
 * source code tag "v2.49.0-RC2" [5],
 * website pull request listing the release [6], the blog post [6], and
 publishing the API reference manual [7].
 * Java artifacts were built with Gradle GRADLE_VERSION and
 OpenJDK/Oracle JDK JDK_VERSION.

>>> nit: versions were missing.
>
>> * Python artifacts are deployed along with the source release to the
 dist.apache.org [2] and PyPI [8].
 * Go artifacts and documentation are available at pkg.go.dev [9]
 * Validation sheet with a tab for 2.49.0 release to help with
 validation [10].
 * Docker images published to Docker Hub [11].
 * PR to run tests against release branch [12].

 The vote will be open for at least 72 hours. It is adopted by majority
 approval, with at least 3 PMC affirmative votes.

 For guidelines on how to try the release in your projects, check out
 our blog post at /blog/validate-beam-release/.

 Thanks,
 Release Manager

 [1] https://github.com/apache/beam/milestone/13
 [2] https://dist.apache.org/repos/dist/dev/beam/2.49.0/
 [3] https://dist.apache.org/repos/dist/release/beam/KEYS
 [4]
 https://repository.apache.org/content/repositories/orgapachebeam-1349/
 [5] https://github.com/apache/beam/tree/v2.49.0-RC2
 [6] https://github.com/apache/beam/pull/27374 (unchanged since RC1)
 [7] https://github.com/apache/beam-site/pull/646  (unchanged since RC1)
 [8] https://pypi.org/project/apache-beam/2.49.0rc2/
 [9]
 https://pkg.go.dev/github.com/apache/beam/sdks/v2@v2.49.0-RC2/go/pkg/beam
 [10]
 https://docs.google.com/spreadsheets/d/1qk-N5vjXvbcEk68GjbkSZTR8AGqyNUM-oLFo_ZXBpJw/edit#gid=934901728
 [11] https://hub.docker.com/search?q=apache%2Fbeam&type=image
 [12] https://github.com/apache/beam/pull/27307

 --

 Yi Hu, (he/him/his)

 Software Engineer





Beam High Priority Issue Report (39)

2023-07-17 Thread beamactions
This is your daily summary of Beam's current high priority issues that may need 
attention.

See https://beam.apache.org/contribute/issue-priorities for the meaning and 
expectations around issue priorities.

Unassigned P1 Issues:

https://github.com/apache/beam/issues/27486 [Bug]: Read from datastore with 
inequality filters
https://github.com/apache/beam/issues/27428 [Failing Test]: 
beam_PostRelease_Python_Candidate script broken
https://github.com/apache/beam/issues/27320 [Failing Test]: 
beam_Release_Gradle_Build permared
https://github.com/apache/beam/issues/27315 [Failing Test]: PubsubReadIT 
timeout pollForResultForDuration
https://github.com/apache/beam/issues/27314 [Failing Test]: 
bigquery.StorageApiSinkCreateIfNeededIT.testCreateManyTables[1]
https://github.com/apache/beam/issues/27312 [Bug]: JmsIO create connection 
based on the number of threads
https://github.com/apache/beam/issues/27238 [Bug]: Window trigger has lag when 
using Kafka and GroupByKey on Dataflow Runner
https://github.com/apache/beam/issues/26981 [Bug]: Getting an error related to 
SchemaCoder after upgrading to 2.48
https://github.com/apache/beam/issues/26969 [Failing Test]: Python PostCommit 
is failing due to exceeded rate limits
https://github.com/apache/beam/issues/26911 [Bug]: UNNEST ARRAY with a nested 
ROW (described below)
https://github.com/apache/beam/issues/26547 [Failing Test]: 
beam_PostCommit_Java_DataflowV2
https://github.com/apache/beam/issues/26354 [Bug]: BigQueryIO direct read not 
reading all rows when set --setEnableBundling=true
https://github.com/apache/beam/issues/26343 [Bug]: 
apache_beam.io.gcp.bigquery_read_it_test.ReadAllBQTests.test_read_queries is 
flaky
https://github.com/apache/beam/issues/26329 [Bug]: BigQuerySourceBase does not 
propagate a Coder to AvroSource
https://github.com/apache/beam/issues/26041 [Bug]: Unable to create 
exactly-once Flink pipeline with stream source and file sink
https://github.com/apache/beam/issues/25975 [Bug]: Reducing parallelism in 
FlinkRunner leads to a data loss
https://github.com/apache/beam/issues/24776 [Bug]: Race condition in Python SDK 
Harness ProcessBundleProgress
https://github.com/apache/beam/issues/24389 [Failing Test]: 
HadoopFormatIOElasticTest.classMethod ExceptionInInitializerError 
ContainerFetchException
https://github.com/apache/beam/issues/24313 [Flaky]: 
apache_beam/runners/portability/portable_runner_test.py::PortableRunnerTestWithSubprocesses::test_pardo_state_with_custom_key_coder
https://github.com/apache/beam/issues/23944  beam_PreCommit_Python_Cron 
regularily failing - test_pardo_large_input flaky
https://github.com/apache/beam/issues/23709 [Flake]: Spark batch flakes in 
ParDoLifecycleTest.testTeardownCalledAfterExceptionInProcessElement and 
ParDoLifecycleTest.testTeardownCalledAfterExceptionInStartBundle
https://github.com/apache/beam/issues/23525 [Bug]: Default PubsubMessage coder 
will drop message id and orderingKey
https://github.com/apache/beam/issues/22913 [Bug]: 
beam_PostCommit_Java_ValidatesRunner_Flink is flakes in 
org.apache.beam.sdk.transforms.GroupByKeyTest$BasicTests.testAfterProcessingTimeContinuationTriggerUsingState
https://github.com/apache/beam/issues/22605 [Bug]: Beam Python failure for 
dataflow_exercise_metrics_pipeline_test.ExerciseMetricsPipelineTest.test_metrics_it
https://github.com/apache/beam/issues/21714 
PulsarIOTest.testReadFromSimpleTopic is very flaky
https://github.com/apache/beam/issues/21708 beam_PostCommit_Java_DataflowV2, 
testBigQueryStorageWrite30MProto failing consistently
https://github.com/apache/beam/issues/21706 Flaky timeout in github Python unit 
test action 
StatefulDoFnOnDirectRunnerTest.test_dynamic_timer_clear_then_set_timer
https://github.com/apache/beam/issues/21643 FnRunnerTest with non-trivial 
(order 1000 elements) numpy input flakes in non-cython environment
https://github.com/apache/beam/issues/21476 WriteToBigQuery Dynamic table 
destinations returns wrong tableId
https://github.com/apache/beam/issues/21469 beam_PostCommit_XVR_Flink flaky: 
Connection refused
https://github.com/apache/beam/issues/21424 Java VR (Dataflow, V2, Streaming) 
failing: ParDoTest$TimestampTests/OnWindowExpirationTests
https://github.com/apache/beam/issues/21262 Python AfterAny, AfterAll do not 
follow spec
https://github.com/apache/beam/issues/21260 Python DirectRunner does not emit 
data at GC time
https://github.com/apache/beam/issues/21121 
apache_beam.examples.streaming_wordcount_it_test.StreamingWordCountIT.test_streaming_wordcount_it
 flakey
https://github.com/apache/beam/issues/21104 Flaky: 
apache_beam.runners.portability.fn_api_runner.fn_runner_test.FnApiRunnerTestWithGrpcAndMultiWorkers
https://github.com/apache/beam/issues/20976 
apache_beam.runners.portability.flink_runner_test.FlinkRunnerTestOptimized.test_flink_metrics
 is flaky
https://github.com/apache/beam/issues/20108 Python direct runner doesn't emit 
empty pane when it should
https://github.com/apache/beam/issues/19814 Flink