Huang Xingbo created FLINK-29461:
------------------------------------
Summary: ProcessDataStreamStreamingTests.test_process_function
unstable
Key: FLINK-29461
URL: https://issues.apache.org/jira/browse/FLINK-29461
Project: Flink
Issue Type: Bug
Components: API / Python
Affects Versions: 1.16.0
Reporter: Huang Xingbo
{code:java}
2022-09-29T02:10:45.3571648Z Sep 29 02:10:45 self =
<pyflink.datastream.tests.test_data_stream.ProcessDataStreamStreamingTests
testMethod=test_process_function>
2022-09-29T02:10:45.3572279Z Sep 29 02:10:45
2022-09-29T02:10:45.3572810Z Sep 29 02:10:45 def
test_process_function(self):
2022-09-29T02:10:45.3573495Z Sep 29 02:10:45 self.env.set_parallelism(1)
2022-09-29T02:10:45.3574148Z Sep 29 02:10:45
self.env.get_config().set_auto_watermark_interval(2000)
2022-09-29T02:10:45.3580634Z Sep 29 02:10:45
self.env.set_stream_time_characteristic(TimeCharacteristic.EventTime)
2022-09-29T02:10:45.3583194Z Sep 29 02:10:45 data_stream =
self.env.from_collection([(1, '1603708211000'),
2022-09-29T02:10:45.3584515Z Sep 29 02:10:45
(2, '1603708224000'),
2022-09-29T02:10:45.3585957Z Sep 29 02:10:45
(3, '1603708226000'),
2022-09-29T02:10:45.3587132Z Sep 29 02:10:45
(4, '1603708289000')],
2022-09-29T02:10:45.3588094Z Sep 29 02:10:45
type_info=Types.ROW([Types.INT(), Types.STRING()]))
2022-09-29T02:10:45.3589090Z Sep 29 02:10:45
2022-09-29T02:10:45.3589949Z Sep 29 02:10:45 class
MyProcessFunction(ProcessFunction):
2022-09-29T02:10:45.3590710Z Sep 29 02:10:45
2022-09-29T02:10:45.3591856Z Sep 29 02:10:45 def
process_element(self, value, ctx):
2022-09-29T02:10:45.3592873Z Sep 29 02:10:45 current_timestamp
= ctx.timestamp()
2022-09-29T02:10:45.3593862Z Sep 29 02:10:45 current_watermark
= ctx.timer_service().current_watermark()
2022-09-29T02:10:45.3594915Z Sep 29 02:10:45 yield "current
timestamp: {}, current watermark: {}, current_value: {}"\
2022-09-29T02:10:45.3596201Z Sep 29 02:10:45
.format(str(current_timestamp), str(current_watermark), str(value))
2022-09-29T02:10:45.3597089Z Sep 29 02:10:45
2022-09-29T02:10:45.3597942Z Sep 29 02:10:45 watermark_strategy =
WatermarkStrategy.for_monotonous_timestamps()\
2022-09-29T02:10:45.3599260Z Sep 29 02:10:45
.with_timestamp_assigner(SecondColumnTimestampAssigner())
2022-09-29T02:10:45.3600611Z Sep 29 02:10:45
data_stream.assign_timestamps_and_watermarks(watermark_strategy)\
2022-09-29T02:10:45.3601877Z Sep 29 02:10:45
.process(MyProcessFunction(),
output_type=Types.STRING()).add_sink(self.test_sink)
2022-09-29T02:10:45.3603527Z Sep 29 02:10:45 self.env.execute('test
process function')
2022-09-29T02:10:45.3604445Z Sep 29 02:10:45 results =
self.test_sink.get_results()
2022-09-29T02:10:45.3605684Z Sep 29 02:10:45 expected = ["current
timestamp: 1603708211000, current watermark: "
2022-09-29T02:10:45.3607157Z Sep 29 02:10:45
"-9223372036854775808, current_value: Row(f0=1, f1='1603708211000')",
2022-09-29T02:10:45.3608256Z Sep 29 02:10:45 "current
timestamp: 1603708224000, current watermark: "
2022-09-29T02:10:45.3609650Z Sep 29 02:10:45
"-9223372036854775808, current_value: Row(f0=2, f1='1603708224000')",
2022-09-29T02:10:45.3610854Z Sep 29 02:10:45 "current
timestamp: 1603708226000, current watermark: "
2022-09-29T02:10:45.3612279Z Sep 29 02:10:45
"-9223372036854775808, current_value: Row(f0=3, f1='1603708226000')",
2022-09-29T02:10:45.3613382Z Sep 29 02:10:45 "current
timestamp: 1603708289000, current watermark: "
2022-09-29T02:10:45.3615683Z Sep 29 02:10:45
"-9223372036854775808, current_value: Row(f0=4, f1='1603708289000')"]
2022-09-29T02:10:45.3617687Z Sep 29 02:10:45 >
self.assert_equals_sorted(expected, results)
2022-09-29T02:10:45.3618620Z Sep 29 02:10:45
2022-09-29T02:10:45.3619425Z Sep 29 02:10:45
pyflink/datastream/tests/test_data_stream.py:986:
2022-09-29T02:10:45.3620424Z Sep 29 02:10:45 _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _
2022-09-29T02:10:45.3621886Z Sep 29 02:10:45
pyflink/datastream/tests/test_data_stream.py:66: in assert_equals_sorted
2022-09-29T02:10:45.3622847Z Sep 29 02:10:45 self.assertEqual(expected,
actual)
2022-09-29T02:10:45.3624658Z Sep 29 02:10:45 E AssertionError: Lists differ:
["cur[414 chars]ark: -9223372036854775808, current_value: Row([22 chars]0')"]
!= ["cur[414 chars]ark: 1603708225999, current_value: Row(f0=4, f[15 chars]0')"]
2022-09-29T02:10:45.3625881Z Sep 29 02:10:45 E
2022-09-29T02:10:45.3626591Z Sep 29 02:10:45 E First differing element 3:
2022-09-29T02:10:45.3627726Z Sep 29 02:10:45 E "curr[44 chars]ark:
-9223372036854775808, current_value: Row([21 chars]00')"
2022-09-29T02:10:45.3628758Z Sep 29 02:10:45 E "curr[44 chars]ark:
1603708225999, current_value: Row(f0=4, f[14 chars]00')"
2022-09-29T02:10:45.3629276Z Sep 29 02:10:45 E
2022-09-29T02:10:45.3629842Z Sep 29 02:10:45 E Diff is 753 characters long.
Set self.maxDiff to None to see it.
{code}
https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=41436&view=logs&j=9cada3cb-c1d3-5621-16da-0f718fb86602&t=c67e71ed-6451-5d26-8920-5a8cf9651901
--
This message was sent by Atlassian Jira
(v8.20.10#820010)