[
https://issues.apache.org/jira/browse/BEAM-14429?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yichi Zhang updated BEAM-14429:
-------------------------------
Description: With the default 20 split, the num records produced by
Read.from(SyntheticUnboundedSource) is always larger than the numRecords
specified. the more splits the more actual number records produced is off. And
the Read step tends to take longer time with more splits.
> SyntheticUnboundedSource(with SDF) produce wrong number of records when
> initial split is larger than 1
> ------------------------------------------------------------------------------------------------------
>
> Key: BEAM-14429
> URL: https://issues.apache.org/jira/browse/BEAM-14429
> Project: Beam
> Issue Type: Bug
> Components: io-common
> Reporter: Yichi Zhang
> Priority: P2
> Time Spent: 40m
> Remaining Estimate: 0h
>
> With the default 20 split, the num records produced by
> Read.from(SyntheticUnboundedSource) is always larger than the numRecords
> specified. the more splits the more actual number records produced is off.
> And the Read step tends to take longer time with more splits.
--
This message was sent by Atlassian Jira
(v8.20.7#820007)