[
https://issues.apache.org/jira/browse/BEAM-1892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sourabh Bajaj resolved BEAM-1892.
---------------------------------
Resolution: Fixed
Fix Version/s: Not applicable
> Log process during size estimation in filebasedsource
> -----------------------------------------------------
>
> Key: BEAM-1892
> URL: https://issues.apache.org/jira/browse/BEAM-1892
> Project: Beam
> Issue Type: Improvement
> Components: sdk-py
> Reporter: Sourabh Bajaj
> Assignee: Sourabh Bajaj
> Fix For: Not applicable
>
>
> http://stackoverflow.com/questions/43095445/how-to-iterate-all-files-in-google-cloud-storage-to-be-used-as-dataflow-input
> The user mentioned that there was no output and a huge delay in submitting
> the pipeline. The file size estimation process can be slow for really large
> datasets and this reports no process to the end user right now. We should be
> logging process and thresholding the pre submission size estimation as well.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)