[
https://issues.apache.org/jira/browse/BEAM-12741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Eugene Nikolaiev updated BEAM-12741:
------------------------------------
Comment: was deleted
(was: Closing this issue in favor of BEAM-12665)
> Read multiple files keeping track of file names (Python)
> --------------------------------------------------------
>
> Key: BEAM-12741
> URL: https://issues.apache.org/jira/browse/BEAM-12741
> Project: Beam
> Issue Type: Improvement
> Components: io-py-files
> Affects Versions: 2.31.0
> Reporter: Eugene Nikolaiev
> Priority: P3
> Labels: io, python, text
> Time Spent: 1h 50m
> Remaining Estimate: 0h
>
> When reading lines from text files with multiple patterns it is sometimes
> useful to keep track of the file names from which the lines originated.
> Example: read tab-delimited files and map their lines to column headers
> coming from separate files.
> It would be nice to have a {{ReadAllFromTextWithFilename}} transform, which
> modifies {{ReadAllFromText}} transform in a similar way as
> {{ReadFromTextWithFilename}} modifies the {{ReadFromText}} transform to
> produce tuples of file names paired with text lines.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)