[ 
https://issues.apache.org/jira/browse/BEAM-12741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Eugene Nikolaiev updated BEAM-12741:
------------------------------------
    Comment: was deleted

(was: Closing this issue in favor of BEAM-12665)

> Read multiple files keeping track of file names (Python)
> --------------------------------------------------------
>
>                 Key: BEAM-12741
>                 URL: https://issues.apache.org/jira/browse/BEAM-12741
>             Project: Beam
>          Issue Type: Improvement
>          Components: io-py-files
>    Affects Versions: 2.31.0
>            Reporter: Eugene Nikolaiev
>            Priority: P3
>              Labels: io, python, text
>          Time Spent: 1h 50m
>  Remaining Estimate: 0h
>
> When reading lines from text files with multiple patterns it is sometimes 
> useful to keep track of the file names from which the lines originated. 
> Example: read tab-delimited files and map their lines to column headers 
> coming from separate files.
> It would be nice to have a {{ReadAllFromTextWithFilename}} transform, which 
> modifies {{ReadAllFromText}} transform in a similar way as 
> {{ReadFromTextWithFilename}} modifies  the {{ReadFromText}} transform to 
> produce tuples of file names paired with text lines.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to