Eugene Nikolaiev created BEAM-12741:
---------------------------------------

             Summary: Read multiple files keeping track of file names (Python)
                 Key: BEAM-12741
                 URL: https://issues.apache.org/jira/browse/BEAM-12741
             Project: Beam
          Issue Type: Improvement
          Components: io-py-files
    Affects Versions: 2.31.0
            Reporter: Eugene Nikolaiev
             Fix For: 2.33.0


When reading lines from text files with multiple patterns it is sometimes 
useful to keep track of the file names from which the lines originated. 
Example: read tab-delimited files and map their lines to column headers coming 
from separate files.

It would be nice to have a {{ReadAllFromTextWithFilename}} transform, which 
modifies {{ReadAllFromText}} transform in a similar way as 
{{ReadFromTextWithFilename}} modifies  the {{ReadFromText}} transform to 
produce tuples of file names paired with text lines.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to