[jira] [Updated] (FLINK-6417) Wildcard support for FileInputFormat

Flink Jira Bot (Jira) Sun, 19 Dec 2021 14:39:22 -0800


     [ 
https://issues.apache.org/jira/browse/FLINK-6417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Flink Jira Bot updated FLINK-6417:
----------------------------------
      Labels: auto-deprioritized-minor pull-request-available  (was: 
pull-request-available stale-minor)
    Priority: Not a Priority  (was: Minor)

This issue was labeled "stale-minor" 7 days ago and has not received any 
updates so it is being deprioritized. If this ticket is actually Minor, please 
raise the priority and ask a committer to assign you the issue or revive the 
public discussion.


> Wildcard support for FileInputFormat
> ------------------------------------
>
>                 Key: FLINK-6417
>                 URL: https://issues.apache.org/jira/browse/FLINK-6417
>             Project: Flink
>          Issue Type: New Feature
>          Components: API / DataSet
>            Reporter: Artiom Darie
>            Priority: Not a Priority
>              Labels: auto-deprioritized-minor, pull-request-available
>
> Add wildcard support while reading from s3://, hdfs://, file://, etc.
> h6. Examples:
> # {code} s3://bucket-name/*.gz {code}
> # {code} hdfs://path/*file-name*.csv {code}
> # {code} file://tmp/**/*.* {code}
> h6. Proposal
> # Use the existing method: {code}environment.readFile(...){code}
> # List all the files in the directories
> # Read files using existing: {code}ContinuousFileReaderOperator{code}
> h6. Concerns (Open for discussions)
> # Have multiple DataSource(s) created for each each file and then to join 
> them into a single DataSource
> # Have all the files into the same DataSource
> # Have the listing of the files on the driver and load on each task manager



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

[jira] [Updated] (FLINK-6417) Wildcard support for FileInputFormat

Reply via email to