Chirag Todarka created GOBBLIN-968:
--------------------------------------

             Summary: Honor file split size for HadoopFileInputSource
                 Key: GOBBLIN-968
                 URL: https://issues.apache.org/jira/browse/GOBBLIN-968
             Project: Apache Gobblin
          Issue Type: Bug
          Components: gobblin-core
    Affects Versions: 0.14.0
            Reporter: Chirag Todarka
            Assignee: Abhishek Tiwari


For HadoopFileInputSource it is splitting the source file only in default chunk 
size (which is 128MB) and not honoring the min and mix split size mentioned in 
pull file.

For example even if below entires are present in pull file it will still chunk 
the source file in smaller 134217728 bytes files and not in 268435456 bytes 
files.

{{source.hadoop.file.input.split.minsize=268435456}}
{{source.hadoop.file.input.split.maxsize=268435456}}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to