Benoit Lacelle created PARQUET-1055:
---------------------------------------

             Summary: Improve the creation of ExecutorService when reading 
footers
                 Key: PARQUET-1055
                 URL: https://issues.apache.org/jira/browse/PARQUET-1055
             Project: Parquet
          Issue Type: Improvement
          Components: parquet-mr
    Affects Versions: 1.9.0
            Reporter: Benoit Lacelle
            Priority: Minor
             Fix For: 1.9.1


Doing some benchmarks loading a large set of parquet files (3000+) from the 
local FS, we observed some inefficiencies in the number of created threads when 
reading footers.

By reading, the read the configuration parallelism in Hadoop configuration 
(defaulted to 5) and allocate 2 ExecuteService with each 5 threads to read 
footers. This is especially inefficient if there is less Callable to handle 
than the configured parallelism.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to