Pierre Belzile created ARROW-9863:
-------------------------------------

             Summary: [C++] [PARQUET] Optimize meta data recovery of 
ApplicationVersion
                 Key: ARROW-9863
                 URL: https://issues.apache.org/jira/browse/ARROW-9863
             Project: Apache Arrow
          Issue Type: Improvement
          Components: C++
    Affects Versions: 1.0.0
            Reporter: Pierre Belzile


The class contains two large regexes which are compiled in the 
ApplicationVersion::ApplicationVersion(const std:::string) constructor. This is 
the constructor that is used when reading files. 

I stopped a server in gdb that had been processing several files at once and 4 
threads out of 8 were building those regexes!



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to