Pierre Belzile created ARROW-9863:
-------------------------------------
Summary: [C++] [PARQUET] Optimize meta data recovery of
ApplicationVersion
Key: ARROW-9863
URL: https://issues.apache.org/jira/browse/ARROW-9863
Project: Apache Arrow
Issue Type: Improvement
Components: C++
Affects Versions: 1.0.0
Reporter: Pierre Belzile
The class contains two large regexes which are compiled in the
ApplicationVersion::ApplicationVersion(const std:::string) constructor. This is
the constructor that is used when reading files.
I stopped a server in gdb that had been processing several files at once and 4
threads out of 8 were building those regexes!
--
This message was sent by Atlassian Jira
(v8.3.4#803005)