lei w created HUDI-4613:
---------------------------
Summary: Avoid the use of regular expressions when call
hoodieFileGroup#addLogFile function
Key: HUDI-4613
URL: https://issues.apache.org/jira/browse/HUDI-4613
Project: Apache Hudi
Issue Type: Improvement
Components: compaction
Reporter: lei w
When the number of logFile files exceeds a certain amount of data, the
construction of fsview will become very time-consuming. The reason is that the
LogFileComparator#compare method is frequently called when constructing a
filegroup, and regular expressions are used in this method.
{panel:title=build FileSystemView Log }
INFO view.AbstractTableFileSystemView: #files found in partition (20220810)
=60801, Time taken =28314
INFO view.AbstractTableFileSystemView: addFilesToView: NumFiles=60801,
NumFileGroups=200, FileGroupsCreationTime=34036, StoreTimeTaken=2
{panel}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)