[jira] [Updated] (HUDI-7576) Avoid recomputing partition path in AbstractFileSystemView

2024-04-13 Thread Ethan Guo (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-7576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ethan Guo updated HUDI-7576:

Status: Patch Available  (was: In Progress)

> Avoid recomputing partition path in AbstractFileSystemView
> --
>
> Key: HUDI-7576
> URL: https://issues.apache.org/jira/browse/HUDI-7576
> Project: Apache Hudi
>  Issue Type: Improvement
>Reporter: Timothy Brown
>Assignee: Timothy Brown
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.15.0, 1.0.0
>
>
> We have observed a non-negligible amount of CPU spent simply computing the 
> partition paths of base and log files when building a file system view. We 
> should aim to improve the efficiency of these calls and reduce the number of 
> them.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (HUDI-7576) Avoid recomputing partition path in AbstractFileSystemView

2024-04-13 Thread Ethan Guo (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-7576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ethan Guo updated HUDI-7576:

Status: In Progress  (was: Open)

> Avoid recomputing partition path in AbstractFileSystemView
> --
>
> Key: HUDI-7576
> URL: https://issues.apache.org/jira/browse/HUDI-7576
> Project: Apache Hudi
>  Issue Type: Improvement
>Reporter: Timothy Brown
>Assignee: Timothy Brown
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.15.0, 1.0.0
>
>
> We have observed a non-negligible amount of CPU spent simply computing the 
> partition paths of base and log files when building a file system view. We 
> should aim to improve the efficiency of these calls and reduce the number of 
> them.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (HUDI-7576) Avoid recomputing partition path in AbstractFileSystemView

2024-04-13 Thread Ethan Guo (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-7576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ethan Guo updated HUDI-7576:

Fix Version/s: 0.15.0
   1.0.0

> Avoid recomputing partition path in AbstractFileSystemView
> --
>
> Key: HUDI-7576
> URL: https://issues.apache.org/jira/browse/HUDI-7576
> Project: Apache Hudi
>  Issue Type: Improvement
>Reporter: Timothy Brown
>Assignee: Timothy Brown
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.15.0, 1.0.0
>
>
> We have observed a non-negligible amount of CPU spent simply computing the 
> partition paths of base and log files when building a file system view. We 
> should aim to improve the efficiency of these calls and reduce the number of 
> them.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (HUDI-7576) Avoid recomputing partition path in AbstractFileSystemView

2024-04-11 Thread Timothy Brown (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-7576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Timothy Brown updated HUDI-7576:

Summary: Avoid recomputing partition path in AbstractFileSystemView  (was: 
Add partitionPath to the HoodieBaseFile and HoodieLogFile objects)

> Avoid recomputing partition path in AbstractFileSystemView
> --
>
> Key: HUDI-7576
> URL: https://issues.apache.org/jira/browse/HUDI-7576
> Project: Apache Hudi
>  Issue Type: Improvement
>Reporter: Timothy Brown
>Assignee: Timothy Brown
>Priority: Major
>  Labels: pull-request-available
>
> Adding this field to the classes will allow us to avoid repeatedly computing 
> the partition path per file in other parts of the code. This can cut down on 
> the CPU overhead associated with creating the FS View.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (HUDI-7576) Avoid recomputing partition path in AbstractFileSystemView

2024-04-11 Thread Timothy Brown (Jira)


 [ 
https://issues.apache.org/jira/browse/HUDI-7576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Timothy Brown updated HUDI-7576:

Description: We have observed a non-negligible amount of CPU spent simply 
computing the partition paths of base and log files when building a file system 
view. We should aim to improve the efficiency of these calls and reduce the 
number of them.  (was: Adding this field to the classes will allow us to avoid 
repeatedly computing the partition path per file in other parts of the code. 
This can cut down on the CPU overhead associated with creating the FS View.)

> Avoid recomputing partition path in AbstractFileSystemView
> --
>
> Key: HUDI-7576
> URL: https://issues.apache.org/jira/browse/HUDI-7576
> Project: Apache Hudi
>  Issue Type: Improvement
>Reporter: Timothy Brown
>Assignee: Timothy Brown
>Priority: Major
>  Labels: pull-request-available
>
> We have observed a non-negligible amount of CPU spent simply computing the 
> partition paths of base and log files when building a file system view. We 
> should aim to improve the efficiency of these calls and reduce the number of 
> them.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)