[ 
https://issues.apache.org/jira/browse/HUDI-1292?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17397657#comment-17397657
 ] 

ASF GitHub Bot commented on HUDI-1292:
--------------------------------------

vinothchandar commented on pull request #3427:
URL: https://github.com/apache/hudi/pull/3427#issuecomment-897179236


   I am thinking about how to turn this off across different paths - spark 
datasource writer, deltastreamer, async compact,cleaner, clustering jobs. 
   
   From the code, I see the syncing happens on preWrite and postCommit. So the 
main issue is `WriteClient#compact()` and `WriteClient#cluster` calls invoking 
sync? Wondering if we should also check the write operation type and avoid 
syncing during these operations. then , as long as multi writer is turned on, 
it should work and be sane. 
   
   
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


> [Umbrella] RFC-15 : File Listing and Query Planning Optimizations 
> ------------------------------------------------------------------
>
>                 Key: HUDI-1292
>                 URL: https://issues.apache.org/jira/browse/HUDI-1292
>             Project: Apache Hudi
>          Issue Type: Improvement
>          Components: Spark Integration, Writer Core
>    Affects Versions: 0.9.0
>            Reporter: Vinoth Chandar
>            Assignee: Prashant Wason
>            Priority: Major
>              Labels: hudi-umbrellas, pull-request-available
>             Fix For: 0.10.0
>
>
> This is the umbrella ticket that tracks the overall implementation of RFC-15



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to