[ 
https://issues.apache.org/jira/browse/HUDI-2271?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ethan Guo updated HUDI-2271:
----------------------------
    Description: 
Follow-up items after HUDI-1138:
 * Implement upgrade/downgrade, rollback regarding markers
 ** Rollback should be based on the markers in the file system, with marker 
type indicated by MARKERS.type, or assumed to be "direct", if no MARKERS.type
 ** New write follows the `hoodie.write.markers.type` config to create markers
 * Verify multiple concurrent writers with timeline-server-based markers
 * Let's ensure the parallelism passed for local engine context is reasonable
 * Fix `Task not serializable` due to new APIs in FSUtils for recursive, level 
by level listing (`java.io.NotSerializableException: 
org.apache.hudi.common.fs.FSUtils$$Lambda$4224/1845791682`)
 * Optimize fs.exists(dirPath) call in marker creation in DirectMarkerFiles
 * Optimize future timeouts due to I/O failure in the marker handler
 * Test new marker strategy for all engines (spark, flink and java) w/o any 
additional changes

  was:
Follow-up items after HUDI-1138:
 * Optimize fs.exists(dirPath) call in marker creation in DirectMarkerFiles
 * Test new marker strategy for all engines (spark, flink and java) w/o any 
additional changes


> Follow-up items for timeline-server-based marker files
> ------------------------------------------------------
>
>                 Key: HUDI-2271
>                 URL: https://issues.apache.org/jira/browse/HUDI-2271
>             Project: Apache Hudi
>          Issue Type: Improvement
>            Reporter: Ethan Guo
>            Priority: Major
>
> Follow-up items after HUDI-1138:
>  * Implement upgrade/downgrade, rollback regarding markers
>  ** Rollback should be based on the markers in the file system, with marker 
> type indicated by MARKERS.type, or assumed to be "direct", if no MARKERS.type
>  ** New write follows the `hoodie.write.markers.type` config to create markers
>  * Verify multiple concurrent writers with timeline-server-based markers
>  * Let's ensure the parallelism passed for local engine context is reasonable
>  * Fix `Task not serializable` due to new APIs in FSUtils for recursive, 
> level by level listing (`java.io.NotSerializableException: 
> org.apache.hudi.common.fs.FSUtils$$Lambda$4224/1845791682`)
>  * Optimize fs.exists(dirPath) call in marker creation in DirectMarkerFiles
>  * Optimize future timeouts due to I/O failure in the marker handler
>  * Test new marker strategy for all engines (spark, flink and java) w/o any 
> additional changes



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to