[ 
https://issues.apache.org/jira/browse/HUDI-648?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17300188#comment-17300188
 ] 

liujinhui commented on HUDI-648:
--------------------------------

[~xushiyan]  [~vinoth] 

When querying the error record, we introduce a configuration, similar to
hhoodie.datasource.query.table=error
basepath = "hdfs://xxx/datalake/data/tmp/";

or
Directly specify
basepath = "hdfs://xxx/datalake/data/tmp/.hoodie/errors";


Which one is better? Please help with comments

 

> Implement error log/table for Datasource/DeltaStreamer/WriteClient/Compaction 
> writes
> ------------------------------------------------------------------------------------
>
>                 Key: HUDI-648
>                 URL: https://issues.apache.org/jira/browse/HUDI-648
>             Project: Apache Hudi
>          Issue Type: New Feature
>          Components: DeltaStreamer, Spark Integration, Writer Core
>            Reporter: Vinoth Chandar
>            Assignee: liujinhui
>            Priority: Major
>         Attachments: image-2021-03-03-11-40-21-083.png
>
>
> We would like a way to hand the erroring records from writing or compaction 
> back to the users, in a separate table or log. This needs to work generically 
> across all the different writer paths.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to