[
https://issues.apache.org/jira/browse/HUDI-648?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17300188#comment-17300188
]
liujinhui commented on HUDI-648:
--------------------------------
[~xushiyan] [~vinoth]
When querying the error record, we introduce a configuration, similar to
hhoodie.datasource.query.table=error
basepath = "hdfs://xxx/datalake/data/tmp/";
or
Directly specify
basepath = "hdfs://xxx/datalake/data/tmp/.hoodie/errors";
Which one is better? Please help with comments
> Implement error log/table for Datasource/DeltaStreamer/WriteClient/Compaction
> writes
> ------------------------------------------------------------------------------------
>
> Key: HUDI-648
> URL: https://issues.apache.org/jira/browse/HUDI-648
> Project: Apache Hudi
> Issue Type: New Feature
> Components: DeltaStreamer, Spark Integration, Writer Core
> Reporter: Vinoth Chandar
> Assignee: liujinhui
> Priority: Major
> Attachments: image-2021-03-03-11-40-21-083.png
>
>
> We would like a way to hand the erroring records from writing or compaction
> back to the users, in a separate table or log. This needs to work generically
> across all the different writer paths.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)