[
https://issues.apache.org/jira/browse/HUDI-507?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17010282#comment-17010282
]
Vinoth Chandar commented on HUDI-507:
-------------------------------------
There is a PR open for CSV source.. May be can see if sharing code is possible
or borrow the approach from that?
> Support \ t split hdfs source
> -----------------------------
>
> Key: HUDI-507
> URL: https://issues.apache.org/jira/browse/HUDI-507
> Project: Apache Hudi (incubating)
> Issue Type: Improvement
> Components: Utilities
> Reporter: liujinhui
> Priority: Minor
> Fix For: 0.5.1
>
> Original Estimate: 240h
> Remaining Estimate: 240h
>
> hi,hudi
>
> Current Hudi data source does not support HDFS file data splitting with \ t
> separator
> I want to complete it and contribute to the community.
> The main change is the addition of the TextDFSSource class to provide
> support.
> The specific new logic is: split the hdfs data according to the delimiter,
> and then map it to the source.avsc pattern
>
> Or do some other symbol format as an extension
> thanks,
> liujh
>
> [~vinoth] Please help with suggestions
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)