[ https://issues.apache.org/jira/browse/HUDI-507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
liujinhui updated HUDI-507: --------------------------- Status: Open (was: New) > Support \ t split hdfs source > ----------------------------- > > Key: HUDI-507 > URL: https://issues.apache.org/jira/browse/HUDI-507 > Project: Apache Hudi (incubating) > Issue Type: Improvement > Components: Utilities > Reporter: liujinhui > Assignee: liujinhui > Priority: Minor > Fix For: 0.6.0 > > Original Estimate: 240h > Remaining Estimate: 240h > > hi,hudi > > Current Hudi data source does not support HDFS file data splitting with \ t > separator > I want to complete it and contribute to the community. > The main change is the addition of the TextDFSSource class to provide > support. > The specific new logic is: split the hdfs data according to the delimiter, > and then map it to the source.avsc pattern > > Or do some other symbol format as an extension > thanks, > liujh > > [~vinoth] Please help with suggestions > -- This message was sent by Atlassian Jira (v8.3.4#803005)