[ 
https://issues.apache.org/jira/browse/HUDI-76?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16976607#comment-16976607
 ] 

Vinoth Chandar commented on HUDI-76:
------------------------------------

[~guoyihua] any reason this has to be different than the existing DFS source 
which takes lastModifiedTime of the source dataset's files? I understand for 
large datasets, this listing can get unwieldy. but changing this for CSV alone 
seems at a lower scope? We can do what we do for other DFS sources now and then 
tackle this in a separate PR? 

> CSV Source support for Hudi Delta Streamer
> ------------------------------------------
>
>                 Key: HUDI-76
>                 URL: https://issues.apache.org/jira/browse/HUDI-76
>             Project: Apache Hudi (incubating)
>          Issue Type: Improvement
>          Components: deltastreamer, Incremental Pull
>            Reporter: Balaji Varadarajan
>            Assignee: Ethan Guo
>            Priority: Minor
>
> DeltaStreamer does not have support to pull CSV data from sources (hdfs log 
> files/kafka). THis ticket is to provide support for csv sources.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to