xiarixiaoyao commented on pull request #3203:
URL: https://github.com/apache/hudi/pull/3203#issuecomment-940777833


   @danny0405 @nsivabalan @leesf 
   update the code.  addressed all comments。
   1. recover HUDI-1969, and remove RealTimeMergedRecordReader.java which is no 
needed.
   2. add more  docs on 
HoodieParquetRealtimeInputFormat.listStatusForIncrementalMode and split this 
function.
   3. keep PathWithLogFileStatus unSplitable.  just like spark 
MergeOnReadIncrementalRelation. now both hbase index and flink write table will 
be ok.   
   
   
   
   last things: now we have no way  to distinguish between the newly added data 
and the updated data in a log file.  we should make the split unSplitable(just 
like the logical MergeOnReadIncrementalRelation)
    if we can find a way to distinguish added data and updated data, we can 
remove this limited.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to