n3nash commented on issue #2095: URL: https://github.com/apache/hudi/issues/2095#issuecomment-696212548
@vishalpathak1986 Currently, Hudi supports writing inserts in columnar file fomat (parquet) for MOR tables. All inserts goto parquet while updates goto AVRO file. This is done for 2 reasons a) If you only have inserts, you don't have to compact again and have your data written in columnar file format to start with b) Absence of an index that can index log file. This feature will soon be supported with -> https://cwiki.apache.org/confluence/display/HUDI/RFC+-+08+%3A+Record+level+indexing+mechanisms+for+Hudi+datasets or you can try the [HbaseIndex](https://github.com/apache/hudi/blob/master/hudi-client/src/main/java/org/apache/hudi/index/hbase/HBaseIndex.java) in the meantime which requires a Hbase cluster. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
