lw309637554 edited a comment on pull request #2283: URL: https://github.com/apache/hudi/pull/2283#issuecomment-792845297
@pengzhiwei2018 @nsivabalan @vinothchandar 1、Now when use spark sql query hudi, need set "spark.sql.hive.convertMetastoreParquet=false" https://hudi.apache.org/docs/querying_data.html. But it is confused, many user maybe forget it . 2、If spark read the table use datasource have a big advantage: hive meta will be very light, the partition list and schema do not need to visit hive meta. How does databricks delta do ? 3、I suggest to persist the properties to hudi metatable. Hive meta just persist the table name and database name . May be we can research delta lake. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
