vinothchandar commented on a change in pull request #1333: [HUDI-589][DOCS] Fix querying_data page URL: https://github.com/apache/incubator-hudi/pull/1333#discussion_r379226624
########## File path: docs/_docs/2_3_querying_data.md ########## @@ -145,8 +161,13 @@ Additionally, `HoodieReadClient` offers the following functionality using Hudi's | filterExists() | Filter out already existing records from the provided RDD[HoodieRecord]. Useful for de-duplication | | checkExists(keys) | Check if the provided keys exist in a Hudi table | +### Read optimized query + +For read optimized queries, either Hive SerDe can be used by turning off convertMetastoreParquet as described above or Spark's built in support can be leveraged. +If using spark's built in support, additionally a path filter needs to be pushed into sparkContext as described earlier. ## Presto -Presto is a popular query engine, providing interactive query performance. Presto currently supports only read optimized queries on Hudi tables. -This requires the `hudi-presto-bundle` jar to be placed into `<presto_install>/plugin/hive-hadoop2/`, across the installation. +Presto is a popular query engine, providing interactive query performance. Presto currently supports snapshot queries on +COPY_On_WRITE and read optimized queries on MERGE_ON_READ Hudi tables. This requires the `hudi-presto-bundle` jar Review comment: COPY_ON_WRITE: typo ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services
