paul-rogers commented on a change in pull request #1953: Add docs for Drill Metastore URL: https://github.com/apache/drill/pull/1953#discussion_r374451863
########## File path: _docs/performance-tuning/drill-metastore/030-drill-iceberg-metastore.md ########## @@ -0,0 +1,69 @@ +--- +title: "Drill Iceberg Metastore" +parent: "Drill Metastore" +date: 2020-01-31 +--- + +Drill uses Iceberg Metastore implementation based on [Iceberg tables](http://iceberg.incubator.apache.org). For Drill 1.17, + this is default Drill Metastore implementation. For details on how to configure Iceberg Metastore implementation and + its option descriptions, please refer to [Iceberg Metastore docs](https://github.com/apache/drill/blob/master/metastore/iceberg-metastore/README.md). + +{% include startnote.html %} +Iceberg table supports concurrent writes and transactions but they are only effective on file systems that support + atomic rename. +If the file system does not support atomic rename, it could lead to inconsistencies during concurrent writes. +{% include endnote.html %} + +### Iceberg Tables Location + +Iceberg tables will reside on the file system in the location based on +Iceberg Metastore base location `drill.metastore.iceberg.location.base_path` and component specific location. +If Iceberg Metastore base location is `/drill/metastore/iceberg` +and tables component location is `tables`. Iceberg table for tables component +will be located in `/drill/metastore/iceberg/tables` folder. + +Metastore metadata will be stored inside Iceberg table location provided +in the configuration file. Drill table metadata location will be constructed +based on specific component storage keys. For example, for `tables` component, +storage keys are storage plugin, workspace and table name: unique table identifier in Drill. Review comment: This is hard to understand. We just explained the "tables" folder. Perhaps say, "Within the tables directory, there is one file per analyzed table in the form `<storage-plugin-config-name>/<workspace-name>/<table-name>.whatever`. Note that you are very likely (or should be) using the storage plugin *config* name, not the storage plugin name. That is "my-hdfs", not "dfs." ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services
