paul-rogers commented on a change in pull request #1986: Additional changes for 
Drill Metastore docs
URL: https://github.com/apache/drill/pull/1986#discussion_r384874809
 
 

 ##########
 File path: 
_docs/performance-tuning/drill-metastore/010-using-drill-metastore.md
 ##########
 @@ -364,7 +406,10 @@ apache drill> ANALYZE TABLE dfs.tmp.lineitem DROP 
METADATA;
 1 row selected (0.291 seconds)
 ```
 
-### Collecting metadata for specific set of columns
+### Collect metadata for specific set of columns
+
+Next lets gather metadata for a subset of the columns in the `lineitem` table. 
You would do this when you know
+ which columns will be present in filter conditions or data size or columns 
count is too large.
 
 Review comment:
   A bit awkward. Maybe something like: "You would do this to allow Drill to 
optimize `WHERE` conditions on certain columns. Also, if file size or the 
number of columns grows large, it can take too long to gather all statistics. 
Instead you can speed up analysis by gathering statistics only for selected 
columns: those actually used in the `WHERE` clause."

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

Reply via email to