jnturton commented on a change in pull request #886:
URL: https://github.com/apache/drill/pull/886#discussion_r787409132
##########
File path: _docs/performance-tuning/010-performance-tuning-introduction.md
##########
@@ -3,9 +3,9 @@ title: "Performance Tuning Introduction"
date:
parent: "Performance Tuning"
---
-You can apply performance tuning measures to improve how efficiently Drill
queries data. To significantly improve performance in Drill, you must have
knowledge about the underlying data and data sources, as well as familiarity
with how Drill executes queries.
+You can change system options in Drill to improve the query performance.
Before you improve performance in Drill, you must choose a layout of the data
and the choose an appropriate file format specific to your use case. For
example, for an analytic workload operating on historical time series data,
then choosing Parquet as the file format and a partitioning scheme that uses
time as a partitionining dimension would be a recommended approach. In the case
you are directly querying data data sources, you need to have an understanding
of the data source itself. Some familiarity with how Drill executes queries can
also help.
Review comment:
I think it's okay for this introductory page to be vague and high-level.
Partitioning is discussed in more detail in its own child page.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]