Github user xuchuanyin commented on a diff in the pull request:
https://github.com/apache/carbondata/pull/2693#discussion_r215976048
--- Diff: docs/configuration-parameters.md ---
@@ -70,7 +70,7 @@ This section provides the details of all the
configurations required for the Car
| carbon.enable.calculate.size | true | **For Load Operation**: Setting
this property calculates the size of the carbon data file (.carbondata) and
carbon index file (.carbonindex) for every load and updates the table status
file. **For Describe Formatted**: Setting this property calculates the total
size of the carbon data files and carbon index files for the respective table
and displays in describe formatted command.**NOTE:** This is useful to
determine the overall size of the carbondata table and also get an idea of how
the table is growing in order to take up other backup strategy decisions. |
| carbon.cutOffTimestamp | (none) | CarbonData has capability to generate
the Dictionary values for the timestamp columns from the data itself without
the need to store the computed dictionary values. This configuration sets the
start date for calculating the timestamp. Java counts the number of
milliseconds from start of "1970-01-01 00:00:00". This property is used to
customize the start of position. For example "2000-01-01 00:00:00". **NOTE:**
The date must be in the form ***carbon.timestamp.format***. CarbonData supports
storing data for upto 68 years.For example, if the cut-off time is 1970-01-01
05:30:00, then data upto 2038-01-01 05:30:00 will be supported by CarbonData. |
| carbon.timegranularity | SECOND | The configuration is used to specify
the data granularity level such as DAY, HOUR, MINUTE, or SECOND.This helps to
store more than 68 years of data into CarbonData. |
-| carbon.use.local.dir | false | CarbonData during data loading, writes
files to local temp directories before copying the files to HDFS.This
configuration is used to specify whether CarbonData can write locally to tmp
directory of the container or to the YARN application directory. |
+| carbon.use.local.dir | false | CarbonData,during data loading, writes
files to local temp directories before copying the files to HDFS.This
configuration is used to specify whether CarbonData can write locally to tmp
directory of the container or to the YARN application directory. |
| carbon.use.multiple.temp.dir | false | When multiple disks are present
in the system, YARN is generally configured with multiple disks to be used as
temp directories for managing the containers.This configuration specifies
whether to use multiple YARN local directories during data loading for disk IO
load balancing.Enable ***carbon.use.local.dir*** for this configuration to take
effect.**NOTE:** Data Loading is an IO intensive operation whose performance
can be limited by the disk IO threshold, particularly during multi table
concurrent data load.Configuring this parameter, balances the disk IO across
multiple disks there by improving the over all load performance. |
--- End diff --
I think this can be turned to enable along with the previous configuration
for the beginners to achieve better performance.
---