GitHub user Zhangshunyu opened a pull request:
https://github.com/apache/incubator-carbondata/pull/8
Record load performance statistics(configurable)
1)We can use a parameter "time.statistics.open" which can be configured by
user to determine whether the statistics will be recorded and calculated during
data loading, the default value is false.
2)We defined a dummy util, if we don't need to record the statistics, it
will do nothing.
For example, we set "time.statistics.open" to "true" and run CarbonExample,
the message as following:
Data load request has been received for table default.t3
============================== lru Cache Load Cost Time: 0.003(s)
]==========**********Compress One Node For One Thread pool-24-thread-1
Cost: 0.017
]========== TIME STATISTICS PartitionID: 0==========
]STATISTICS ->Raw data IO cost(load csv to dataFrame and generate block
dict): 0.354(s)
]STATISTICS ->Distinct Value IO cost(global dict shuffle and write dict
file): 0.153(s)
]STATISTICS -> |_maximum distinct column shuffle time: 0.046(s)
]STATISTICS -> |_maximum distinct column write dict file time: 0.087(s)
]STATISTICS ->Total cost of gen surrogate key, sort and write to temp
files: 0.45(s)
]STATISTICS -> |_read cost of raw csv file: 0.313(s)
]STATISTICS -> |_cost of transform to surrogate key: 0.353(s)
]STATISTICS -> |_io cost(sort rows and write to temp file): 0.32(s)
]STATISTICS ->IO cost(tansform to MDK, compress and write fact file):
0.69(s)
]==============================
]========== BLOCK_INFO ==========
]BLOCK_INFO ->Node host: localhost
]BLOCK_INFO ->The block count in this node: 1
]==============================
Data load is successful for default.t3
+-------+------+
|country|amount|
+-------+------+
| france| 101|
| china| 849|
+-------+------+
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/Zhangshunyu/incubator-carbondata stat71
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/incubator-carbondata/pull/8.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #8
----
commit 6f86351cf7108d04708c9f1771fe1a3e18cb8af9
Author: Zhangshunyu <[email protected]>
Date: 2016-06-01T12:17:57Z
record statitics during data loading
commit 753f192e777482c1082b6dcf91a6fb5585cdf93c
Author: Zhangshunyu <[email protected]>
Date: 2016-06-30T02:51:26Z
rebase630
commit b1421f0329a28b9f18ad8afe1bc9d78acb307601
Author: Zhangshunyu <[email protected]>
Date: 2016-06-30T07:32:28Z
new structure
commit e868cda3f48d0e0de1fda2325b44791dafce581f
Author: Zhangshunyu <[email protected]>
Date: 2016-06-30T08:07:45Z
style
commit 903798149076dddc5be7e8991ab2a61e11226579
Author: Zhangshunyu <[email protected]>
Date: 2016-06-30T09:53:53Z
style
commit 1a3b750ee12e796c2bba19ba486da77f2bfc9922
Author: Zhangshunyu <[email protected]>
Date: 2016-06-30T10:01:09Z
style
commit c0da49d7ebf90150ad1cef3168968cd63b5561a7
Author: Zhangshunyu <[email protected]>
Date: 2016-06-30T10:07:18Z
style
commit ccf96a3576df6c1b2dd703038d579893a5a3bf26
Author: Zhangshunyu <[email protected]>
Date: 2016-06-30T10:15:51Z
style
commit 1f3ea224addfc46d59130872cb2fd1ab8a844b8c
Author: Zhangshunyu <[email protected]>
Date: 2016-07-01T01:04:39Z
modify the peramerter
commit 4d8ac7cf30bb08a50600bcabb7c62bcf21ca975b
Author: Zhangshunyu <[email protected]>
Date: 2016-07-01T03:16:30Z
remove set for nopar stat
commit 5938bfce96061eaf309d903ccc7f9686bc0a68a5
Author: Zhangshunyu <[email protected]>
Date: 2016-07-01T03:21:08Z
style
commit bd92401605ee3dedf971fef0b8fab008cea4b720
Author: Zhangshunyu <[email protected]>
Date: 2016-07-01T07:31:35Z
remove tree set
commit 708686761b0bd3dc1c0c97f94140692ce8ca9a97
Author: Zhangshunyu <[email protected]>
Date: 2016-07-01T07:38:49Z
fix style
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---