[06/50] carbondata git commit: [CARBONDATA-1827] S3 Carbon Implementation

2018-02-27 Thread jackylk
[CARBONDATA-1827] S3 Carbon Implementation 1.Provide support for s3 in carbondata. 2.Added S3Example to create carbon table on s3. 3.Added S3CSVExample to load carbon table using csv from s3. This closes #1805 Project: http://git-wip-us.apache.org/repos/asf/carbondata/repo Commit:

[43/50] carbondata git commit: [CARBONDATA-2018][DataLoad] Optimization in reading/writing for sort temp row

2018-02-27 Thread jackylk
[CARBONDATA-2018][DataLoad] Optimization in reading/writing for sort temp row Pick up the no-sort fields in the row and pack them as bytes array and skip parsing them during merge sort to reduce CPU consumption This closes #1792 Project: http://git-wip-us.apache.org/repos/asf/carbondata/repo

[02/50] carbondata git commit: [CARBONDATA-1968] Add external table support

2018-02-27 Thread jackylk
[CARBONDATA-1968] Add external table support This PR adds support for creating external table with existing carbondata files, using Hive syntax. CREATE EXTERNAL TABLE tableName STORED BY 'carbondata' LOCATION 'path' This closes #1749 Project:

[38/50] carbondata git commit: [HotFix][CheckStyle] Fix import related checkstyle

2018-02-27 Thread jackylk
[HotFix][CheckStyle] Fix import related checkstyle This closes #1952 Project: http://git-wip-us.apache.org/repos/asf/carbondata/repo Commit: http://git-wip-us.apache.org/repos/asf/carbondata/commit/aef478f9 Tree: http://git-wip-us.apache.org/repos/asf/carbondata/tree/aef478f9 Diff:

[35/50] carbondata git commit: [CARBONDATA-2156] Add interface annotation

2018-02-27 Thread jackylk
[CARBONDATA-2156] Add interface annotation InterfaceAudience and InterfaceStability annotation should be added for user and developer 1.InetfaceAudience can be User and Developer 2.InterfaceStability can be Stable, Evolving, Unstable This closes #1968 Project:

[49/50] carbondata git commit: [CARBONDATA-1114][Tests] Fix bugs in tests in windows env

2018-02-27 Thread jackylk
[CARBONDATA-1114][Tests] Fix bugs in tests in windows env Fix bugs in tests that will cause failure under windows env This closes #1994 Project: http://git-wip-us.apache.org/repos/asf/carbondata/repo Commit: http://git-wip-us.apache.org/repos/asf/carbondata/commit/c45a44d9 Tree:

[03/50] carbondata git commit: [CARBONDATA-1992] Remove partitionId in CarbonTablePath

2018-02-27 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/574f014c/processing/src/main/java/org/apache/carbondata/processing/loading/steps/DataWriterProcessorStepImpl.java -- diff --git

[39/50] carbondata git commit: [CARBONDATA-2159] Remove carbon-spark dependency in store-sdk module

2018-02-27 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/9ecdd7fa/processing/src/main/java/org/apache/carbondata/processing/loading/model/CarbonLoadModelBuilder.java -- diff --git

[04/50] carbondata git commit: [CARBONDATA-1992] Remove partitionId in CarbonTablePath

2018-02-27 Thread jackylk
[CARBONDATA-1992] Remove partitionId in CarbonTablePath In CarbonTablePath, there is a deprecated partition id which is always 0, it should be removed to avoid confusion. This closes #1765 Project: http://git-wip-us.apache.org/repos/asf/carbondata/repo Commit:

[05/50] carbondata git commit: [CARBONDATA-1480]Min Max Index Example for DataMap

2018-02-27 Thread jackylk
[CARBONDATA-1480]Min Max Index Example for DataMap Datamap Example. Implementation of Min Max Index through Datamap. And Using the Index while prunning. This closes #1359 Project: http://git-wip-us.apache.org/repos/asf/carbondata/repo Commit:

[37/50] carbondata git commit: [CARBONDATA-2018][DataLoad] Optimization in reading/writing for sort temp row

2018-02-27 Thread jackylk
[CARBONDATA-2018][DataLoad] Optimization in reading/writing for sort temp row Pick up the no-sort fields in the row and pack them as bytes array and skip parsing them during merge sort to reduce CPU consumption This closes #1792 Project: http://git-wip-us.apache.org/repos/asf/carbondata/repo

[40/50] carbondata git commit: [CARBONDATA-2159] Remove carbon-spark dependency in store-sdk module

2018-02-27 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/9ecdd7fa/integration/spark-common/src/main/scala/org/apache/carbondata/spark/util/DataLoadingUtil.scala -- diff --git

[48/50] carbondata git commit: [CARBONDATA-2091][DataLoad] Support specifying sort column bounds in data loading

2018-02-27 Thread jackylk
[CARBONDATA-2091][DataLoad] Support specifying sort column bounds in data loading Enhance data loading performance by specifying sort column bounds 1. Add row range number during convert-process-step 2. Dispatch rows to each sorter by range number 3. Sort/Write process step can be done

[10/50] carbondata git commit: [CARBONDATA-2025] Unify all path construction through CarbonTablePath static method

2018-02-27 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/2e1f2b95/core/src/test/java/org/apache/carbondata/core/util/path/CarbonFormatDirectoryStructureTest.java -- diff --git

[30/50] carbondata git commit: Revert "[CARBONDATA-2023][DataLoad] Add size base block allocation in data loading"

2018-02-27 Thread jackylk
Revert "[CARBONDATA-2023][DataLoad] Add size base block allocation in data loading" This reverts commit 6dd8b038fc898dbf48ad30adfc870c19eb38e3d0. Project: http://git-wip-us.apache.org/repos/asf/carbondata/repo Commit: http://git-wip-us.apache.org/repos/asf/carbondata/commit/53c9ac7f Tree:

[12/50] carbondata git commit: [CARBONDATA-2099] Refactor query scan process to improve readability

2018-02-27 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/2e3077c4/processing/src/main/java/org/apache/carbondata/processing/merger/CarbonCompactionExecutor.java -- diff --git

[46/50] carbondata git commit: [CARBONDATA-2186] Add InterfaceAudience.Internal to annotate internal interface

2018-02-27 Thread jackylk
[CARBONDATA-2186] Add InterfaceAudience.Internal to annotate internal interface This closes #1986 Project: http://git-wip-us.apache.org/repos/asf/carbondata/repo Commit: http://git-wip-us.apache.org/repos/asf/carbondata/commit/20e6bd89 Tree:

[09/50] carbondata git commit: [CARBONDATA-2025] Unify all path construction through CarbonTablePath static method

2018-02-27 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/2e1f2b95/integration/spark2/src/main/scala/org/apache/carbondata/spark/rdd/AggregateDataMapCompactor.scala -- diff --git

[33/50] carbondata git commit: Revert "[CARBONDATA-2018][DataLoad] Optimization in reading/writing for sort temp row"

2018-02-27 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/0db9b81b/processing/src/main/java/org/apache/carbondata/processing/loading/sort/unsafe/holder/UnsafeSortTempFileChunkHolder.java -- diff --git

[50/50] carbondata git commit: [REBASE] Solve conflict after merging master

2018-02-27 Thread jackylk
[REBASE] Solve conflict after merging master Project: http://git-wip-us.apache.org/repos/asf/carbondata/repo Commit: http://git-wip-us.apache.org/repos/asf/carbondata/commit/4067d215 Tree: http://git-wip-us.apache.org/repos/asf/carbondata/tree/4067d215 Diff:

carbondata git commit: [HOTFIX] Fix timestamp format in testcase

2018-02-27 Thread jackylk
Repository: carbondata Updated Branches: refs/heads/carbonstore 37080e9a3 -> c738afbc2 [HOTFIX] Fix timestamp format in testcase In UT testcases, there is random failure because of thread local variable in DataTypeUtil. This PR corrected it. This closes #2007 Project:

[2/2] carbondata git commit: [CARBONDATA-2206] support lucene index datamap

2018-02-27 Thread jackylk
[CARBONDATA-2206] support lucene index datamap This PR is an initial effort to integrate lucene as an index datamap into carbondata. A new module called carbondata-lucene is added to support lucene datamap: 1.Add LuceneFineGrainDataMap, implement FineGrainDataMap interface. 2.Add

[1/2] carbondata git commit: [CARBONDATA-2206] support lucene index datamap

2018-02-27 Thread jackylk
Repository: carbondata Updated Branches: refs/heads/datamap 06c572dee -> 24ebfbbda http://git-wip-us.apache.org/repos/asf/carbondata/blob/24ebfbbd/processing/src/main/java/org/apache/carbondata/processing/store/writer/AbstractFactDataWriter.java

[1/2] carbondata git commit: [HOTFIX] Upgrade pom version from 1.3.0 to 1.4.0

2018-02-26 Thread jackylk
Repository: carbondata Updated Branches: refs/heads/datamap fd97f1db1 -> 06c572dee [HOTFIX] Upgrade pom version from 1.3.0 to 1.4.0 Upgrade pom version from 1.3.0 to 1.4.0 This closes #1961 Project: http://git-wip-us.apache.org/repos/asf/carbondata/repo Commit:

[2/2] carbondata git commit: [HOTFIX] upgrade to 1.4.0-SNAPSHOT for processing and sdk

2018-02-26 Thread jackylk
[HOTFIX] upgrade to 1.4.0-SNAPSHOT for processing and sdk Project: http://git-wip-us.apache.org/repos/asf/carbondata/repo Commit: http://git-wip-us.apache.org/repos/asf/carbondata/commit/06c572de Tree: http://git-wip-us.apache.org/repos/asf/carbondata/tree/06c572de Diff:

[carbondata] Git Push Summary [forced push!] [Forced Update!]

2018-02-26 Thread jackylk
Repository: carbondata Updated Branches: refs/heads/datamap f09b07025 -> ef3031d0c (forced update)

[48/49] carbondata git commit: [CARBONDATA-1114][Tests] Fix bugs in tests in windows env

2018-02-26 Thread jackylk
[CARBONDATA-1114][Tests] Fix bugs in tests in windows env Fix bugs in tests that will cause failure under windows env This closes #1994 Project: http://git-wip-us.apache.org/repos/asf/carbondata/repo Commit: http://git-wip-us.apache.org/repos/asf/carbondata/commit/de36a5d8 Tree:

[08/49] carbondata git commit: [CARBONDATA-2099] Refactor query scan process to improve readability

2018-02-26 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/975725a4/core/src/main/java/org/apache/carbondata/core/util/AbstractDataFileFooterConverter.java -- diff --git

[17/49] carbondata git commit: [CARBONDATA-2099] Refactor query scan process to improve readability

2018-02-26 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/975725a4/core/src/main/java/org/apache/carbondata/core/datastore/chunk/impl/FixedLengthDimensionDataChunk.java -- diff --git

[40/49] carbondata git commit: [CARBONDATA-2159] Remove carbon-spark dependency in store-sdk module

2018-02-26 Thread jackylk
[CARBONDATA-2159] Remove carbon-spark dependency in store-sdk module To make assembling JAR of store-sdk module, it should not depend on carbon-spark module This closes #1970 Project: http://git-wip-us.apache.org/repos/asf/carbondata/repo Commit:

[09/49] carbondata git commit: [CARBONDATA-2099] Refactor query scan process to improve readability

2018-02-26 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/975725a4/core/src/main/java/org/apache/carbondata/core/scan/result/iterator/PartitionSpliterRawResultIterator.java -- diff --git

[41/49] carbondata git commit: [CARBONDATA-2018][DataLoad] Optimization in reading/writing for sort temp row

2018-02-26 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/93b2efdd/processing/src/main/java/org/apache/carbondata/processing/loading/sort/unsafe/holder/UnsafeSortTempFileChunkHolder.java -- diff --git

[23/49] carbondata git commit: [CARBONDATA-2080] [S3-Implementation] Propagated hadoopConf from driver to executor for s3 implementation in cluster mode.

2018-02-26 Thread jackylk
[CARBONDATA-2080] [S3-Implementation] Propagated hadoopConf from driver to executor for s3 implementation in cluster mode. Problem : hadoopconf was not getting propagated from driver to the executor that's why load was failing to the distributed environment. Solution: Setting the Hadoop conf in

[19/49] carbondata git commit: [CARBONDATA-2025] Unify all path construction through CarbonTablePath static method

2018-02-26 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/4d453d4b/processing/src/main/java/org/apache/carbondata/processing/merger/CarbonCompactionUtil.java -- diff --git

[44/49] carbondata git commit: Support generating assembling JAR for store-sdk module

2018-02-26 Thread jackylk
Support generating assembling JAR for store-sdk module Support generating assembling JAR for store-sdk module and remove junit dependency This closes #1976 Project: http://git-wip-us.apache.org/repos/asf/carbondata/repo Commit: http://git-wip-us.apache.org/repos/asf/carbondata/commit/33a6d2bc

[22/49] carbondata git commit: [CARBONDATA-2025] Unify all path construction through CarbonTablePath static method

2018-02-26 Thread jackylk
[CARBONDATA-2025] Unify all path construction through CarbonTablePath static method Refactory CarbonTablePath: 1.Remove CarbonStorePath and use CarbonTablePath only. 2.Make CarbonTablePath an utility without object creation, it can avoid creating object before using it, thus code is cleaner

[02/49] carbondata git commit: [CARBONDATA-1992] Remove partitionId in CarbonTablePath

2018-02-26 Thread jackylk
[CARBONDATA-1992] Remove partitionId in CarbonTablePath In CarbonTablePath, there is a deprecated partition id which is always 0, it should be removed to avoid confusion. This closes #1765 Project: http://git-wip-us.apache.org/repos/asf/carbondata/repo Commit:

[36/49] carbondata git commit: [CARBONDATA-2156] Add interface annotation

2018-02-26 Thread jackylk
[CARBONDATA-2156] Add interface annotation InterfaceAudience and InterfaceStability annotation should be added for user and developer 1.InetfaceAudience can be User and Developer 2.InterfaceStability can be Stable, Evolving, Unstable This closes #1968 Project:

[39/49] carbondata git commit: [CARBONDATA-2159] Remove carbon-spark dependency in store-sdk module

2018-02-26 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/06a71586/integration/spark-common/src/main/scala/org/apache/carbondata/spark/util/DataLoadingUtil.scala -- diff --git

[04/49] carbondata git commit: [CARBONDATA-1827] S3 Carbon Implementation

2018-02-26 Thread jackylk
[CARBONDATA-1827] S3 Carbon Implementation 1.Provide support for s3 in carbondata. 2.Added S3Example to create carbon table on s3. 3.Added S3CSVExample to load carbon table using csv from s3. This closes #1805 Project: http://git-wip-us.apache.org/repos/asf/carbondata/repo Commit:

[20/49] carbondata git commit: [CARBONDATA-2025] Unify all path construction through CarbonTablePath static method

2018-02-26 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/4d453d4b/integration/spark2/src/main/scala/org/apache/carbondata/spark/rdd/AggregateDataMapCompactor.scala -- diff --git

[35/49] carbondata git commit: Revert "[CARBONDATA-2018][DataLoad] Optimization in reading/writing for sort temp row"

2018-02-26 Thread jackylk
Revert "[CARBONDATA-2018][DataLoad] Optimization in reading/writing for sort temp row" This reverts commit de92ea9a123b17d903f2d1d4662299315c792954. Project: http://git-wip-us.apache.org/repos/asf/carbondata/repo Commit: http://git-wip-us.apache.org/repos/asf/carbondata/commit/2058a472 Tree:

[42/49] carbondata git commit: [CARBONDATA-2018][DataLoad] Optimization in reading/writing for sort temp row

2018-02-26 Thread jackylk
[CARBONDATA-2018][DataLoad] Optimization in reading/writing for sort temp row Pick up the no-sort fields in the row and pack them as bytes array and skip parsing them during merge sort to reduce CPU consumption This closes #1792 Project: http://git-wip-us.apache.org/repos/asf/carbondata/repo

[25/49] carbondata git commit: [CARBONDATA-1544][Datamap] Datamap FineGrain implementation

2018-02-26 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/e992013a/datamap/examples/src/minmaxdatamap/main/java/org/apache/carbondata/datamap/examples/MinMaxDataWriter.java -- diff --git

[38/49] carbondata git commit: [CARBONDATA-2159] Remove carbon-spark dependency in store-sdk module

2018-02-26 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/06a71586/processing/src/main/java/org/apache/carbondata/processing/loading/model/CarbonLoadModelBuilder.java -- diff --git

[45/49] carbondata git commit: [CARBONDATA-2186] Add InterfaceAudience.Internal to annotate internal interface

2018-02-26 Thread jackylk
[CARBONDATA-2186] Add InterfaceAudience.Internal to annotate internal interface This closes #1986 Project: http://git-wip-us.apache.org/repos/asf/carbondata/repo Commit: http://git-wip-us.apache.org/repos/asf/carbondata/commit/e0591671 Tree:

[37/49] carbondata git commit: [CARBONDATA-1997] Add CarbonWriter SDK API

2018-02-26 Thread jackylk
[CARBONDATA-1997] Add CarbonWriter SDK API Added a new module called store-sdk, and added a CarbonWriter API, it can be used to write Carbondata files to a specified folder, without Spark and Hadoop dependency. User can use this API in any environment. This closes #1967 Project:

[01/49] carbondata git commit: [CARBONDATA-1992] Remove partitionId in CarbonTablePath

2018-02-26 Thread jackylk
Repository: carbondata Updated Branches: refs/heads/carbonstore-rebase4 [created] 7466d6538 http://git-wip-us.apache.org/repos/asf/carbondata/blob/bf3602fc/processing/src/main/java/org/apache/carbondata/processing/loading/steps/DataWriterProcessorStepImpl.java

[27/49] carbondata git commit: [REBASE] Solve conflict after rebasing master

2018-02-26 Thread jackylk
[REBASE] Solve conflict after rebasing master Project: http://git-wip-us.apache.org/repos/asf/carbondata/repo Commit: http://git-wip-us.apache.org/repos/asf/carbondata/commit/806b9841 Tree: http://git-wip-us.apache.org/repos/asf/carbondata/tree/806b9841 Diff:

[07/49] carbondata git commit: [CARBONDATA-2099] Refactor query scan process to improve readability

2018-02-26 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/975725a4/core/src/test/java/org/apache/carbondata/core/util/CarbonUtilTest.java -- diff --git a/core/src/test/java/org/apache/carbondata/core/util/CarbonUtilTest.java

[11/49] carbondata git commit: [CARBONDATA-2099] Refactor query scan process to improve readability

2018-02-26 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/975725a4/core/src/main/java/org/apache/carbondata/core/scan/filter/executer/RowLevelRangeLessThanFiterExecuterImpl.java -- diff --git

[29/49] carbondata git commit: [CARBONDATA-2018][DataLoad] Optimization in reading/writing for sort temp row

2018-02-26 Thread jackylk
[CARBONDATA-2018][DataLoad] Optimization in reading/writing for sort temp row Pick up the no-sort fields in the row and pack them as bytes array and skip parsing them during merge sort to reduce CPU consumption This closes #1792 Project: http://git-wip-us.apache.org/repos/asf/carbondata/repo

[05/49] carbondata git commit: [CARBONDATA-1968] Add external table support

2018-02-26 Thread jackylk
[CARBONDATA-1968] Add external table support This PR adds support for creating external table with existing carbondata files, using Hive syntax. CREATE EXTERNAL TABLE tableName STORED BY 'carbondata' LOCATION 'path' This closes #1749 Project:

[15/49] carbondata git commit: [CARBONDATA-2099] Refactor query scan process to improve readability

2018-02-26 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/975725a4/core/src/main/java/org/apache/carbondata/core/metadata/blocklet/SegmentInfo.java -- diff --git

[43/49] carbondata git commit: [CARBONDATA-2023][DataLoad] Add size base block allocation in data loading

2018-02-26 Thread jackylk
[CARBONDATA-2023][DataLoad] Add size base block allocation in data loading Carbondata assign blocks to nodes at the beginning of data loading. Previous block allocation strategy is block number based and it will suffer skewed data problem if the size of input files differs a lot. We introduced a

[16/49] carbondata git commit: [CARBONDATA-2099] Refactor query scan process to improve readability

2018-02-26 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/975725a4/core/src/main/java/org/apache/carbondata/core/datastore/chunk/store/impl/unsafe/UnsafeFixedLengthDimensionDataChunkStore.java -- diff --git

[28/49] carbondata git commit: [CARBONDATA-2018][DataLoad] Optimization in reading/writing for sort temp row

2018-02-26 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/2d779368/processing/src/main/java/org/apache/carbondata/processing/loading/sort/unsafe/holder/UnsafeSortTempFileChunkHolder.java -- diff --git

[24/49] carbondata git commit: [CARBONDATA-1480]Min Max Index Example for DataMap

2018-02-26 Thread jackylk
[CARBONDATA-1480]Min Max Index Example for DataMap Datamap Example. Implementation of Min Max Index through Datamap. And Using the Index while prunning. This closes #1359 Project: http://git-wip-us.apache.org/repos/asf/carbondata/repo Commit:

[26/49] carbondata git commit: [CARBONDATA-1544][Datamap] Datamap FineGrain implementation

2018-02-26 Thread jackylk
[CARBONDATA-1544][Datamap] Datamap FineGrain implementation Implemented interfaces for FG datamap and integrated to filterscanner to use the pruned bitset from FG datamap. FG Query flow as follows. 1.The user can add FG datamap to any table and implement there interfaces. 2. Any filter query

[21/49] carbondata git commit: [CARBONDATA-2025] Unify all path construction through CarbonTablePath static method

2018-02-26 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/4d453d4b/core/src/test/java/org/apache/carbondata/core/util/path/CarbonFormatDirectoryStructureTest.java -- diff --git

[34/49] carbondata git commit: Revert "[CARBONDATA-2018][DataLoad] Optimization in reading/writing for sort temp row"

2018-02-26 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/2058a472/processing/src/main/java/org/apache/carbondata/processing/loading/sort/unsafe/holder/UnsafeSortTempFileChunkHolder.java -- diff --git

[47/49] carbondata git commit: [CARBONDATA-2091][DataLoad] Support specifying sort column bounds in data loading

2018-02-26 Thread jackylk
[CARBONDATA-2091][DataLoad] Support specifying sort column bounds in data loading Enhance data loading performance by specifying sort column bounds 1. Add row range number during convert-process-step 2. Dispatch rows to each sorter by range number 3. Sort/Write process step can be done

[30/49] carbondata git commit: [HotFix][CheckStyle] Fix import related checkstyle

2018-02-26 Thread jackylk
[HotFix][CheckStyle] Fix import related checkstyle This closes #1952 Project: http://git-wip-us.apache.org/repos/asf/carbondata/repo Commit: http://git-wip-us.apache.org/repos/asf/carbondata/commit/5cc6d362 Tree: http://git-wip-us.apache.org/repos/asf/carbondata/tree/5cc6d362 Diff:

[06/49] carbondata git commit: [CARBONDATA-2099] Refactor query scan process to improve readability

2018-02-26 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/975725a4/processing/src/main/java/org/apache/carbondata/processing/merger/CarbonCompactionExecutor.java -- diff --git

[33/49] carbondata git commit: Revert "[CARBONDATA-2023][DataLoad] Add size base block allocation in data loading"

2018-02-26 Thread jackylk
Revert "[CARBONDATA-2023][DataLoad] Add size base block allocation in data loading" This reverts commit 6dd8b038fc898dbf48ad30adfc870c19eb38e3d0. Project: http://git-wip-us.apache.org/repos/asf/carbondata/repo Commit: http://git-wip-us.apache.org/repos/asf/carbondata/commit/7a11f8e5 Tree:

[10/49] carbondata git commit: [CARBONDATA-2099] Refactor query scan process to improve readability

2018-02-26 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/975725a4/core/src/main/java/org/apache/carbondata/core/scan/processor/DataBlockIterator.java -- diff --git

[18/49] carbondata git commit: [CARBONDATA-2099] Refactor query scan process to improve readability

2018-02-26 Thread jackylk
[CARBONDATA-2099] Refactor query scan process to improve readability Unified concepts in scan process flow: 1.QueryModel contains all parameter for scan, it is created by API in CarbonTable. (In future, CarbonTable will be the entry point for various table operations) 2.Use term ColumnChunk to

[12/49] carbondata git commit: [CARBONDATA-2099] Refactor query scan process to improve readability

2018-02-26 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/975725a4/core/src/main/java/org/apache/carbondata/core/scan/filter/executer/RowLevelFilterExecuterImpl.java -- diff --git

[03/49] carbondata git commit: [REBASE] Solve conflict after rebasing master

2018-02-26 Thread jackylk
[REBASE] Solve conflict after rebasing master Project: http://git-wip-us.apache.org/repos/asf/carbondata/repo Commit: http://git-wip-us.apache.org/repos/asf/carbondata/commit/7f508287 Tree: http://git-wip-us.apache.org/repos/asf/carbondata/tree/7f508287 Diff:

[13/49] carbondata git commit: [CARBONDATA-2099] Refactor query scan process to improve readability

2018-02-26 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/975725a4/core/src/main/java/org/apache/carbondata/core/scan/filter/executer/ExcludeColGroupFilterExecuterImpl.java -- diff --git

carbondata git commit: [CARBONDATA-1114][Tests] Fix bugs in tests in windows env

2018-02-26 Thread jackylk
Repository: carbondata Updated Branches: refs/heads/carbonstore ab9b4cf89 -> 37080e9a3 [CARBONDATA-1114][Tests] Fix bugs in tests in windows env Fix bugs in tests that will cause failure under windows env This closes #1994 Project: http://git-wip-us.apache.org/repos/asf/carbondata/repo

carbondata git commit: [CARBONDATA-2200] Fix bug of LIKE operation on streaming table

2018-02-25 Thread jackylk
Repository: carbondata Updated Branches: refs/heads/master 088465f0c -> 7269c0627 [CARBONDATA-2200] Fix bug of LIKE operation on streaming table Fix bug of LIKE operation on streaming table, LIKE operation will be converted to StartsWith / EndsWith / Contains expression. Carbon will use

[10/10] carbondata git commit: [CARBONDATA-2189] Add DataMapProvider developer interface

2018-02-25 Thread jackylk
[CARBONDATA-2189] Add DataMapProvider developer interface Add developer interface for 2 types of DataMap: 1.IndexDataMap: DataMap that leveraging index to accelerate filter query 2.MVDataMap: DataMap that leveraging Materialized View to accelerate olap style query, like SPJG query (select,

[06/10] carbondata git commit: [CARBONDATA-2189] Add DataMapProvider developer interface

2018-02-25 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/2117c077/integration/spark-common-test/src/test/scala/org/apache/carbondata/spark/testsuite/datamap/TestDataMapCommand.scala -- diff --git

[07/10] carbondata git commit: [CARBONDATA-2189] Add DataMapProvider developer interface

2018-02-25 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/2117c077/hadoop/src/main/java/org/apache/carbondata/hadoop/api/CarbonTableInputFormat.java -- diff --git

[01/10] carbondata git commit: [CARBONDATA-2091][DataLoad] Support specifying sort column bounds in data loading [Forced Update!]

2018-02-25 Thread jackylk
Repository: carbondata Updated Branches: refs/heads/datamap 5a625f4ce -> 2117c077b (forced update) http://git-wip-us.apache.org/repos/asf/carbondata/blob/ab9b4cf8/processing/src/main/java/org/apache/carbondata/processing/loading/sort/impl/UnsafeParallelReadMergeSorterWithBucketingImpl.java

[04/10] carbondata git commit: [CARBONDATA-1543] Supported DataMap chooser and expression for supporting multiple datamaps in single query

2018-02-25 Thread jackylk
[CARBONDATA-1543] Supported DataMap chooser and expression for supporting multiple datamaps in single query This PR supports 3 features. 1.Load datamaps from the DataMapSchema which are created through DDL. 2.DataMap Chooser: It chooses the datamap out of available datamaps based on simple

[09/10] carbondata git commit: [CARBONDATA-2189] Add DataMapProvider developer interface

2018-02-25 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/2117c077/core/src/main/java/org/apache/carbondata/core/indexstore/blockletindex/BlockletDataMap.java -- diff --git

[08/10] carbondata git commit: [CARBONDATA-2189] Add DataMapProvider developer interface

2018-02-25 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/2117c077/core/src/main/java/org/apache/carbondata/core/indexstore/blockletindex/BlockletIndexDataMap.java -- diff --git

[02/10] carbondata git commit: [CARBONDATA-2091][DataLoad] Support specifying sort column bounds in data loading

2018-02-25 Thread jackylk
[CARBONDATA-2091][DataLoad] Support specifying sort column bounds in data loading Enhance data loading performance by specifying sort column bounds 1. Add row range number during convert-process-step 2. Dispatch rows to each sorter by range number 3. Sort/Write process step can be done

[03/10] carbondata git commit: [CARBONDATA-1543] Supported DataMap chooser and expression for supporting multiple datamaps in single query

2018-02-25 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/5733413e/integration/spark-common-cluster-test/src/test/scala/org/apache/carbondata/cluster/sdv/generated/MergeIndexTestCase.scala -- diff --git

[05/10] carbondata git commit: [CARBONDATA-2189] Add DataMapProvider developer interface

2018-02-25 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/2117c077/integration/spark2/src/main/scala/org/apache/spark/sql/execution/command/preaaggregate/PreAggregateTableHelper.scala -- diff --git

[2/2] carbondata git commit: [CARBONDATA-2091][DataLoad] Support specifying sort column bounds in data loading

2018-02-25 Thread jackylk
[CARBONDATA-2091][DataLoad] Support specifying sort column bounds in data loading Enhance data loading performance by specifying sort column bounds 1. Add row range number during convert-process-step 2. Dispatch rows to each sorter by range number 3. Sort/Write process step can be done

[1/2] carbondata git commit: [CARBONDATA-2091][DataLoad] Support specifying sort column bounds in data loading

2018-02-25 Thread jackylk
Repository: carbondata Updated Branches: refs/heads/carbonstore 88c0527f5 -> ab9b4cf89 http://git-wip-us.apache.org/repos/asf/carbondata/blob/ab9b4cf8/processing/src/main/java/org/apache/carbondata/processing/loading/sort/impl/UnsafeParallelReadMergeSorterWithBucketingImpl.java

[5/6] carbondata git commit: [CARBONDATA-2189] Add DataMapProvider developer interface

2018-02-23 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/5a625f4c/core/src/main/java/org/apache/carbondata/core/indexstore/blockletindex/BlockletDataMap.java -- diff --git

[3/6] carbondata git commit: [CARBONDATA-2189] Add DataMapProvider developer interface

2018-02-23 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/5a625f4c/hadoop/src/main/java/org/apache/carbondata/hadoop/api/CarbonTableInputFormat.java -- diff --git

[6/6] carbondata git commit: [CARBONDATA-2189] Add DataMapProvider developer interface

2018-02-23 Thread jackylk
[CARBONDATA-2189] Add DataMapProvider developer interface Add developer interface for 2 types of DataMap: 1.IndexDataMap: DataMap that leveraging index to accelerate filter query 2.MVDataMap: DataMap that leveraging Materialized View to accelerate olap style query, like SPJG query (select,

[2/6] carbondata git commit: [CARBONDATA-2189] Add DataMapProvider developer interface

2018-02-23 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/5a625f4c/integration/spark-common-test/src/test/scala/org/apache/carbondata/spark/testsuite/datamap/TestDataMapCommand.scala -- diff --git

[1/6] carbondata git commit: [CARBONDATA-2189] Add DataMapProvider developer interface

2018-02-23 Thread jackylk
Repository: carbondata Updated Branches: refs/heads/datamap e616162c0 -> 5a625f4ce http://git-wip-us.apache.org/repos/asf/carbondata/blob/5a625f4c/integration/spark2/src/main/scala/org/apache/spark/sql/execution/command/preaaggregate/PreAggregateTableHelper.scala

[4/6] carbondata git commit: [CARBONDATA-2189] Add DataMapProvider developer interface

2018-02-23 Thread jackylk
http://git-wip-us.apache.org/repos/asf/carbondata/blob/5a625f4c/core/src/main/java/org/apache/carbondata/core/indexstore/blockletindex/BlockletIndexDataMap.java -- diff --git

carbondata git commit: [CARBONDATA-2151][Streaming] Fix filter query issue on streaming table

2018-02-23 Thread jackylk
Repository: carbondata Updated Branches: refs/heads/master ceaddeacb -> 936037056 [CARBONDATA-2151][Streaming] Fix filter query issue on streaming table 1.Fix filter query issue for timestamp, date, decimal 2.Add more test case dataType: int, streaming, float, double, decimal, timestamp,

[2/2] carbondata git commit: [CARBONDATA-1543] Supported DataMap chooser and expression for supporting multiple datamaps in single query

2018-02-21 Thread jackylk
[CARBONDATA-1543] Supported DataMap chooser and expression for supporting multiple datamaps in single query This PR supports 3 features. 1.Load datamaps from the DataMapSchema which are created through DDL. 2.DataMap Chooser: It chooses the datamap out of available datamaps based on simple

carbondata git commit: [CARBONDATA-2186] Add InterfaceAudience.Internal to annotate internal interface

2018-02-19 Thread jackylk
Repository: carbondata Updated Branches: refs/heads/datamap 0b1521710 -> 88c0527f5 [CARBONDATA-2186] Add InterfaceAudience.Internal to annotate internal interface This closes #1986 Project: http://git-wip-us.apache.org/repos/asf/carbondata/repo Commit:

carbondata git commit: [CARBONDATA-2186] Add InterfaceAudience.Internal to annotate internal interface

2018-02-19 Thread jackylk
Repository: carbondata Updated Branches: refs/heads/carbonstore 0b1521710 -> 88c0527f5 [CARBONDATA-2186] Add InterfaceAudience.Internal to annotate internal interface This closes #1986 Project: http://git-wip-us.apache.org/repos/asf/carbondata/repo Commit:

[carbondata] Git Push Summary

2018-02-19 Thread jackylk
Repository: carbondata Updated Branches: refs/heads/datamap [created] 0b1521710

[carbondata] Git Push Summary

2018-02-17 Thread jackylk
Repository: carbondata Updated Branches: refs/heads/datamap [deleted] 0b1521710

[carbondata] Git Push Summary

2018-02-17 Thread jackylk
Repository: carbondata Updated Branches: refs/heads/datamap [created] 0b1521710

carbondata git commit: Support generating assembling JAR for store-sdk module

2018-02-12 Thread jackylk
Repository: carbondata Updated Branches: refs/heads/carbonstore fd450b151 -> 0b1521710 Support generating assembling JAR for store-sdk module Support generating assembling JAR for store-sdk module and remove junit dependency This closes #1976 Project:

carbondata git commit: [CARBONDATA-2023][DataLoad] Add size base block allocation in data loading

2018-02-12 Thread jackylk
Repository: carbondata Updated Branches: refs/heads/carbonstore 937bdb867 -> fd450b151 [CARBONDATA-2023][DataLoad] Add size base block allocation in data loading Carbondata assign blocks to nodes at the beginning of data loading. Previous block allocation strategy is block number based and it

[1/2] carbondata git commit: [CARBONDATA-2018][DataLoad] Optimization in reading/writing for sort temp row

2018-02-12 Thread jackylk
Repository: carbondata Updated Branches: refs/heads/carbonstore 0d50f6546 -> 937bdb867 http://git-wip-us.apache.org/repos/asf/carbondata/blob/937bdb86/processing/src/main/java/org/apache/carbondata/processing/loading/sort/unsafe/holder/UnsafeSortTempFileChunkHolder.java

<    5   6   7   8   9   10   11   12   13   14   >