[GitHub] carbondata issue #2996: [WIP] Fix Rename-Fail & Datamap-creation-Fail

2019-01-06 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2996
  
retest this please


---


[GitHub] carbondata pull request #3028: [CARBONDATA-3205]Fix Get Local Dictionary for...

2018-12-27 Thread Indhumathi27
GitHub user Indhumathi27 opened a pull request:

https://github.com/apache/carbondata/pull/3028

[CARBONDATA-3205]Fix Get Local Dictionary for empty Array of Struct

Problem:
When a table contains a nested complex schema  with Array of Struct as 
empty data, get dictionary for empty Array of Struct fails.

Solution:
Check if usedDictionary is empty, then return null

 - [ ] Any interfaces changed?
 
 - [ ] Any backward compatibility impacted?
 
 - [ ] Document update required?

 - [x] Testing done
  Added a test case
   
 - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA. 



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/Indhumathi27/carbondata nested

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/carbondata/pull/3028.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #3028


commit 99e6233dcdb4b488d0f0366ac5966d4e9c4f895c
Author: Indhumathi27 
Date:   2018-12-27T12:02:48Z

[CARBONDATA-3205]Fix Get Local Dictionary for empty Array of Struct




---


[GitHub] carbondata pull request #2998: [WIP] Test Load using carbondata

2018-12-18 Thread Indhumathi27
GitHub user Indhumathi27 opened a pull request:

https://github.com/apache/carbondata/pull/2998

[WIP] Test Load using carbondata

Be sure to do all of the following checklist to help us incorporate 
your contribution quickly and easily:

 - [ ] Any interfaces changed?
 
 - [ ] Any backward compatibility impacted?
 
 - [ ] Document update required?

 - [ ] Testing done
Please provide details on 
- Whether new unit test cases have been added or why no new tests 
are required?
- How it is tested? Please attach test report.
- Is it a performance related change? Please attach the performance 
test report.
- Any additional information to help reviewers in testing this 
change.
   
 - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA. 



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/Indhumathi27/carbondata testload

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/carbondata/pull/2998.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #2998


commit b113f3aba594fac43bef4bfecc6e92ad270e0327
Author: Indhumathi27 
Date:   2018-12-18T10:38:42Z

test using carbondata




---


[GitHub] carbondata pull request #2989: [CARBONDATA-3175]Fix Testcase failures in com...

2018-12-14 Thread Indhumathi27
GitHub user Indhumathi27 opened a pull request:

https://github.com/apache/carbondata/pull/2989

[CARBONDATA-3175]Fix Testcase failures in complex deleimiters

Fix Testcase failures in complex deleimiters:
Changed the complex delimiter in testcases

 - [ ] Any interfaces changed?
 
 - [ ] Any backward compatibility impacted?
 
 - [ ] Document update required?

 - [ ] Testing done

 - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA. 



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/Indhumathi27/carbondata fixsdv

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/carbondata/pull/2989.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #2989


commit ccd2347f6413f0d92e326b4c1013b415eabafbc1
Author: Indhumathi27 
Date:   2018-12-14T10:56:00Z

[CARBONDATA-3175]Fix Testcase failures in complex deleimiters




---


[GitHub] carbondata pull request #2971: [TEST] Test loading performance of range_sort

2018-12-03 Thread Indhumathi27
Github user Indhumathi27 commented on a diff in the pull request:

https://github.com/apache/carbondata/pull/2971#discussion_r238184022
  
--- Diff: 
integration/spark-common/src/main/scala/org/apache/carbondata/spark/load/DataLoadProcessorStepOnSpark.scala
 ---
@@ -305,4 +307,107 @@ object DataLoadProcessorStepOnSpark {
   e)
 }
   }
+
+  def sortAdnWriteFunc(
--- End diff --

Please change the method name from sortAdnWriteFunc to sortAndWriteFunc 


---


[GitHub] carbondata pull request #2821: [CARBONDATA-3017] Map DDL Support

2018-12-02 Thread Indhumathi27
Github user Indhumathi27 commented on a diff in the pull request:

https://github.com/apache/carbondata/pull/2821#discussion_r238161031
  
--- Diff: 
integration/spark-common-test/src/test/scala/org/apache/carbondata/spark/testsuite/createTable/TestCreateDDLForComplexMapType.scala
 ---
@@ -0,0 +1,330 @@
+/*
+
+Licensed to the Apache Software Foundation (ASF) under one or more
+contributor license agreements. See the NOTICE file distributed with
+this work for additional information regarding copyright ownership.
+The ASF licenses this file to You under the Apache License, Version 2.0
+(the "License"); you may not use this file except in compliance with
+the License. You may obtain a copy of the License at
+*
+http://www.apache.org/licenses/LICENSE-2.0
+*
+Unless required by applicable law or agreed to in writing, software
+distributed under the License is distributed on an "AS IS" BASIS,
+WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or 
implied.
+See the License for the specific language governing permissions and
+limitations under the License.
+*/
+package 
org.apache.carbondata.spark.testsuite.createTable.TestCreateDDLForComplexMapType
+
+import java.io.File
+
+import org.apache.hadoop.conf.Configuration
+import org.apache.spark.sql.Row
+import org.apache.spark.sql.test.util.QueryTest
+import org.scalatest.BeforeAndAfterAll
+
+class TestCreateDDLForComplexMapType extends QueryTest with 
BeforeAndAfterAll {
+  private val conf: Configuration = new Configuration(false)
+
+  val rootPath = new File(this.getClass.getResource("/").getPath
+  + "../../../..").getCanonicalPath
+
+  val path = 
s"$rootPath/examples/spark2/src/main/resources/mapDDLTestData.csv"
+
+  override def beforeAll(): Unit = {
+sql("DROP TABLE IF EXISTS carbon")
+  }
+
+  test("Single Map One Level") {
+sql("DROP TABLE IF EXISTS carbon")
+sql(
+  s"""
+ | CREATE TABLE carbon(
+ | mapField map
+ | )
+ | STORED BY 'carbondata'
+ | """
+.stripMargin)
+val desc = sql(
+  s"""
+ | Describe Formatted
+ | carbon
+ | """.stripMargin).collect()
+
assert(desc(0).get(1).asInstanceOf[String].trim.equals("map"))
+  }
+
+  test("Single Map with Two Nested Level") {
+sql("DROP TABLE IF EXISTS carbon")
+sql(
+  s"""
+ | CREATE TABLE carbon(
+ | mapField map>
+ | )
+ | STORED BY
+ |'carbondata'
+ |"""
+.stripMargin)
+val desc = sql(
+  s"""
+ | Describe Formatted
+ | carbon
+ | """.stripMargin).collect()
+
assert(desc(0).get(1).asInstanceOf[String].trim.equals("map>"))
+  }
+
+  test("Map Type with array type as value") {
+sql("DROP TABLE IF EXISTS carbon")
+sql(
+  s"""
+ | CREATE TABLE carbon(
+ | mapField map>
+ | )
+ | STORED BY 'carbondata'
+ |
+ """
+.stripMargin)
+val desc = sql(
+  s"""
+ | Describe Formatted
+ | carbon
+ | """.stripMargin).collect()
+
assert(desc(0).get(1).asInstanceOf[String].trim.equals("map>"))
+  }
+
+  test("Map Type with struct type as value") {
+sql("DROP TABLE IF EXISTS carbon")
+sql(
+  s"""
+ | CREATE TABLE carbon(
+ | mapField map>
+ | )
+ | STORED BY
+ | 'carbondata'
+ | """
+.stripMargin)
+val desc = sql(
+  s"""
+ | Describe Formatted
+ | carbon
+ | """.stripMargin).collect()
+assert(desc(0).get(1).asInstanceOf[String].trim
+  .equals("map>"))
+  }
+
+  test("Map Type as child to struct type") {
+sql("DROP TABLE IF EXISTS carbon")
+sql(
+  s"""
+ | CREATE TABLE carbon(
+ | mapField struct>
+ | )
+ | STORED BY
+ |'carbondata' """
+.stripMargin)
 

[GitHub] carbondata pull request #2821: [CARBONDATA-3017] Map DDL Support

2018-12-02 Thread Indhumathi27
Github user Indhumathi27 commented on a diff in the pull request:

https://github.com/apache/carbondata/pull/2821#discussion_r238160123
  
--- Diff: 
integration/spark-common-test/src/test/scala/org/apache/carbondata/spark/testsuite/createTable/TestCreateDDLForComplexMapType.scala
 ---
@@ -0,0 +1,330 @@
+/*
+
+Licensed to the Apache Software Foundation (ASF) under one or more
+contributor license agreements. See the NOTICE file distributed with
+this work for additional information regarding copyright ownership.
+The ASF licenses this file to You under the Apache License, Version 2.0
+(the "License"); you may not use this file except in compliance with
+the License. You may obtain a copy of the License at
+*
+http://www.apache.org/licenses/LICENSE-2.0
+*
+Unless required by applicable law or agreed to in writing, software
+distributed under the License is distributed on an "AS IS" BASIS,
+WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or 
implied.
+See the License for the specific language governing permissions and
+limitations under the License.
+*/
+package 
org.apache.carbondata.spark.testsuite.createTable.TestCreateDDLForComplexMapType
+
+import java.io.File
+
+import org.apache.hadoop.conf.Configuration
+import org.apache.spark.sql.Row
+import org.apache.spark.sql.test.util.QueryTest
+import org.scalatest.BeforeAndAfterAll
+
+class TestCreateDDLForComplexMapType extends QueryTest with 
BeforeAndAfterAll {
+  private val conf: Configuration = new Configuration(false)
+
+  val rootPath = new File(this.getClass.getResource("/").getPath
+  + "../../../..").getCanonicalPath
+
+  val path = 
s"$rootPath/examples/spark2/src/main/resources/mapDDLTestData.csv"
+
+  override def beforeAll(): Unit = {
+sql("DROP TABLE IF EXISTS carbon")
+  }
+
+  test("Single Map One Level") {
+sql("DROP TABLE IF EXISTS carbon")
+sql(
+  s"""
+ | CREATE TABLE carbon(
+ | mapField map
+ | )
+ | STORED BY 'carbondata'
+ | """
+.stripMargin)
+val desc = sql(
+  s"""
+ | Describe Formatted
+ | carbon
+ | """.stripMargin).collect()
+
assert(desc(0).get(1).asInstanceOf[String].trim.equals("map"))
+  }
+
+  test("Single Map with Two Nested Level") {
+sql("DROP TABLE IF EXISTS carbon")
+sql(
+  s"""
+ | CREATE TABLE carbon(
+ | mapField map>
+ | )
+ | STORED BY
+ |'carbondata'
+ |"""
+.stripMargin)
+val desc = sql(
+  s"""
+ | Describe Formatted
+ | carbon
+ | """.stripMargin).collect()
+
assert(desc(0).get(1).asInstanceOf[String].trim.equals("map>"))
+  }
+
+  test("Map Type with array type as value") {
+sql("DROP TABLE IF EXISTS carbon")
+sql(
+  s"""
+ | CREATE TABLE carbon(
+ | mapField map>
+ | )
+ | STORED BY 'carbondata'
+ |
+ """
+.stripMargin)
+val desc = sql(
+  s"""
+ | Describe Formatted
+ | carbon
+ | """.stripMargin).collect()
+
assert(desc(0).get(1).asInstanceOf[String].trim.equals("map>"))
+  }
+
+  test("Map Type with struct type as value") {
+sql("DROP TABLE IF EXISTS carbon")
+sql(
+  s"""
+ | CREATE TABLE carbon(
+ | mapField map>
+ | )
+ | STORED BY
+ | 'carbondata'
+ | """
+.stripMargin)
+val desc = sql(
+  s"""
+ | Describe Formatted
+ | carbon
+ | """.stripMargin).collect()
+assert(desc(0).get(1).asInstanceOf[String].trim
+  .equals("map>"))
+  }
+
+  test("Map Type as child to struct type") {
+sql("DROP TABLE IF EXISTS carbon")
+sql(
+  s"""
+ | CREATE TABLE carbon(
+ | mapField struct>
+ | )
+ | STORED BY
+ |'carbondata' """
+.stripMargin)
 

[GitHub] carbondata pull request #2963: [CARBONDATA-3139] Fix bugs in MinMaxDataMap e...

2018-12-02 Thread Indhumathi27
Github user Indhumathi27 commented on a diff in the pull request:

https://github.com/apache/carbondata/pull/2963#discussion_r238156188
  
--- Diff: 
integration/spark2/src/test/scala/org/apache/carbondata/datamap/minmax/MinMaxDataMapFunctionSuite.scala
 ---
@@ -0,0 +1,415 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ *http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.carbondata.datamap.minmax
+
+import org.apache.spark.sql.test.util.QueryTest
+import org.scalatest.BeforeAndAfterAll
+
+import org.apache.carbondata.core.constants.CarbonCommonConstants
+import org.apache.carbondata.core.util.CarbonProperties
+
+class MinMaxDataMapFunctionSuite extends QueryTest with BeforeAndAfterAll {
+  private val minmaxDataMapFactoryName = 
"org.apache.carbondata.datamap.minmax.MinMaxDataMapFactory"
+  var originalStatEnabled = CarbonProperties.getInstance().getProperty(
+CarbonCommonConstants.ENABLE_QUERY_STATISTICS,
+CarbonCommonConstants.ENABLE_QUERY_STATISTICS_DEFAULT)
+
+  override protected def beforeAll(): Unit = {
+CarbonProperties.getInstance()
+  .addProperty(CarbonCommonConstants.ENABLE_QUERY_STATISTICS, "true")
+
CarbonProperties.getInstance().addProperty(CarbonCommonConstants.CARBON_DATE_FORMAT,
+  "-MM-dd")
+
CarbonProperties.getInstance().addProperty(CarbonCommonConstants.CARBON_TIMESTAMP_FORMAT,
+  "-MM-dd HH:mm:ss")
--- End diff --

Please check whether we can use the default timestamp/date format here.

org.apache.carbondata.core.constants.CarbonCommonConstants#CARBON_TIMESTAMP_DEFAULT_FORMAT


---


[GitHub] carbondata pull request #2963: [CARBONDATA-3139] Fix bugs in MinMaxDataMap e...

2018-12-02 Thread Indhumathi27
Github user Indhumathi27 commented on a diff in the pull request:

https://github.com/apache/carbondata/pull/2963#discussion_r238152299
  
--- Diff: 
datamap/example/src/main/java/org/apache/carbondata/datamap/minmax/AbstractMinMaxDataMapWriter.java
 ---
@@ -0,0 +1,248 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ *http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.carbondata.datamap.minmax;
+
+import java.io.DataOutputStream;
+import java.io.IOException;
+import java.math.BigDecimal;
+import java.util.List;
+
+import org.apache.carbondata.common.logging.LogServiceFactory;
+import org.apache.carbondata.core.constants.CarbonCommonConstants;
+import org.apache.carbondata.core.datamap.Segment;
+import org.apache.carbondata.core.datamap.dev.DataMapWriter;
+import org.apache.carbondata.core.datastore.impl.FileFactory;
+import org.apache.carbondata.core.datastore.page.ColumnPage;
+import 
org.apache.carbondata.core.datastore.page.encoding.bool.BooleanConvert;
+import 
org.apache.carbondata.core.datastore.page.statistics.ColumnPageStatsCollector;
+import 
org.apache.carbondata.core.datastore.page.statistics.KeyPageStatsCollector;
+import 
org.apache.carbondata.core.datastore.page.statistics.PrimitivePageStatsCollector;
+import org.apache.carbondata.core.metadata.datatype.DataType;
+import org.apache.carbondata.core.metadata.datatype.DataTypes;
+import org.apache.carbondata.core.metadata.encoder.Encoding;
+import 
org.apache.carbondata.core.metadata.schema.table.column.CarbonColumn;
+import org.apache.carbondata.core.util.CarbonUtil;
+import org.apache.carbondata.core.util.DataTypeUtil;
+
+import org.apache.log4j.Logger;
+
+/**
+ * We will record the min & max value for each index column in each 
blocklet.
+ * Since the size of index is quite small, we will combine the index for 
all index columns
+ * in one file.
+ */
+public abstract class AbstractMinMaxDataMapWriter extends DataMapWriter {
+  private static final Logger LOGGER = LogServiceFactory.getLogService(
+  AbstractMinMaxDataMapWriter.class.getName());
+
+  private ColumnPageStatsCollector[] indexColumnMinMaxCollectors;
+  protected int currentBlockletId;
+  private String currentIndexFile;
+  private DataOutputStream currentIndexFileOutStream;
+
+  public AbstractMinMaxDataMapWriter(String tablePath, String dataMapName,
+  List indexColumns, Segment segment, String shardName) 
throws IOException {
+super(tablePath, dataMapName, indexColumns, segment, shardName);
+initStatsCollector();
+initDataMapFile();
+  }
+
+  private void initStatsCollector() {
+indexColumnMinMaxCollectors = new 
ColumnPageStatsCollector[indexColumns.size()];
+CarbonColumn indexCol;
+for (int i = 0; i < indexColumns.size(); i++) {
+  indexCol = indexColumns.get(i);
+  if (indexCol.isMeasure()
+  || (indexCol.isDimension()
+  && DataTypeUtil.isPrimitiveColumn(indexCol.getDataType())
+  && !indexCol.hasEncoding(Encoding.DICTIONARY)
+  && !indexCol.hasEncoding(Encoding.DIRECT_DICTIONARY))) {
+indexColumnMinMaxCollectors[i] = 
PrimitivePageStatsCollector.newInstance(
+indexColumns.get(i).getDataType());
+  } else {
+indexColumnMinMaxCollectors[i] = 
KeyPageStatsCollector.newInstance(DataTypes.BYTE_ARRAY);
+  }
+}
+  }
+
+  private void initDataMapFile() throws IOException {
+if (!FileFactory.isFileExist(dataMapPath) &&
+!FileFactory.mkdirs(dataMapPath, 
FileFactory.getFileType(dataMapPath))) {
+  throw new IOException("Failed to create directory " + dataMapPath);
+}
+
+try {
+  currentIndexFile = MinMaxIndexDataMap.getIndexFile(dataMapPath,
+  MinMaxIndexHolder.MINMAX_INDEX_PREFFIX + indexColumns.size());
+  FileFactory.createNewFile(currentIndexFile, 
FileFactory

[GitHub] carbondata pull request #2960: [WIP] Update the Project List

2018-11-29 Thread Indhumathi27
Github user Indhumathi27 closed the pull request at:

https://github.com/apache/carbondata/pull/2960


---


[GitHub] carbondata pull request #2960: [WIP] Update the Project List

2018-11-28 Thread Indhumathi27
GitHub user Indhumathi27 opened a pull request:

https://github.com/apache/carbondata/pull/2960

[WIP] Update the Project List

Update the Project List

 - [ ] Any interfaces changed?
 
 - [ ] Any backward compatibility impacted?
 
 - [ ] Document update required?

 - [ ] Testing done
   
 - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA. 



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/Indhumathi27/carbondata scan

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/carbondata/pull/2960.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #2960


commit 20f9e53e7eb11fe7ce850f30630d77d9b377b63a
Author: Indhumathi27 
Date:   2018-11-28T10:48:13Z

[CARBONDATA-3137] Update the Project List




---


[GitHub] carbondata issue #2937: [CARBONDATA-3114]Remove Null Values for a Dictionary...

2018-11-21 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2937
  
Retest this please


---


[GitHub] carbondata pull request #2939: [CARBONDATA-3115] Fix CodeGen error in preagg...

2018-11-21 Thread Indhumathi27
GitHub user Indhumathi27 opened a pull request:

https://github.com/apache/carbondata/pull/2939

[CARBONDATA-3115] Fix CodeGen error in preaggregate table and codegen 
display issue in oldstores

**Problem:**
1. While querying a preaggregate table, codegen error is displayed.
2. In old stores, code is getting displayed while executing queries.


 - [ ] Any interfaces changed?
 
 - [ ] Any backward compatibility impacted?
 
 - [ ] Document update required?

 - [ ] Testing done
   
 - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA. 



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/Indhumathi27/carbondata codegen

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/carbondata/pull/2939.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #2939


commit b1c1cd2bb354afc8566774c6b002d511c4a99f65
Author: Indhumathi27 
Date:   2018-11-21T11:53:25Z

[CARBONDATA-3115]Fix Codegen issue




---


[GitHub] carbondata pull request #2937: [CARBONDATA-3114]Remove Null Values in all ty...

2018-11-21 Thread Indhumathi27
GitHub user Indhumathi27 opened a pull request:

https://github.com/apache/carbondata/pull/2937

[CARBONDATA-3114]Remove Null Values in all types of columns for Range 
Filters

**Problem:**
Null Values are not removed in case of RangeFilters, if column is a 
dictionary and no_inverted_index column.
**Solution:**
Remove NULL values in case of RangeFilters for columns of all types.

 - [ ] Any interfaces changed?
 
 - [ ] Any backward compatibility impacted?
 
 - [ ] Document update required?

 - [ ] Testing done
   
 - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA. 



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/Indhumathi27/carbondata timestamp_di

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/carbondata/pull/2937.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #2937


commit fb0912fc938ed0be3c9451ce21bde8d59505d477
Author: Indhumathi27 
Date:   2018-11-21T09:51:49Z

[CARBONDATA-3114]Remove Null Values in all types of columns for RangeFilters




---


[GitHub] carbondata issue #2932: [HOTFIX]Fix Describe Formatted Testcases

2018-11-21 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2932
  
Retest this please


---


[GitHub] carbondata issue #2932: [HOTFIX]Fix Describe Formatted Testcases

2018-11-21 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2932
  
Retest this please


---


[GitHub] carbondata issue #2932: [HOTFIX]Fix Describe Formatted Testcases

2018-11-20 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2932
  
retest this please


---


[GitHub] carbondata pull request #2932: [HOTFIX]Fix Describe Formatted Testcases

2018-11-20 Thread Indhumathi27
GitHub user Indhumathi27 opened a pull request:

https://github.com/apache/carbondata/pull/2932

[HOTFIX]Fix Describe Formatted Testcases



 - [ ] Any interfaces changed?
 
 - [ ] Any backward compatibility impacted?
 
 - [ ] Document update required?

 - [ ] Testing done
Please provide details on 
- Whether new unit test cases have been added or why no new tests 
are required?
- How it is tested? Please attach test report.
- Is it a performance related change? Please attach the performance 
test report.
- Any additional information to help reviewers in testing this 
change.
   
 - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA. 



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/Indhumathi27/carbondata testcasefix

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/carbondata/pull/2932.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #2932


commit 6123f3008c62cd3f60ec391f5c16f141d9b00428
Author: Indhumathi27 
Date:   2018-11-20T09:02:00Z

Fix Describe Formatted Testcases




---


[GitHub] carbondata pull request #2928: [CARBONDATA-3106] Written_by_APPNAME not seri...

2018-11-16 Thread Indhumathi27
GitHub user Indhumathi27 opened a pull request:

https://github.com/apache/carbondata/pull/2928

[CARBONDATA-3106] Written_by_APPNAME not serialized in executor with global 
sort



**Problem:**
Written_By_APPNAME when added in carbonproperty is not serialized in 
executor with global sort

**Solution:**
Add Written_by_APPNAME in hadoop conf and in executor get it from 
configuration and add to carbonproperty


 - [ ] Any interfaces changed?
 
 - [ ] Any backward compatibility impacted?
 
 - [ ] Document update required?

 - [ ] Testing done
Please provide details on 
- Whether new unit test cases have been added or why no new tests 
are required?
- How it is tested? Please attach test report.
- Is it a performance related change? Please attach the performance 
test report.
- Any additional information to help reviewers in testing this 
change.
   
 - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA. 



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/Indhumathi27/carbondata global_sort

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/carbondata/pull/2928.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #2928


commit d956df1340e2dd6309ed688dc71b4046afac336e
Author: Indhumathi27 
Date:   2018-11-16T16:19:16Z

[CARBONDATA-3106] WrittenbyAPI not serialized in executor with globalsort




---


[GitHub] carbondata issue #2916: [CARBONDATA-3096] Wrong records size on the input me...

2018-11-15 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2916
  
retest this please


---


[GitHub] carbondata issue #2849: [CARBONDATA-2896] Added TestCases for Adaptive encod...

2018-11-12 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2849
  
Retest this please


---


[GitHub] carbondata issue #2896: [CARBONDATA-3075] Select Filter fails for Legacy sto...

2018-11-05 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2896
  
Retest this please


---


[GitHub] carbondata issue #2896: [CARBONDATA-3075] Select Filter fails for Legacy sto...

2018-11-05 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2896
  
Retest this please


---


[GitHub] carbondata issue #2896: [CARBONDATA-3075] Select Filter fails for Legacy sto...

2018-11-05 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2896
  
Retest this please


---


[GitHub] carbondata pull request #2896: [CARBONDATA-3075] Select Filter fails for Leg...

2018-11-05 Thread Indhumathi27
Github user Indhumathi27 commented on a diff in the pull request:

https://github.com/apache/carbondata/pull/2896#discussion_r230731270
  
--- Diff: 
core/src/main/java/org/apache/carbondata/core/scan/executor/impl/AbstractQueryExecutor.java
 ---
@@ -230,6 +230,11 @@ protected void initQuery(QueryModel queryModel) throws 
IOException {
   updateColumns(queryModel, fileFooter.getColumnInTable(), 
blockInfo.getFilePath());
   filePathToSegmentPropertiesMap.put(blockInfo.getFilePath(), 
segmentProperties);
 }
+//In case of Legacy store, set setDirectVectorFill to false,
+// as filling vector directly is not supported for V2 store
--- End diff --

okay


---


[GitHub] carbondata pull request #2896: [WIP] Select Filter fails for Legacy store if...

2018-11-05 Thread Indhumathi27
GitHub user Indhumathi27 opened a pull request:

https://github.com/apache/carbondata/pull/2896

[WIP] Select Filter fails for Legacy store if DirectVecorFill is enabled

**Why this PR?**
When **isDirectVectorFill** is set to true, Select filter fails in Legacy 
Store throwing UnsupportedOperationException

**Solution:**
Set **isDirectVectorFill** to false for Legacy store


 - [ ] Any interfaces changed?
 
 - [ ] Any backward compatibility impacted?
 
 - [ ] Document update required?

 - [ ] Testing done
   
 - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA. 



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/Indhumathi27/carbondata scan_comp

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/carbondata/pull/2896.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #2896


commit c2af1dc8ed8ea9ec5671d3fa107007c338ecbbef
Author: Indhumathi27 
Date:   2018-11-05T10:42:24Z

[CARBONDATA-3075] Select Filter fails for Legacy store if DirectVectorFill 
is enabled




---


[GitHub] carbondata pull request #2883: [CARBONDATA-3062] Fix Compatibility issue wit...

2018-10-30 Thread Indhumathi27
GitHub user Indhumathi27 opened a pull request:

https://github.com/apache/carbondata/pull/2883

[CARBONDATA-3062] Fix Compatibility issue with cache_level as blocklet


**Why this PR for?**
In case of hybrid store we can have block as well as blocklet schema.
Scenario: 
When there is a hybrid store in which few loads are from legacy store which 
do not contain the blocklet information and hence they will be, by default have 
cache_level as BLOCK and few loads with latest store which contain the BLOCKLET 
information and have cache_level BLOCKLET. For these type of scenarios we need 
to have separate task and footer schemas. For all loads with/without blocklet 
info there will not be any additional cost of maintaining 2 variables

 - [ ] Any interfaces changed?
 
 - [ ] Any backward compatibility impacted?
 
 - [ ] Document update required?

 - [ ] Testing done
   
 - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA. 



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/Indhumathi27/carbondata column_comp

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/carbondata/pull/2883.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #2883






---


[GitHub] carbondata pull request #2852: [WIP]Column Schema objects are present in Dri...

2018-10-24 Thread Indhumathi27
GitHub user Indhumathi27 opened a pull request:

https://github.com/apache/carbondata/pull/2852

[WIP]Column Schema objects are present in Driver even after dropping table

**Problem:**
Column Schema objects are present in Driver even after dropping table.

**Solution:**
After dropping table, remove entry of tableInfo from CarbonMetaDataInstance.

 - [ ] Any interfaces changed?
 
 - [ ] Any backward compatibility impacted?
 
 - [ ] Document update required?

 - [ ] Testing done
   
 - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA. 



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/Indhumathi27/carbondata memory_leak_driver

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/carbondata/pull/2852.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #2852


commit 182acdf3b2edd09403303fb15959f4d058e2c759
Author: Indhumathi27 
Date:   2018-10-25T04:46:32Z

Column Schema objects are present in Driver even after dropping table




---


[GitHub] carbondata pull request #2845: [WIP] Rand function issue

2018-10-23 Thread Indhumathi27
GitHub user Indhumathi27 opened a pull request:

https://github.com/apache/carbondata/pull/2845

[WIP] Rand function issue

Be sure to do all of the following checklist to help us incorporate 
your contribution quickly and easily:

 - [ ] Any interfaces changed?
 
 - [ ] Any backward compatibility impacted?
 
 - [ ] Document update required?

 - [ ] Testing done
Please provide details on 
- Whether new unit test cases have been added or why no new tests 
are required?
- How it is tested? Please attach test report.
- Is it a performance related change? Please attach the performance 
test report.
- Any additional information to help reviewers in testing this 
change.
   
 - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA. 



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/Indhumathi27/carbondata rand

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/carbondata/pull/2845.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #2845


commit 5b21078335ee9cd1962b4744d04069526dce08cc
Author: Indhumathi27 
Date:   2018-10-23T05:59:51Z

Check rand()




---


[GitHub] carbondata issue #2785: [CARBONDATA-2986] Table Properties are lost when mul...

2018-10-03 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2785
  
retest this please


---


[GitHub] carbondata issue #2734: [CARBONDATA-2946] Unify conversion while writing to ...

2018-09-26 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2734
  
@xuchuanyin Please check this PR. The conversion on filter value is now 
avoided.


---


[GitHub] carbondata issue #2756: [CARBONDATA-2966]Update Documentation For Avro DataT...

2018-09-26 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2756
  
Retest this please



---


[GitHub] carbondata pull request #2756: [CARBONDATA-2966]Update Documentation For Avr...

2018-09-25 Thread Indhumathi27
Github user Indhumathi27 commented on a diff in the pull request:

https://github.com/apache/carbondata/pull/2756#discussion_r220429435
  
--- Diff: docs/configuration-parameters.md ---
@@ -42,6 +42,7 @@ This section provides the details of all the 
configurations required for the Car
 | carbon.lock.type | LOCALLOCK | This configuration specifies the type of 
lock to be acquired during concurrent operations on table. There are following 
types of lock implementation: - LOCALLOCK: Lock is created on local file system 
as file. This lock is useful when only one spark driver (thrift server) runs on 
a machine and no other CarbonData spark application is launched concurrently. - 
HDFSLOCK: Lock is created on HDFS file system as file. This lock is useful when 
multiple CarbonData spark applications are launched and no ZooKeeper is running 
on cluster and HDFS supports file based locking. |
 | carbon.lock.path | TABLEPATH | This configuration specifies the path 
where lock files have to be created. Recommended to configure zookeeper lock 
type or configure HDFS lock path(to this property) in case of S3 file system as 
locking is not feasible on S3. |
 | carbon.unsafe.working.memory.in.mb | 512 | CarbonData supports storing 
data in off-heap memory for certain operations during data loading and 
query.This helps to avoid the Java GC and thereby improve the overall 
performance.The Minimum value recommeded is 512MB.Any value below this is reset 
to default value of 512MB.**NOTE:** The below formulas explain how to arrive at 
the off-heap size required.Memory Required For Data 
Loading:(*carbon.number.of.cores.while.loading*) * (Number of tables to 
load in parallel) * (*offheap.sort.chunk.size.inmb* + 
*carbon.blockletgroup.size.in.mb* + *carbon.blockletgroup.size.in.mb*/3.5 ). 
Memory required for Query:SPARK_EXECUTOR_INSTANCES * 
(*carbon.blockletgroup.size.in.mb* + *carbon.blockletgroup.size.in.mb* * 3.5) * 
spark.executor.cores |
+| carbon.unsafe.driver.working.memory.in.mb | 60% of JVM Heap Memory | 
CarbonData supports storing data in unsafe on-heap memory in driver for certain 
operations like insert into, query for loading datamap cache. The Minimum value 
recommended is 512MB. |
--- End diff --

Okay. I think parameter description has covered all the questions which are 
applicable


---


[GitHub] carbondata pull request #2752: [WIP]Even after carbon file is copied to targ...

2018-09-25 Thread Indhumathi27
Github user Indhumathi27 commented on a diff in the pull request:

https://github.com/apache/carbondata/pull/2752#discussion_r220142943
  
--- Diff: 
processing/src/main/java/org/apache/carbondata/processing/store/writer/AbstractFactDataWriter.java
 ---
@@ -418,6 +423,24 @@ protected void closeExecutorService() throws 
CarbonDataWriterException {
 CarbonDataWriterException exception = null;
 try {
   listener.finish();
+  String tableName = this.model.getTableName();
+  // Delete CarbonData and CarbonIndex files after copying from temp 
location
+  if (null != carbonDataFileTempPath && null == 
model.getTableSpec().getCarbonTable()
+  .getBucketingInfo(tableName)) {
+String deleteDataAndIndexInTempPath =
--- End diff --

Changed the code fix. Delete carbon index and carbon data files in temp 
location, once it is copied from temp to target path


---


[GitHub] carbondata pull request #2756: [CARBONDATA-2966]Update Documentation For Avr...

2018-09-24 Thread Indhumathi27
GitHub user Indhumathi27 opened a pull request:

https://github.com/apache/carbondata/pull/2756

[CARBONDATA-2966]Update Documentation For Avro DataType conversion

Updated document for Avro DataType conversion to carbon

 - [ ] Any interfaces changed?
 
 - [ ] Any backward compatibility impacted?
 
 - [ ] Document update required?

 - [ ] Testing done
   
 - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA. 



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/Indhumathi27/carbondata doc_avro

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/carbondata/pull/2756.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #2756


commit f30d1489dbdbe3fecd9eacb96ddff3658904b691
Author: Indhumathi27 
Date:   2018-09-24T18:04:04Z

[CARBONDATA-2966]Update Documentation For Avro DataType conversion




---


[GitHub] carbondata pull request #2752: [WIP]Even after carbon file is copied to targ...

2018-09-24 Thread Indhumathi27
GitHub user Indhumathi27 opened a pull request:

https://github.com/apache/carbondata/pull/2752

[WIP]Even after carbon file is copied to targetfolder(local/hdfs), carbon 
files is not deleted from temp directory

**Problem:**
 Even after carbon file is copied to targetfolder(local/hdfs), carbon files 
is not deleted from temp   directory.

 - [ ] Any interfaces changed?
 
 - [ ] Any backward compatibility impacted?
 
 - [ ] Document update required?

 - [ ] Testing done  
   
 - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA. 



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/Indhumathi27/carbondata deletetemp

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/carbondata/pull/2752.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #2752


commit d47beb2990791feb923215e4ab6bd8b8ef005068
Author: Indhumathi27 
Date:   2018-09-24T06:58:47Z

Even after carbon file is copied to targetfolder(local/hdfs), carbon files 
is not deleted from temp directory




---


[GitHub] carbondata issue #2745: [CARBONDATA-2958] Compaction with CarbonProperty 'ca...

2018-09-21 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2745
  
Retest this please


---


[GitHub] carbondata pull request #2745: [CARBONDATA-2958] Compaction with CarbonPrope...

2018-09-21 Thread Indhumathi27
GitHub user Indhumathi27 opened a pull request:

https://github.com/apache/carbondata/pull/2745

[CARBONDATA-2958] Compaction with CarbonProperty 
'carbon.enable.page.level.reader.in.compaction' enabled fails as Compressor is 
null

**Problem:**
When CarbonProperty 'carbon.enable.page.level.reader.in.compaction' is 
enabled, compaction fails throwing Null Pointer Exception as compressor is Null
**Solution:**
Set compressor from pageMetaData
 - [ ] Any interfaces changed?
 
 - [ ] Any backward compatibility impacted?
 
 - [ ] Document update required?

 - [x] Testing done
Test case added
  
 - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA. 



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/Indhumathi27/carbondata page_compaction

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/carbondata/pull/2745.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #2745


commit 1c0a6fc8435c582cefa00bcd8d6979e89f1d886d
Author: Indhumathi27 
Date:   2018-09-21T09:54:39Z

[CARBONDATA-2958] Compaction with CarbonProperty 
'carbon.enable.page.level.reader.in.compaction' enabled fails as Compressor is 
null




---


[GitHub] carbondata pull request #2727: [CARBONDATA-2940]Fix BufferUnderFlowException...

2018-09-17 Thread Indhumathi27
GitHub user Indhumathi27 opened a pull request:

https://github.com/apache/carbondata/pull/2727

[CARBONDATA-2940]Fix BufferUnderFlowException for ComplexPushDown

**Problem:**
BufferUnderFlow Exception was thrown, as buffer data was filled with wrong 
values.
**Solution:**
Check for if column is of Primitive Dimension column with 
ComplexParentOrdinal = -1 and not Complex and increment Buffer index for 
NoDictionaryKeys.

 - [ ] Any interfaces changed?
 
 - [ ] Any backward compatibility impacted?
 
 - [ ] Document update required?

 - [x] Testing done
Added testcase scenario
   
 - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA. 



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/Indhumathi27/carbondata complex_bue

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/carbondata/pull/2727.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #2727


commit 833f4b0158be0a3a643d5b862acc41bf612a31b8
Author: Indhumathi27 
Date:   2018-09-17T07:02:49Z

[CARBONDATA-2940]Fix BufferUnderFlowException for ComplexPushDown




---


[GitHub] carbondata issue #2694: [CARBONDATA-2876]AVRO datatype support through SDK

2018-09-06 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2694
  
Retest this please


---


[GitHub] carbondata pull request #2694: [CARBONDATA-2876]AVRO datatype support throug...

2018-09-05 Thread Indhumathi27
Github user Indhumathi27 commented on a diff in the pull request:

https://github.com/apache/carbondata/pull/2694#discussion_r215500532
  
--- Diff: 
store/sdk/src/main/java/org/apache/carbondata/sdk/file/AvroCarbonWriter.java ---
@@ -445,17 +445,13 @@ private static Field prepareFields(Schema.Field 
avroField) {
 if (logicalType instanceof LogicalTypes.Date) {
--- End diff --

Added a comment


---


[GitHub] carbondata pull request #2694: [CARBONDATA-2876]AVRO datatype support throug...

2018-09-05 Thread Indhumathi27
GitHub user Indhumathi27 opened a pull request:

https://github.com/apache/carbondata/pull/2694

[CARBONDATA-2876]AVRO datatype support through SDK

This PR supports following Avro DataTypes to carbon format through SDK. 
Avro datatypes include,

  1.  Avro Logical type TimeMillis
  2.  Avro Logical type TimeMicros

 - [ ] Any interfaces changed?
 
 - [ ] Any backward compatibility impacted?
 
 - [ ] Document update required?

 - [ ] Testing done
test cases added
 - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA. 



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/Indhumathi27/carbondata avro_support_sdk1

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/carbondata/pull/2694.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #2694


commit 2738b29b4cb3439fec40b80da0ec31776c5944a6
Author: Indhumathi27 
Date:   2018-09-03T04:47:20Z

[CARBONDATA-2876]AVRO datatype support through SDK




---


[GitHub] carbondata issue #2687: [CARBONDATA-2876]Fix Avro decimal datatype with prec...

2018-09-03 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2687
  
Retest this please


---


[GitHub] carbondata pull request #2687: [CARBONDATA-2876]Fix Avro decimal datatype wi...

2018-09-03 Thread Indhumathi27
GitHub user Indhumathi27 opened a pull request:

https://github.com/apache/carbondata/pull/2687

[CARBONDATA-2876]Fix Avro decimal datatype with precision and scale

**What is PR for?**

Add precision and scale for fieldvalue for Avro Decimal logical type

 - [ ] Any interfaces changed?
 
 - [ ] Any backward compatibility impacted?
 
 - [ ] Document update required?

 - [ ] Testing done
 
   
 - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA. 



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/Indhumathi27/carbondata decimal_fix

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/carbondata/pull/2687.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #2687


commit a6e5149c12a7427a00fdc634cd0d35aca077
Author: Indhumathi27 
Date:   2018-09-03T12:05:01Z

Fix decimal type for Avro




---


[GitHub] carbondata pull request #2671: [WIP]AVRO datatype support through SDK

2018-08-29 Thread Indhumathi27
GitHub user Indhumathi27 opened a pull request:

https://github.com/apache/carbondata/pull/2671

[WIP]AVRO datatype support through SDK

This PR supports following Avro DataTypes to carbon format through SDK. 
Avro datatypes include,
1. Avro Union
2. Avro Enum
3. Avro Logical type Decimal

Please refer JIRA CARBONDATA-2876 for further detail.

 - [ ] Any interfaces changed?
 
 - [ ] Any backward compatibility impacted?
 
 - [ ] Document update required?

 - [x] Testing done
Test file has been added
  
 - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA. 



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/Indhumathi27/carbondata avro_support_sdk

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/carbondata/pull/2671.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #2671


commit eae87521707fa86337a45d5677cc7d0f1f1fbfbc
Author: Indhumathi27 
Date:   2018-08-29T14:18:21Z

Support Avro datatype conversion through SDK




---


[GitHub] carbondata pull request #2621: [CARBONDATA-2840] Added SDV testcases for Com...

2018-08-08 Thread Indhumathi27
GitHub user Indhumathi27 opened a pull request:

https://github.com/apache/carbondata/pull/2621

[CARBONDATA-2840] Added SDV testcases for Complex DataType Support

**Added SDV testcases for Complex DataType Support**

 - [ ] Any interfaces changed?
 
 - [ ] Any backward compatibility impacted?
 
 - [ ] Document update required?

 - [ ] Testing done

   
 - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA. 



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/Indhumathi27/carbondata complexsdv

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/carbondata/pull/2621.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #2621


commit f9bd3e83720b6657654d38d96f1b3d07f9172192
Author: Indhumathi27 
Date:   2018-08-08T10:03:36Z

[CARBONDATA-2840] Added SDV testcases for Complex DataType Support




---


[GitHub] carbondata issue #2578: Fix Dictionary_Include for ComplexDataType

2018-07-30 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2578
  
Retest this please


---


[GitHub] carbondata pull request #2578: Fix Dictionary_Include for ComplexDataType

2018-07-30 Thread Indhumathi27
GitHub user Indhumathi27 opened a pull request:

https://github.com/apache/carbondata/pull/2578

Fix Dictionary_Include for ComplexDataType

Why this PR for?
Check if a column is given as Dictionary_Include and fill cardinality for 
that column
 - [ ] Any interfaces changed?
 
 - [ ] Any backward compatibility impacted?
 
 - [ ] Document update required?

 - [ ] Testing done
   
 - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA. 



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/Indhumathi27/carbondata dict_fix

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/carbondata/pull/2578.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #2578


commit bab79be62af7ffecb9dc1fcfe520fd0288c0281a
Author: Indhumathi27 
Date:   2018-07-30T08:48:44Z

Fix Dictionary_Include for ComplexDataType




---


[GitHub] carbondata issue #2573: [CARBONDATA-2794]Distinct count fails on ArrayOfStru...

2018-07-28 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2573
  
Retest this please


---


[GitHub] carbondata pull request #2573: [CARBONDATA-2794]Distinct count fails on Arra...

2018-07-27 Thread Indhumathi27
GitHub user Indhumathi27 opened a pull request:

https://github.com/apache/carbondata/pull/2573

[CARBONDATA-2794]Distinct count fails on ArrayOfStruct

This PR fixes Code Generator Error thrown when Select filter contains more 
than one count of distinct of ArrayofStruct with group by Clause

 - [ ] Any interfaces changed?
 
 - [ ] Any backward compatibility impacted?
 
 - [ ] Document update required?

 - [x] Testing done
   Testcase added
 - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA. 



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/Indhumathi27/carbondata codegeb

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/carbondata/pull/2573.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #2573


commit e9e1cd6e7fcd9603bc35a74ad7981853502fef51
Author: Indhumathi27 
Date:   2018-07-27T18:09:50Z

[CARBONDATA-2794]Distinct count fails on ArrayOfStruct




---


[GitHub] carbondata issue #2569: [CARBONDATA-2791]Fix Encoding for Double if exceeds ...

2018-07-27 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2569
  
Retest this please


---


[GitHub] carbondata pull request #2569: [CARBONDATA-2791]Fix Adaptive Encoding for Do...

2018-07-27 Thread Indhumathi27
GitHub user Indhumathi27 opened a pull request:

https://github.com/apache/carbondata/pull/2569

[CARBONDATA-2791]Fix Adaptive Encoding for Double if exceeds LONG.Max_value

**What this PR for?**

If Factor(decimalcount) * absMaxValue exceeds LONG.MAX_VALUE, then go for 
direct compression.
 - [ ] Any interfaces changed?
 
 - [ ] Any backward compatibility impacted?
 
 - [ ] Document update required?

 - [x] Testing done
   Testcases added
 - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA. 



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/Indhumathi27/carbondata adapfix

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/carbondata/pull/2569.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #2569


commit c26897b2d8d37ba4e5361da7585dfe6942c42f5a
Author: Indhumathi27 
Date:   2018-07-27T06:52:25Z

[CARBONDATA-2791]Fix Adaptive Encoding for Double if exceeds LONG.Max_value




---


[GitHub] carbondata issue #2555: [CARBONDATA-2753][Compatibility] Row count of page i...

2018-07-26 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2555
  
retest sdv please


---


[GitHub] carbondata issue #2535: [CARBONDATA-2606]Fix Complex array Pushdown

2018-07-24 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2535
  
Retest this please


---


[GitHub] carbondata issue #2535: [CARBONDATA-2606]Fix Complex array Pushdown

2018-07-24 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2535
  
Retest this please


---


[GitHub] carbondata issue #2535: [CARBONDATA-2606]Fix Complex array Pushdown

2018-07-24 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2535
  
Retest this please



---


[GitHub] carbondata issue #2535: [CARBONDATA-2606]Fix Complex array Pushdown

2018-07-24 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2535
  
Retest this please


---


[GitHub] carbondata issue #2535: [CARBONDATA-2606]Fix Complex array Pushdown

2018-07-23 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2535
  
Retest sdv please


---


[GitHub] carbondata issue #2535: [CARBONDATA-2606]Fix Complex array Pushdown

2018-07-23 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2535
  
Retest this please


---


[GitHub] carbondata issue #2535: [CARBONDATA-2606]Fix Complex array Pushdown

2018-07-23 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2535
  
Retest this please


---


[GitHub] carbondata issue #2535: [CARBONDATA-2606]Fix Complex array Pushdown

2018-07-23 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2535
  
Retest this please


---


[GitHub] carbondata pull request #2535: [CARBONDATA-2606]Fix Complex array Pushdown

2018-07-21 Thread Indhumathi27
GitHub user Indhumathi27 opened a pull request:

https://github.com/apache/carbondata/pull/2535

[CARBONDATA-2606]Fix Complex array Pushdown

**What is PR for?**
  Check for if Complex Column contains ArrayType at n levels and add parent 
to projection if contains array.

 - [ ] Any interfaces changed?
 
 - [ ] Any backward compatibility impacted?
 
 - [ ] Document update required?

 - [ ] Testing done
   
 - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA. 



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/Indhumathi27/carbondata arrayfix

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/carbondata/pull/2535.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #2535


commit e5b3d345d8054eaf1f326e70a2ab3453e9e1c153
Author: Indhumathi27 
Date:   2018-07-21T10:46:21Z

[CARBONDATA-2606]Fix Complex array Pushdown




---


[GitHub] carbondata issue #2532: [CARBONDATA-2759]Add Bad_Records_Options to STMPROPE...

2018-07-19 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2532
  
Retest this please


---


[GitHub] carbondata pull request #2532: [CARBONDATA-2759]Add Bad_Records_Options to S...

2018-07-19 Thread Indhumathi27
GitHub user Indhumathi27 opened a pull request:

https://github.com/apache/carbondata/pull/2532

[CARBONDATA-2759]Add Bad_Records_Options to STMPROPERTIES for Streaming 
Table

Problem:
Bad_Records_Options are not added for STMPROPERTIES for streaming table.

 - [ ] Any interfaces changed?
 
 - [ ] Any backward compatibility impacted?
 
 - [ ] Document update required?

 - [x] Testing done
  Testcase added
   
 - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA. 



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/Indhumathi27/carbondata streaming

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/carbondata/pull/2532.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #2532


commit 1b561c4ba7a570c5c6d1503d716c71cfe474a10b
Author: Indhumathi27 
Date:   2018-07-19T14:17:59Z

[CARBONDATA-2759]Add Bad_Records_Options to STMPROPERTIES for Streaming 
Table




---


[GitHub] carbondata issue #2523: [CARBONDATA-2753] Fix Compatibility issue with Preag...

2018-07-18 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2523
  
Retest this please


---


[GitHub] carbondata pull request #2523: [CARBONDATA-2753] Fix Compatibility issue wit...

2018-07-18 Thread Indhumathi27
GitHub user Indhumathi27 opened a pull request:

https://github.com/apache/carbondata/pull/2523

[CARBONDATA-2753] Fix Compatibility issue with Preaggregate table

Problem: User creates maintable and load data and create preaggregate 
datamap on maintable using old version and loads data into maintable using new 
version and dataload fails.

 - [ ] Any interfaces changed?
 
 - [ ] Any backward compatibility impacted?
 
 - [ ] Document update required?

 - [ ] Testing done
   
 - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA. 



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/Indhumathi27/carbondata preagg

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/carbondata/pull/2523.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #2523


commit 7bfd652d3c1513206b0e5d719e68caccea6c2d47
Author: Indhumathi27 
Date:   2018-07-18T15:17:06Z

[CARBONDATA-2753] Fix Compatibility issue with Preaggregate table




---


[GitHub] carbondata issue #2501: [CARBONDATA-2738]Block Preaggregate, Compaction, Dic...

2018-07-17 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2501
  
Retest this please


---


[GitHub] carbondata issue #2502: [CARBONDATA-2738]Update documentation for Complex da...

2018-07-16 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2502
  
Retest this please


---


[GitHub] carbondata issue #2501: [CARBONDATA-2738]Block Preaggregate, Dictionary Excl...

2018-07-13 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2501
  
Retest this please


---


[GitHub] carbondata issue #2502: [CARBONDATA-2738]Update documentation for Complex da...

2018-07-13 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2502
  
Retest this please


---


[GitHub] carbondata issue #2489: [CARBONDATA-2606][Complex DataType Enhancements]Fix ...

2018-07-13 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2489
  
Retest this please


---


[GitHub] carbondata pull request #2502: [CARBONDATA-2738]Update documentation for Com...

2018-07-13 Thread Indhumathi27
GitHub user Indhumathi27 opened a pull request:

https://github.com/apache/carbondata/pull/2502

[CARBONDATA-2738]Update documentation for Complex datatype

For PR-2501, Update documentation for Complex datatype

 - [ ] Any interfaces changed?
 
 - [ ] Any backward compatibility impacted?
 
 - [x] Document update required?
 Added
 - [ ] Testing done
   
 - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA. 



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/Indhumathi27/carbondata complexdoc

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/carbondata/pull/2502.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #2502


commit f36e97077f3852be4ac8851566cd5790709f7954
Author: Indhumathi27 
Date:   2018-07-13T06:26:25Z

[CARBONDATA-2738]Update documentation for Complex datatype




---


[GitHub] carbondata pull request #2501: [CARBONDATA-2738]Block Preaggregate, Dictiona...

2018-07-12 Thread Indhumathi27
GitHub user Indhumathi27 opened a pull request:

https://github.com/apache/carbondata/pull/2501

[CARBONDATA-2738]Block Preaggregate, Dictionary Exclude/Include for child 
columns for Complex datatype

Block Preaggregate, Dictionary Exclude/Include for child columns for 
Complex datatype

 - [ ] Any interfaces changed?
 
 - [ ] Any backward compatibility impacted?
 
 - [ ] Document update required?

 - [x] Testing done
   Test cases added
 - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA. 



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/Indhumathi27/carbondata orderby

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/carbondata/pull/2501.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #2501


commit c962b84bd88a3afe695245653d7bc3f14d89c3ce
Author: Indhumathi27 
Date:   2018-07-12T15:41:11Z

[CARBONDATA-2738]Block features for complex data type




---


[GitHub] carbondata pull request #2489: [CARBONDATA-2606][Complex DataType Enhancemen...

2018-07-11 Thread Indhumathi27
GitHub user Indhumathi27 opened a pull request:

https://github.com/apache/carbondata/pull/2489

[CARBONDATA-2606][Complex DataType Enhancements]Fix Null result if First 
two Projection column have same parent and third column has different Parent 
Struct


Problem:
 When multiple columns are there,then the first child elements is only 
going to make parent Object Array. For all other cases it should be null.
 For e.g. a : . here as 'a' is the parent column and b, c, d are 
child columns during traversal when we encounter the first element in list i.e. 
column 'b','a' will be completely filled. In case when column 'c' and 'd' 
encountered then only place null in the output.
Hence, as Null is placed in the output, Select result is Null if First two 
Projection column have same parent and third column has different parent Struct 
column.
Solution: Place null in the end of output

 - [ ] Any interfaces changed?
 
 - [ ] Any backward compatibility impacted?
 
 - [ ] Document update required?

 - [x] Testing done
   Test cases added
 - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA. 



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/Indhumathi27/carbondata project2struct

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/carbondata/pull/2489.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #2489


commit be3008224f100d6e046987ab4c56792976d6c3a4
Author: Indhumathi27 
Date:   2018-07-11T10:24:55Z

[CARBONDATA-2606][Complex DataType Enhancements]Fix Null result when there 
are two Struct Column




---


[GitHub] carbondata issue #2439: [CARBONDATA-2606][Complex DataType Enhancements]Fixe...

2018-07-06 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2439
  
Retest this please


---


[GitHub] carbondata pull request #2442: [CARBONDATA-2684]Code Generator Error is thro...

2018-07-05 Thread Indhumathi27
Github user Indhumathi27 closed the pull request at:

https://github.com/apache/carbondata/pull/2442


---


[GitHub] carbondata issue #2439: [CARBONDATA-2606][Complex DataType Enhancements]Fixe...

2018-07-05 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2439
  
Retest sdv please 


---


[GitHub] carbondata issue #2449: [CARBONDATA-2684] [PR-2442] Distinct count fails on ...

2018-07-05 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2449
  
Retest sdv please 


---


[GitHub] carbondata issue #2439: [CARBONDATA-2606][Complex DataType Enhancements]Fixe...

2018-07-05 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2439
  
Retest this please


---


[GitHub] carbondata issue #2449: [CARBONDATA-2684] [PR-2442] Distinct count fails on ...

2018-07-05 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2449
  
Retest this please


---


[GitHub] carbondata issue #2449: [CARBONDATA-2684] [PR-2442] Distinct count fails on ...

2018-07-05 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2449
  
Retest sdv please


---


[GitHub] carbondata issue #2439: [CARBONDATA-2606][Complex DataType Enhancements]Fixe...

2018-07-04 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2439
  
Retest sdv please


---


[GitHub] carbondata issue #2439: [CARBONDATA-2606][Complex DataType Enhancements]Fixe...

2018-07-03 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2439
  
Retest this please


---


[GitHub] carbondata issue #2439: [CARBONDATA-2606][Complex DataType Enhancements]Fixe...

2018-07-03 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2439
  
Retest this please


---


[GitHub] carbondata pull request #2442: [CARBONDATA-2684]Code Generator Error is thro...

2018-07-03 Thread Indhumathi27
GitHub user Indhumathi27 opened a pull request:

https://github.com/apache/carbondata/pull/2442

[CARBONDATA-2684]Code Generator Error is thrown when Select filter contains 
more than one count of distinct of ComplexColumn with group by Clause


This PR fixes Code Generator Error thrown when Select filter contains more 
than one count of distinct of ComplexColumn with group by Clause

 - [ ] Any interfaces changed?
 
 - [ ] Any backward compatibility impacted?
 
 - [ ] Document update required?

 - [x] Testing done
  Testcase added
   
 - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA. 



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/Indhumathi27/carbondata complexarray

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/carbondata/pull/2442.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #2442


commit 492b60edbf8502c9f62f3f968926fa75f77504ed
Author: Indhumathi27 
Date:   2018-07-03T17:01:40Z

Code Generator Error is thrown when Select filter contains more than one 
count of distinct of ComplexColumn with group by Clause




---


[GitHub] carbondata pull request #2439: [CARBONDATA-2606][Complex DataType Enhancemen...

2018-07-03 Thread Indhumathi27
GitHub user Indhumathi27 opened a pull request:

https://github.com/apache/carbondata/pull/2439

 [CARBONDATA-2606][Complex DataType Enhancements]Fixed Projection Pushdown 
when Select filter contains Struct column

**Problem:**
 If Select filter contains Struct Column which is not in Projection 
list, then only null value is stored for struct column given in filter and 
select query result is null.
**Solution:** 
 Pushdown Parent column of corresponding struct type if any struct 
column is present in Filter list.
 - [ ] Any interfaces changed?
 
 - [ ] Any backward compatibility impacted?
 
 - [ ] Document update required?

 - [x] Testing done
   Test Case Added
   
 - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA. 



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/Indhumathi27/carbondata filter

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/carbondata/pull/2439.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #2439


commit 26df47ef07f6592919f0d9ccd5aa68703360de47
Author: Indhumathi27 
Date:   2018-07-03T11:10:55Z

Fixed Projection Pushdown when Select filter contains Struct column




---


[GitHub] carbondata issue #2421: [CARBONDATA-2606] [Complex DataType Enhancements]Fix...

2018-07-02 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2421
  
Retest this please


---


[GitHub] carbondata issue #2421: [CARBONDATA-2606] [Complex DataType Enhancements]Fix...

2018-07-02 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2421
  
Retest this please


---


[GitHub] carbondata issue #2421: [CARBONDATA-2606] [Complex DataType Enhancements]Fix...

2018-07-02 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2421
  
Retest sdv please


---


[GitHub] carbondata issue #2421: [CARBONDATA-2606] [Complex DataType Enhancements]Fix...

2018-07-02 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2421
  
Retest this please


---


[GitHub] carbondata issue #2421: [CARBONDATA-2606] [Complex DataType Enhancements]Fix...

2018-06-27 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2421
  
Retest this please


---


[GitHub] carbondata pull request #2421: [CARBONDATA-2606] [Complex DataType Enhanceme...

2018-06-27 Thread Indhumathi27
GitHub user Indhumathi27 opened a pull request:

https://github.com/apache/carbondata/pull/2421

[CARBONDATA-2606] [Complex DataType Enhancements]Fix for ComplexDataType 
Projection PushDown


Fix for ComplexDataType Projection PushDown when Table Schema contains 
ColumnName in UpperCase


 - [ ] Any interfaces changed?
 
 - [ ] Any backward compatibility impacted?
 
 - [ ] Document update required?

 - [ ] Testing done
Please provide details on 
- Whether new unit test cases have been added or why no new tests 
are required?
- How it is tested? Please attach test report.
- Is it a performance related change? Please attach the performance 
test report.
- Any additional information to help reviewers in testing this 
change.
   
 - [ ] For large changes, please consider breaking it into sub-tasks under 
an umbrella JIRA. 



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/Indhumathi27/carbondata projectbug

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/carbondata/pull/2421.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #2421


commit a043de47f6eee5e45e3c9ed84ebd53459609839f
Author: Indhumathi27 
Date:   2018-06-27T07:31:28Z

Bug Fix For ComplexDataType Projection




---


[GitHub] carbondata issue #2396: [CARBONDATA-2606] [Complex DataType Enhancements] Pr...

2018-06-25 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2396
  
retest this please


---


[GitHub] carbondata issue #2396: [CARBONDATA-2606] [Complex DataType Enhancements] Pr...

2018-06-25 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2396
  
retest this please


---


[GitHub] carbondata pull request #2396: [CARBONDATA-2606] [Complex DataType Enhanceme...

2018-06-24 Thread Indhumathi27
Github user Indhumathi27 commented on a diff in the pull request:

https://github.com/apache/carbondata/pull/2396#discussion_r197678648
  
--- Diff: 
core/src/main/java/org/apache/carbondata/core/scan/collector/impl/DictionaryBasedResultCollector.java
 ---
@@ -67,17 +69,38 @@
   int noDictionaryColumnIndex;
   int complexTypeColumnIndex;
 
+  int noDictionaryComplexColumnIndex = 0;
+  int complexTypeComplexColumnIndex = 0;
+
   boolean isDimensionExists;
 
+  private int[] surrogateResult;
+  private byte[][] noDictionaryKeys;
+  private byte[][] complexTypeKeyArray;
+
   protected Map comlexDimensionInfoMap;
 
+  /**
+   * Field of this Map is the parent Column and associated child columns.
+   * Final Projection shuld be a merged list consist of only parents.
+   */
+  public Map> mergedComplexDimensionColumns;
+
+  /**
+   * Fields of this Map of Parent Ordinal with the List is the Child 
Column Dimension and
+   * the corresponding data buffer of that column.
+   */
+
+  public Map> 
mergedComplexDimensionDataMap;
--- End diff --

It is already at class level and initialised only once and reused


---


[GitHub] carbondata pull request #2396: [CARBONDATA-2606] [Complex DataType Enhanceme...

2018-06-24 Thread Indhumathi27
Github user Indhumathi27 commented on a diff in the pull request:

https://github.com/apache/carbondata/pull/2396#discussion_r197678288
  
--- Diff: 
core/src/main/java/org/apache/carbondata/core/scan/collector/impl/DictionaryBasedResultCollector.java
 ---
@@ -27,6 +28,7 @@
 import 
org.apache.carbondata.core.keygenerator.directdictionary.DirectDictionaryKeyGeneratorFactory;
 import org.apache.carbondata.core.metadata.datatype.DataTypes;
 import org.apache.carbondata.core.metadata.encoder.Encoding;
+import 
org.apache.carbondata.core.metadata.schema.table.column.CarbonDimension;
--- End diff --

Dictionary will be handled with Adaptive encoding PR


---


[GitHub] carbondata pull request #2396: [CARBONDATA-2606] [Complex DataType Enhanceme...

2018-06-24 Thread Indhumathi27
Github user Indhumathi27 commented on a diff in the pull request:

https://github.com/apache/carbondata/pull/2396#discussion_r197678273
  
--- Diff: 
core/src/main/java/org/apache/carbondata/core/scan/collector/impl/DictionaryBasedResultCollector.java
 ---
@@ -19,6 +19,7 @@
 import java.nio.ByteBuffer;
 import java.util.ArrayList;
 import java.util.Arrays;
+import java.util.HashMap;
--- End diff --

VectorCollector is not required for now. It will be handled in next Complex 
data type enhancement PR


---


[GitHub] carbondata issue #2396: [CARBONDATA-2606] [Complex DataType Enhancements] Pr...

2018-06-24 Thread Indhumathi27
Github user Indhumathi27 commented on the issue:

https://github.com/apache/carbondata/pull/2396
  
retest this please


---


[GitHub] carbondata pull request #2396: [CARBONDATA-2606] [Complex DataType Enhanceme...

2018-06-24 Thread Indhumathi27
Github user Indhumathi27 commented on a diff in the pull request:

https://github.com/apache/carbondata/pull/2396#discussion_r197645303
  
--- Diff: 
core/src/main/java/org/apache/carbondata/core/scan/complextypes/ArrayQueryType.java
 ---
@@ -97,4 +99,56 @@ public void 
parseBlocksAndReturnComplexColumnByteArray(DimensionRawColumnChunk[]
 return 
DataTypeUtil.getDataTypeConverter().wrapWithGenericArrayData(data);
   }
 
+  @Override public Object getDataBasedOnColumn(ByteBuffer dataBuffer, 
CarbonDimension parent,
+  CarbonDimension child) {
+int dataLength;
+if (parent.getOrdinal() < child.getOrdinal()) {
+  dataLength = parent.getNumberOfChild();
+
+  if (dataLength == -1) {
+return null;
+  }
+  Object[] data = new Object[dataLength];
+  for (int i = 0; i < dataLength; i++) {
+data[i] = children
+.getDataBasedOnColumn(dataBuffer, 
parent.getListOfChildDimensions().get(i), child);
+  }
+  return 
DataTypeUtil.getDataTypeConverter().wrapWithGenericArrayData(data);
+} else if (parent.getOrdinal() > child.getOrdinal()) {
+  return null;
+} else {
+  // dataLength = dataBuffer.getInt();
+  return DataTypeUtil.getDataTypeConverter()
+  .wrapWithGenericArrayData(getDataBasedOnDataType(dataBuffer));
+}
+  }
+
+  @Override public Object getDataBasedOnColumnList(Map childBuffer,
--- End diff --

changed


---


[GitHub] carbondata pull request #2396: [CARBONDATA-2606] [Complex DataType Enhanceme...

2018-06-24 Thread Indhumathi27
Github user Indhumathi27 commented on a diff in the pull request:

https://github.com/apache/carbondata/pull/2396#discussion_r197645280
  
--- Diff: 
integration/spark2/src/main/scala/org/apache/spark/sql/CarbonDatasourceHadoopRelation.scala
 ---
@@ -67,14 +68,68 @@ case class CarbonDatasourceHadoopRelation(
   override def schema: StructType = 
tableSchema.getOrElse(carbonRelation.schema)
 
   def buildScan(requiredColumns: Array[String],
+  projects: Seq[NamedExpression],
   filters: Array[Filter],
   partitions: Seq[PartitionSpec]): RDD[InternalRow] = {
 val filterExpression: Option[Expression] = filters.flatMap { filter =>
   CarbonFilters.createCarbonFilter(schema, filter)
 }.reduceOption(new AndExpression(_, _))
 
+var parentColumns = new ListBuffer[String]
+// In case of Struct or StructofStruct Complex type, get the project 
column for given
+// parent/child field and pushdown the corresponding project column. 
In case of Array,
+// ArrayofStruct or StructofArray, pushdown parent column
+var reqColumns = projects.map {
+  case a@Alias(s: GetStructField, name) =>
+val arrayTypeExists = s.childSchema.map(x => x.dataType)
+  .filter(dataType => dataType.isInstanceOf[ArrayType])
+if (0 == arrayTypeExists.length) {
+  val columnName = s.toString().replaceAll("#[0-9]*", "")
+  parentColumns += columnName.split("\\.")(0)
+  columnName
+}
+else {
+  None
+}
+  case a@Alias(s: GetArrayItem, name) =>
+None
+  case other => other.name.replaceAll("#[0-9]*", "")
+}
+
+var reqCols = reqColumns.filterNot(none => none.equals(None)).map(col 
=> col.toString)
+parentColumns = parentColumns.distinct
+reqCols = reqCols.distinct
+
+// if the parent column is there in the projection list then we can 
filter out all the children
+// in that projection list
+val parentColumnOnProjectionList = reqCols.filter(col => 
parentColumns.contains(col))
--- End diff --

removed


---


  1   2   >