[GitHub] carbondata pull request #1841: [CARBONDATA-2062] Configure the temp director...
Github user sraghunandan commented on a diff in the pull request: https://github.com/apache/carbondata/pull/1841#discussion_r163167471 --- Diff: integration/spark-common/src/main/scala/org/apache/carbondata/spark/util/CommonUtil.scala --- @@ -941,4 +943,39 @@ object CommonUtil { } } + def setTempStoreLocation( + index: Int, + carbonLoadModel: CarbonLoadModel, + isCompactionFlow: Boolean, + isAltPartitionFlow: Boolean) : Unit = { +var storeLocation: String = null + +// this property is used to determine whether temp location for carbon is inside +// container temp dir or is yarn application directory. +val carbonUseLocalDir = CarbonProperties.getInstance() + .getProperty("carbon.use.local.dir", "false") + +if (carbonUseLocalDir.equalsIgnoreCase("true")) { + + val storeLocations = Util.getConfiguredLocalDirs(SparkEnv.get.conf) + if (null != storeLocations && storeLocations.nonEmpty) { +storeLocation = storeLocations(Random.nextInt(storeLocations.length)) + } + if (storeLocation == null) { +storeLocation = System.getProperty("java.io.tmpdir") + } +} else { + storeLocation = System.getProperty("java.io.tmpdir") +} +storeLocation = storeLocation + '/' + "carbon" + System.nanoTime() + '_' + index --- End diff -- handled ---
[GitHub] carbondata issue #1845: [CARBONDATA-2066] Add Local keyword when loading dat...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1845 Build Success with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/1817/ ---
[GitHub] carbondata pull request #1846: [CARBONDATA-2067] Fix NPE exception in Stream...
GitHub user QiangCai opened a pull request: https://github.com/apache/carbondata/pull/1846 [CARBONDATA-2067] Fix NPE exception in StreamHandoffRDD To avoid NPE of carbon table in StreamHandoffRDD, add carbon table to cache in internalCompute method - [x] Any interfaces changed? no - [x] Any backward compatibility impacted? no - [x] Document update required? no - [x] Testing done Please provide details on - Whether new unit test cases have been added or why no new tests are required? local mode doesn't have this issue - How it is tested? Please attach test report. - Is it a performance related change? Please attach the performance test report. - Any additional information to help reviewers in testing this change. - [x] For large changes, please consider breaking it into sub-tasks under an umbrella JIRA. small changes You can merge this pull request into a Git repository by running: $ git pull https://github.com/QiangCai/carbondata handoff_nullpoint Alternatively you can review and apply these changes as the patch at: https://github.com/apache/carbondata/pull/1846.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #1846 commit ca75a647434ebd190c502d8bc966257f7f86516d Author: QiangCaiDate: 2018-01-23T08:20:19Z fix NPE exception in StreamHandoffRDD ---
[GitHub] carbondata issue #1841: [CARBONDATA-2062] Configure the temp directory to be...
Github user ravipesala commented on the issue: https://github.com/apache/carbondata/pull/1841 SDV Build Fail , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/3040/ ---
[GitHub] carbondata issue #1845: [CARBONDATA-2066] Add Local keyword when loading dat...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1845 Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/3048/ ---
[GitHub] carbondata issue #1845: [CARBONDATA-2066] Add Local keyword when loading dat...
Github user ravipesala commented on the issue: https://github.com/apache/carbondata/pull/1845 SDV Build Success , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/3039/ ---
[jira] [Assigned] (CARBONDATA-2067) Streaming hand off operation throw NullPointerException
[ https://issues.apache.org/jira/browse/CARBONDATA-2067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] QiangCai reassigned CARBONDATA-2067: Assignee: QiangCai > Streaming hand off operation throw NullPointerException > --- > > Key: CARBONDATA-2067 > URL: https://issues.apache.org/jira/browse/CARBONDATA-2067 > Project: CarbonData > Issue Type: Bug >Reporter: QiangCai >Assignee: QiangCai >Priority: Major > > 18/01/23 16:01:10 ERROR CompactionResultSortProcessor: Executor task launch > worker for task 0 Compaction failed: null java.lang.NullPointerException at > org.apache.carbondata.processing.util.CarbonDataProcessorUtil.getLocalDataFolderLocation(CarbonDataProcessorUtil.java:152) > at > org.apache.carbondata.processing.merger.CompactionResultSortProcessor.initTempStoreLocation(CompactionResultSortProcessor.java:424) > at > org.apache.carbondata.processing.merger.CompactionResultSortProcessor.execute(CompactionResultSortProcessor.java:156) > at > org.apache.carbondata.streaming.StreamHandoffRDD.internalCompute(StreamHandoffRDD.scala:113) > at org.apache.carbondata.spark.rdd.CarbonRDD.compute(CarbonRDD.scala:60) at > org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:323) at > org.apache.spark.rdd.RDD.iterator(RDD.scala:287) at > org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:87) at > org.apache.spark.scheduler.Task.run(Task.scala:108) at > org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:338) at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (CARBONDATA-2067) Streaming hand off operation throw NullPointerException
QiangCai created CARBONDATA-2067: Summary: Streaming hand off operation throw NullPointerException Key: CARBONDATA-2067 URL: https://issues.apache.org/jira/browse/CARBONDATA-2067 Project: CarbonData Issue Type: Bug Reporter: QiangCai 18/01/23 16:01:10 ERROR CompactionResultSortProcessor: Executor task launch worker for task 0 Compaction failed: null java.lang.NullPointerException at org.apache.carbondata.processing.util.CarbonDataProcessorUtil.getLocalDataFolderLocation(CarbonDataProcessorUtil.java:152) at org.apache.carbondata.processing.merger.CompactionResultSortProcessor.initTempStoreLocation(CompactionResultSortProcessor.java:424) at org.apache.carbondata.processing.merger.CompactionResultSortProcessor.execute(CompactionResultSortProcessor.java:156) at org.apache.carbondata.streaming.StreamHandoffRDD.internalCompute(StreamHandoffRDD.scala:113) at org.apache.carbondata.spark.rdd.CarbonRDD.compute(CarbonRDD.scala:60) at org.apache.spark.rdd.RDD.computeOrReadCheckpoint(RDD.scala:323) at org.apache.spark.rdd.RDD.iterator(RDD.scala:287) at org.apache.spark.scheduler.ResultTask.runTask(ResultTask.scala:87) at org.apache.spark.scheduler.Task.run(Task.scala:108) at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:338) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) at java.lang.Thread.run(Thread.java:745) -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[GitHub] carbondata issue #1840: [CARBONDATA-2054]Add an example: how to use CarbonDa...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1840 Build Success with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/1820/ ---
[GitHub] carbondata pull request #1847: [CARBONDATA-2064] Add compaction listener
GitHub user dhatchayani opened a pull request: https://github.com/apache/carbondata/pull/1847 [CARBONDATA-2064] Add compaction listener Added compaction listeners - [ ] Any interfaces changed? - [ ] Any backward compatibility impacted? - [ ] Document update required? - [x] Testing done Manual Testing - [ ] For large changes, please consider breaking it into sub-tasks under an umbrella JIRA. You can merge this pull request into a Git repository by running: $ git pull https://github.com/dhatchayani/incubator-carbondata compactionListeners Alternatively you can review and apply these changes as the patch at: https://github.com/apache/carbondata/pull/1847.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #1847 commit 4b679aae54ee8fcd87bd666d0a8641ad1b50a0e3 Author: dhatchayaniDate: 2018-01-23T09:56:26Z [CARBONDATA-2064] Add compaction listener ---
[GitHub] carbondata issue #1840: [CARBONDATA-2054]Add an example: how to use CarbonDa...
Github user ravipesala commented on the issue: https://github.com/apache/carbondata/pull/1840 SDV Build Success , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/3042/ ---
[GitHub] carbondata issue #1847: [WIP][CARBONDATA-2064] Add compaction listener
Github user ravipesala commented on the issue: https://github.com/apache/carbondata/pull/1847 SDV Build Fail , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/3044/ ---
[GitHub] carbondata issue #1846: [CARBONDATA-2067] Fix NPE exception in StreamHandoff...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1846 Build Success with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/1819/ ---
[GitHub] carbondata issue #1846: [CARBONDATA-2067] Fix NPE exception in StreamHandoff...
Github user ravipesala commented on the issue: https://github.com/apache/carbondata/pull/1846 SDV Build Success , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/3041/ ---
[GitHub] carbondata issue #1840: [CARBONDATA-2054]Add an example: how to use CarbonDa...
Github user zzcclp commented on the issue: https://github.com/apache/carbondata/pull/1840 retest this please. ---
[GitHub] carbondata issue #1840: [CARBONDATA-2054]Add an example: how to use CarbonDa...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1840 Build Success with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/1821/ ---
[GitHub] carbondata issue #1840: [CARBONDATA-2054]Add an example: how to use CarbonDa...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1840 Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/3051/ ---
[GitHub] carbondata issue #1822: [CARBONDATA-2043] Configurable wait time for request...
Github user kushalsaha commented on the issue: https://github.com/apache/carbondata/pull/1822 retest SDV this please ---
[GitHub] carbondata issue #1831: [CARBONDATA-1993] Carbon properties default values f...
Github user kushalsaha commented on the issue: https://github.com/apache/carbondata/pull/1831 retest SDV please ---
[GitHub] carbondata issue #1831: [CARBONDATA-1993] Carbon properties default values f...
Github user ravipesala commented on the issue: https://github.com/apache/carbondata/pull/1831 SDV Build Fail , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/3043/ ---
[GitHub] carbondata issue #1847: [WIP][CARBONDATA-2064] Add compaction listener
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1847 Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/3053/ ---
[GitHub] carbondata issue #1840: [CARBONDATA-2054]Add an example: how to use CarbonDa...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1840 Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/3052/ ---
[GitHub] carbondata issue #1841: [CARBONDATA-2062] Configure the temp directory to be...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1841 Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/3049/ ---
[GitHub] carbondata issue #1841: [CARBONDATA-2062] Configure the temp directory to be...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1841 Build Failed with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/1818/ ---
[GitHub] carbondata issue #1846: [CARBONDATA-2067] Fix NPE exception in StreamHandoff...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1846 Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/3050/ ---
[GitHub] carbondata issue #1847: [WIP][CARBONDATA-2064] Add compaction listener
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1847 Build Failed with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/1822/ ---
[GitHub] carbondata issue #1847: [WIP][CARBONDATA-2064] Add compaction listener
Github user dhatchayani commented on the issue: https://github.com/apache/carbondata/pull/1847 retest this please ---
[GitHub] carbondata issue #1838: [CARBONDATA-2060]fix insert overwrite on partition t...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1838 Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/3057/ ---
[GitHub] carbondata issue #1847: [WIP][CARBONDATA-2064] Add compaction listener
Github user ravipesala commented on the issue: https://github.com/apache/carbondata/pull/1847 SDV Build Success , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/3046/ ---
[GitHub] carbondata issue #1838: [CARBONDATA-2060]fix insert overwrite on partition t...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1838 Build Success with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/1825/ ---
[GitHub] carbondata issue #1831: [CARBONDATA-1993] Carbon properties default values f...
Github user ravipesala commented on the issue: https://github.com/apache/carbondata/pull/1831 SDV Build Fail , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/3047/ ---
[GitHub] carbondata issue #1848: [WIP][CARBONDATA-2068][CARBONDATA-1516][Tests] Drop ...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1848 Build Failed with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/1826/ ---
[GitHub] carbondata issue #1847: [WIP][CARBONDATA-2064] Add compaction listener
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1847 Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/3056/ ---
[GitHub] carbondata issue #1831: [CARBONDATA-1993] Carbon properties default values f...
Github user kushalsaha commented on the issue: https://github.com/apache/carbondata/pull/1831 retest SDV please ---
[GitHub] carbondata issue #1822: [CARBONDATA-2043] Configurable wait time for request...
Github user kushalsaha commented on the issue: https://github.com/apache/carbondata/pull/1822 retest SDV please ---
[jira] [Created] (CARBONDATA-2068) Drop datamap should work for timeseries
xubo245 created CARBONDATA-2068: --- Summary: Drop datamap should work for timeseries Key: CARBONDATA-2068 URL: https://issues.apache.org/jira/browse/CARBONDATA-2068 Project: CarbonData Issue Type: Bug Components: core, spark-integration Affects Versions: 1.3.0 Reporter: xubo245 Fix For: 1.3.0 Drop datamap is not work after creating timeseries datamap for preaggregate table, but it should work. refer: https://issues.apache.org/jira/browse/CARBONDATA-1516 -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[GitHub] carbondata issue #1840: [CARBONDATA-2054]Add an example: how to use CarbonDa...
Github user zzcclp commented on the issue: https://github.com/apache/carbondata/pull/1840 retest this please ---
[GitHub] carbondata issue #1840: [CARBONDATA-2054]Add an example: how to use CarbonDa...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1840 Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/3054/ ---
[GitHub] carbondata issue #1840: [CARBONDATA-2054]Add an example: how to use CarbonDa...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1840 Build Success with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/1823/ ---
[GitHub] carbondata issue #1847: [WIP][CARBONDATA-2064] Add compaction listener
Github user ravipesala commented on the issue: https://github.com/apache/carbondata/pull/1847 SDV Build Success , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/3045/ ---
[GitHub] carbondata pull request #1848: [CARBONDATA-2068][CARBONDATA-1516] Drop datam...
GitHub user xubo245 opened a pull request: https://github.com/apache/carbondata/pull/1848 [CARBONDATA-2068][CARBONDATA-1516] Drop datamap should work for timeseries preaggregate table - [ ] Any interfaces changed? No - [ ] Any backward compatibility impacted? No - [ ] Document update required? No - [ ] Testing done add test case for drop timeseries pre aggregate table - [ ] For large changes, please consider breaking it into sub-tasks under an umbrella JIRA. No You can merge this pull request into a Git repository by running: $ git pull https://github.com/xubo245/carbondata dropTimeseriesDataMap Alternatively you can review and apply these changes as the patch at: https://github.com/apache/carbondata/pull/1848.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #1848 commit f0b0d2ec406aa1b8a2bccd1e1b21435f5924fcbf Author: xubo245 <601450868@...> Date: 2018-01-23T12:05:06Z [CARBONDATA-2068] Drop datamap should work for timeseries preaggregate table ---
[GitHub] carbondata issue #1847: [WIP][CARBONDATA-2064] Add compaction listener
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1847 Build Failed with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/1824/ ---
[GitHub] carbondata issue #1848: [WIP][CARBONDATA-2068][CARBONDATA-1516][Tests] Drop ...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1848 Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/3058/ ---
[jira] [Commented] (CARBONDATA-2068) Drop datamap should work for timeseries
[ https://issues.apache.org/jira/browse/CARBONDATA-2068?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16335758#comment-16335758 ] kumar vishal commented on CARBONDATA-2068: -- [~xubo245] can u please add your testcase for the above scenario > Drop datamap should work for timeseries > - > > Key: CARBONDATA-2068 > URL: https://issues.apache.org/jira/browse/CARBONDATA-2068 > Project: CarbonData > Issue Type: Bug > Components: core, spark-integration >Affects Versions: 1.3.0 >Reporter: xubo245 >Priority: Major > Fix For: 1.3.0 > > Time Spent: 0.5h > Remaining Estimate: 0h > > Drop datamap is not work after creating timeseries datamap for preaggregate > table, > but it should work. > refer: > https://issues.apache.org/jira/browse/CARBONDATA-1516 -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[GitHub] carbondata pull request #1849: [HOTFIX] process VM error for the streaming w...
GitHub user QiangCai opened a pull request: https://github.com/apache/carbondata/pull/1849 [HOTFIX] process VM error for the streaming writer process VM error for the streaming writer by using Throwable instead of Exception - [x] Any interfaces changed? - [x] Any backward compatibility impacted? - [x] Document update required? - [x] Testing done Please provide details on - Whether new unit test cases have been added or why no new tests are required? - How it is tested? Please attach test report. - Is it a performance related change? Please attach the performance test report. - Any additional information to help reviewers in testing this change. - [x] For large changes, please consider breaking it into sub-tasks under an umbrella JIRA. You can merge this pull request into a Git repository by running: $ git pull https://github.com/QiangCai/carbondata catch_jvm_error Alternatively you can review and apply these changes as the patch at: https://github.com/apache/carbondata/pull/1849.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #1849 commit f09703a5cc91c6a3e9327dd5f5f7cf0a28611c6a Author: QiangCaiDate: 2018-01-23T13:12:32Z process vw error for streaming writer ---
[GitHub] carbondata issue #1847: [WIP][CARBONDATA-2064] Add compaction listener
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1847 Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/3059/ ---
[GitHub] carbondata issue #1847: [WIP][CARBONDATA-2064] Add compaction listener
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1847 Build Failed with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/1827/ ---
[GitHub] carbondata pull request #1693: [CARBONDATA-1909] Load is failing during inse...
Github user gvramana commented on a diff in the pull request: https://github.com/apache/carbondata/pull/1693#discussion_r163241946 --- Diff: core/src/main/java/org/apache/carbondata/core/constants/CarbonCommonConstants.java --- @@ -864,6 +864,18 @@ public static final String CARBON_MERGE_SORT_PREFETCH = "carbon.merge.sort.prefetch"; public static final String CARBON_MERGE_SORT_PREFETCH_DEFAULT = "true"; + /** + * to determine to use the rdd persist or not for insert query. --- End diff -- Please write clear comment, when this should be used and what is the impact if not used, why default value is false? ---
[GitHub] carbondata issue #1822: [CARBONDATA-2043] Configurable wait time for request...
Github user ravipesala commented on the issue: https://github.com/apache/carbondata/pull/1822 SDV Build Success , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/3048/ ---
[jira] [Created] (CARBONDATA-2070) when hive metastore is enabled, create preaggregate table on decimal column of main table is failing
Akash R Nilugal created CARBONDATA-2070: --- Summary: when hive metastore is enabled, create preaggregate table on decimal column of main table is failing Key: CARBONDATA-2070 URL: https://issues.apache.org/jira/browse/CARBONDATA-2070 Project: CarbonData Issue Type: Bug Reporter: Akash R Nilugal Assignee: Akash R Nilugal {color:#33}steps:{color} {color:#33}Enable hive metastore and run the following queries{color} {color:#33}1){color} {color:#33}CREATE TABLE uniqdata(CUST_ID int,CUST_NAME String,ACTIVE_EMUI_VERSION string,DOB timestamp,DOJ timestamp, BIGINT_COLUMN1 bigint,BIGINT_COLUMN2 bigint,DECIMAL_COLUMN1 decimal(30,10),DECIMAL_COLUMN2 decimal(36,10),Double_COLUMN1 double, Double_COLUMN2 double,INTEGER_COLUMN1 int) STORED BY 'org.apache.carbondata.format'{color} {color:#33}2){color} {color:#33}insert into uniqdata select 9000,'CUST_NAME_0','ACTIVE_EMUI_VERSION_0','1970-01-01 01:00:03','1970-01-01 02:00:03',123372036854,-223372036854,12345678901.123400,22345678901.123400,11234567489.797600,-11234567489.797600,1{color} {color:#33}3){color} {color:#33}create datamap uniqdata_agg on table uniqdata using 'preaggregate' as select min(DECIMAL_COLUMN1) from uniqdata group by DECIMAL_COLUMN1{color} {color:#33}java.lang.ClassCastException: org.apache.carbondata.core.metadata.datatype.DataType cannot be cast to org.apache.carbondata.core.metadata.datatype.DecimalType at org.apache.carbondata.core.metadata.schema.table.column.ColumnSchema.write(ColumnSchema.java:478) at org.apache.carbondata.core.metadata.schema.table.TableSchema.write(TableSchema.java:215) at org.apache.carbondata.core.metadata.schema.table.DataMapSchema.write(DataMapSchema.java:99) at org.apache.carbondata.core.metadata.schema.table.TableInfo.write(TableInfo.java:245) at org.apache.carbondata.core.metadata.schema.table.TableInfo.serialize(TableInfo.java:304) at org.apache.spark.sql.CarbonDatasourceHadoopRelation.buildScan(CarbonDatasourceHadoopRelation.scala:83) at org.apache.spark.sql.execution.strategy.CarbonLateDecodeStrategy$$anonfun$1.apply(CarbonLateDecodeStrategy.scala:63) at org.apache.spark.sql.execution.strategy.CarbonLateDecodeStrategy$$anonfun$1.apply(CarbonLateDecodeStrategy.scala:63) at org.apache.spark.sql.execution.strategy.CarbonLateDecodeStrategy$$anonfun$pruneFilterProject$1.apply(CarbonLateDecodeStrategy.scala:178) at org.apache.spark.sql.execution.strategy.CarbonLateDecodeStrategy$$anonfun$pruneFilterProject$1.apply(CarbonLateDecodeStrategy.scala:177) at org.apache.spark.sql.execution.strategy.CarbonLateDecodeStrategy.getDataSourceScan(CarbonLateDecodeStrategy.scala:366) at org.apache.spark.sql.execution.strategy.CarbonLateDecodeStrategy.pruneFilterProjectRaw(CarbonLateDecodeStrategy.scala:299) at org.apache.spark.sql.execution.strategy.CarbonLateDecodeStrategy.pruneFilterProject(CarbonLateDecodeStrategy.scala:172) at org.apache.spark.sql.execution.strategy.CarbonLateDecodeStrategy.apply(CarbonLateDecodeStrategy.scala:59) at org.apache.spark.sql.catalyst.planning.QueryPlanner$$anonfun$1.apply(QueryPlanner.scala:62) at org.apache.spark.sql.catalyst.planning.QueryPlanner$$anonfun$1.apply(QueryPlanner.scala:62) at scala.collection.Iterator$$anon$12.nextCur(Iterator.scala:434) at scala.collection.Iterator$$anon$12.hasNext(Iterator.scala:440) at scala.collection.Iterator$$anon$12.hasNext(Iterator.scala:439) at org.apache.spark.sql.catalyst.planning.QueryPlanner.plan(QueryPlanner.scala:92) at org.apache.spark.sql.catalyst.planning.QueryPlanner$$anonfun$2$$anonfun$apply$2.apply(QueryPlanner.scala:77) at org.apache.spark.sql.catalyst.planning.QueryPlanner$$anonfun$2$$anonfun$apply$2.apply(QueryPlanner.scala:74) at scala.collection.TraversableOnce$$anonfun$foldLeft$1.apply(TraversableOnce.scala:157) at scala.collection.TraversableOnce$$anonfun$foldLeft$1.apply(TraversableOnce.scala:157) at scala.collection.Iterator$class.foreach(Iterator.scala:893) at scala.collection.AbstractIterator.foreach(Iterator.scala:1336) at scala.collection.TraversableOnce$class.foldLeft(TraversableOnce.scala:157){color} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (CARBONDATA-2069) Data is not loaded into preaggregate table when table is created when data load is in progress for main table
Kunal Kapoor created CARBONDATA-2069: Summary: Data is not loaded into preaggregate table when table is created when data load is in progress for main table Key: CARBONDATA-2069 URL: https://issues.apache.org/jira/browse/CARBONDATA-2069 Project: CarbonData Issue Type: Bug Reporter: Kunal Kapoor Assignee: Kunal Kapoor Problem: # Load data into maintable # create datamap parallelly preaggregate table will not have any data while data load is successful for main table. This will make the pre-aggregate table inconsistent -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[GitHub] carbondata pull request #1850: [CARBONDATA-2069] Restrict create datamap whe...
GitHub user kunal642 opened a pull request: https://github.com/apache/carbondata/pull/1850 [CARBONDATA-2069] Restrict create datamap when load is in progress **Problem:** 1. Load data into maintable 2. create datamap parallelly preaggregate table will not have any data while data load is successful for main table. This will make the pre-aggregate table inconsistent **Solution:** Restrict creation of pre-aggregate table when load is in progress on main table Be sure to do all of the following checklist to help us incorporate your contribution quickly and easily: - [X] Any interfaces changed? - [X] Any backward compatibility impacted? - [X] Document update required? - [X] Testing done Please provide details on - Whether new unit test cases have been added or why no new tests are required? - How it is tested? Please attach test report. - Is it a performance related change? Please attach the performance test report. - Any additional information to help reviewers in testing this change. - [X] For large changes, please consider breaking it into sub-tasks under an umbrella JIRA. You can merge this pull request into a Git repository by running: $ git pull https://github.com/kunal642/carbondata restrict_create Alternatively you can review and apply these changes as the patch at: https://github.com/apache/carbondata/pull/1850.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #1850 commit 0e47c0e8841c1e0f1ec0291c18d7c7caea2f58aa Author: kunal642Date: 2018-01-23T13:22:48Z restrict create datamap when load is in progress ---
[jira] [Closed] (CARBONDATA-2068) Drop datamap should work for timeseries
[ https://issues.apache.org/jira/browse/CARBONDATA-2068?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] xubo245 closed CARBONDATA-2068. --- Resolution: Won't Fix sql(s"drop datamap if exists agg1 on table mainTable") not support, Carbon support sql(s"drop datamap if exists agg1_year on table mainTable"). > Drop datamap should work for timeseries > - > > Key: CARBONDATA-2068 > URL: https://issues.apache.org/jira/browse/CARBONDATA-2068 > Project: CarbonData > Issue Type: Bug > Components: core, spark-integration >Affects Versions: 1.3.0 >Reporter: xubo245 >Priority: Major > Fix For: 1.3.0 > > Time Spent: 1.5h > Remaining Estimate: 0h > > Drop datamap is not work after creating timeseries datamap for preaggregate > table, > but it should work. > refer: > https://issues.apache.org/jira/browse/CARBONDATA-1516 -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[GitHub] carbondata issue #1853: [CARBONDATA-2072][TEST] Add dropTables method for op...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1853 Build Success with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/1841/ ---
[GitHub] carbondata issue #1850: [CARBONDATA-2069] Restrict create datamap when load ...
Github user ravipesala commented on the issue: https://github.com/apache/carbondata/pull/1850 SDV Build Success , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/3065/ ---
[GitHub] carbondata issue #1850: [CARBONDATA-2069] Restrict create datamap when load ...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1850 Build Success with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/1845/ ---
[GitHub] carbondata issue #1781: [CARBONDATA-2012] Add support to load pre-aggregate ...
Github user ravipesala commented on the issue: https://github.com/apache/carbondata/pull/1781 SDV Build Fail , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/3066/ ---
[GitHub] carbondata issue #1844: [CARBONDATA-2061] Check for only valid IN_PROGRESS s...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1844 Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/3079/ ---
[GitHub] carbondata issue #1844: [CARBONDATA-2061] Check for only valid IN_PROGRESS s...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1844 Build Success with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/1848/ ---
[GitHub] carbondata issue #1851: [CARBONDATA-2071] Added block size to BblockletDataM...
Github user ravipesala commented on the issue: https://github.com/apache/carbondata/pull/1851 SDV Build Success , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/3064/ ---
[GitHub] carbondata issue #1851: [CARBONDATA-2071] Added block size to BblockletDataM...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1851 Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/3074/ ---
[GitHub] carbondata issue #1851: [CARBONDATA-2071] Added block size to BblockletDataM...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1851 Build Success with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/1843/ ---
[GitHub] carbondata issue #1852: [CARBONDATA-2070]fix create preaggregate on decimal ...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1852 Build Success with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/1844/ ---
[GitHub] carbondata issue #1781: [CARBONDATA-2012] Add support to load pre-aggregate ...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1781 Build Failed with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/1846/ ---
[GitHub] carbondata issue #1852: [CARBONDATA-2070]fix create preaggregate on decimal ...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1852 Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/3075/ ---
[GitHub] carbondata issue #1851: [CARBONDATA-2071] Added block size to BblockletDataM...
Github user ravipesala commented on the issue: https://github.com/apache/carbondata/pull/1851 retest this please ---
[GitHub] carbondata issue #1850: [CARBONDATA-2069] Restrict create datamap when load ...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1850 Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/3076/ ---
[GitHub] carbondata issue #1851: [CARBONDATA-2071] Added block size to BblockletDataM...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1851 Build Failed with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/1847/ ---
[GitHub] carbondata issue #1844: [CARBONDATA-2061] Check for only valid IN_PROGRESS s...
Github user dhatchayani commented on the issue: https://github.com/apache/carbondata/pull/1844 retest this please ---
[GitHub] carbondata issue #1852: [CARBONDATA-2070]fix create preaggregate on decimal ...
Github user akashrn5 commented on the issue: https://github.com/apache/carbondata/pull/1852 retest this please ---
[GitHub] carbondata issue #1781: [CARBONDATA-2012] Add support to load pre-aggregate ...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1781 Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/3077/ ---
[GitHub] carbondata issue #1851: [CARBONDATA-2071] Added block size to BblockletDataM...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1851 Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/3078/ ---
[jira] [Created] (CARBONDATA-2072) Add dropTables method for optimizing drop table operation in test cases
xubo245 created CARBONDATA-2072: --- Summary: Add dropTables method for optimizing drop table operation in test cases Key: CARBONDATA-2072 URL: https://issues.apache.org/jira/browse/CARBONDATA-2072 Project: CarbonData Issue Type: Test Components: test Affects Versions: 1.3.0 Reporter: xubo245 Assignee: xubo245 Fix For: 1.3.0 There are many drop table in beforeAll or afterAll of test cases,like this: {code:java} override def afterAll { sql("drop table if exists load") sql("drop table if exists inser") sql("DROP TABLE IF EXISTS THive") sql("DROP TABLE IF EXISTS TCarbon") sql("drop table if exists TCarbonLocal") sql("drop table if exists TCarbonSource") sql("drop table if exists loadtable") sql("drop table if exists insertTable") sql("drop table if exists CarbonDest") sql("drop table if exists HiveDest") sql("drop table if exists CarbonOverwrite") sql("drop table if exists HiveOverwrite") sql("drop table if exists tcarbonsourceoverwrite") sql("drop table if exists carbon_table1") sql("drop table if exists carbon_table") sql("DROP TABLE IF EXISTS student") sql("DROP TABLE IF EXISTS uniqdata") sql("DROP TABLE IF EXISTS show_insert") sql("drop table if exists OverwriteTable_t1") sql("drop table if exists OverwriteTable_t2") } {code} in org.apache.carbondata.spark.testsuite.allqueries.InsertIntoCarbonTableTestCase It can be optimized by a public method in QueryTest -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[GitHub] carbondata pull request #1854: [WIP] Test insert fail issue
GitHub user ravipesala opened a pull request: https://github.com/apache/carbondata/pull/1854 [WIP] Test insert fail issue Be sure to do all of the following checklist to help us incorporate your contribution quickly and easily: - [ ] Any interfaces changed? - [ ] Any backward compatibility impacted? - [ ] Document update required? - [ ] Testing done Please provide details on - Whether new unit test cases have been added or why no new tests are required? - How it is tested? Please attach test report. - Is it a performance related change? Please attach the performance test report. - Any additional information to help reviewers in testing this change. - [ ] For large changes, please consider breaking it into sub-tasks under an umbrella JIRA. You can merge this pull request into a Git repository by running: $ git pull https://github.com/ravipesala/incubator-carbondata fix-insert-tests Alternatively you can review and apply these changes as the patch at: https://github.com/apache/carbondata/pull/1854.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #1854 commit a43caa3755435a08fd4ced23dd9e203b10417348 Author: ravipesalaDate: 2018-01-24T03:47:07Z Test insert fail issue ---
[GitHub] carbondata issue #1854: [WIP] Test insert fail issue
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1854 Build Failed with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/1842/ ---
[GitHub] carbondata pull request #1851: [CARBONDATA-2071] Added block size to Bblockl...
Github user ravipesala commented on a diff in the pull request: https://github.com/apache/carbondata/pull/1851#discussion_r163450767 --- Diff: core/src/main/java/org/apache/carbondata/core/indexstore/BlockletDataMapIndexStore.java --- @@ -244,4 +247,22 @@ public void clearAccessCount( cacheable.clear(); } } + + public static class BlockMetaInfo { +private String[] locationInfo; +private long size; --- End diff -- ok ---
[GitHub] carbondata issue #1853: [CARBONDATA-2072][TEST] Add dropTables method for op...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1853 Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/3072/ ---
[GitHub] carbondata issue #1853: [CARBONDATA-2072][TEST] Add dropTables method for op...
Github user ravipesala commented on the issue: https://github.com/apache/carbondata/pull/1853 SDV Build Success , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/3062/ ---
[GitHub] carbondata pull request #1851: [CARBONDATA-2071] Added block size to Bblockl...
Github user jackylk commented on a diff in the pull request: https://github.com/apache/carbondata/pull/1851#discussion_r163449480 --- Diff: core/src/main/java/org/apache/carbondata/core/indexstore/BlockletDataMapIndexStore.java --- @@ -78,17 +78,19 @@ public BlockletDataMap get(TableBlockIndexUniqueIdentifier identifier) String segmentPath = CarbonTablePath.getSegmentPath( identifier.getAbsoluteTableIdentifier().getTablePath(), identifier.getSegmentId()); -MaplocationMap = new HashMap<>(); +Map blockMetaInfoMap = new HashMap<>(); CarbonFile carbonFile = FileFactory.getCarbonFile(segmentPath); CarbonFile[] carbonFiles = carbonFile.locationAwareListFiles(); SegmentIndexFileStore indexFileStore = new SegmentIndexFileStore(); indexFileStore.readAllIIndexOfSegment(carbonFiles); PartitionMapFileStore partitionFileStore = new PartitionMapFileStore(); partitionFileStore.readAllPartitionsOfSegment(carbonFiles, segmentPath); for (CarbonFile file : carbonFiles) { - locationMap.put(file.getAbsolutePath(), file.getLocations()); + blockMetaInfoMap + .put(file.getAbsolutePath(), new BlockMetaInfo(file.getLocations(), file.getSize())); --- End diff -- move `.put` to previous line ---
[GitHub] carbondata pull request #1851: [CARBONDATA-2071] Added block size to Bblockl...
Github user jackylk commented on a diff in the pull request: https://github.com/apache/carbondata/pull/1851#discussion_r163449593 --- Diff: core/src/main/java/org/apache/carbondata/core/indexstore/BlockletDataMapIndexStore.java --- @@ -244,4 +247,22 @@ public void clearAccessCount( cacheable.clear(); } } + + public static class BlockMetaInfo { +private String[] locationInfo; +private long size; --- End diff -- please add comment for these two variable ---
[GitHub] carbondata pull request #1851: [CARBONDATA-2071] Added block size to Bblockl...
Github user jackylk commented on a diff in the pull request: https://github.com/apache/carbondata/pull/1851#discussion_r163449568 --- Diff: core/src/main/java/org/apache/carbondata/core/indexstore/BlockletDataMapIndexStore.java --- @@ -244,4 +247,22 @@ public void clearAccessCount( cacheable.clear(); } } + + public static class BlockMetaInfo { --- End diff -- Is it only used in this class? I think it is better to move to outside if not just used in this class ---
[jira] [Commented] (CARBONDATA-2068) Drop datamap should work for timeseries
[ https://issues.apache.org/jira/browse/CARBONDATA-2068?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16336735#comment-16336735 ] xubo245 commented on CARBONDATA-2068: - sql(s"drop datamap if exists agg1 on table mainTable") not support, Carbon support sql(s"drop datamap if exists agg1_year on table mainTable"). > Drop datamap should work for timeseries > - > > Key: CARBONDATA-2068 > URL: https://issues.apache.org/jira/browse/CARBONDATA-2068 > Project: CarbonData > Issue Type: Bug > Components: core, spark-integration >Affects Versions: 1.3.0 >Reporter: xubo245 >Priority: Major > Fix For: 1.3.0 > > Time Spent: 1.5h > Remaining Estimate: 0h > > Drop datamap is not work after creating timeseries datamap for preaggregate > table, > but it should work. > refer: > https://issues.apache.org/jira/browse/CARBONDATA-1516 -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[GitHub] carbondata pull request #1853: [CARBONDATA-2072][TEST] Add dropTables method...
GitHub user xubo245 opened a pull request: https://github.com/apache/carbondata/pull/1853 [CARBONDATA-2072][TEST] Add dropTables method for optimizing drop table method in test cases Add dropTables method for optimizing drop table method in test cases Please see jira Be sure to do all of the following checklist to help us incorporate your contribution quickly and easily: - [ ] Any interfaces changed? No - [ ] Any backward compatibility impacted? No - [ ] Document update required? No - [ ] Testing done only change test cases, no new test cases - [ ] For large changes, please consider breaking it into sub-tasks under an umbrella JIRA. No You can merge this pull request into a Git repository by running: $ git pull https://github.com/xubo245/carbondata dropTables Alternatively you can review and apply these changes as the patch at: https://github.com/apache/carbondata/pull/1853.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #1853 commit 48760ee0554d7cf3a645da0db697bdd5bee7bc47 Author: xubo245 <601450868@...> Date: 2018-01-24T02:00:44Z [CARBONDATA-2072][TEST] Add dropTables method for optimizing drop table method in test cases ---
[GitHub] carbondata issue #1851: [CARBONDATA-2071] Added block size to BblockletDataM...
Github user QiangCai commented on the issue: https://github.com/apache/carbondata/pull/1851 verified on local standalone mode LGTM ---
[GitHub] carbondata issue #1851: [CARBONDATA-2071] Added block size to BblockletDataM...
Github user ravipesala commented on the issue: https://github.com/apache/carbondata/pull/1851 SDV Build Success , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/3054/ ---
[GitHub] carbondata issue #1693: [CARBONDATA-1909] Load is failing during insert into...
Github user ravipesala commented on the issue: https://github.com/apache/carbondata/pull/1693 SDV Build Success , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/3057/ ---
[GitHub] carbondata issue #1852: [CARBONDATA-2070]fix create preaggregate on decimal ...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1852 Build Success with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/1840/ ---
[GitHub] carbondata issue #1838: [CARBONDATA-2060]fix insert overwrite on partition t...
Github user manishgupta88 commented on the issue: https://github.com/apache/carbondata/pull/1838 LGTM ---
[jira] [Resolved] (CARBONDATA-2060) Fix InsertOverwrite on partition table
[ https://issues.apache.org/jira/browse/CARBONDATA-2060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Manish Gupta resolved CARBONDATA-2060. -- Resolution: Fixed Fix Version/s: 1.3.0 > Fix InsertOverwrite on partition table > -- > > Key: CARBONDATA-2060 > URL: https://issues.apache.org/jira/browse/CARBONDATA-2060 > Project: CarbonData > Issue Type: Bug >Reporter: Akash R Nilugal >Assignee: Akash R Nilugal >Priority: Minor > Fix For: 1.3.0 > > Time Spent: 3h > Remaining Estimate: 0h > > {color:#33}when partition table overwrite with empty table , it is not > overwriting the partition table , and when insert overwrite is done on > dynamic partition table , overwrite was not happening.{color} > > {color:#33}sql("create table partitionLoadTable(name string, age int) > PARTITIONED BY(address string) stored by 'carbondata'"){color} > {color:#33}sql("insert into partitionLoadTable select > 'abc',4,'def'"){color} > {color:#33}sql("insert into partitionLoadTable select > 'abd',5,'xyz'"){color} > {color:#33}sql("create table noLoadTable (name string, age int, address > string) stored by 'carbondata'"){color} > {color:#33}sql("insert overwrite table partitionLoadTable select * from > noLoadTable"){color} > {color:#33}when we do select * after insert overwrite operation, ideally > it should give empty data, but it is giving all data.{color} > > {color:#33}sql("CREATE TABLE uniqdata_hive_static (CUST_ID int,CUST_NAME > String,ACTIVE_EMUI_VERSION string, DOB timestamp, DOJ timestamp, > BIGINT_COLUMN1 bigint,BIGINT_COLUMN2 bigint,DECIMAL_COLUMN1 decimal(30,10), > DECIMAL_COLUMN2 decimal(36,10),Double_COLUMN1 double, Double_COLUMN2 double, > INTEGER_COLUMN1 int)ROW FORMAT DELIMITED FIELDS TERMINATED BY ','"){color} > {color:#33}sql("CREATE TABLE uniqdata_string_static(CUST_ID int,CUST_NAME > String,DOB timestamp,DOJ timestamp, BIGINT_COLUMN1 bigint,BIGINT_COLUMN2 > bigint,DECIMAL_COLUMN1 decimal(30,10),DECIMAL_COLUMN2 > decimal(36,10),Double_COLUMN1 double, Double_COLUMN2 double,INTEGER_COLUMN1 > int) PARTITIONED BY(ACTIVE_EMUI_VERSION string) STORED BY > 'org.apache.carbondata.format' TBLPROPERTIES ('TABLE_BLOCKSIZE'= '256 > MB')"){color} > {color:#33}sql(s"LOAD DATA INPATH '$resourcesPath/partData.csv' into > table uniqdata_string_static OPTIONS('FILEHEADER'='CUST_ID,CUST_NAME > ,ACTIVE_EMUI_VERSION,DOB,DOJ, > BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1,DECIMAL_COLUMN2,Double_COLUMN1, > Double_COLUMN2,INTEGER_COLUMN1','BAD_RECORDS_ACTION'='FORCE')"){color} > {color:#33}sql(s"LOAD DATA INPATH '$resourcesPath/partData.csv' into > table uniqdata_string_static OPTIONS('FILEHEADER'='CUST_ID,CUST_NAME > ,ACTIVE_EMUI_VERSION,DOB,DOJ, > BIGINT_COLUMN1,BIGINT_COLUMN2,DECIMAL_COLUMN1,DECIMAL_COLUMN2,Double_COLUMN1, > Double_COLUMN2,INTEGER_COLUMN1','BAD_RECORDS_ACTION'='FORCE')"){color} > {color:#33}sql("insert overwrite table uniqdata_string_static select > CUST_ID, CUST_NAME,DOB,doj, bigint_column1, bigint_column2, decimal_column1, > decimal_column2,double_column1, > double_column2,integer_column1,active_emui_version from uniqdata_hive_static > limit 10"){color} > > {color:#33}after this, select * was giving result, ideally it should give > empty result.{color} > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[GitHub] carbondata issue #1693: [CARBONDATA-1909] Load is failing during insert into...
Github user ravipesala commented on the issue: https://github.com/apache/carbondata/pull/1693 SDV Build Success , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/3056/ ---
[GitHub] carbondata issue #1852: [CARBONDATA-2070]fix create preaggregate on decimal ...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1852 Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/3071/ ---
[GitHub] carbondata issue #1854: [WIP] Test insert fail issue
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1854 Build Failed with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/1849/ ---
[GitHub] carbondata issue #1844: [CARBONDATA-2061] Check for only valid IN_PROGRESS s...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1844 Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/3081/ ---
[GitHub] carbondata issue #1841: [CARBONDATA-2062] Configure the temp directory to be...
Github user QiangCai commented on the issue: https://github.com/apache/carbondata/pull/1841 retest this please ---
[GitHub] carbondata issue #1844: [CARBONDATA-2061] Check for only valid IN_PROGRESS s...
Github user ravipesala commented on the issue: https://github.com/apache/carbondata/pull/1844 SDV Build Success , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/3067/ ---
[GitHub] carbondata issue #1854: [WIP] Test insert fail issue
Github user ravipesala commented on the issue: https://github.com/apache/carbondata/pull/1854 SDV Build Success , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/3068/ ---
[jira] [Created] (CARBONDATA-2071) Add block size to BblockletDataMap while initialising
Ravindra Pesala created CARBONDATA-2071: --- Summary: Add block size to BblockletDataMap while initialising Key: CARBONDATA-2071 URL: https://issues.apache.org/jira/browse/CARBONDATA-2071 Project: CarbonData Issue Type: Improvement Reporter: Ravindra Pesala Add block size to blocklet datamap so that features like small file merge can use them -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[GitHub] carbondata pull request #1852: [CARBONDATA-2070]fix create preaggregate on d...
GitHub user akashrn5 opened a pull request: https://github.com/apache/carbondata/pull/1852 [CARBONDATA-2070]fix create preaggregate on decimal column in hive metastore Problem: when hive metastore is enabled and aggregate table is tried to create on the decimal column of main table, cast exception is thrown for Decimal datatype solution:During creation of TableInfo from hivemetastore the DataMapSchemas and the columns dataTypes are not converted to the appropriate child classes. convert to actual datatype Be sure to do all of the following checklist to help us incorporate your contribution quickly and easily: - [X] Any interfaces changed? NA - [X] Any backward compatibility impacted? NA - [X] Document update required? NA - [X] Testing done Added UT Please provide details on - Whether new unit test cases have been added or why no new tests are required? - How it is tested? Please attach test report. - Is it a performance related change? Please attach the performance test report. - Any additional information to help reviewers in testing this change. - [] For large changes, please consider breaking it into sub-tasks under an umbrella JIRA. You can merge this pull request into a Git repository by running: $ git pull https://github.com/akashrn5/incubator-carbondata hive_pre Alternatively you can review and apply these changes as the patch at: https://github.com/apache/carbondata/pull/1852.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #1852 commit d8c38d5d11fbb1f4fcfb590168b8785497ab868a Author: akashrn5Date: 2018-01-23T13:29:51Z fix create preaggregate on decimal column in hive metastore ---
[GitHub] carbondata pull request #1693: [CARBONDATA-1909] Load is failing during inse...
Github user ManoharVanam commented on a diff in the pull request: https://github.com/apache/carbondata/pull/1693#discussion_r163250438 --- Diff: core/src/main/java/org/apache/carbondata/core/constants/CarbonCommonConstants.java --- @@ -864,6 +864,18 @@ public static final String CARBON_MERGE_SORT_PREFETCH = "carbon.merge.sort.prefetch"; public static final String CARBON_MERGE_SORT_PREFETCH_DEFAULT = "true"; + /** + * to determine to use the rdd persist or not for insert query. --- End diff -- done ---
[GitHub] carbondata issue #1848: [WIP][CARBONDATA-2068][CARBONDATA-1516][Tests] Drop ...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1848 Build Failed with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/1829/ ---