[GitHub] carbondata issue #1162: [CARBONDATA-1295] Optimize test cases,merge three Qu...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1162 Build Success with Spark 1.6, Please check CI http://144.76.159.231:8080/job/ApacheCarbonPRBuilder/440/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1142: [CARBONDATA-1271] Enhanced Performance for Hive Inte...
Github user chenliang613 commented on the issue: https://github.com/apache/carbondata/pull/1142 retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[jira] [Created] (CARBONDATA-1296) Add cluster test cases
Liang Chen created CARBONDATA-1296: -- Summary: Add cluster test cases Key: CARBONDATA-1296 URL: https://issues.apache.org/jira/browse/CARBONDATA-1296 Project: CarbonData Issue Type: Sub-task Components: test Reporter: Liang Chen Add test cases to run on Hadoop and spark cluster environment -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[GitHub] carbondata issue #1142: [CARBONDATA-1271] Enhanced Performance for Hive Inte...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1142 Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder/3031/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #984: [CARBONDATA-1008] Make Carbon table schema compatible...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/984 Build Success with Spark 1.6, Please check CI http://144.76.159.231:8080/job/ApacheCarbonPRBuilder/441/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[jira] [Created] (CARBONDATA-1299) delete segment not work
cocopc created CARBONDATA-1299: -- Summary: delete segment not work Key: CARBONDATA-1299 URL: https://issues.apache.org/jira/browse/CARBONDATA-1299 Project: CarbonData Issue Type: Bug Components: sql Affects Versions: 1.1.0 Reporter: cocopc env: spark2.1 ,carbondata-1.1.0 sql :delete from table default.test_table where SEGMENT.ID in (0); result: {code:java} java.lang.RuntimeException: BaseSqlParser Operation not allowed: delete from(line 1, pos 0) == SQL == delete from table default.test_table where SEGMENT in (0) ^^^ CarbonSqlParser [1.8] failure: identifier matching regex (?i)SEGMENTS expected delete from table default.test_table where SEGMENT in (0) ^ {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[GitHub] carbondata pull request #984: [CARBONDATA-1008] Make Carbon table schema com...
Github user anubhav100 commented on a diff in the pull request: https://github.com/apache/carbondata/pull/984#discussion_r126867371 --- Diff: integration/spark/src/main/scala/org/apache/spark/sql/execution/command/carbonTableSchema.scala --- @@ -189,10 +189,52 @@ case class CreateTable(cm: TableModel) extends RunnableCommand { // Need to fill partitioner class when we support partition val tablePath = catalog.createTableFromThrift(tableInfo, dbName, tbName, null)(sqlContext) try { -sqlContext.sql( - s"""CREATE TABLE $dbName.$tbName USING carbondata""" + - s""" OPTIONS (tableName "$dbName.$tbName", tablePath "$tablePath") """) - .collect +val useCompatibleSchema = sqlContext.sparkContext.conf --- End diff -- @cenyuhai changes in this class is not required till now we are only supporting hive with spark 2.1 and hadoop 2.7 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata pull request #1159: [CARBONDATA-1274] added example for update an...
Github user chenliang613 commented on a diff in the pull request: https://github.com/apache/carbondata/pull/1159#discussion_r126856684 --- Diff: examples/spark2/src/main/scala/org/apache/carbondata/examples/DataUpdateDeleteExample.scala --- @@ -0,0 +1,183 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one or more + * contributor license agreements. See the NOTICE file distributed with + * this work for additional information regarding copyright ownership. + * The ASF licenses this file to You under the Apache License, Version 2.0 + * (the "License"); you may not use this file except in compliance with + * the License. You may obtain a copy of the License at + * + *http://www.apache.org/licenses/LICENSE-2.0 + * + * Unless required by applicable law or agreed to in writing, software + * distributed under the License is distributed on an "AS IS" BASIS, + * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. + * See the License for the specific language governing permissions and + * limitations under the License. + */ + +package org.apache.carbondata.examples + +import java.io.File + +import org.apache.spark.sql.SaveMode +import org.apache.spark.sql.SparkSession + +import org.apache.carbondata.core.constants.CarbonCommonConstants +import org.apache.carbondata.core.util.CarbonProperties + +object DataUpdateDeleteExample { + + def main(args: Array[String]) { + +val rootPath = new File(this.getClass.getResource("/").getPath + + "../../../..").getCanonicalPath + +var hdfsStoreFlg = false; +if (args != null && args.size > 0) { + if ("true".equalsIgnoreCase(args(0))) { +hdfsStoreFlg = true + } +} +var storeLocation = s"$rootPath/examples/spark2/target/store" +var warehouse = s"$rootPath/examples/spark2/target/warehouse" +var metastoredb = s"$rootPath/examples/spark2/target" +var testData = s"$rootPath/examples/spark2/src/main/resources/data_update.csv" +if (hdfsStoreFlg) { + storeLocation = "hdfs://nameservice1/carbon2/data/" + warehouse = "hdfs://nameservice1/carbon2/warehouse/" + metastoredb = "hdfs://nameservice1/carbon2/carbonstore/" + testData = "hdfs://nameservice1/carbon2/data_update.csv" +} + +import org.apache.spark.sql.CarbonSession._ +val spark = SparkSession + .builder() + .master("local") + .appName("CarbonSessionExample") + .config("spark.sql.warehouse.dir", warehouse) + .config("spark.driver.host", "localhost") + .config("spark.sql.crossJoin.enabled", "true") + .getOrCreateCarbonSession(storeLocation, metastoredb) +spark.sparkContext.setLogLevel("WARN") + +// Specify date format based on raw data +CarbonProperties.getInstance() + .addProperty(CarbonCommonConstants.CARBON_DATE_FORMAT, "/MM/dd") + +import spark.implicits._ +// Drop table +spark.sql("DROP TABLE IF EXISTS update_table") +spark.sql("DROP TABLE IF EXISTS big_table") + +spark.sql(s""" + CREATE TABLE IF NOT EXISTS update_table + (ID Int, country String, + name String, phonetype String, serialname char(10), salary Int) + STORED BY 'carbondata' + """) + +spark.sql(s""" + LOAD DATA LOCAL INPATH '$testData' INTO TABLE update_table + """) + +val df = spark.sparkContext.parallelize(1 to 200) + .map(x => (x, "name" + (100 + x), "2017/07/" + (x % 20 + 1), +"china", 2 * x)) + .toDF("id", "name", "date", "country", "salary") +df.write + .format("carbondata") + .option("tableName", "big_table") + .option("tempCSV", "true") + .option("compress", "true") + .mode(SaveMode.Overwrite) + .save() + +// loop update and delete in big_table +var loopCnt = 5 +for (index <- 1 to loopCnt) { + // Update country with simple SET + var name = "name" + (120 + index) + spark.sql(s""" + UPDATE big_table SET (country) = ('india') WHERE name = '$name' + """).show() + // Query data after the above update + spark.sql(s""" + SELECT * FROM big_table WHERE name = '$name' + """).show() + + // Update date with simple SET + name = "name" + (120 + loopCnt + index) + spark.sql(s""" + UPDATE big_table SET (date) = ('2018/08/08') WHERE name = '$name' + """).show() + // Query data after the above update + spark.sql(s""" + SELECT *
[GitHub] carbondata pull request #1155: [CARBONDATA-1283] Carbon should continue with...
Github user gvramana commented on a diff in the pull request: https://github.com/apache/carbondata/pull/1155#discussion_r126867396 --- Diff: core/src/main/java/org/apache/carbondata/core/constants/CarbonCommonConstants.java --- @@ -329,6 +329,15 @@ */ public static final String CSV_READ_BUFFER_SIZE_DEFAULT = "5"; /** + * min value for csv read buffer size + */ + public static final int CSV_READ_BUFFER_SIZE_MIN = 10240; //10 kb + /** + * max value for csv read buffer size + */ + public static final int CSV_READ_BUFFER_SIZE_MAX = 10485760; // 10 mb --- End diff -- This size can be 100mb. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1142: [CARBONDATA-1271] Enhanced Performance for Hive Inte...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1142 Build Success with Spark 1.6, Please check CI http://144.76.159.231:8080/job/ApacheCarbonPRBuilder/443/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1162: [CARBONDATA-1295] Optimize test cases,merge three Qu...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1162 Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder/3028/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1159: [CARBONDATA-1274] added example for update and delet...
Github user chenliang613 commented on the issue: https://github.com/apache/carbondata/pull/1159 @mayunSaicmotor Please change PR title to : Adds example for update and delete with Spark 2.1 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[jira] [Assigned] (CARBONDATA-1295) Optimize QueryTest: Merge three QueryTest class into one
[ https://issues.apache.org/jira/browse/CARBONDATA-1295?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang Chen reassigned CARBONDATA-1295: -- Assignee: Liang Chen > Optimize QueryTest: Merge three QueryTest class into one > > > Key: CARBONDATA-1295 > URL: https://issues.apache.org/jira/browse/CARBONDATA-1295 > Project: CarbonData > Issue Type: Sub-task > Components: test >Reporter: Liang Chen >Assignee: Liang Chen > -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[GitHub] carbondata issue #1142: [CARBONDATA-1271] Enhanced Performance for Hive Inte...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1142 Build Success with Spark 1.6, Please check CI http://144.76.159.231:8080/job/ApacheCarbonPRBuilder/442/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1149: [CARBONDATA-1284]Implement hive based schema storage...
Github user gvramana commented on the issue: https://github.com/apache/carbondata/pull/1149 @ravipesala Previously schema refresh is using store level timestamp file and then table schema file modification timestamp. How now refresh is detected? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata pull request #1155: [CARBONDATA-1283] Carbon should continue with...
Github user gvramana commented on a diff in the pull request: https://github.com/apache/carbondata/pull/1155#discussion_r126867170 --- Diff: core/src/main/java/org/apache/carbondata/core/util/CarbonProperties.java --- @@ -98,6 +100,119 @@ private void validateAndLoadDefaultProperties() { validateBlockletGroupSizeInMB(); validateNumberOfColumnPerIORead(); validateNumberOfRowsPerBlockletColumnPage(); +validateEnableUnsafeSort(); +validateCustomBlockDistribution(); +validateEnableVectorReader(); +validateLockType(); +validateCarbonCSVReadBufferSizeByte(); + } + + private void validateCarbonCSVReadBufferSizeByte() { +String csvReadBufferSizeStr = + carbonProperties.getProperty(CarbonCommonConstants.CSV_READ_BUFFER_SIZE); +if (null != csvReadBufferSizeStr) { + try { +int bufferSize = Integer.parseInt(csvReadBufferSizeStr); +if (bufferSize < CarbonCommonConstants.CSV_READ_BUFFER_SIZE_MIN +|| bufferSize > CarbonCommonConstants.CSV_READ_BUFFER_SIZE_MAX) { + LOGGER.warn("The value \"" + csvReadBufferSizeStr + "\" configured for key " + + CarbonCommonConstants.CSV_READ_BUFFER_SIZE + + "\" is not in range. Valid range us (byte) \"" + + CarbonCommonConstants.CSV_READ_BUFFER_SIZE_MIN + " to \"" + + CarbonCommonConstants.CSV_READ_BUFFER_SIZE_MAX + ". Using the default value \"" + + CarbonCommonConstants.CSV_READ_BUFFER_SIZE_DEFAULT); + carbonProperties.setProperty(CarbonCommonConstants.CSV_READ_BUFFER_SIZE, + CarbonCommonConstants.CSV_READ_BUFFER_SIZE_DEFAULT); +} + } catch (NumberFormatException nfe) { +LOGGER.warn("The value \"" + csvReadBufferSizeStr + "\" configured for key " ++ CarbonCommonConstants.CSV_READ_BUFFER_SIZE ++ "\" is invalid. Using the default value \"" ++ CarbonCommonConstants.CSV_READ_BUFFER_SIZE_DEFAULT); + carbonProperties.setProperty(CarbonCommonConstants.CSV_READ_BUFFER_SIZE, +CarbonCommonConstants.CSV_READ_BUFFER_SIZE_DEFAULT); + } +} + } + + private void validateLockType() { +String lockTypeConfigured = carbonProperties.getProperty(CarbonCommonConstants.LOCK_TYPE); +if (null != lockTypeConfigured) { + switch (lockTypeConfigured.toUpperCase()) { +case CarbonCommonConstants.CARBON_LOCK_TYPE_LOCAL: +case CarbonCommonConstants.CARBON_LOCK_TYPE_ZOOKEEPER: +case CarbonCommonConstants.CARBON_LOCK_TYPE_HDFS: + break; +default: + configureDefaultLockType(lockTypeConfigured); + } +} else { + configureDefaultLockType(lockTypeConfigured); +} + } + + /** + * the method decide and set the lock type based on the configured system type + * + * @param lockTypeConfigured + */ + private void configureDefaultLockType(String lockTypeConfigured) { +Configuration configuration = new Configuration(true); +String defaultFs = configuration.get("fs.defaultFS"); --- End diff -- How about the case when default.fs is hdfs but carbon store path is configured as local file path. Alluxio fs what is behaviour of locking? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[jira] [Created] (CARBONDATA-1300) delete segment not work
cocopc created CARBONDATA-1300: -- Summary: delete segment not work Key: CARBONDATA-1300 URL: https://issues.apache.org/jira/browse/CARBONDATA-1300 Project: CarbonData Issue Type: Bug Components: sql Affects Versions: 1.1.0 Reporter: cocopc env: spark2.1 ,carbondata-1.1.0 sql :delete from table default.test_table where SEGMENT.ID in (0); result: {code:java} java.lang.RuntimeException: BaseSqlParser Operation not allowed: delete from(line 1, pos 0) == SQL == delete from table default.test_table where SEGMENT in (0) ^^^ CarbonSqlParser [1.8] failure: identifier matching regex (?i)SEGMENTS expected delete from table default.test_table where SEGMENT in (0) ^ {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[GitHub] carbondata issue #1156: [CARBONDATA-1279] Push down for some select queries ...
Github user gvramana commented on the issue: https://github.com/apache/carbondata/pull/1156 Please change title to push down like "%xxx" , ends with. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #984: [CARBONDATA-1008] Make Carbon table schema compatible...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/984 Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder/3029/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1142: [CARBONDATA-1271] Enhanced Performance for Hive Inte...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1142 Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder/3030/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[jira] [Assigned] (CARBONDATA-1297) Consolidate same test cases for spark 1.x and spark 2.x
[ https://issues.apache.org/jira/browse/CARBONDATA-1297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang Chen reassigned CARBONDATA-1297: -- Assignee: Liang Chen > Consolidate same test cases for spark 1.x and spark 2.x > --- > > Key: CARBONDATA-1297 > URL: https://issues.apache.org/jira/browse/CARBONDATA-1297 > Project: CarbonData > Issue Type: Sub-task > Components: test >Reporter: Liang Chen >Assignee: Liang Chen > -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[GitHub] carbondata issue #1115: [CARBONDATA-1247]Block pruning not working for date ...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1115 Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder/3018/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1147: [CARBONDATA-1277] Dictionary generation failure if t...
Github user gvramana commented on the issue: https://github.com/apache/carbondata/pull/1147 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1149: [CARBONDATA-1284]Implement hive based schema storage...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1149 Build Failed with Spark 1.6, Please check CI http://144.76.159.231:8080/job/ApacheCarbonPRBuilder/431/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1149: [CARBONDATA-1284]Implement hive based schema storage...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1149 Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder/3019/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[jira] [Created] (CARBONDATA-1291) CarbonData query performace improvement when number of carbon blocks are high
kumar vishal created CARBONDATA-1291: Summary: CarbonData query performace improvement when number of carbon blocks are high Key: CARBONDATA-1291 URL: https://issues.apache.org/jira/browse/CARBONDATA-1291 Project: CarbonData Issue Type: Bug Reporter: kumar vishal Assignee: kumar vishal When number of carbon blocks are high query performance is bad -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[jira] [Resolved] (CARBONDATA-1277) Dictionary generation failure if there is failure in closing output stream in HDFS
[ https://issues.apache.org/jira/browse/CARBONDATA-1277?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Venkata Ramana G resolved CARBONDATA-1277. -- Resolution: Fixed > Dictionary generation failure if there is failure in closing output stream in > HDFS > -- > > Key: CARBONDATA-1277 > URL: https://issues.apache.org/jira/browse/CARBONDATA-1277 > Project: CarbonData > Issue Type: Bug >Reporter: Manish Gupta >Assignee: Manish Gupta > Fix For: 1.2.0 > > Time Spent: 2h 40m > Remaining Estimate: 0h > > If there is any failure while closing the output stream of dictionary file in > HDFS then on next data load, update or insert into operation dictionary > generation fails. This is because we open the dictionary file in append mode > and when we try to get the output stream for that file HDFS throws an > exception that Lease is already acquired by some other client. Exception > trace as below. > java.io.IOException: Failed to APPEND_FILE > /user/hive/warehouse/carbon.store/test/t12/Metadata/010497af-1833-4804-a7ec-849ab7b9bf10.dictmeta > for DFSClient_NONMAPREDUCE_-1904161438_159 on 172.168.100.212 because lease > recovery is in progress. Try again later. > at > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.recoverLeaseInternal(FSNamesystem.java:2901) > at > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.appendFileInternal(FSNamesystem.java:2655) > at > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.appendFileInt(FSNamesystem.java:2968) > at > org.apache.hadoop.hdfs.server.namenode.FSNamesystem.appendFile(FSNamesystem.java:2937) > at > org.apache.hadoop.hdfs.server.namenode.NameNodeRpcServer.append(NameNodeRpcServer.java:776) > at > org.apache.hadoop.hdfs.protocolPB.ClientNamenodeProtocolServerSideTranslatorPB.append(ClientNamenodeProtocolServerSideTranslatorPB.java:458) > at > org.apache.hadoop.hdfs.protocol.proto.ClientNamenodeProtocolProtos$ClientNamenodeProtocol$2.callBlockingMethod(ClientNamenodeProtocolProtos.java) > at > org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:616) -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[GitHub] carbondata pull request #1149: [CARBONDATA-1284]Implement hive based schema ...
Github user ravipesala commented on a diff in the pull request: https://github.com/apache/carbondata/pull/1149#discussion_r126654369 --- Diff: integration/spark2/src/main/scala/org/apache/carbondata/spark/util/CarbonSparkUtil.scala --- @@ -42,4 +44,14 @@ object CarbonSparkUtil { } CarbonMetaData(dimensionsAttr, measureAttr, carbonTable, DictionaryMap(dictionary.toMap)) } + + def createCarbonRelation(tableInfo: TableInfo, tablePath: String): CarbonRelation = { +val identifier = AbsoluteTableIdentifier fromTablePath(tablePath) --- End diff -- ok --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1155: [CARBONDATA-1283] Carbon should continue with the de...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1155 Build Failed with Spark 1.6, Please check CI http://144.76.159.231:8080/job/ApacheCarbonPRBuilder/432/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1158: [CARBONDATA-1274] added example for update and delet...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1158 Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder/3021/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1115: [CARBONDATA-1247]Block pruning not working for date ...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1115 Build Success with Spark 1.6, Please check CI http://144.76.159.231:8080/job/ApacheCarbonPRBuilder/430/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[jira] [Created] (CARBONDATA-1292) why carbondata not support realtime insert
cocopc created CARBONDATA-1292: -- Summary: why carbondata not support realtime insert Key: CARBONDATA-1292 URL: https://issues.apache.org/jira/browse/CARBONDATA-1292 Project: CarbonData Issue Type: Wish Components: data-load Affects Versions: NONE Reporter: cocopc Fix For: 1.2.0 Why carbondata not support realtime insert ,For example integrate with sparkstreaming. -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[GitHub] carbondata issue #1155: [CARBONDATA-1283] Carbon should continue with the de...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1155 Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder/3020/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata pull request #1147: [CARBONDATA-1277] Dictionary generation failu...
Github user asfgit closed the pull request at: https://github.com/apache/carbondata/pull/1147 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata pull request #1158: [CARBONDATA-1274] added example for update an...
GitHub user anubhav100 opened a pull request: https://github.com/apache/carbondata/pull/1158 [CARBONDATA-1274] added example for update and delete 1.added examples for update and delete in spark 2.1 2.Manually tested with running examples You can merge this pull request into a Git repository by running: $ git pull https://github.com/anubhav100/incubator-carbondata CARBONDATA-1274 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/carbondata/pull/1158.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #1158 commit 524edd23a4ab4a6a3d26eaf41817f89eaa45c0c7 Author: anubhav100Date: 2017-07-11T11:11:47Z added example for update and delete --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1158: [CARBONDATA-1274] added example for update and delet...
Github user asfgit commented on the issue: https://github.com/apache/carbondata/pull/1158 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1158: [CARBONDATA-1274] added example for update and delet...
Github user asfgit commented on the issue: https://github.com/apache/carbondata/pull/1158 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1158: [CARBONDATA-1274] added example for update and delet...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1158 Build Success with Spark 1.6, Please check CI http://144.76.159.231:8080/job/ApacheCarbonPRBuilder/433/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1149: [CARBONDATA-1284]Implement hive based schema storage...
Github user jackylk commented on the issue: https://github.com/apache/carbondata/pull/1149 LGTM Test case need to be solved in another PR --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[jira] [Commented] (CARBONDATA-1290) [branch-1.1]-max() and delete problem
[ https://issues.apache.org/jira/browse/CARBONDATA-1290?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16082104#comment-16082104 ] sehriff commented on CARBONDATA-1290: - max works right because column id is of string type,but delete problem is valid. > [branch-1.1]-max() and delete problem > - > > Key: CARBONDATA-1290 > URL: https://issues.apache.org/jira/browse/CARBONDATA-1290 > Project: CarbonData > Issue Type: Bug >Reporter: sehriff > > 1.max function is not return the right result; > scala> cc.sql("select * from qqdata2.fullappend where > id=1999").show(false) > +++--+-+--++--++ > |id |qqnum |nick |age > |gender|auth|qunnum|mvcc| > +++--+-+--++--++ > |1999|19991999|2009-05-27|1999c1999|1 |1 > |1999dd1999|1 | > +++--+-+--++--++ > scala> cc.sql("select max(id) from qqdata2.fullappend ").show(false) > +---+ > |max(id)| > +---+ > |999| > +---+ > 2.delete error > scala> cc.sql("delete from qqdata2.fullappend where id>1 and id<10").show > 17/07/11 17:32:33 AUDIT ProjectForDeleteCommand:[Thread-1] Delete data > request has been received for qqdata2.fullappend. > [Stage 21:> (0 + 2) / > 2]17/07/11 17:32:52 WARN TaskSetManager: Lost task 1.0 in stage 21.0 (TID 40, > executor 2): java.lang.ArrayIndexOutOfBoundsException: 1 > at > org.apache.carbondata.core.mutate.CarbonUpdateUtil.getRequiredFieldFromTID(CarbonUpdateUtil.java:67) > at > org.apache.carbondata.core.mutate.CarbonUpdateUtil.getSegmentWithBlockFromTID(CarbonUpdateUtil.java:76) > at > org.apache.spark.sql.execution.command.deleteExecution$$anonfun$4.apply(IUDCommands.scala:555) > at > org.apache.spark.sql.execution.command.deleteExecution$$anonfun$4.apply(IUDCommands.scala:552) > at scala.collection.Iterator$$anon$11.next(Iterator.scala:409) > at > org.apache.spark.shuffle.sort.BypassMergeSortShuffleWriter.write(BypassMergeSortShuffleWriter.java:150) > at > org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:96) > at > org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:53) > at org.apache.spark.scheduler.Task.run(Task.scala:99) > at > org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:322) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[GitHub] carbondata issue #1115: [CARBONDATA-1247]Block pruning not working for date ...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1115 Build Success with Spark 1.6, Please check CI http://144.76.159.231:8080/job/ApacheCarbonPRBuilder/434/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1159: [CARBONDATA-1274] added example for update and delet...
Github user asfgit commented on the issue: https://github.com/apache/carbondata/pull/1159 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata pull request #1159: [CARBONDATA-1274] added example for update an...
GitHub user mayunSaicmotor opened a pull request: https://github.com/apache/carbondata/pull/1159 [CARBONDATA-1274] added example for update and delete added detailed examples for update and delete in spark 2.1 and spark1.6 You can merge this pull request into a Git repository by running: $ git pull https://github.com/mayunSaicmotor/incubator-carbondata CARBONDATA-1274-NEW Alternatively you can review and apply these changes as the patch at: https://github.com/apache/carbondata/pull/1159.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #1159 commit f93fbbfa02db6191a5f7081a33a8992598b0b77e Author: mayunDate: 2017-07-11T14:56:10Z add update and delete examples --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1159: [CARBONDATA-1274] added example for update and delet...
Github user asfgit commented on the issue: https://github.com/apache/carbondata/pull/1159 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[jira] [Created] (CARBONDATA-1290) [branch-1.1]-max() and delete problem
sehriff created CARBONDATA-1290: --- Summary: [branch-1.1]-max() and delete problem Key: CARBONDATA-1290 URL: https://issues.apache.org/jira/browse/CARBONDATA-1290 Project: CarbonData Issue Type: Bug Reporter: sehriff 1.max function is not return the right result; scala> cc.sql("select * from qqdata2.fullappend where id=1999").show(false) +++--+-+--++--++ |id |qqnum |nick |age |gender|auth|qunnum|mvcc| +++--+-+--++--++ |1999|19991999|2009-05-27|1999c1999|1 |1 |1999dd1999|1 | +++--+-+--++--++ scala> cc.sql("select max(id) from qqdata2.fullappend ").show(false) +---+ |max(id)| +---+ |999| +---+ 2.delete error scala> cc.sql("delete from qqdata2.fullappend where id>1 and id<10").show 17/07/11 17:32:33 AUDIT ProjectForDeleteCommand:[Thread-1] Delete data request has been received for qqdata2.fullappend. [Stage 21:> (0 + 2) / 2]17/07/11 17:32:52 WARN TaskSetManager: Lost task 1.0 in stage 21.0 (TID 40, executor 2): java.lang.ArrayIndexOutOfBoundsException: 1 at org.apache.carbondata.core.mutate.CarbonUpdateUtil.getRequiredFieldFromTID(CarbonUpdateUtil.java:67) at org.apache.carbondata.core.mutate.CarbonUpdateUtil.getSegmentWithBlockFromTID(CarbonUpdateUtil.java:76) at org.apache.spark.sql.execution.command.deleteExecution$$anonfun$4.apply(IUDCommands.scala:555) at org.apache.spark.sql.execution.command.deleteExecution$$anonfun$4.apply(IUDCommands.scala:552) at scala.collection.Iterator$$anon$11.next(Iterator.scala:409) at org.apache.spark.shuffle.sort.BypassMergeSortShuffleWriter.write(BypassMergeSortShuffleWriter.java:150) at org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:96) at org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:53) at org.apache.spark.scheduler.Task.run(Task.scala:99) at org.apache.spark.executor.Executor$TaskRunner.run(Executor.scala:322) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) at java.lang.Thread.run(Thread.java:745) -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[GitHub] carbondata issue #1115: [CARBONDATA-1247]Block pruning not working for date ...
Github user BJangir commented on the issue: https://github.com/apache/carbondata/pull/1115 please retest --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1115: [CARBONDATA-1247]Block pruning not working for date ...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1115 Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder/3022/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1159: [CARBONDATA-1274] added example for update and delet...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1159 Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder/3023/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1159: [CARBONDATA-1274] added example for update and delet...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1159 Build Success with Spark 1.6, Please check CI http://144.76.159.231:8080/job/ApacheCarbonPRBuilder/435/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1161: [CARBONDATA-1293] update on carbon data failed with ...
Github user asfgit commented on the issue: https://github.com/apache/carbondata/pull/1161 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #984: [CARBONDATA-1008] Make Carbon table schema compatible...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/984 Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder/3027/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata pull request #1161: [CARBONDATA-1293] update on carbon data faile...
GitHub user mohammadshahidkhan opened a pull request: https://github.com/apache/carbondata/pull/1161 [CARBONDATA-1293] update on carbon data failed with carbon.update.persist.enable false You can merge this pull request into a Git repository by running: $ git pull https://github.com/mohammadshahidkhan/incubator-carbondata iud_ispersistenablefalse_prob Alternatively you can review and apply these changes as the patch at: https://github.com/apache/carbondata/pull/1161.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #1161 commit 03b36469383eb5060d910060677d5081d5b23c3c Author: mohammadshahidkhanDate: 2017-07-11T16:50:05Z [CARBONDATA-1293] update on carbon data failed with carbon.update.persist.enable false --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1161: [CARBONDATA-1293] update on carbon data failed with ...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1161 Build Failed with Spark 1.6, Please check CI http://144.76.159.231:8080/job/ApacheCarbonPRBuilder/437/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #984: [CARBONDATA-1008] Make Carbon table schema compatible...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/984 Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder/3026/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1161: [CARBONDATA-1293] update on carbon data failed with ...
Github user asfgit commented on the issue: https://github.com/apache/carbondata/pull/1161 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1161: [CARBONDATA-1293] update on carbon data failed with ...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1161 Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder/3025/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #984: [CARBONDATA-1008] Make Carbon table schema compatible...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/984 Build Success with Spark 1.6, Please check CI http://144.76.159.231:8080/job/ApacheCarbonPRBuilder/438/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #984: [CARBONDATA-1008] Make Carbon table schema compatible...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/984 Build Success with Spark 1.6, Please check CI http://144.76.159.231:8080/job/ApacheCarbonPRBuilder/439/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata pull request #1160: [WIP] change command to update schema and dat...
GitHub user jackylk opened a pull request: https://github.com/apache/carbondata/pull/1160 [WIP] change command to update schema and data separately You can merge this pull request into a Git repository by running: $ git pull https://github.com/jackylk/incubator-carbondata metadata Alternatively you can review and apply these changes as the patch at: https://github.com/apache/carbondata/pull/1160.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #1160 commit 1f59e868d595994949ba45c641b4493a01e9ea13 Author: jackylkDate: 2017-07-11T16:40:44Z change command --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[jira] [Created] (CARBONDATA-1293) update on carbon data failed with carbon.update.persist.enable false
Mohammad Shahid Khan created CARBONDATA-1293: Summary: update on carbon data failed with carbon.update.persist.enable false Key: CARBONDATA-1293 URL: https://issues.apache.org/jira/browse/CARBONDATA-1293 Project: CarbonData Issue Type: Bug Components: data-load Reporter: Mohammad Shahid Khan Priority: Critical -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[GitHub] carbondata issue #1160: [WIP] change command to update schema and data separ...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1160 Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder/3024/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1160: [WIP] change command to update schema and data separ...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1160 Build Failed with Spark 1.6, Please check CI http://144.76.159.231:8080/job/ApacheCarbonPRBuilder/436/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata pull request #1146: [CARBONDATA-1286] Change Query related RDD to...
Github user jackylk closed the pull request at: https://github.com/apache/carbondata/pull/1146 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1146: [CARBONDATA-1286] Change Query related RDD to use Ta...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1146 Build Failed with Spark 1.6, Please check CI http://144.76.159.231:8080/job/ApacheCarbonPRBuilder/425/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata pull request #1146: [CARBONDATA-1286] Change Query related RDD to...
Github user jackylk commented on a diff in the pull request: https://github.com/apache/carbondata/pull/1146#discussion_r126603318 --- Diff: core/src/main/java/org/apache/carbondata/core/metadata/schema/table/WritableUtil.java --- @@ -0,0 +1,65 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under one or more + * contributor license agreements. See the NOTICE file distributed with + * this work for additional information regarding copyright ownership. + * The ASF licenses this file to You under the Apache License, Version 2.0 + * (the "License"); you may not use this file except in compliance with + * the License. You may obtain a copy of the License at + * + *http://www.apache.org/licenses/LICENSE-2.0 + * + * Unless required by applicable law or agreed to in writing, software + * distributed under the License is distributed on an "AS IS" BASIS, + * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. + * See the License for the specific language governing permissions and + * limitations under the License. + */ + +package org.apache.carbondata.core.metadata.schema.table; + +import java.io.DataInput; +import java.io.DataOutput; +import java.io.IOException; + +public class WritableUtil { + + public static void writeString(DataOutput out, String string) throws IOException { --- End diff -- fixed --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata pull request #1146: [CARBONDATA-1286] Change Query related RDD to...
Github user jackylk commented on a diff in the pull request: https://github.com/apache/carbondata/pull/1146#discussion_r126603309 --- Diff: core/src/main/java/org/apache/carbondata/core/metadata/schema/table/TableSchema.java --- @@ -198,4 +201,28 @@ public PartitionInfo getPartitionInfo() { public void setPartitionInfo(PartitionInfo partitionInfo) { this.partitionInfo = partitionInfo; } + + @Override + public void write(DataOutput out) throws IOException { +WritableUtil.writeString(out, tableId); --- End diff -- fixed --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata pull request #1146: [CARBONDATA-1286] Change Query related RDD to...
Github user jackylk commented on a diff in the pull request: https://github.com/apache/carbondata/pull/1146#discussion_r126603356 --- Diff: core/src/main/java/org/apache/carbondata/core/metadata/schema/table/TableInfo.java --- @@ -206,4 +202,68 @@ public void setStorePath(String storePath) { } return true; } + + /** + * This method will return the table size. Default table block size will be considered + * in case not specified by the user + */ + int getTableBlockSizeInMB() { +String tableBlockSize = null; +// In case of old store there will not be any map for table properties so table properties +// will be null +MaptableProperties = getFactTable().getTableProperties(); +if (null != tableProperties) { + tableBlockSize = tableProperties.get(CarbonCommonConstants.TABLE_BLOCKSIZE); +} +if (null == tableBlockSize) { + tableBlockSize = CarbonCommonConstants.BLOCK_SIZE_DEFAULT_VAL; + LOGGER.info("Table block size not specified for " + getTableUniqueName() + + ". Therefore considering the default value " + + CarbonCommonConstants.BLOCK_SIZE_DEFAULT_VAL + " MB"); +} +return Integer.parseInt(tableBlockSize); + } + + @Override + public void write(DataOutput out) throws IOException { +WritableUtil.writeString(out, databaseName); --- End diff -- fixed --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata pull request #1146: [CARBONDATA-1286] Change Query related RDD to...
Github user jackylk commented on a diff in the pull request: https://github.com/apache/carbondata/pull/1146#discussion_r126603344 --- Diff: core/src/main/java/org/apache/carbondata/core/metadata/schema/table/TableInfo.java --- @@ -206,4 +202,68 @@ public void setStorePath(String storePath) { } return true; } + + /** + * This method will return the table size. Default table block size will be considered + * in case not specified by the user + */ + int getTableBlockSizeInMB() { +String tableBlockSize = null; +// In case of old store there will not be any map for table properties so table properties +// will be null +MaptableProperties = getFactTable().getTableProperties(); +if (null != tableProperties) { + tableBlockSize = tableProperties.get(CarbonCommonConstants.TABLE_BLOCKSIZE); +} +if (null == tableBlockSize) { + tableBlockSize = CarbonCommonConstants.BLOCK_SIZE_DEFAULT_VAL; + LOGGER.info("Table block size not specified for " + getTableUniqueName() + + ". Therefore considering the default value " + + CarbonCommonConstants.BLOCK_SIZE_DEFAULT_VAL + " MB"); +} +return Integer.parseInt(tableBlockSize); + } + + @Override + public void write(DataOutput out) throws IOException { +WritableUtil.writeString(out, databaseName); +WritableUtil.writeString(out, tableUniqueName); +factTable.write(out); +out.writeLong(lastUpdatedTime); +WritableUtil.writeString(out, metaDataFilepath); +WritableUtil.writeString(out, storePath); + } + + @Override + public void readFields(DataInput in) throws IOException { +this.databaseName = WritableUtil.readString(in); --- End diff -- fixed --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[jira] [Commented] (CARBONDATA-1250) 13. Change default partition id from Max to 0
[ https://issues.apache.org/jira/browse/CARBONDATA-1250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16081759#comment-16081759 ] Cao, Lionel commented on CARBONDATA-1250: - This change will be used in later alter table partition feature. > 13. Change default partition id from Max to 0 > - > > Key: CARBONDATA-1250 > URL: https://issues.apache.org/jira/browse/CARBONDATA-1250 > Project: CarbonData > Issue Type: Sub-task > Components: data-load, sql >Reporter: Cao, Lionel >Assignee: Cao, Lionel > > Change default partition id from Max to 0 -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[GitHub] carbondata issue #1102: [CARBONDATA-1098] Change page statistics use exact t...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1102 Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder/3014/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1102: [CARBONDATA-1098] Change page statistics use exact t...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1102 Build Failed with Spark 1.6, Please check CI http://144.76.159.231:8080/job/ApacheCarbonPRBuilder/427/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1117: [CARBONDATA-757] Big decimal optimization
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1117 Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder/3015/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1146: [CARBONDATA-1286] Change Query related RDD to use Ta...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1146 Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder/3016/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1146: [CARBONDATA-1286] Change Query related RDD to use Ta...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1146 Build Failed with Spark 1.6, Please check CI http://144.76.159.231:8080/job/ApacheCarbonPRBuilder/428/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1146: [CARBONDATA-1286] Change Query related RDD to use Ta...
Github user ravipesala commented on the issue: https://github.com/apache/carbondata/pull/1146 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata pull request #1133: [CARBONDATA-1261] Load data sql add 'header' ...
Github user QiangCai closed the pull request at: https://github.com/apache/carbondata/pull/1133 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata pull request #1149: [CARBONDATA-1284]Implement hive based schema ...
Github user QiangCai commented on a diff in the pull request: https://github.com/apache/carbondata/pull/1149#discussion_r126610639 --- Diff: integration/spark2/src/main/scala/org/apache/carbondata/spark/util/CarbonSparkUtil.scala --- @@ -42,4 +44,14 @@ object CarbonSparkUtil { } CarbonMetaData(dimensionsAttr, measureAttr, carbonTable, DictionaryMap(dictionary.toMap)) } + + def createCarbonRelation(tableInfo: TableInfo, tablePath: String): CarbonRelation = { +val identifier = AbsoluteTableIdentifier fromTablePath(tablePath) --- End diff -- AbsoluteTableIdentifier.fromTablePath(tablePath) --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1117: [CARBONDATA-757] Big decimal optimization
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1117 Build Success with Spark 1.6, Please check CI http://144.76.159.231:8080/job/ApacheCarbonPRBuilder/426/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1133: [CARBONDATA-1261] Load data sql add 'header' option
Github user jackylk commented on the issue: https://github.com/apache/carbondata/pull/1133 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata pull request #1144: [CARBONDATA-1273] String datatype will be no ...
Github user QiangCai closed the pull request at: https://github.com/apache/carbondata/pull/1144 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata pull request #1102: [CARBONDATA-1098] Change page statistics use ...
Github user jackylk commented on a diff in the pull request: https://github.com/apache/carbondata/pull/1102#discussion_r126616780 --- Diff: core/src/main/java/org/apache/carbondata/core/datastore/page/statistics/SimpleStatsResult.java --- @@ -15,9 +15,25 @@ * limitations under the License. */ -package org.apache.carbondata.core.datastore.page.compression; +package org.apache.carbondata.core.datastore.page.statistics; -public interface Compression { - byte[] compress(byte[] input); - byte[] decompress(byte[] input); +import java.util.BitSet; + +import org.apache.carbondata.core.metadata.datatype.DataType; + +public interface SimpleStatsResult { + + byte[] getMinAsBytes(); + + byte[] getMaxAsBytes(); + + Object getMin(); + + Object getMax(); + + BitSet getNullBits(); --- End diff -- fixed --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1117: [CARBONDATA-757] Big decimal optimization
Github user jackylk commented on the issue: https://github.com/apache/carbondata/pull/1117 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1125: [CARBONDATA-1250] Change default partition id & Add ...
Github user QiangCai commented on the issue: https://github.com/apache/carbondata/pull/1125 local verify passed for spark1.6 and spark2.1 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1144: [CARBONDATA-1273] String datatype will be no diction...
Github user jackylk commented on the issue: https://github.com/apache/carbondata/pull/1144 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1102: [CARBONDATA-1098] Change page statistics use exact t...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1102 Build Failed with Spark 1.6, Please check CI http://144.76.159.231:8080/job/ApacheCarbonPRBuilder/429/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1102: [CARBONDATA-1098] Change page statistics use exact t...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/1102 Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder/3017/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[jira] [Created] (CARBONDATA-1294) Test Case refactor and optimization
Liang Chen created CARBONDATA-1294: -- Summary: Test Case refactor and optimization Key: CARBONDATA-1294 URL: https://issues.apache.org/jira/browse/CARBONDATA-1294 Project: CarbonData Issue Type: Improvement Components: test Reporter: Liang Chen -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[jira] [Updated] (CARBONDATA-1294) Test Case refactor and optimization
[ https://issues.apache.org/jira/browse/CARBONDATA-1294?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang Chen updated CARBONDATA-1294: --- Description: 1. Fix same test cases name issues. 2. Consolidate same test cases for spark 1.x and spark 2.x 3. Add cluster mode test cases > Test Case refactor and optimization > --- > > Key: CARBONDATA-1294 > URL: https://issues.apache.org/jira/browse/CARBONDATA-1294 > Project: CarbonData > Issue Type: Improvement > Components: test >Reporter: Liang Chen > > 1. Fix same test cases name issues. > 2. Consolidate same test cases for spark 1.x and spark 2.x > 3. Add cluster mode test cases -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[jira] [Created] (CARBONDATA-1295) Optimize QueryTest: Merge three QueryTest class into one
Liang Chen created CARBONDATA-1295: -- Summary: Optimize QueryTest: Merge three QueryTest class into one Key: CARBONDATA-1295 URL: https://issues.apache.org/jira/browse/CARBONDATA-1295 Project: CarbonData Issue Type: Sub-task Components: test Reporter: Liang Chen -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[jira] [Created] (CARBONDATA-1297) Consolidate same test cases for spark 1.x and spark 2.x
Liang Chen created CARBONDATA-1297: -- Summary: Consolidate same test cases for spark 1.x and spark 2.x Key: CARBONDATA-1297 URL: https://issues.apache.org/jira/browse/CARBONDATA-1297 Project: CarbonData Issue Type: Sub-task Components: test Reporter: Liang Chen -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[GitHub] carbondata pull request #1162: [CARBONDATA-1295] Optimize test cases,merge t...
GitHub user chenliang613 opened a pull request: https://github.com/apache/carbondata/pull/1162 [CARBONDATA-1295] Optimize test cases,merge three QueryTest class into one Optimize test cases,merge three QueryTest class into one, and update related code. You can merge this pull request into a Git repository by running: $ git pull https://github.com/chenliang613/carbondata opti_testcase Alternatively you can review and apply these changes as the patch at: https://github.com/apache/carbondata/pull/1162.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #1162 commit b2740f1e3f3e3bbd2ce29794a8adb45c7e96a9a0 Author: chenliang613Date: 2017-07-12T02:45:22Z [CARBONDATA-1295] Optimize test cases,merge three QueryTest class into one --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] carbondata issue #1162: [CARBONDATA-1295] Optimize test cases,merge three Qu...
Github user asfgit commented on the issue: https://github.com/apache/carbondata/pull/1162 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---