[GitHub] carbondata issue #2129: [CARBONDATA-2298][BACKPORT-1.3]Delete segment lock f...
Github user ravipesala commented on the issue: https://github.com/apache/carbondata/pull/2129 SDV Build Fail , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/4238/ ---
[jira] [Created] (CARBONDATA-2303) If dataload is failed for parition table then cleanup is not working.
Rahul Kumar created CARBONDATA-2303: --- Summary: If dataload is failed for parition table then cleanup is not working. Key: CARBONDATA-2303 URL: https://issues.apache.org/jira/browse/CARBONDATA-2303 Project: CarbonData Issue Type: Bug Reporter: Rahul Kumar Test Step : 1. create table 2. load data (make sure data load is failed either manually or other) 3. clean files for table *Expected Output*: after clean files data from HDFS should be delete for segments which is Marked for delete. *Actual Output:*Alter cleanup ,data are not deleted from HDFS -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[GitHub] carbondata issue #2128: [CARBONDATA-2303] [WIP] If dataload is failed for pa...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/2128 Build Success with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/3506/ ---
[GitHub] carbondata issue #2128: [CARBONDATA-2303] [WIP] If dataload is failed for pa...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/2128 Build Success with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/4733/ ---
[GitHub] carbondata issue #2056: [CARBONDATA-2238][DataLoad] Merge and spill in-memor...
Github user ravipesala commented on the issue: https://github.com/apache/carbondata/pull/2056 SDV Build Fail , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/4239/ ---
[GitHub] carbondata issue #2056: [CARBONDATA-2238][DataLoad] Merge and spill in-memor...
Github user xuchuanyin commented on the issue: https://github.com/apache/carbondata/pull/2056 retest this please ---
[GitHub] carbondata issue #2109: [CARBONDATA-2294] Partition preaggregate support
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/2109 Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/4736/ ---
[GitHub] carbondata issue #2109: [CARBONDATA-2294] Partition preaggregate support
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/2109 Build Failed with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/3509/ ---
[GitHub] carbondata issue #2046: [CARBONDATA-2239] Added sdv test cases for querying ...
Github user geetikagupta16 commented on the issue: https://github.com/apache/carbondata/pull/2046 retest this please ---
[GitHub] carbondata issue #2105: [CARBONDATA-2286][SDV] Added sdv test cases for stre...
Github user geetikagupta16 commented on the issue: https://github.com/apache/carbondata/pull/2105 retest this please ---
[jira] [Created] (CARBONDATA-2304) Enhance compaction performance by enabling prefetch
xuchuanyin created CARBONDATA-2304: -- Summary: Enhance compaction performance by enabling prefetch Key: CARBONDATA-2304 URL: https://issues.apache.org/jira/browse/CARBONDATA-2304 Project: CarbonData Issue Type: Improvement Components: data-load Reporter: xuchuanyin Assignee: xuchuanyin During compaction, carbondata will query on the segments and retrieve a row, then it will sort the rows and produce the final carbondata file. Currently we find the poor performance in retrieving the rows, so adding prefetch for the rows will surely improve the compaction performance. In my local tests, compacting 4 segments each with 100 thousand rows costs 30s with prefetch and 50s without prefetch. In my tests in a larger cluster, compacting 6 segments each with 18GB raw data costs 45min with prefetch and 57min without prefetch. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[GitHub] carbondata issue #2056: [CARBONDATA-2238][DataLoad] Merge and spill in-memor...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/2056 Build Failed with Spark 2.2.1, Please check CI http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/3508/ ---
[GitHub] carbondata issue #2056: [CARBONDATA-2238][DataLoad] Merge and spill in-memor...
Github user CarbonDataQA commented on the issue: https://github.com/apache/carbondata/pull/2056 Build Failed with Spark 2.1.0, Please check CI http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/4735/ ---
[GitHub] carbondata issue #2128: [CARBONDATA-2303] [WIP] If dataload is failed for pa...
Github user rahulforallp commented on the issue: https://github.com/apache/carbondata/pull/2128 retest sdv please ---
[GitHub] carbondata issue #2127: [CARBONDATA-2301][SDK] CarbonStore interface and two...
Github user ravipesala commented on the issue: https://github.com/apache/carbondata/pull/2127 SDV Build Success , Please check CI http://144.76.159.231:8080/job/ApacheSDVTests/4240/ ---
[jira] [Updated] (CARBONDATA-2303) If dataload is failed for parition table then cleanup is not working.
[ https://issues.apache.org/jira/browse/CARBONDATA-2303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rahul Kumar updated CARBONDATA-2303: Description: Test Step : 1. create table 2. load data (make sure data load is failed either manually or other) 3. clean files for table *Expected Output*: after clean files data from HDFS should be delete for segments which is Marked for delete. *Actual Output:* Alter cleanup ,data are not deleted from HDFS was: Test Step : 1. create table 2. load data (make sure data load is failed either manually or other) 3. clean files for table *Expected Output*: after clean files data from HDFS should be delete for segments which is Marked for delete. *Actual Output:*Alter cleanup ,data are not deleted from HDFS > If dataload is failed for parition table then cleanup is not working. > - > > Key: CARBONDATA-2303 > URL: https://issues.apache.org/jira/browse/CARBONDATA-2303 > Project: CarbonData > Issue Type: Bug >Reporter: Rahul Kumar >Priority: Major > Time Spent: 0.5h > Remaining Estimate: 0h > > Test Step : > 1. create table > 2. load data (make sure data load is failed either manually or other) > 3. clean files for table > *Expected Output*: after clean files data from HDFS should be delete for > segments which is Marked for delete. > *Actual Output:* Alter cleanup ,data are not deleted from HDFS > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[GitHub] carbondata pull request #1713: [CARBONDATA-1899] Optimize CarbonData concurr...
Github user asfgit closed the pull request at: https://github.com/apache/carbondata/pull/1713 ---
[GitHub] carbondata issue #2109: [CARBONDATA-2294] Partition preaggregate support
Github user kunal642 commented on the issue: https://github.com/apache/carbondata/pull/2109 retest this please ---
[GitHub] carbondata issue #1713: [CARBONDATA-1899] Optimize CarbonData concurrency te...
Github user jackylk commented on the issue: https://github.com/apache/carbondata/pull/1713 LGTM ---
[jira] [Created] (CARBONDATA-2305) Memory leak when using DataFrame.coalesce
Jin Zhou created CARBONDATA-2305: Summary: Memory leak when using DataFrame.coalesce Key: CARBONDATA-2305 URL: https://issues.apache.org/jira/browse/CARBONDATA-2305 Project: CarbonData Issue Type: Bug Components: spark-integration Reporter: Jin Zhou When using DataFrame.coalesce, taskContext object holds reader’s reference until the task finished and coalesce combines a lot of CarbonSparkPartition into one task. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (CARBONDATA-2306) Memory leak when using DataFrame.coalesce
Jin Zhou created CARBONDATA-2306: Summary: Memory leak when using DataFrame.coalesce Key: CARBONDATA-2306 URL: https://issues.apache.org/jira/browse/CARBONDATA-2306 Project: CarbonData Issue Type: Bug Components: spark-integration Reporter: Jin Zhou When using DataFrame.coalesce, taskContext object holds reader’s reference until the task finished and coalesce combines a lot of CarbonSparkPartition into one task. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (CARBONDATA-2307) Memory leak when using DataFrame.coalesce
Jin Zhou created CARBONDATA-2307: Summary: Memory leak when using DataFrame.coalesce Key: CARBONDATA-2307 URL: https://issues.apache.org/jira/browse/CARBONDATA-2307 Project: CarbonData Issue Type: Bug Reporter: Jin Zhou TaskContext object holds reader’s reference until the task finished and coalesce combines a lot of CarbonSparkPartition into one task. -- This message was sent by Atlassian JIRA (v7.6.3#76005)