[GitHub] carbondata issue #2129: [CARBONDATA-2298][BACKPORT-1.3]Delete segment lock f...

2018-04-01 Thread ravipesala
Github user ravipesala commented on the issue:

https://github.com/apache/carbondata/pull/2129
  
SDV Build Fail , Please check CI 
http://144.76.159.231:8080/job/ApacheSDVTests/4238/



---


[jira] [Created] (CARBONDATA-2303) If dataload is failed for parition table then cleanup is not working.

2018-04-01 Thread Rahul Kumar (JIRA)
Rahul Kumar created CARBONDATA-2303:
---

 Summary: If dataload is failed for parition table then cleanup is 
not working.
 Key: CARBONDATA-2303
 URL: https://issues.apache.org/jira/browse/CARBONDATA-2303
 Project: CarbonData
  Issue Type: Bug
Reporter: Rahul Kumar


Test Step : 

1. create table 

2. load data (make sure data load is failed either manually or other)

3. clean files for table

*Expected Output*: after clean files data from HDFS should be delete for 
segments which is Marked for delete.
*Actual Output:*Alter cleanup ,data are not deleted from HDFS

 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[GitHub] carbondata issue #2128: [CARBONDATA-2303] [WIP] If dataload is failed for pa...

2018-04-01 Thread CarbonDataQA
Github user CarbonDataQA commented on the issue:

https://github.com/apache/carbondata/pull/2128
  
Build Success with Spark 2.2.1, Please check CI 
http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/3506/



---


[GitHub] carbondata issue #2128: [CARBONDATA-2303] [WIP] If dataload is failed for pa...

2018-04-01 Thread CarbonDataQA
Github user CarbonDataQA commented on the issue:

https://github.com/apache/carbondata/pull/2128
  
Build Success with Spark 2.1.0, Please check CI 
http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/4733/



---


[GitHub] carbondata issue #2056: [CARBONDATA-2238][DataLoad] Merge and spill in-memor...

2018-04-01 Thread ravipesala
Github user ravipesala commented on the issue:

https://github.com/apache/carbondata/pull/2056
  
SDV Build Fail , Please check CI 
http://144.76.159.231:8080/job/ApacheSDVTests/4239/



---


[GitHub] carbondata issue #2056: [CARBONDATA-2238][DataLoad] Merge and spill in-memor...

2018-04-01 Thread xuchuanyin
Github user xuchuanyin commented on the issue:

https://github.com/apache/carbondata/pull/2056
  
retest this please


---


[GitHub] carbondata issue #2109: [CARBONDATA-2294] Partition preaggregate support

2018-04-01 Thread CarbonDataQA
Github user CarbonDataQA commented on the issue:

https://github.com/apache/carbondata/pull/2109
  
Build Failed  with Spark 2.1.0, Please check CI 
http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/4736/



---


[GitHub] carbondata issue #2109: [CARBONDATA-2294] Partition preaggregate support

2018-04-01 Thread CarbonDataQA
Github user CarbonDataQA commented on the issue:

https://github.com/apache/carbondata/pull/2109
  
Build Failed with Spark 2.2.1, Please check CI 
http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/3509/



---


[GitHub] carbondata issue #2046: [CARBONDATA-2239] Added sdv test cases for querying ...

2018-04-01 Thread geetikagupta16
Github user geetikagupta16 commented on the issue:

https://github.com/apache/carbondata/pull/2046
  
retest this please


---


[GitHub] carbondata issue #2105: [CARBONDATA-2286][SDV] Added sdv test cases for stre...

2018-04-01 Thread geetikagupta16
Github user geetikagupta16 commented on the issue:

https://github.com/apache/carbondata/pull/2105
  
retest this please


---


[jira] [Created] (CARBONDATA-2304) Enhance compaction performance by enabling prefetch

2018-04-01 Thread xuchuanyin (JIRA)
xuchuanyin created CARBONDATA-2304:
--

 Summary: Enhance compaction performance by enabling prefetch
 Key: CARBONDATA-2304
 URL: https://issues.apache.org/jira/browse/CARBONDATA-2304
 Project: CarbonData
  Issue Type: Improvement
  Components: data-load
Reporter: xuchuanyin
Assignee: xuchuanyin


During compaction, carbondata will query on the segments and retrieve a row, 
then it will sort the rows and produce the final carbondata file.

Currently we find the poor performance in retrieving the rows, so adding 
prefetch for the rows will surely improve the compaction performance.

In my local tests, compacting 4 segments each with 100 thousand rows costs 30s 
with prefetch and 50s without prefetch.

In my tests in a larger cluster, compacting 6 segments each with 18GB raw data 
costs 45min with prefetch and 57min without prefetch.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[GitHub] carbondata issue #2056: [CARBONDATA-2238][DataLoad] Merge and spill in-memor...

2018-04-01 Thread CarbonDataQA
Github user CarbonDataQA commented on the issue:

https://github.com/apache/carbondata/pull/2056
  
Build Failed with Spark 2.2.1, Please check CI 
http://88.99.58.216:8080/job/ApacheCarbonPRBuilder/3508/



---


[GitHub] carbondata issue #2056: [CARBONDATA-2238][DataLoad] Merge and spill in-memor...

2018-04-01 Thread CarbonDataQA
Github user CarbonDataQA commented on the issue:

https://github.com/apache/carbondata/pull/2056
  
Build Failed  with Spark 2.1.0, Please check CI 
http://136.243.101.176:8080/job/ApacheCarbonPRBuilder1/4735/



---


[GitHub] carbondata issue #2128: [CARBONDATA-2303] [WIP] If dataload is failed for pa...

2018-04-01 Thread rahulforallp
Github user rahulforallp commented on the issue:

https://github.com/apache/carbondata/pull/2128
  
retest sdv please


---


[GitHub] carbondata issue #2127: [CARBONDATA-2301][SDK] CarbonStore interface and two...

2018-04-01 Thread ravipesala
Github user ravipesala commented on the issue:

https://github.com/apache/carbondata/pull/2127
  
SDV Build Success , Please check CI 
http://144.76.159.231:8080/job/ApacheSDVTests/4240/



---


[jira] [Updated] (CARBONDATA-2303) If dataload is failed for parition table then cleanup is not working.

2018-04-01 Thread Rahul Kumar (JIRA)

 [ 
https://issues.apache.org/jira/browse/CARBONDATA-2303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rahul Kumar updated CARBONDATA-2303:

Description: 
Test Step : 

1. create table 

2. load data (make sure data load is failed either manually or other)

3. clean files for table

*Expected Output*: after clean files data from HDFS should be delete for 
segments which is Marked for delete.
 *Actual Output:* Alter cleanup ,data are not deleted from HDFS

 

  was:
Test Step : 

1. create table 

2. load data (make sure data load is failed either manually or other)

3. clean files for table

*Expected Output*: after clean files data from HDFS should be delete for 
segments which is Marked for delete.
*Actual Output:*Alter cleanup ,data are not deleted from HDFS

 


> If dataload is failed for parition table then cleanup is not working.
> -
>
> Key: CARBONDATA-2303
> URL: https://issues.apache.org/jira/browse/CARBONDATA-2303
> Project: CarbonData
>  Issue Type: Bug
>Reporter: Rahul Kumar
>Priority: Major
>  Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> Test Step : 
> 1. create table 
> 2. load data (make sure data load is failed either manually or other)
> 3. clean files for table
> *Expected Output*: after clean files data from HDFS should be delete for 
> segments which is Marked for delete.
>  *Actual Output:* Alter cleanup ,data are not deleted from HDFS
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[GitHub] carbondata pull request #1713: [CARBONDATA-1899] Optimize CarbonData concurr...

2018-04-01 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/carbondata/pull/1713


---


[GitHub] carbondata issue #2109: [CARBONDATA-2294] Partition preaggregate support

2018-04-01 Thread kunal642
Github user kunal642 commented on the issue:

https://github.com/apache/carbondata/pull/2109
  
retest this please


---


[GitHub] carbondata issue #1713: [CARBONDATA-1899] Optimize CarbonData concurrency te...

2018-04-01 Thread jackylk
Github user jackylk commented on the issue:

https://github.com/apache/carbondata/pull/1713
  
LGTM


---


[jira] [Created] (CARBONDATA-2305) Memory leak when using DataFrame.coalesce

2018-04-01 Thread Jin Zhou (JIRA)
Jin Zhou created CARBONDATA-2305:


 Summary: Memory leak when using DataFrame.coalesce
 Key: CARBONDATA-2305
 URL: https://issues.apache.org/jira/browse/CARBONDATA-2305
 Project: CarbonData
  Issue Type: Bug
  Components: spark-integration
Reporter: Jin Zhou


When using DataFrame.coalesce, taskContext object holds reader’s reference 
until the task finished and 
coalesce combines a lot of CarbonSparkPartition into one task.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (CARBONDATA-2306) Memory leak when using DataFrame.coalesce

2018-04-01 Thread Jin Zhou (JIRA)
Jin Zhou created CARBONDATA-2306:


 Summary: Memory leak when using DataFrame.coalesce
 Key: CARBONDATA-2306
 URL: https://issues.apache.org/jira/browse/CARBONDATA-2306
 Project: CarbonData
  Issue Type: Bug
  Components: spark-integration
Reporter: Jin Zhou


When using DataFrame.coalesce, taskContext object holds reader’s reference 
until the task finished and 
coalesce combines a lot of CarbonSparkPartition into one task.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (CARBONDATA-2307) Memory leak when using DataFrame.coalesce

2018-04-01 Thread Jin Zhou (JIRA)
Jin Zhou created CARBONDATA-2307:


 Summary: Memory leak when using DataFrame.coalesce
 Key: CARBONDATA-2307
 URL: https://issues.apache.org/jira/browse/CARBONDATA-2307
 Project: CarbonData
  Issue Type: Bug
Reporter: Jin Zhou


TaskContext object holds reader’s reference until the task finished and 
coalesce combines a lot of CarbonSparkPartition into one task.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


<    1   2