[jira] [Created] (CARBONDATA-3861) Support show stage information

2020-06-21 Thread Zhi Liu (Jira)
Zhi Liu created CARBONDATA-3861:
---

 Summary: Support show stage information
 Key: CARBONDATA-3861
 URL: https://issues.apache.org/jira/browse/CARBONDATA-3861
 Project: CarbonData
  Issue Type: New Feature
Reporter: Zhi Liu


Sometimes, user need to know information about table stages, so that they can 
make stage load plan better. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[GitHub] [carbondata] CarbonDataQA1 commented on pull request #3796: [WIP] Lock and retry to read tablestatus before throwing …

2020-06-21 Thread GitBox


CarbonDataQA1 commented on pull request #3796:
URL: https://github.com/apache/carbondata/pull/3796#issuecomment-647097497


   Build Success with Spark 2.3.4, Please check CI 
http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.3/3181/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [carbondata] CarbonDataQA1 commented on pull request #3796: [WIP] Lock and retry to read tablestatus before throwing …

2020-06-21 Thread GitBox


CarbonDataQA1 commented on pull request #3796:
URL: https://github.com/apache/carbondata/pull/3796#issuecomment-647097624


   Build Success with Spark 2.4.5, Please check CI 
http://121.244.95.60:12545/job/ApacheCarbon_PR_Builder_2.4.5/1455/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [carbondata] CarbonDataQA1 commented on pull request #3798: [WIP] Support 'show segment include stage'

2020-06-21 Thread GitBox


CarbonDataQA1 commented on pull request #3798:
URL: https://github.com/apache/carbondata/pull/3798#issuecomment-647168625


   Build Success with Spark 2.4.5, Please check CI 
http://121.244.95.60:12545/job/ApacheCarbon_PR_Builder_2.4.5/1458/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [carbondata] CarbonDataQA1 commented on pull request #3797: [WIP] Support show segment information

2020-06-21 Thread GitBox


CarbonDataQA1 commented on pull request #3797:
URL: https://github.com/apache/carbondata/pull/3797#issuecomment-647146761


   Build Failed  with Spark 2.4.5, Please check CI 
http://121.244.95.60:12545/job/ApacheCarbon_PR_Builder_2.4.5/1457/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[jira] [Created] (CARBONDATA-3862) Insert stage performance optimazation

2020-06-21 Thread Xingjun Hao (Jira)
Xingjun Hao created CARBONDATA-3862:
---

 Summary: Insert stage performance optimazation
 Key: CARBONDATA-3862
 URL: https://issues.apache.org/jira/browse/CARBONDATA-3862
 Project: CarbonData
  Issue Type: New Feature
Reporter: Xingjun Hao


There are two major performance bottlenecks of insert stage.

1) Get LastModifyTime of stagefiles requires a lot of access to OBS
2) Parallelism is not supported

Which shall be optimazed.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[GitHub] [carbondata] marchpure opened a new pull request #3800: [WIP] Lock to read tablestatus

2020-06-21 Thread GitBox


marchpure opened a new pull request #3800:
URL: https://github.com/apache/carbondata/pull/3800


   Why is this PR needed?
   when storing table status file in object store, reading of table status file 
mayfail (receive EOFException or JsonSyntaxException)
   when table status file is being modifying
   we shall retry add the lock to read tablestatus before throwing EOFException 
or JsonSyntaxException
   
   What changes were proposed in this PR?
   Add lock to read tablestatus
   
   Does this PR introduce any user interface change?
   NO
   
   Is any new testcase added?
   NO
   
### Why is this PR needed?


### What changes were proposed in this PR?
   
   
### Does this PR introduce any user interface change?
- No
- Yes. (please explain the change and update document)
   
### Is any new testcase added?
- No
- Yes
   
   
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [carbondata] CarbonDataQA1 commented on pull request #3798: [WIP] Support 'show segment include stage'

2020-06-21 Thread GitBox


CarbonDataQA1 commented on pull request #3798:
URL: https://github.com/apache/carbondata/pull/3798#issuecomment-647168876


   Build Success with Spark 2.3.4, Please check CI 
http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.3/3184/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [carbondata] CarbonDataQA1 commented on pull request #3797: [WIP] Support show segment information

2020-06-21 Thread GitBox


CarbonDataQA1 commented on pull request #3797:
URL: https://github.com/apache/carbondata/pull/3797#issuecomment-647146911


   Build Failed  with Spark 2.3.4, Please check CI 
http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.3/3183/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [carbondata] CarbonDataQA1 commented on pull request #3796: [CARBONDATA-3859] Retry to read tablestatus before throwing EOFException or JsonSyntaxException or FileNotFoundException

2020-06-21 Thread GitBox


CarbonDataQA1 commented on pull request #3796:
URL: https://github.com/apache/carbondata/pull/3796#issuecomment-647156457


   Build Failed  with Spark 2.3.4, Please check CI 
http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.3/3182/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [carbondata] marchpure opened a new pull request #3799: [CARBONDATA-3862] Insert stage performance optimazation

2020-06-21 Thread GitBox


marchpure opened a new pull request #3799:
URL: https://github.com/apache/carbondata/pull/3799


### Why is this PR needed?
   There are two major performance bottlenecks of 'insert stage'.
   1) Get LastModifyTime of stagefiles requires a lot of access to OBS.
   2) Parallelism is not supported
   
### What changes were proposed in this PR?
   1) Cache the lastmodifytime info when list stage files.
   2) support insert stage in parallel. we add a tag 'loading' to the stages in 
process. different insertstage processes can load different data separately by 
choose the stages without 'loading' tag or stages loaded timeout. which avoid 
loading the same data between concurrent insertstage processes. The 'loading' 
tag is actually an empty file with '.loading' suffix filename.
   
### Does this PR introduce any user interface change?
   NO
   
### Is any new testcase added?
   YES
   
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [carbondata] niuge01 opened a new pull request #3797: [WIP] Support show segment information

2020-06-21 Thread GitBox


niuge01 opened a new pull request #3797:
URL: https://github.com/apache/carbondata/pull/3797


### Why is this PR needed?
Sometimes, user need to know information about table stages, so that they 
can make stage load plan better. 

### What changes were proposed in this PR?
   Add "INCLUDE STAGE" keywords in "SHOW SEGMENTS" command, show stage 
information as segment information
   
### Does this PR introduce any user interface change?
- No
- Yes. (please explain the change and update document)
   
### Is any new testcase added?
- No
- Yes
   
   
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [carbondata] CarbonDataQA1 commented on pull request #3796: [CARBONDATA-3859] Retry to read tablestatus before throwing EOFException or JsonSyntaxException

2020-06-21 Thread GitBox


CarbonDataQA1 commented on pull request #3796:
URL: https://github.com/apache/carbondata/pull/3796#issuecomment-647171957


   Build Success with Spark 2.3.4, Please check CI 
http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.3/3185/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [carbondata] CarbonDataQA1 commented on pull request #3796: [CARBONDATA-3859] Retry to read tablestatus before throwing EOFException or JsonSyntaxException

2020-06-21 Thread GitBox


CarbonDataQA1 commented on pull request #3796:
URL: https://github.com/apache/carbondata/pull/3796#issuecomment-647172066


   Build Success with Spark 2.4.5, Please check CI 
http://121.244.95.60:12545/job/ApacheCarbon_PR_Builder_2.4.5/1459/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [carbondata] CarbonDataQA1 commented on pull request #3799: [CARBONDATA-3862] Insert stage performance optimazation

2020-06-21 Thread GitBox


CarbonDataQA1 commented on pull request #3799:
URL: https://github.com/apache/carbondata/pull/3799#issuecomment-647196597


   Build Success with Spark 2.3.4, Please check CI 
http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.3/3186/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[jira] [Created] (CARBONDATA-3863) index service go back to emmbedded mode

2020-06-21 Thread Taoli (Jira)
Taoli created CARBONDATA-3863:
-

 Summary: index service go back to emmbedded mode
 Key: CARBONDATA-3863
 URL: https://issues.apache.org/jira/browse/CARBONDATA-3863
 Project: CarbonData
  Issue Type: Bug
Affects Versions: 2.0.0
Reporter: Taoli


when use index service,some way may cause the floder "/tmp/indexservertmp" get 
max-directory-item exception. in that case the index service go back to 
emmbedded mode.

the error is like above:

 

Exception occured: The directory item limit of /tmp/indexservertmp is exceeded: 
limit=1048576

items=1048576.

 

 

 

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[GitHub] [carbondata] CarbonDataQA1 commented on pull request #3799: [CARBONDATA-3862] Insert stage performance optimazation

2020-06-21 Thread GitBox


CarbonDataQA1 commented on pull request #3799:
URL: https://github.com/apache/carbondata/pull/3799#issuecomment-647197331


   Build Success with Spark 2.4.5, Please check CI 
http://121.244.95.60:12545/job/ApacheCarbon_PR_Builder_2.4.5/1460/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [carbondata] CarbonDataQA1 commented on pull request #3800: [WIP] Lock to read tablestatus

2020-06-21 Thread GitBox


CarbonDataQA1 commented on pull request #3800:
URL: https://github.com/apache/carbondata/pull/3800#issuecomment-647203937


   Build Failed  with Spark 2.4.5, Please check CI 
http://121.244.95.60:12545/job/ApacheCarbon_PR_Builder_2.4.5/1461/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [carbondata] CarbonDataQA1 commented on pull request #3800: [WIP] Lock to read tablestatus

2020-06-21 Thread GitBox


CarbonDataQA1 commented on pull request #3800:
URL: https://github.com/apache/carbondata/pull/3800#issuecomment-647204213


   Build Failed  with Spark 2.3.4, Please check CI 
http://121.244.95.60:12545/job/ApacheCarbonPRBuilder2.3/3187/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [carbondata] marchpure opened a new pull request #3798: [WIP] Support 'show segment include stage'

2020-06-21 Thread GitBox


marchpure opened a new pull request #3798:
URL: https://github.com/apache/carbondata/pull/3798


### Why is this PR needed?


### What changes were proposed in this PR?
   
   
### Does this PR introduce any user interface change?
- No
- Yes. (please explain the change and update document)
   
### Is any new testcase added?
- No
- Yes
   
   
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org




[GitHub] [carbondata] CarbonDataQA1 commented on pull request #3796: [CARBONDATA-3859] Retry to read tablestatus before throwing EOFException or JsonSyntaxException

2020-06-21 Thread GitBox


CarbonDataQA1 commented on pull request #3796:
URL: https://github.com/apache/carbondata/pull/3796#issuecomment-647156922


   Build Failed  with Spark 2.4.5, Please check CI 
http://121.244.95.60:12545/job/ApacheCarbon_PR_Builder_2.4.5/1456/
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org