[GitHub] incubator-carbondata pull request #226: [CARBONDATA-294]Fix timestamp data e...

2016-10-11 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/incubator-carbondata/pull/226 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the

[jira] [Created] (CARBONDATA-306) block size info should be show in Desc Formatted and executor log

2016-10-11 Thread Jay (JIRA)
Jay created CARBONDATA-306: -- Summary: block size info should be show in Desc Formatted and executor log Key: CARBONDATA-306 URL: https://issues.apache.org/jira/browse/CARBONDATA-306 Project: CarbonData

[GitHub] incubator-carbondata pull request #230: []Add block size info in descFormatt...

2016-10-11 Thread Jay357089
GitHub user Jay357089 opened a pull request: https://github.com/apache/incubator-carbondata/pull/230 []Add block size info in descFormatted and executor log Be sure to do all of the following to help us incorporate your contribution quickly and easily: - [ ] Make sure

RE: Discussion(New feature) regarding single pass data loading solution.

2016-10-11 Thread Jihong Ma
A rather straight option is allow user to supply global dictionary generated somewhere else or we build a separate tool just for generating as well updating dictionary. Then the general normal data loading process will encode columns with local dictionary if not supplied. This should cover

[GitHub] incubator-carbondata pull request #200: [CARBONDATA-276]add trim option

2016-10-11 Thread eason-lyx
Github user eason-lyx commented on a diff in the pull request: https://github.com/apache/incubator-carbondata/pull/200#discussion_r82923303 --- Diff: processing/src/main/java/org/apache/carbondata/processing/csvreaderstep/UnivocityCsvParser.java --- @@ -102,8 +102,8 @@ public

[GitHub] incubator-carbondata pull request #229: [CARBONDATA-297]Added interface for ...

2016-10-11 Thread ravipesala
Github user ravipesala commented on a diff in the pull request: https://github.com/apache/incubator-carbondata/pull/229#discussion_r82830846 --- Diff: processing/src/main/java/org/apache/carbondata/processing/newflow/DataLoadProcessorStep.java --- @@ -0,0 +1,40 @@ +package

[GitHub] incubator-carbondata pull request #228: Add license header for FalseExpressi...

2016-10-11 Thread amy-309
GitHub user amy-309 opened a pull request: https://github.com/apache/incubator-carbondata/pull/228 Add license header for FalseExpression.java Add license header for FalseExpression.java You can merge this pull request into a Git repository by running: $ git pull

[GitHub] incubator-carbondata pull request #200: [CARBONDATA-276]add trim option

2016-10-11 Thread sujith71955
Github user sujith71955 commented on a diff in the pull request: https://github.com/apache/incubator-carbondata/pull/200#discussion_r82758567 --- Diff: processing/src/main/java/org/apache/carbondata/processing/csvreaderstep/UnivocityCsvParser.java --- @@ -102,8 +102,8 @@ public

Discussion(New feature) regarding single pass data loading solution.

2016-10-11 Thread Ravindra Pesala
Hi All, This discussion is regarding single pass data load solution. Currently data is loading to carbon in 2 pass/jobs 1. Generating global dictionary using spark job. 2. Encode the data with dictionary values and create carbondata files. This 2 pass solution has many disadvantages like it