Github user jackylk commented on a diff in the pull request:
https://github.com/apache/incubator-carbondata/pull/230#discussion_r83361557
--- Diff:
integration/spark/src/main/scala/org/apache/spark/sql/execution/command/carbonTableSchema.scala
---
@@ -1422,6 +1422,7 @@
Github user jackylk commented on a diff in the pull request:
https://github.com/apache/incubator-carbondata/pull/229#discussion_r83354231
--- Diff:
processing/src/main/java/org/apache/carbondata/processing/newflow/AbstractDataLoadProcessorStep.java
---
@@ -73,15 +72,15 @@ public
Github user jackylk commented on a diff in the pull request:
https://github.com/apache/incubator-carbondata/pull/229#discussion_r83354325
--- Diff:
processing/src/main/java/org/apache/carbondata/processing/newflow/AbstractDataLoadProcessorStep.java
---
@@ -55,14 +54,14 @@ public
Github user jackylk commented on a diff in the pull request:
https://github.com/apache/incubator-carbondata/pull/236#discussion_r83360819
--- Diff:
processing/src/main/java/org/apache/carbondata/processing/newflow/dictionary/InMemBiDictionary.java
---
@@ -0,0 +1,85 @@
+/*
Github user jackylk commented on a diff in the pull request:
https://github.com/apache/incubator-carbondata/pull/236#discussion_r83360359
--- Diff:
core/src/main/java/org/apache/carbondata/core/devapi/BiDictionary.java ---
@@ -0,0 +1,53 @@
+/*
+ * Licensed to the Apache
Github user jackylk commented on a diff in the pull request:
https://github.com/apache/incubator-carbondata/pull/233#discussion_r83360131
--- Diff:
hadoop/src/main/java/org/apache/carbondata/hadoop/mapreduce/CSVInputFormat.java
---
@@ -0,0 +1,180 @@
+/*
+ * Licensed to
Github user jackylk commented on a diff in the pull request:
https://github.com/apache/incubator-carbondata/pull/233#discussion_r83359938
--- Diff:
hadoop/src/main/java/org/apache/carbondata/hadoop/util/CSVInputFormatUtil.java
---
@@ -0,0 +1,57 @@
+/*
+ * Licensed to the
Github user jackylk commented on a diff in the pull request:
https://github.com/apache/incubator-carbondata/pull/233#discussion_r83359593
--- Diff:
hadoop/src/main/java/org/apache/carbondata/hadoop/mapreduce/CSVInputFormat.java
---
@@ -0,0 +1,180 @@
+/*
+ * Licensed to
Github user jackylk commented on a diff in the pull request:
https://github.com/apache/incubator-carbondata/pull/233#discussion_r83360081
--- Diff:
hadoop/src/main/java/org/apache/carbondata/hadoop/mapreduce/CSVInputFormat.java
---
@@ -0,0 +1,180 @@
+/*
+ * Licensed to
Github user jackylk commented on a diff in the pull request:
https://github.com/apache/incubator-carbondata/pull/233#discussion_r83355842
--- Diff:
hadoop/src/main/java/org/apache/carbondata/hadoop/io/StringArrayWritable.java
---
@@ -0,0 +1,69 @@
+/*
+ * Licensed to the
Github user jackylk commented on a diff in the pull request:
https://github.com/apache/incubator-carbondata/pull/233#discussion_r83359637
--- Diff:
hadoop/src/main/java/org/apache/carbondata/hadoop/mapreduce/CSVInputFormat.java
---
@@ -0,0 +1,180 @@
+/*
+ * Licensed to
Github user jackylk commented on a diff in the pull request:
https://github.com/apache/incubator-carbondata/pull/233#discussion_r83359457
--- Diff:
hadoop/src/main/java/org/apache/carbondata/hadoop/mapreduce/CSVInputFormat.java
---
@@ -0,0 +1,180 @@
+/*
+ * Licensed to
Github user ravipesala commented on a diff in the pull request:
https://github.com/apache/incubator-carbondata/pull/236#discussion_r83359748
--- Diff:
processing/src/main/java/org/apache/carbondata/processing/newflow/dictionary/InMemBiDictionary.java
---
@@ -0,0 +1,85 @@
+/*
Github user ravipesala commented on a diff in the pull request:
https://github.com/apache/incubator-carbondata/pull/236#discussion_r83359504
--- Diff:
core/src/main/java/org/apache/carbondata/core/devapi/BiDictionary.java ---
@@ -0,0 +1,53 @@
+/*
+ * Licensed to the Apache
Github user Jay357089 commented on a diff in the pull request:
https://github.com/apache/incubator-carbondata/pull/230#discussion_r83354112
--- Diff:
processing/src/main/java/org/apache/carbondata/processing/store/writer/AbstractFactDataWriter.java
---
@@ -252,6 +252,15 @@
GitHub user lion-x opened a pull request:
https://github.com/apache/incubator-carbondata/pull/238
[WIP] Correct Some Spelling Mistakes
# Why raise this PR?
Correct some simple spelling mistakes.
You can merge this pull request into a Git repository by running:
$ git pull
Github user Jay357089 commented on a diff in the pull request:
https://github.com/apache/incubator-carbondata/pull/230#discussion_r83352166
--- Diff:
integration/spark/src/main/scala/org/apache/spark/sql/execution/command/carbonTableSchema.scala
---
@@ -1422,6 +1422,7 @@
Github user jackylk commented on a diff in the pull request:
https://github.com/apache/incubator-carbondata/pull/212#discussion_r83351395
--- Diff:
integration/spark/src/main/scala/org/apache/spark/sql/execution/command/carbonTableSchema.scala
---
@@ -861,9 +861,11 @@
Github user ravipesala commented on a diff in the pull request:
https://github.com/apache/incubator-carbondata/pull/212#discussion_r83350489
--- Diff:
integration/spark/src/main/scala/org/apache/spark/sql/CarbonDatasourceRelation.scala
---
@@ -55,18 +55,11 @@ class CarbonSource
Github user ravipesala commented on a diff in the pull request:
https://github.com/apache/incubator-carbondata/pull/212#discussion_r83350430
--- Diff:
integration/spark/src/main/scala/org/apache/spark/sql/execution/command/carbonTableSchema.scala
---
@@ -861,9 +861,11 @@
Github user jackylk commented on a diff in the pull request:
https://github.com/apache/incubator-carbondata/pull/230#discussion_r83349721
--- Diff:
integration/spark/src/main/scala/org/apache/spark/sql/execution/command/carbonTableSchema.scala
---
@@ -1422,6 +1422,7 @@
Yes, need to solve it , the CI should support different spark version.
Regards
Liang
zhujin wrote
> One issue:
> I modified the spark.version in pom.xml,using spark1.6.2, then compliation
> failed.
>
>
> Root cause:
> There was a "unused import statement" warinng in CarbonOptimizer class
>
After rethinking at point 4 in my previous email;
It will be very expensive to rebuild and re-encode the values , so may not
be a viable option. only future loads can benefit from it. But then will
end up having some segments using global dictionary and some using local
dictionary. May be we
Hi jihong
I am not sure that users can accept to use extra tool to do this work,
because provide tool or do scan at first time per table for most of global
dict are same cost from users perspective, and maintain the dict file also
be same cost, they always expecting that system can automatically
Github user jackylk commented on a diff in the pull request:
https://github.com/apache/incubator-carbondata/pull/212#discussion_r83336930
--- Diff:
integration/spark/src/main/scala/org/apache/spark/sql/CarbonDatasourceRelation.scala
---
@@ -55,18 +55,11 @@ class CarbonSource
the question is what would be the default implementation? Load data without
dictionary?
My thought is we can provide a tool to generate global dictionary using sample
data set, so the initial global dictionaries is available before normal data
loading. We shall be able to perform
GitHub user mohammadshahidkhan opened a pull request:
https://github.com/apache/incubator-carbondata/pull/237
[CARBONDATA-317] - CSV having only space char is throwing NullPointerâ¦
Problem: Data loading fails if csv is having only empty chars
Analysis: During data load,
Github user jackylk commented on a diff in the pull request:
https://github.com/apache/incubator-carbondata/pull/229#discussion_r83208961
--- Diff:
processing/src/main/java/org/apache/carbondata/processing/newflow/DataLoadProcessorStep.java
---
@@ -0,0 +1,40 @@
+package
Github user Jay357089 commented on a diff in the pull request:
https://github.com/apache/incubator-carbondata/pull/230#discussion_r83208043
--- Diff:
processing/src/main/java/org/apache/carbondata/processing/store/writer/AbstractFactDataWriter.java
---
@@ -252,6 +252,9 @@ private
Github user Jay357089 commented on a diff in the pull request:
https://github.com/apache/incubator-carbondata/pull/223#discussion_r83205716
--- Diff: docs/DML-Operations-on-Carbon.md ---
@@ -104,8 +109,10 @@ Following are the options that can be used in load
data:
GitHub user mohammadshahidkhan opened a pull request:
https://github.com/apache/incubator-carbondata/pull/235
[CARBONDATA-316] Change BAD_RECORDS_LOGGER_ACTION to BAD_RECORDS_ACTION
**Poblem**
the name BAD_RECORDS_LOGGER_ACTION is not related to logging the bad
records, its
Mohammad Shahid Khan created CARBONDATA-316:
---
Summary: Change BAD_RECORDS_LOGGER_ACTION to BAD_RECORDS_ACTION
Key: CARBONDATA-316
URL: https://issues.apache.org/jira/browse/CARBONDATA-316
GitHub user manishgupta88 opened a pull request:
https://github.com/apache/incubator-carbondata/pull/234
[CARBONDATA-315] Data loading fails if parsing a double value returns
infinity
Problem: Data loading fails if parsing a double value returns infinity
Analysis: During
Manish Gupta created CARBONDATA-315:
---
Summary: Data loading fails if parsing a double value returns
infinity
Key: CARBONDATA-315
URL: https://issues.apache.org/jira/browse/CARBONDATA-315
Project:
GitHub user QiangCai reopened a pull request:
https://github.com/apache/incubator-carbondata/pull/127
[CARBONDATA-213] Remove dependency: thrift complier
[CARBONDATA-213] Remove dependency: thrift complier
**analysis**
I think it unnecessary for user/developer to
Hi Jihong/Aniket,
In the current implementation of carbondata we are already handling
external dictionary while loading the data.
But here the question is what would be the default implementation? Load
data with out dictionary?
Regards,
Ravi
On 13 October 2016 at 03:50, Aniket Adnaik
Github user QiangCai closed the pull request at:
https://github.com/apache/incubator-carbondata/pull/132
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if
GitHub user QiangCai opened a pull request:
https://github.com/apache/incubator-carbondata/pull/233
[CARBONDATA-296]1.Add CSVInputFormat to read csv files.
**1 Add CSVInputFormat to read csv files**
MRv1:
38 matches
Mail list logo