Re:[DISCUSSION] Order by + Limit Optimization

2017-03-27 Thread 马云
please ignore the email. my mistake, the mail is not finished. I will sent a new mail later At 2017-03-28 13:24:34, "马云" wrote: >Hi Dev, > > >currently I have done an optimization for order by 1 dimension. >performance test as below > > >my optimization

[DISCUSSION] Order by + Limit Optimization

2017-03-27 Thread 马云
Hi Dev, currently I have done an optimization for order by 1 dimension. performance test as below my optimization solution for order by 1 dimension as below mainly leverage the dimension's order stored feature in each blocklet step1. change logical plan and push down the order by and limit

Re: carbondata find a bug

2017-03-27 Thread Liang Chen
Hi tianli First, please send mail to dev-subscr...@carbondata.incubator.apache.org for joining mailing list group. Then you can send and receive mail from dev@carbondata.incubator.apache.org. Can you raise one JIRA at https://issues.apache.org/jira/browse/CARBONDATA, and raise one pull request

pyspark carbondata

2017-03-27 Thread ????????
Use python to query carbondata through spark/sparksql?

Re: carbondata find a bug

2017-03-27 Thread QiangCai
+1 Best Regards David QiangCai -- View this message in context: http://apache-carbondata-mailing-list-archive.1130556.n5.nabble.com/carbondata-find-a-bug-tp9747p9749.html Sent from the Apache CarbonData Mailing List archive mailing list archive at Nabble.com.

Re: question about dimension's sort order in blocklet level

2017-03-27 Thread Liang Chen
Hi Can you provide one table to show your info, can't see very clear? The column of high cardinality(>100) would not do dictionary. Regards Liang 2017-03-27 14:32 GMT+05:30 马云 : > Hi DEV, > > I create table according to the below SQL > > cc.sql(""" > >

carbondata find a bug

2017-03-27 Thread Tian Li 田力
hi: org.apache.carbondata.spark.rdd.CarbonGlobalDictionaryGenerateRDD find a bug: code line 378-380 if (model.isFirstLoad && model.highCardIdentifyEnable && !model.isComplexes(split.index) && model.dimensions(split.index).isColumnar) { model.dimensions(split.index).isColumnar must

Re: data not input hive

2017-03-27 Thread Sea
Now Spark persist data source table into Hive metastore in Spark SQL specific format. This is not a bug. -- Original -- From: "";<1141982...@qq.com>; Date: Mon, Mar 27, 2017 04:47 PM To: "dev"; Subject: data

Re: [apache/incubator-carbondata] [CARBONDATA-727][WIP] addhiveintegration for carbon (#672)

2017-03-27 Thread Sea
Hi, Anubhav: Do you use mysql to store the hive metadata?spark sql and hive must use the same metastore. PS: Before you query data using hive, you should alter table schema. This is the latest guide.

[jira] [Created] (CARBONDATA-827) Query statistics log format is incorrect

2017-03-27 Thread Jacky Li (JIRA)
Jacky Li created CARBONDATA-827: --- Summary: Query statistics log format is incorrect Key: CARBONDATA-827 URL: https://issues.apache.org/jira/browse/CARBONDATA-827 Project: CarbonData Issue

[DISCUSSION]: (New Feature) Streaming Ingestion into CarbonData

2017-03-27 Thread Aniket Adnaik
Hi All, I would like to open up a discussion for new feature to support streaming ingestion in CarbonData. Please refer to design document(draft) in the link below. https://drive.google.com/file/d/0B71_EuXTdDi8MlFDU2tqZU9BZ3M /view?usp=sharing Your comments/suggestions are welcome. Here

Re: data not input hive

2017-03-27 Thread Jacky Li
Hi, Carbon does not support load data using Hive yet. You can use Spark to load. Regards, Jacky > 在 2017年3月27日,下午2:17,风云际会 <1141982...@qq.com> 写道: > > spark 2.1.0 > hive 1.2.1 > Couldn't find corresponding Hive SerDe for data source provider > org.apache.spark.sql.CarbonSource. Persisting

data not input hive

2017-03-27 Thread ????????
spark 2.1.0 hive 1.2.1 Couldn't find corresponding Hive SerDe for data source provider org.apache.spark.sql.CarbonSource. Persisting data source table `default`.`carbon_table30` into Hive metastore in Spark SQL specific format, which is NOT compatible with Hive.

[jira] [Created] (CARBONDATA-825) upload or delete problem

2017-03-27 Thread sehriff (JIRA)
sehriff created CARBONDATA-825: -- Summary: upload or delete problem Key: CARBONDATA-825 URL: https://issues.apache.org/jira/browse/CARBONDATA-825 Project: CarbonData Issue Type: Bug

[jira] [Created] (CARBONDATA-826) Create carbondata-connector of presto for supporting presto query carbon data

2017-03-27 Thread Liang Chen (JIRA)
Liang Chen created CARBONDATA-826: - Summary: Create carbondata-connector of presto for supporting presto query carbon data Key: CARBONDATA-826 URL: https://issues.apache.org/jira/browse/CARBONDATA-826

Re: Re:Re:Re:Re:Re:Re: insert into carbon table failed

2017-03-27 Thread Liang Chen
Hi Please enable vector , it might help limit query. import org.apache.carbondata.core.util.CarbonProperties import org.apache.carbondata.core.constants.CarbonCommonConstants CarbonProperties.getInstance().addProperty(CarbonCommonConstants.ENABLE_VECTOR_READER, "true") Regards Liang a wrote

Re: [DISCUSSION] Initiating Apache CarbonData-1.1.0 incubating Release

2017-03-27 Thread manish gupta
+1 Regards Manish Gupta On Mon, Mar 27, 2017 at 2:41 PM, Kumar Vishal wrote: > +1 > -Regards > Kumar Vishal > > On Mar 27, 2017 09:31, "xm_zzc" <441586...@qq.com> wrote: > > > Hi, Liang: > > Thanks for your reply. > > > > > > > > -- > > View this message in

question about dimension's sort order in blocklet level

2017-03-27 Thread 马云
Hi DEV, I create table according to the below SQL cc.sql(""" CREATE TABLE IF NOT EXISTS t3 (ID Int, date Timestamp, country String, name String, phonetype String, serialname String, salary Int, name1 String, name2 String, name3 String, name4 String,

Re: [DISCUSSION] Initiating Apache CarbonData-1.1.0 incubating Release

2017-03-27 Thread Kumar Vishal
+1 -Regards Kumar Vishal On Mar 27, 2017 09:31, "xm_zzc" <441586...@qq.com> wrote: > Hi, Liang: > Thanks for your reply. > > > > -- > View this message in context: http://apache-carbondata- > mailing-list-archive.1130556.n5.nabble.com/Re-DISCUSSION- >

Re: insert into carbon table failed

2017-03-27 Thread william
I guess then word node in "Carbodata launches one job per each node to sort the data at node level and avoid shuffling" may make some confuse. I guess carbondata should launches one task per each executor . here job should be task ,node should be executor. Maybe he can try increase the number of

Carbondata resolve kettle dependencies fail

2017-03-27 Thread william
As the attachment shows: Carbondata resolve kettle dependencies fail. Can anyone know how to fix this? Also,if you use exclusion in maven to exclude the kettle, your project will fail even in compile time.

Re: [jira] [Created] (CARBONDATA-824) Null pointer Exception display to user while performance Testing

2017-03-27 Thread william
First, can you try change the create table statement ends with STORED BY 'carbondata' instead of STORED BY 'org.apache.carbondata.format'; Second , can you give some sample data instead of trying to upload 32GB CSV file.

[jira] [Created] (CARBONDATA-824) Null pointer Exception display to user while performance Testing

2017-03-27 Thread Vinod Rohilla (JIRA)
Vinod Rohilla created CARBONDATA-824: Summary: Null pointer Exception display to user while performance Testing Key: CARBONDATA-824 URL: https://issues.apache.org/jira/browse/CARBONDATA-824