Re: elastic search as kylin storage engine

zeLiu Wed, 27 Jan 2016 18:15:21 -0800

hi,shaofeng
I am trying to review the code of 2.x,and Compared with 1.x, the code of 2.x
change is very big.
this version  is just implemented functional,There are a lot of place needs
to be optimized in the performance and architecture.



Because I do not know how to modify the webapp code,So if the description of
Cube contains  "ES", then it is a ES task
build job has four step:"Create Intermediate Flat Hive Table"---->"Bulk
Index and Load Data to ES"---->"Update Cube Info"---->"Garbage Collection"
"Bulk Index and Load Data to ES" is a new mapreduce job which use to mapping
and bulk index .
the central class is "org.apache.kylin.job.hadoop.cube.BulkESMapper" .

The query is based on SQLDigest,and return a ITupleIterator. 
when the description of Cube contains  "ES" ,the "StorageEngineFactory" will
return  "org.apache.kylin.storage.elasticsearch.ElasticSearchStorageEngine".
if there are no aggregate,excute
"org.apache.kylin.storage.elasticsearch.SerializedElasticSearchTupleIterator"
,Otherwise, excute
"org.apache.kylin.storage.elasticsearch.SerializedElasticAggregationTupleIterator"

the count() function is blocked before the SQLDigest,so it does not support
count() now.

thanks!

--
View this message in context: 
http://apache-kylin.74782.x6.nabble.com/elastic-search-as-kylin-storage-engine-tp3429p3470.html
Sent from the Apache Kylin mailing list archive at Nabble.com.

Re: elastic search as kylin storage engine

Reply via email to