Licensing questions

2018-10-19 Thread Slim Bouguerra
Looking at the http://druid.io/licensing i can see : Druid and the Druid logo are copyright Metamarkets Group Inc. Druid is a registered trademark of Metamarkets Group Inc. Is that the case or the page is outdated. Can someone please explain to me what was transferred to Apache-Foundation and what

Re: Question regarding distribution of GPL licensed MySQL Connector

2018-10-19 Thread Slim Bouguerra
We have to do something similar internally since we can not ship GPL code within our binaries. Thus we build MySQL connector and exclude the Driver form the artifacts. Then we instruct users on how to add the Driver Jars. FYI users usually have a platform dependent shared Mysql/Lzo drivers so

Re: Druid + Theta Sketches performance

2018-10-19 Thread Charles Allen
This is a good callout. Those numbers still seem very slow. One item I'm curious of is if you are dropping the id when you index, or if the id is also being indexed into the druid segments. With how druid does indexing, it dictionary encodes all the dimension values. So the cardinality of rows is

Druid Heavy Data Load perform only partial load

2018-10-19 Thread Kiran Jagtap
Hi Team, Thank you so much & appreciated your help & support. I'm facing some issues to load heavy data into druid single node setup, data load job is successful, but only partial data gets loaded. Machine config : Linux-ubuntu 16.04 LTS, 4 CPU, 16GB RAM, 500 GB disk space Data csv file size :

Druid + Theta Sketches performance

2018-10-19 Thread alex . rnv . ru
Hi Druid devs, I am testing Druid for our specific count distinct estimation case. Data was ingested via Hadoop indexer. When simplified, it has following schema: timestampkeycountrytheta-sketchevent-counter So, there are 2 dimensions, one counter metric, one theta sketch metric.