I tested moving datasketches to core and it doesn’t look like it brings 
additional dependencies:

> [INFO] --------------------< org.apache.druid:druid-core 
> >---------------------
> [INFO] Building druid-core 0.17.0-incubating-SNAPSHOT
> [INFO] --------------------------------[ jar 
> ]---------------------------------
> [INFO]
> [INFO] --- maven-dependency-plugin:3.1.1:tree (default-cli) @ druid-core ---
> [INFO] org.apache.druid:druid-core:jar:0.17.0-incubating-SNAPSHOT
> [INFO] +- com.yahoo.datasketches:sketches-core:jar:0.13.4:compile
> [INFO] +- com.yahoo.datasketches:memory:jar:0.12.2:compile
> [INFO] +- commons-io:commons-io:jar:2.6:compile

A diff of the distribution build before and after moving datasketches:

> diff -r before after | grep -v Binary
> Only in before/extensions/druid-datasketches: memory-0.12.2.jar
> Only in before/extensions/druid-datasketches: sketches-core-0.13.4.jar
> Only in before/extensions/druid-datasketches: slf4j-api-1.7.25.jar
> Only in after/lib: memory-0.12.2.jar
> Only in after/lib: sketches-core-0.13.4.jar


Thanks,
Chi


> On Oct 31, 2019, at 9:15 AM, Charles Allen <cral...@apache.org> wrote:
> 
> Any time we discuss moving things into core Druid I would love to see a
> list of dependencies that comes with it.
> 
> On Wed, Oct 30, 2019, 6:08 PM Jihoon Son <jihoon...@apache.org> wrote:
> 
>> +1 on moving too.
>> 
>> On Mon, Oct 28, 2019 at 12:46 PM Fangjin Yang <fang...@imply.io> wrote:
>> 
>>> +1 on moving datasketches to core
>>> 
>>> On Mon, Oct 28, 2019 at 12:36 PM Chi Cao Minh <chi.caom...@imply.io>
>>> wrote:
>>> 
>>>> To support range partitioning for native parallel batch indexing, I’m
>>>> considering moving DataSketches from extensions to core (see
>>>> https://github.com/apache/incubator-druid/issues/8769 <
>>>> https://github.com/apache/incubator-druid/issues/8769> for details).
>>>> Having DataSketches in core would also allow us to switch usages of
>>>> HyperLogLogCollector to the better HLL implementation available in
>>>> DataSketches. One drawback is that moving DataSketches to core will
>>>> possibly block the work to upgrade DataSketches to the latest version:
>>>> https://github.com/apache/incubator-druid/pull/8647 <
>>>> https://github.com/apache/incubator-druid/pull/8647>.
>>>> 
>>>> Any other thoughts on the pros/cons?
>>>> 
>>>> Thanks,
>>>> Chi
>>> 
>> 


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@druid.apache.org
For additional commands, e-mail: dev-h...@druid.apache.org

Reply via email to