Hey all,

Reviving this thread. Now that
https://github.com/apache/incubator-druid/pull/6742 and
https://github.com/apache/incubator-druid/pull/6862 have been released in
0.14, I'm re-proposing graduating Druid SQL from experimental status in the
next release, 0.15. I don't think we need a formal vote on this, but if
there seems to be general consensus, I'll do a PR before the next 3-monthly
0.15 code freeze (which is in about 2 weeks).

On Thu, Jan 31, 2019 at 9:20 AM Gian Merlino <g...@apache.org> wrote:

> It sounds like the general feeling is +1 on Kafka and maybe wait another
> release for SQL. I will do a PR to mark Kafka ingest as non-experimental,
> then, and on SQL we'll see whether #6742 and #6862 look solid in 0.14.
>
> On Tue, Jan 15, 2019 at 8:39 AM Gian Merlino <g...@apache.org> wrote:
>
>> Hi Mat,
>>
>> Ah, right. IMO https://github.com/apache/incubator-druid/pull/6742 is a
>> decent workaround towards making #6176 less of a problem. It would prevent
>> incorrect results from happening (the broker will not start up its http
>> server & announce itself, and so it won't get picked up by clients, if it
>> never got the initialization event). If paired with monitoring that
>> restarts unhealthy brokers, the issue should be fully worked-around in
>> practice.
>>
>> Even though there's an (imo) viable workaround, it would still be good to
>> fix the root cause of #6176. I just raised
>> https://github.com/apache/incubator-druid/pull/6862 to update Curator
>> and see if that helps -- there is a bug fixed in the latest release that
>> looks like it could cause the behavior we're seeing (
>> https://issues.apache.org/jira/browse/CURATOR-476).
>>
>> My feeling is that it's still reasonable to remove the experimental label
>> from Druid SQL in 0.14, especially since #6742 will make SQL and native
>> queries behave at parity (initialization getting missed will delay broker
>> startup for _both_ cases). So in that sense they are at least on the same
>> footing. And hopefully #6862 will fix them both, together.
>>
>> On Tue, Jan 15, 2019 at 7:56 AM Pierre-Emile Ferron <pe.fer...@gmail.com>
>> wrote:
>>
>>> A remaining issue with SQL is
>>> https://github.com/apache/incubator-druid/issues/6176
>>>
>>> We've seen it happen several times in production on 0.12, where
>>> thankfully
>>> SQL doesn't power anything critical. The current workarounds are:
>>> 1. Restart the broker. Obviously not a good solution.
>>> 2. Migrate to HTTP segment discovery. I'm fine with that, and we are
>>> actually planning to do it soon in our clusters, but I'm still concerned
>>> about other Druid users—the default setting is still ZK, which means that
>>> SQL would still have this issue by default.
>>>
>>> Before marking SQL as non-experimental, I'd suggest either fixing the
>>> root
>>> cause, or making HTTP segment discovery the default and then explicitly
>>> deprecating ZK segment discovery.
>>>
>>>
>>> On Mon, Jan 14, 2019 at 2:18 PM Gian Merlino <g...@apache.org> wrote:
>>>
>>> > I'd like to propose graduating a couple of features out of
>>> 'experimental'
>>> > status in 0.14. Both are popular features (judging by mailing list &
>>> github
>>> > issue/PR activity). Both have been around for a while and have
>>> attained a
>>> > good level of quality and stability of API & behavior. I believe
>>> removing
>>> > the 'experimental' banner from these features would more accurately
>>> reflect
>>> > reality, and be a good signal to the user community.
>>> >
>>> > 1) Kafka indexing service. First introduced in Druid 0.9.1, it went
>>> through
>>> > a major protocol change in Druid 0.12.0 that added incremental
>>> publishing,
>>> > & 'mixing' of data from different partitions. Subjectively, quality
>>> appears
>>> > to be getting more solid, based on frequency of bug reports and also
>>> based
>>> > on our own experiences running this in production. Finally- I believe
>>> it is
>>> > already much more robust than Tranquility, the only 'stable'
>>> alternative.
>>> >
>>> > 2) Druid SQL. First introduced in Druid 0.10.0. It isn't feature
>>> complete
>>> > yet (multi-value dimensions, datasketches, etc, remain unsupported)
>>> but the
>>> > API and behavior have been generally stable. No major issues around
>>> memory
>>> > / performance / etc regressions relative to native Druid queries are
>>> > outstanding. IMO, it is well on its way to becoming a first class way
>>> to
>>> > query Druid, and it is a good time to remove the 'experimental' banner.
>>> >
>>>
>>

Reply via email to