It looks like renaming the directory and moving components can be separate
steps. If there is consensus that connectors will move out, should the
directory be named misc for everything else until there is some direction
for the remaining modules?

On Fri, 18 Mar 2022 at 03:03 Jungtaek Lim <kabhwan.opensou...@gmail.com>
wrote:

> Avro reader is technically a connector. We eventually called data source
> implementation "connector" as well; the package name in the catalyst
> represents it.
>
> Docker is something I'm not sure fits with the name "external". It
> probably deserves a top level directory now, since we start to release an
> official docker image. That does not seem to be an experimental one.
>
> Except Docker, all modules in the external directory are "sort of"
> connectors. Ganglia metric sink is an exception, but it is still a kind of
> connector for Dropwizard.
> (It might be interesting to see how many users are still using kinesis-asl
> and ganglia-lgpl modules. We have had almost no updates for DStream for
> several years.)
>
> If we agree with my proposal for docker, remaining is going to be
> effectively a rename. I don't have a strong opinion, just wanted to avoid
> the external directory to become/remain miscellaneous one.
>
> On Fri, Mar 18, 2022 at 10:04 AM Sean Owen <sro...@gmail.com> wrote:
>
>> I sympathize, but might be less change to just rename the dir. There is
>> more in there like the avro reader; it's kind of miscellaneous. I think we
>> might want fewer rather than more top level dirs.
>>
>> On Thu, Mar 17, 2022 at 7:33 PM Jungtaek Lim <
>> kabhwan.opensou...@gmail.com> wrote:
>>
>>> We seem to just focus on how to avoid the conflict with the name
>>> "external" used in bazel. Since we consider the possibility of renaming,
>>> why not revisit the modules "external" contains?
>>>
>>> Looks like kinds of the modules external directory contains are 1)
>>> Docker 2) Connectors 3) Sink on Dropwizard metrics (only ganglia here, and
>>> it seems to be just that Ganglia is LGPL)
>>>
>>> Would it make sense if each kind deserves a top directory? We can
>>> probably give better generalized names, and as a side-effect we will no
>>> longer have "external".
>>>
>>> On Fri, Mar 18, 2022 at 5:45 AM Dongjoon Hyun <dongjoon.h...@gmail.com>
>>> wrote:
>>>
>>>> Thank you for posting this, Alkis.
>>>>
>>>> Before the question (1) and (2), I'm curious if the Apache Spark
>>>> community has other downstreams using Bazel.
>>>>
>>>> To All. If there are some Bazel users with Apache Spark code, could you
>>>> share your practice? If you are using renaming, what is your renamed
>>>> directory name?
>>>>
>>>> Dongjoon.
>>>>
>>>>
>>>> On Thu, Mar 17, 2022 at 11:56 AM Alkis Evlogimenos
>>>> <alkis.evlogime...@databricks.com.invalid> wrote:
>>>>
>>>>> AFAIK there is not. `external` has been baked in bazel since the
>>>>> beginning and there is no plan from bazel devs to attempt to fix this
>>>>> <https://github.com/bazelbuild/bazel/issues/4508#issuecomment-724055371>
>>>>> .
>>>>>
>>>>> On Thu, Mar 17, 2022 at 7:52 PM Sean Owen <sro...@gmail.com> wrote:
>>>>>
>>>>>> Just checking - there is no way to tell bazel to look somewhere else
>>>>>> for whatever 'external' means to it?
>>>>>> It's a kinda big ugly change but it's not a functional change. If
>>>>>> anything it might break some downstream builds that rely on the current
>>>>>> structure too. But such is life for developers? I don't have a strong
>>>>>> reason we can't.
>>>>>>
>>>>>> On Thu, Mar 17, 2022 at 1:47 PM Alkis Evlogimenos
>>>>>> <alkis.evlogime...@databricks.com.invalid> wrote:
>>>>>>
>>>>>>> Hi Spark devs.
>>>>>>>
>>>>>>> The Apache Spark repo has a top level external/ directory. This is a
>>>>>>> reserved name for the bazel build system and it causes all sorts of
>>>>>>> problems: some can be worked around and some cannot (for some details on
>>>>>>> one that cannot see
>>>>>>> https://github.com/hedronvision/bazel-compile-commands-extractor/issues/30
>>>>>>> ).
>>>>>>>
>>>>>>> Some forks of Apache Spark use bazel as a build system. It would be
>>>>>>> nice if we can make this change in Apache Spark without resorting to
>>>>>>> complex renames/merges whenever changes are pulled from upstream.
>>>>>>>
>>>>>>> As such I proposed to rename external/ directory to want to rename
>>>>>>> the external/ directory to something else [SPARK-38569
>>>>>>> <https://issues.apache.org/jira/browse/SPARK-38569>]. I also sent a
>>>>>>> tentative [PR-35874 <https://github.com/apache/spark/pull/35874>]
>>>>>>> that renames external/ to vendor/.
>>>>>>>
>>>>>>> My questions to you are:
>>>>>>> 1. Are there any objections to renaming external to X?
>>>>>>> 2. Is vendor a good new name for external?
>>>>>>>
>>>>>>> Cheers,
>>>>>>>
>>>>>>

Reply via email to