Re: [DISCUSS] The state of the project - 2017

Riccardo Tommasini Mon, 06 Nov 2017 22:24:08 -0800

Hello all,
My name is Riccardo Tommasini (Politecnico di Milano)
It’s nice to see all these activities and I’m glad that Calcite is finally 
attracting other people in academia.


On my side,
In collaboration with the university of Bolzano/Bozen, we are using Calcite for 
ontology based data access (over streams) since one year and a half.

We are planning 2 publications next spring
We are currently running the evaluations.
Honestly I cannot commit to coordinate the area, but i will share here any 
result.

Native SPARQL SUPPORT is also an idea we are considering, especially in 
relation without extension to process RDF streams (RSP-QL)

RT

On 7 Nov 2017, 05:25 +0100, Edmon Begoli <[email protected]>, wrote:
Couple of things to add:

1) Related to academic or just a general research perspective - I volunteer
to coordinate this area, hopefully together with you Michael, Jesus, and
whoever else is interested. Calcite is a great platform to enable this
(modularity, expansiveness, etc.)
Since you guys are already busy with the PMC roles, I can take a lead, and
we can rotate yearly. In terms of output, I propose that we aim for 1-2
strong papers out every year (SIGMOD, VLDB, PODS, etc.), which will
accumulate to a nice body of work. We could try to publish some of these in
collaboration with downstream projects such as Flink, etc.

2) I would also like to propose a new component to Calcite that could
evolve over the next 12-18 months focused on scientific data, starting with
support for genomic and related storage managers (TileDB, SciDB, SAMTools,
etc.), with specific aim at life and medical sciences. This is, in my
opinion, the next big frontier, in addition to the current geospatial focus.

3) I think we'll get the geospatial stuff done by the end of 2018.

Best,
Edmon

On Mon, Nov 6, 2017 at 7:46 PM, Michael Mior <[email protected]> wrote:

Jesús,

I'm happy to step in as PMC for next year if others are comfortable with
that. As far as the answers to your questions, a few thoughts below.

1) I think it's great to see continued growth in new contributors. For such
a widely used project, I've never seen new committers be onboarded so
quickly. It's great to see the scope and diversity of use cases for Calcite
expanding. Although preliminary, things like adding geospatial queries are
opening up a lot of new doors for Calcite and I'm interested to see where
this goes.

2) I'd like to see Calcite get more use in academic research. Hopefully the
paper Edmon is currently leading will contribute to that effort. I think
Calcite can make it much easier to prototype query optimizations that
poking around the internals of Postgres or MySQL. (Disclaimer: It's been a
while since I've looked at either of these projects.)

Also, it would be nice to have the CI process become more stable. Whether
this is some improvements to the current Jenkins infrastructure or the work
Christian is doing on getting things running smoothly on Travis CI.
Furthermore, I don't think the integration tests are run as often as they
should be since it can be a little onerous to set up. I've mentioned before
I think Docker could be a better fit than the current VM solution,
especially if we're able to have separate containers for each service so
they can easily be tested individually.

Although my answer to question 2 was a bit more verbose, that shouldn't be
interpreted negatively. Although I haven't been involved with Calcite for
very long, I've been impressed with the upward trajectory and I'm sure that
will continue!

Cheers,
--
Michael Mior
[email protected]

2017-11-06 12:00 GMT-05:00 Jesus Camacho Rodriguez <[email protected]>:

It has been a bit over two years since Calcite graduated to a top-level
Apache project [1]. Back then, it was decided that every year there would
be a "state of the project" discussion and a new PMC chair/VP would be
chosen [2]. The time has come :)

The adoption of Calcite has continued growing nicely during the last
year.
We continued improving the support to query all data, from
semi-structured
to streaming, including spatial/geographical/geometry data recently.
Calcite can interact with more systems than ever before and we count
already more than 12 different adapters into our codebase. In turn, the
wide adoption of Calcite is helping us to consolidate existing core code
and extend the tests coverage.

The dissemination of the project continued over the last year, with
important presence of Calcite in talks at conferences and meetups. In
addition, some members of the community are trying (for the second time
:)
) to produce a paper describing the project, its architecture, and how
other different systems are using it [3]. There were several discussions
last year about the difficulty to consume Calcite documentation; we hope
that this document would serve as an initial formal reference for the
project.

We also continued with a regular release cadence, which is representative
of the health of the project as well as useful for the rest of the
projects
that consume the Calcite bits. Last week, CALCITE-2027 [4] was logged to
drop support for Java7. I think it is a great opportunity for the project
to take another step forward, releasing Calcite 2.0 shortly after that,
and
deprecating some old APIs along the way.

We have a larger, more diverse number of committers and contributors than
last year, coming both from industry and academia. Their contributions
were
not limited to code for the project, as we had different members of the
community playing the release manager role, spending time improving the
documentation of the project, etc.
Probably we still need to improve some aspects as a community. For
instance, recently there were discussions about the participation of the
community members in one of the important tasks for the project: pull
requests reviews. This continuous engagement seems to be challenging for
a
project such as Calcite, as most of us work primarily on other projects
that "consume" Calcite and we might spend more time involved in those
projects. While this is difficult to change and I do not have any
specific
idea to improve it, it is important that we do our best to help and
ensure
that the project development does not stall.

I am not involved in the Avatica effort, but it has been great to see
Avatica continue maturing, moving into its own repository and following
with its own release cadence. Josh, Julian, if you want to add a few
lines
about the state of Avatica, that would be great.

Since we agreed to rotate the PMC chair every 12 months, I want to use
this thread to start talking about a replacement too. It has been a
privilege to be able to serve as Calcite PMC chair during last year, I
wish
I could have found more time to foster the project further: I truly
believe
in Calcite vision and its value at the core of the development of open
source data management systems and applications.
Which candidates would like to step up? In my opinion, I think Michael
Mior, if he is willing to accept, would be a great candidate. He has been
engaged with the Calcite community in different roles, writing code and
documentation, reviewing PRs, answering questions in the mailing lists,
and
acting as release manager for 1.14, among others.

Lastly, as Julian asked last year:
1) What else are we doing well in the project?
2) What are the areas where we need to do better?
Please take some time to share your thoughts about the state of the
project.

-Jesús



[1] http://calcite.apache.org/news/2015/10/22/calcite-graduates/

[2] http://mail-archives.apache.org/mod_mbox/incubator-
calcite-dev/201509.mbox/%3CCF8D6F96-706F-4502-B41D-
0689E357209D%40apache.org%3E

[3] http://issues.apache.org/jira/browse/CALCITE-2024

[4] http://issues.apache.org/jira/browse/CALCITE-2027

Re: [DISCUSS] The state of the project - 2017

Reply via email to