Could you speak a little more about the current status of the project?

>From the, now available, repository, it seems that it is in very early stages?

https://github.com/Natural-Intelligence/rainbow/graphs/contributors


On Mon, Feb 24, 2020 at 3:45 AM Aviem Zur <aviem...@apache.org> wrote:
>
> Hi all,
>
> Thanks for the feedback.
>
> 1. Indeed, the codebase is still under a private repository. We intend to
> have it ready to share publicly later this March.
> 2. The project is built in Python and Java.This is due to the fact that we
> have deep integrations with open source projects written in these languages.
> We also considered the fact that it is used by both data scientists and
> data engineers and we believe a combination of Python/Java will promote
> collaboration and contribution.
> 3. Rainbow project intends to facilitate and simplify the composition of
> complex pipelines, which are based on other open source projects.
> As such it does not compete or overlap but rather complement these projects.
> 4. Re: DLAB project - as we see it this project focuses in the research
> phase, while Rainbow's focus is in the production phase.
> Seems the 2 projects complement each other and it would be very interesting
> for us to collaborate with the DLAB team.
> 5. We will adjust the proposal to provide more details on how other Apache
> projects are used in Rainbow.
> We currently mainly use Apache Airflow in order to run pipelines defined by
> users in our APIs (YAML, with plans of UI/REST), this reduces the
> engineering requirements for transitioning data science code into
> production. We also leverage Apache Spark and Apache Hive for data
> preparation features and there are plans to integrate with Apache Karaf as
> well.
>
> Thanks,
> Aviem
>
> On Sat, Feb 22, 2020 at 4:29 AM Paul King <pa...@asert.com.au> wrote:
>
> > Indeed, it does sound interesting.
> >
> > I would find it useful if the "existing Apache projects" bit of "Rainbow is
> > in development, leveraging existing Apache projects." could be expanded in
> > any way. I know there is a list of external dependencies later but  any
> > further description of how those technologies are used would be helpful.
> >
> > Also, I'd be interested in knowing how the proposal relates to DLAB:
> > https://dlab.apache.org/
> >
> > Nice work.
> >
> > Cheers, Paul.
> >
> >
> >
> > On Sat, Feb 22, 2020 at 2:34 AM larry mccay <lmc...@apache.org> wrote:
> >
> > > This seems like an interesting proposal.
> > >
> > > Couple points/questions:
> > >
> > > * The existing source is not available for viewing as it is still in
> > > private repos?
> > > * Is it a primarily java project?
> > > * It seems the intent of Rainbow is to not compete or overlap with the
> > > Hadoop ecosystem projects but rather to provide an efficient interface
> > > above them - correct?
> > >
> > >
> > > On Fri, Feb 21, 2020 at 8:51 AM Aviem Zur <aviem...@gmail.com> wrote:
> > >
> > > > Hi,
> > > >
> > > > We would like to propose Rainbow as an Apache incubator project.
> > Rainbow
> > > is
> > > > an end-to-end platform for data engineers & scientists, allowing them
> > to
> > > > build, train and deploy machine learning models in a robust and agile
> > > way.
> > > > The project's goal is to operationalize the machine learning process,
> > > > allowing data scientists to quickly transition from a successful
> > > experiment
> > > > to an automated pipeline in production.
> > > >
> > > > The proposal can be found here:
> > > > https://cwiki.apache.org/confluence/display/INCUBATOR/Apache+Rainbow
> > > >
> > > > We would appreciate your feedback and thoughts on the proposal.
> > > >
> > > > Thanks,
> > > > Aviem
> > > >
> > >
> >



-- 
Luciano Resende
http://twitter.com/lresende1975
http://lresende.blogspot.com/

---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org

Reply via email to