Re: [DISCUSS] NLPCraft Proposal

2020-02-05 Thread Konstantin Boudnik
Thanks for the offer Furkan. While we have a pretty packed house 
already, I think we can find a spot for one more ;)


Welcome! I will update the proposal to reflect this.

--
  Cos

On 2/4/20 5:45 AM, Furkan KAMACI wrote:

Hi,

I've checked your proposal, website, and blog posts and seems promising! I
can help as a mentor if needed too.

Kind Regards,
Furkan KAMACI


On Tue, Feb 4, 2020 at 1:21 PM Paul King  wrote:


All sounds good to me.

On Tue, Feb 4, 2020 at 4:30 PM Nikita Ivanov  wrote:


Hi Paul,
I'm one of the NLPCraft project members, let me chime in here.

1. The project is very interested in native Groovy/Kotlin/Scala model
APIs. It is a bit unclear for now how much work needs to be done
specifically for these languages given that NLPCraft provides a very

plain

(and Scala-friendly) Java APIs. I think it deserves a separate deep
conversation.
2. The topic of Commons Clause (with ALV2) is very clear - if and when

the

project enters the ASF Incubator the license will change to plain Apache
License, v2.0. There's no disagreement on this among project members.
Thanks!
--
Nikita Ivanov


On Mon, Feb 3, 2020 at 9:35 PM Paul King  wrote:


Looks like you have close to a full house for mentors. I could certainly
put my hat in the ring if you need another, otherwise I will certainly

be

an interested community member. It seems like an interesting project. I
would have a particular interest in Groovy/Micronaut integration.

I did notice in the current repo, usage of the Commons Clause (with

ALV2).

The usage by Redis of something similar, albeit with notable

differences,

was controversial a little while back but seemed to die down as per
LEGAL-402. As someone who hasn't been following this lately, is that
something the project needs to manage expectations for?

Cheers, Paul.


On Tue, Feb 4, 2020 at 10:43 AM Konstantin Boudnik 
wrote:


Good time of the time to all!

I'd like to bring this new interesting project for the discussion,

comments

and feedback with the aim of starting a formal [VOTE] of its

acceptance

into
Incubator.

People behind this project aren't new to Apache: some of them were

behind

the
Apache Ignite incubation, which I consider a huge success as the

community

is
literally thriving almost 5 years after the graduation.

I have been involved a little bit with this project when it just

started

privately a few years ago. And I'd like to emphasize that the

community

however small it might look so far, has been aligned with Apache ways

of

doing
things. Nikita Ivanov (from Ignite PMC) is very instrumental in

tirelessly

helping this group to learn what it means to be a truly open source
project.

The code is already under ALv2 and is publicly available. As you will

see

it
has a lot of organic connections with the rest of Apache ecosystem and

IMO

will fit very well here and continue to grow the community.

The project's proposal is available at [1].

Thank you very much for the feedback you're willing to provide!
--
   With best regards,
 Cos

[1]

https://cwiki.apache.org/confluence/display/INCUBATOR/NLPCraftProposal


-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [DISCUSS] NLPCraft Proposal

2020-02-05 Thread Nikita Ivanov
Justin,
As far as "all people signing ICLA" - we can't guarantee this as I don't
think we know (or have a record of) all the people who touched the code
years back (I do know main contributors - they are in this proposal). I
think SGA would be the best option as it should make things clear.

I know this subject was a sensitive discussion when Apache Ignite was
proposed for incubation (I'm a founding member of that project and a
founder at GridGain Systems) - so I'm keenly aware of the importance of the
IP governments of the proposed codebase. Hopefully, SGA will satisfy these
requirements.

@Konstantin Boudnik , @Aaron Radzinski
 - feel free to chime in as well.

Thanks,
--
Nikita Ivanov

On Wed, Feb 5, 2020 at 3:44 PM Justin Mclean 
wrote:

> Hi,
>
> > NLPCraft came originally from DataLingvo.
>
> OK so that clarifies that there is code developed by DataLingvo in this
> code base.
>
> > During migration, some DataLingvo artefacts weren't properly
> > renamed (it should be fixed by now).
>
> I would question why they were “renamed”, while you’re not an ASF project
> yet please see [1] for future expectations. ASF policy is generally to not
> remove 3rd party headers.
>
> > Just to be clear - NLPCraft has no
> > affiliation with DataLingvo of any kind (beyond the original roots) nor
> it
> > is a legal entity; it's just a project name.
>
> You mentioned all the original developers from DataLingvo worked on
> NLPCraft. Is this still the case? Are any of them initial committers? Will
> all people who worked on the original DataLingvo code base be signing ICLAs?
>
> > If required, DataLingvo will execute SGA (as it already open-sourced it
> > with ASL 2.0 license two years ago). Please advise.
>
> I think that would be best.
>
> Thanks,
> Justin
>
> 1. https://www.apache.org/legal/src-headers.html#3party
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>


Re: [DISCUSS] NLPCraft Proposal

2020-02-05 Thread Justin Mclean
Hi,

> NLPCraft came originally from DataLingvo.

OK so that clarifies that there is code developed by DataLingvo in this code 
base. 

> During migration, some DataLingvo artefacts weren't properly
> renamed (it should be fixed by now).

I would question why they were “renamed”, while you’re not an ASF project yet 
please see [1] for future expectations. ASF policy is generally to not remove 
3rd party headers.

> Just to be clear - NLPCraft has no
> affiliation with DataLingvo of any kind (beyond the original roots) nor it
> is a legal entity; it's just a project name.

You mentioned all the original developers from DataLingvo worked on NLPCraft. 
Is this still the case? Are any of them initial committers? Will all people who 
worked on the original DataLingvo code base be signing ICLAs?

> If required, DataLingvo will execute SGA (as it already open-sourced it
> with ASL 2.0 license two years ago). Please advise.

I think that would be best.

Thanks,
Justin

1. https://www.apache.org/legal/src-headers.html#3party
-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [DISCUSS] NLPCraft Proposal

2020-02-05 Thread Nikita Ivanov
Justin, et. al.,
NLPCraft came originally from DataLingvo. DataLingvo was a legal entity
that originally developed some initial code. It has open-sourced it around
2018 along with the license change and it was developed thereafter within
NLPCraft project (including by all the original developers from
DataLingvo). During migration, some DataLingvo artefacts weren't properly
renamed (it should be fixed by now). Just to be clear - NLPCraft has no
affiliation with DataLingvo of any kind (beyond the original roots) nor it
is a legal entity; it's just a project name.

If required, DataLingvo will execute SGA (as it already open-sourced it
with ASL 2.0 license two years ago). Please advise.

Thanks!
--
Nikita Ivanov



On Wed, Feb 5, 2020 at 2:50 PM Justin Mclean 
wrote:

> Hi,
>
> > perhaps the company changed it’s name but it’s the same organization?
>
> Perhaps or perhaps not, it hard to say with the information currently at
> hand. Hopefully the proposed project can clarify.
>
> The IP provenance of the codebase needs to be clear, removing copyright
> headers without explanation IMO makes that unclear. We also need to know if
> others were involved to know who else need to sign ICLAs or if a SGA is
> needed from DataLingvo.
>
> For instance here [1] it states that DataLingvo is the maker of
> NLPCraft.org, that implies we need an SGA from them.
>
> Thanks,
> Justin
>
> 1. https://twitter.com/datalingvo?lang=en
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>


[MENTORS] Report signify due Tue February 11

2020-02-05 Thread Justin Mclean
Hi,

Report are due to be signed off by the Tue February 11th.

Its good to see some projects already having sign offs including 
DolphinScheduler where 4 mentors have signed and ECharts where 3 mentors have 
signed off the report.

Other than non reporting podling, currently missing sign-offs are:
- Milagro
- Myriad
- S2graph
- Tamaya

Thank,
Justin



-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [DISCUSS] NLPCraft Proposal

2020-02-05 Thread Justin Mclean
Hi,

> perhaps the company changed it’s name but it’s the same organization?

Perhaps or perhaps not, it hard to say with the information currently at hand. 
Hopefully the proposed project can clarify.

The IP provenance of the codebase needs to be clear, removing copyright headers 
without explanation IMO makes that unclear. We also need to know if others were 
involved to know who else need to sign ICLAs or if a SGA is needed from 
DataLingvo.

For instance here [1] it states that DataLingvo is the maker of NLPCraft.org, 
that implies we need an SGA from them.

Thanks,
Justin

1. https://twitter.com/datalingvo?lang=en
-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [DISCUSS] NLPCraft Proposal

2020-02-05 Thread Dave Fisher
Hi -

A little research on Google, Quadra and LinkedIn shows that DataLingvo and 
NLPCraft seem to be the same with a name change.

Regards,
Dave

> On Feb 5, 2020, at 5:24 PM, David Jencks  wrote:
> 
> Is there perhaps a language problem? I often have trouble saying what I mean 
> in a way that can be understood… even by me, a few minutes later.  
> https://www.datalingvo.com now seems to redirect to https://nlpcraft.org, 
> perhaps the company changed it’s name but it’s the same organization?  I 
> could have written the below and meant “The original code for the proposed 
> NLPCraft project was written when the NLPCraft company was called DataLingvo”.
> 
> Hope this is not just noise…
> 
> david jencks
> 
>> On Feb 5, 2020, at 1:20 PM, Justin Mclean  wrote:
>> 
>> HI,
>> 
>>> The original code for NLPCraft was developed under DataLingvo.
>> 
>> So wouldn’t they be the copyright owner? If soyou can’t change headers or 
>> remove copyright statements without permission from them, do you have that?
>> 
>> Can you point me to the original repos? It may be that that are other people 
>> that we need ICLAs from or a software grant from DataLingvo is also required.
>> 
>> Thanks,
>> Justin
>> -
>> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
>> For additional commands, e-mail: general-h...@incubator.apache.org
>> 
> 
> 
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
> 


-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [MENTORS] Podling reports are due 5th

2020-02-05 Thread Justin Mclean
Hi,

We still missing reports from:
 - Annotator
 - PageSpeed
 - SDAP
 - Taverna

Both Annotator and Taverna have failed to report twice in a row which is 
concerning.

Thanks,
Justin
-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [DISCUSS] NLPCraft Proposal

2020-02-05 Thread David Jencks
Is there perhaps a language problem? I often have trouble saying what I mean in 
a way that can be understood… even by me, a few minutes later.  
https://www.datalingvo.com now seems to redirect to https://nlpcraft.org, 
perhaps the company changed it’s name but it’s the same organization?  I could 
have written the below and meant “The original code for the proposed NLPCraft 
project was written when the NLPCraft company was called DataLingvo”.

Hope this is not just noise…

david jencks

> On Feb 5, 2020, at 1:20 PM, Justin Mclean  wrote:
> 
> HI,
> 
>> The original code for NLPCraft was developed under DataLingvo.
> 
> So wouldn’t they be the copyright owner? If soyou can’t change headers or 
> remove copyright statements without permission from them, do you have that?
> 
> Can you point me to the original repos? It may be that that are other people 
> that we need ICLAs from or a software grant from DataLingvo is also required.
> 
> Thanks,
> Justin
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
> 


-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [DISCUSS] NLPCraft Proposal

2020-02-05 Thread Justin Mclean
HI,

> The original code for NLPCraft was developed under DataLingvo.

So wouldn’t they be the copyright owner? If soyou can’t change headers or 
remove copyright statements without permission from them, do you have that?

Can you point me to the original repos? It may be that that are other people 
that we need ICLAs from or a software grant from DataLingvo is also required.

Thanks,
Justin
-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [DISCUSS] NLPCraft Proposal

2020-02-05 Thread Nikita Ivanov
Justin - let me clarify (sorry for confusion).

The original code for NLPCraft was developed under DataLingvo. A few years
back it was decided to rename the project to NLPCraft and adopt CC w/ASL2.0
license. The codebase was migrated to a new GitHub repo along with these
changes. Apparently, not all copyright statements were caught and few
remained in the code. I've already notified Aaron Radzinski (a member of
NLPCraft project & of this proposal) - and he should comb through and clean
up copyrights (and other mentions of DataLingvo) from the code.

I hope this makes it a bit clearer.

Best,
--
Nikita Ivanov



On Wed, Feb 5, 2020 at 5:37 AM Justin Mclean 
wrote:

> Hi,
>
> Thanks for the answers.
>
> > 3. There are historical mentions of DataLingvo in the source code (here
> and
> > there) and that needs to be cleaned up (they are just an artefact of
> > migration from repo to repo).
>
>
> I'm not sure that answers my question(s), historical how?
>
> Thanks,
> Justin
>
>
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>


Re: [DISCUSS] NLPCraft Proposal

2020-02-05 Thread Justin Mclean
Hi,

Thanks for the answers.

> 3. There are historical mentions of DataLingvo in the source code (here and
> there) and that needs to be cleaned up (they are just an artefact of
> migration from repo to repo). 


I'm not sure that answers my question(s), historical how?

Thanks,
Justin



-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [DISCUSS] NLPCraft Proposal

2020-02-05 Thread Nikita Ivanov
Justin,
Let me reply on these.

1. The common clause will be removed (if and when the project enters ASF
Incubator). I mentioned that in my last reply.
2. nlpcraft.* domains will be donated to ASF during the incubation period.
3. There are historical mentions of DataLingvo in the source code (here and
there) and that needs to be cleaned up (they are just an artefact of
migration from repo to repo). All the code that will be migrated to ASF
will be cleaned up. No code grants should be necessary.
4. Agree on user list...

Thanks!
--
Nikita Ivanov



On Tue, Feb 4, 2020 at 4:01 PM Justin Mclean 
wrote:

> Hi,
>
> Great proposal and an interesting project you have there:
>
> Some comments:
> - The common clause is a big concern, it basically means this is not open
> source software. The project needs to be willing to remove it and have
> permission to do so from all contributors.
> - Is nlpcraft a company name? (Website seems to imply it might be by
> giving it a physical address) The domains nlpcraft.com nlpcraft.net and
> nlpcraft.org are currently taken, will they be donated or redirected to
> the ASF?
> - A quick search of the repro shows 3rd party content of unknown
> licensing, are you aware of this? e.g [1] Is there a relationship between
> DataLingvo and NLPCraft? If so what is it? Would a software grant from them
> be needed? I notice a number of other files are also copyright DataLingvo.
> - Generally Incubator projects don’t start with a user list, is there a
> reason why you want to start with one?
>
> Thanks,
> Justin
>
> 1. ./src/main/resources/stopwords/first_words.txt.gz
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>