Re: [DISCUSS] NLPCraft Proposal
Thanks for the offer Furkan. While we have a pretty packed house already, I think we can find a spot for one more ;) Welcome! I will update the proposal to reflect this. -- Cos On 2/4/20 5:45 AM, Furkan KAMACI wrote: Hi, I've checked your proposal, website, and blog posts and seems promising! I can help as a mentor if needed too. Kind Regards, Furkan KAMACI On Tue, Feb 4, 2020 at 1:21 PM Paul King wrote: All sounds good to me. On Tue, Feb 4, 2020 at 4:30 PM Nikita Ivanov wrote: Hi Paul, I'm one of the NLPCraft project members, let me chime in here. 1. The project is very interested in native Groovy/Kotlin/Scala model APIs. It is a bit unclear for now how much work needs to be done specifically for these languages given that NLPCraft provides a very plain (and Scala-friendly) Java APIs. I think it deserves a separate deep conversation. 2. The topic of Commons Clause (with ALV2) is very clear - if and when the project enters the ASF Incubator the license will change to plain Apache License, v2.0. There's no disagreement on this among project members. Thanks! -- Nikita Ivanov On Mon, Feb 3, 2020 at 9:35 PM Paul King wrote: Looks like you have close to a full house for mentors. I could certainly put my hat in the ring if you need another, otherwise I will certainly be an interested community member. It seems like an interesting project. I would have a particular interest in Groovy/Micronaut integration. I did notice in the current repo, usage of the Commons Clause (with ALV2). The usage by Redis of something similar, albeit with notable differences, was controversial a little while back but seemed to die down as per LEGAL-402. As someone who hasn't been following this lately, is that something the project needs to manage expectations for? Cheers, Paul. On Tue, Feb 4, 2020 at 10:43 AM Konstantin Boudnik wrote: Good time of the time to all! I'd like to bring this new interesting project for the discussion, comments and feedback with the aim of starting a formal [VOTE] of its acceptance into Incubator. People behind this project aren't new to Apache: some of them were behind the Apache Ignite incubation, which I consider a huge success as the community is literally thriving almost 5 years after the graduation. I have been involved a little bit with this project when it just started privately a few years ago. And I'd like to emphasize that the community however small it might look so far, has been aligned with Apache ways of doing things. Nikita Ivanov (from Ignite PMC) is very instrumental in tirelessly helping this group to learn what it means to be a truly open source project. The code is already under ALv2 and is publicly available. As you will see it has a lot of organic connections with the rest of Apache ecosystem and IMO will fit very well here and continue to grow the community. The project's proposal is available at [1]. Thank you very much for the feedback you're willing to provide! -- With best regards, Cos [1] https://cwiki.apache.org/confluence/display/INCUBATOR/NLPCraftProposal - To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org For additional commands, e-mail: general-h...@incubator.apache.org
Re: [DISCUSS] NLPCraft Proposal
Justin, As far as "all people signing ICLA" - we can't guarantee this as I don't think we know (or have a record of) all the people who touched the code years back (I do know main contributors - they are in this proposal). I think SGA would be the best option as it should make things clear. I know this subject was a sensitive discussion when Apache Ignite was proposed for incubation (I'm a founding member of that project and a founder at GridGain Systems) - so I'm keenly aware of the importance of the IP governments of the proposed codebase. Hopefully, SGA will satisfy these requirements. @Konstantin Boudnik , @Aaron Radzinski - feel free to chime in as well. Thanks, -- Nikita Ivanov On Wed, Feb 5, 2020 at 3:44 PM Justin Mclean wrote: > Hi, > > > NLPCraft came originally from DataLingvo. > > OK so that clarifies that there is code developed by DataLingvo in this > code base. > > > During migration, some DataLingvo artefacts weren't properly > > renamed (it should be fixed by now). > > I would question why they were “renamed”, while you’re not an ASF project > yet please see [1] for future expectations. ASF policy is generally to not > remove 3rd party headers. > > > Just to be clear - NLPCraft has no > > affiliation with DataLingvo of any kind (beyond the original roots) nor > it > > is a legal entity; it's just a project name. > > You mentioned all the original developers from DataLingvo worked on > NLPCraft. Is this still the case? Are any of them initial committers? Will > all people who worked on the original DataLingvo code base be signing ICLAs? > > > If required, DataLingvo will execute SGA (as it already open-sourced it > > with ASL 2.0 license two years ago). Please advise. > > I think that would be best. > > Thanks, > Justin > > 1. https://www.apache.org/legal/src-headers.html#3party > - > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org > For additional commands, e-mail: general-h...@incubator.apache.org > >
Re: [DISCUSS] NLPCraft Proposal
Hi, > NLPCraft came originally from DataLingvo. OK so that clarifies that there is code developed by DataLingvo in this code base. > During migration, some DataLingvo artefacts weren't properly > renamed (it should be fixed by now). I would question why they were “renamed”, while you’re not an ASF project yet please see [1] for future expectations. ASF policy is generally to not remove 3rd party headers. > Just to be clear - NLPCraft has no > affiliation with DataLingvo of any kind (beyond the original roots) nor it > is a legal entity; it's just a project name. You mentioned all the original developers from DataLingvo worked on NLPCraft. Is this still the case? Are any of them initial committers? Will all people who worked on the original DataLingvo code base be signing ICLAs? > If required, DataLingvo will execute SGA (as it already open-sourced it > with ASL 2.0 license two years ago). Please advise. I think that would be best. Thanks, Justin 1. https://www.apache.org/legal/src-headers.html#3party - To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org For additional commands, e-mail: general-h...@incubator.apache.org
Re: [DISCUSS] NLPCraft Proposal
Justin, et. al., NLPCraft came originally from DataLingvo. DataLingvo was a legal entity that originally developed some initial code. It has open-sourced it around 2018 along with the license change and it was developed thereafter within NLPCraft project (including by all the original developers from DataLingvo). During migration, some DataLingvo artefacts weren't properly renamed (it should be fixed by now). Just to be clear - NLPCraft has no affiliation with DataLingvo of any kind (beyond the original roots) nor it is a legal entity; it's just a project name. If required, DataLingvo will execute SGA (as it already open-sourced it with ASL 2.0 license two years ago). Please advise. Thanks! -- Nikita Ivanov On Wed, Feb 5, 2020 at 2:50 PM Justin Mclean wrote: > Hi, > > > perhaps the company changed it’s name but it’s the same organization? > > Perhaps or perhaps not, it hard to say with the information currently at > hand. Hopefully the proposed project can clarify. > > The IP provenance of the codebase needs to be clear, removing copyright > headers without explanation IMO makes that unclear. We also need to know if > others were involved to know who else need to sign ICLAs or if a SGA is > needed from DataLingvo. > > For instance here [1] it states that DataLingvo is the maker of > NLPCraft.org, that implies we need an SGA from them. > > Thanks, > Justin > > 1. https://twitter.com/datalingvo?lang=en > - > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org > For additional commands, e-mail: general-h...@incubator.apache.org > >
[MENTORS] Report signify due Tue February 11
Hi, Report are due to be signed off by the Tue February 11th. Its good to see some projects already having sign offs including DolphinScheduler where 4 mentors have signed and ECharts where 3 mentors have signed off the report. Other than non reporting podling, currently missing sign-offs are: - Milagro - Myriad - S2graph - Tamaya Thank, Justin - To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org For additional commands, e-mail: general-h...@incubator.apache.org
Re: [DISCUSS] NLPCraft Proposal
Hi, > perhaps the company changed it’s name but it’s the same organization? Perhaps or perhaps not, it hard to say with the information currently at hand. Hopefully the proposed project can clarify. The IP provenance of the codebase needs to be clear, removing copyright headers without explanation IMO makes that unclear. We also need to know if others were involved to know who else need to sign ICLAs or if a SGA is needed from DataLingvo. For instance here [1] it states that DataLingvo is the maker of NLPCraft.org, that implies we need an SGA from them. Thanks, Justin 1. https://twitter.com/datalingvo?lang=en - To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org For additional commands, e-mail: general-h...@incubator.apache.org
Re: [DISCUSS] NLPCraft Proposal
Hi - A little research on Google, Quadra and LinkedIn shows that DataLingvo and NLPCraft seem to be the same with a name change. Regards, Dave > On Feb 5, 2020, at 5:24 PM, David Jencks wrote: > > Is there perhaps a language problem? I often have trouble saying what I mean > in a way that can be understood… even by me, a few minutes later. > https://www.datalingvo.com now seems to redirect to https://nlpcraft.org, > perhaps the company changed it’s name but it’s the same organization? I > could have written the below and meant “The original code for the proposed > NLPCraft project was written when the NLPCraft company was called DataLingvo”. > > Hope this is not just noise… > > david jencks > >> On Feb 5, 2020, at 1:20 PM, Justin Mclean wrote: >> >> HI, >> >>> The original code for NLPCraft was developed under DataLingvo. >> >> So wouldn’t they be the copyright owner? If soyou can’t change headers or >> remove copyright statements without permission from them, do you have that? >> >> Can you point me to the original repos? It may be that that are other people >> that we need ICLAs from or a software grant from DataLingvo is also required. >> >> Thanks, >> Justin >> - >> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org >> For additional commands, e-mail: general-h...@incubator.apache.org >> > > > - > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org > For additional commands, e-mail: general-h...@incubator.apache.org > - To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org For additional commands, e-mail: general-h...@incubator.apache.org
Re: [MENTORS] Podling reports are due 5th
Hi, We still missing reports from: - Annotator - PageSpeed - SDAP - Taverna Both Annotator and Taverna have failed to report twice in a row which is concerning. Thanks, Justin - To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org For additional commands, e-mail: general-h...@incubator.apache.org
Re: [DISCUSS] NLPCraft Proposal
Is there perhaps a language problem? I often have trouble saying what I mean in a way that can be understood… even by me, a few minutes later. https://www.datalingvo.com now seems to redirect to https://nlpcraft.org, perhaps the company changed it’s name but it’s the same organization? I could have written the below and meant “The original code for the proposed NLPCraft project was written when the NLPCraft company was called DataLingvo”. Hope this is not just noise… david jencks > On Feb 5, 2020, at 1:20 PM, Justin Mclean wrote: > > HI, > >> The original code for NLPCraft was developed under DataLingvo. > > So wouldn’t they be the copyright owner? If soyou can’t change headers or > remove copyright statements without permission from them, do you have that? > > Can you point me to the original repos? It may be that that are other people > that we need ICLAs from or a software grant from DataLingvo is also required. > > Thanks, > Justin > - > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org > For additional commands, e-mail: general-h...@incubator.apache.org > - To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org For additional commands, e-mail: general-h...@incubator.apache.org
Re: [DISCUSS] NLPCraft Proposal
HI, > The original code for NLPCraft was developed under DataLingvo. So wouldn’t they be the copyright owner? If soyou can’t change headers or remove copyright statements without permission from them, do you have that? Can you point me to the original repos? It may be that that are other people that we need ICLAs from or a software grant from DataLingvo is also required. Thanks, Justin - To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org For additional commands, e-mail: general-h...@incubator.apache.org
Re: [DISCUSS] NLPCraft Proposal
Justin - let me clarify (sorry for confusion). The original code for NLPCraft was developed under DataLingvo. A few years back it was decided to rename the project to NLPCraft and adopt CC w/ASL2.0 license. The codebase was migrated to a new GitHub repo along with these changes. Apparently, not all copyright statements were caught and few remained in the code. I've already notified Aaron Radzinski (a member of NLPCraft project & of this proposal) - and he should comb through and clean up copyrights (and other mentions of DataLingvo) from the code. I hope this makes it a bit clearer. Best, -- Nikita Ivanov On Wed, Feb 5, 2020 at 5:37 AM Justin Mclean wrote: > Hi, > > Thanks for the answers. > > > 3. There are historical mentions of DataLingvo in the source code (here > and > > there) and that needs to be cleaned up (they are just an artefact of > > migration from repo to repo). > > > I'm not sure that answers my question(s), historical how? > > Thanks, > Justin > > > > - > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org > For additional commands, e-mail: general-h...@incubator.apache.org > >
Re: [DISCUSS] NLPCraft Proposal
Hi, Thanks for the answers. > 3. There are historical mentions of DataLingvo in the source code (here and > there) and that needs to be cleaned up (they are just an artefact of > migration from repo to repo). I'm not sure that answers my question(s), historical how? Thanks, Justin - To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org For additional commands, e-mail: general-h...@incubator.apache.org
Re: [DISCUSS] NLPCraft Proposal
Justin, Let me reply on these. 1. The common clause will be removed (if and when the project enters ASF Incubator). I mentioned that in my last reply. 2. nlpcraft.* domains will be donated to ASF during the incubation period. 3. There are historical mentions of DataLingvo in the source code (here and there) and that needs to be cleaned up (they are just an artefact of migration from repo to repo). All the code that will be migrated to ASF will be cleaned up. No code grants should be necessary. 4. Agree on user list... Thanks! -- Nikita Ivanov On Tue, Feb 4, 2020 at 4:01 PM Justin Mclean wrote: > Hi, > > Great proposal and an interesting project you have there: > > Some comments: > - The common clause is a big concern, it basically means this is not open > source software. The project needs to be willing to remove it and have > permission to do so from all contributors. > - Is nlpcraft a company name? (Website seems to imply it might be by > giving it a physical address) The domains nlpcraft.com nlpcraft.net and > nlpcraft.org are currently taken, will they be donated or redirected to > the ASF? > - A quick search of the repro shows 3rd party content of unknown > licensing, are you aware of this? e.g [1] Is there a relationship between > DataLingvo and NLPCraft? If so what is it? Would a software grant from them > be needed? I notice a number of other files are also copyright DataLingvo. > - Generally Incubator projects don’t start with a user list, is there a > reason why you want to start with one? > > Thanks, > Justin > > 1. ./src/main/resources/stopwords/first_words.txt.gz > - > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org > For additional commands, e-mail: general-h...@incubator.apache.org > >