This is an automated email from the ASF dual-hosted git repository.
aradzinski pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/incubator-nlpcraft.git
The following commit(s) were added to refs/heads/master by this push:
new 19debbd Further work on ASF release licensing requirements.
19debbd is described below
commit 19debbd6ab5435d1a19acf0474c31330933186ad
Author: Aaron Radzinzski <[email protected]>
AuthorDate: Wed Apr 8 20:36:39 2020 -0700
Further work on ASF release licensing requirements.
---
LICENSE | 83 -----------
NOTICE | 89 +++++++++++
LICENSE => bindist/LICENSE | 165 ++++++++++-----------
.../common/nlp/core/NCNlpPorterStemmer.scala | 2 +-
4 files changed, 172 insertions(+), 167 deletions(-)
diff --git a/LICENSE b/LICENSE
index 063fbf9..6b0b127 100644
--- a/LICENSE
+++ b/LICENSE
@@ -201,86 +201,3 @@
See the License for the specific language governing permissions and
limitations under the License.
-+=================================================================================+
-|
|
-| NLPCraft project uses or integrates with the following 3rd party software
|
-| (binary dependencies) that is licensed under non-Apache License 2.0. See the
|
-| following link on ASF 3rd party license compatibility:
|
-| https://www.apache.org/legal/resolved.html
|
-|
|
-+=================================================================================+
-
-+--=Google
Protobuf=----------------------------------------------------------------------------------+
-| Protocol buffers are a language-neutral, platform-neutral extensible
mechanism for |
-| serializing structured data.
|
-|
|
-| Maven dependency: com.google.protobuf:protobuf-java
|
-| BSD License: https://github.com/protocolbuffers/protobuf/blob/master/LICENSE
|
-+-----------------------------------------------------------------------------------------------------+
-
-+--=ANTLR=--------------------------------------------------------------------------------------------+
-| ANTLR (ANother Tool for Language Recognition) is a powerful parser generator
for |
-| reading, processing, executing, or translating structured text or binary
files. |
-| It's widely used to build languages, tools, and frameworks. From a grammar,
|
-| ANTLR generates a parser that can build and walk parse trees.
|
-|
|
-| Maven dependency: org.antlr:antlr4-runtime
|
-| Licensed under BSD license: https://www.antlr.org/license.html
|
-+-----------------------------------------------------------------------------------------------------+
-
-+--=jsoup=--------------------------------------------------------------------------------------------+
-| jsoup is a Java library for working with real-world HTML. It provides a very
convenient API for |
-| fetching URLs and extracting and manipulating data, using the best of HTML5
DOM methods |
-| and CSS selectors.
|
-|
|
-| Maven dependency: org.jsoup:jsoup
|
-| Licensed under MIT license: https://jsoup.org/license
|
-+-----------------------------------------------------------------------------------------------------+
-
-+--=Scala
ARM=----------------------------------------------------------------------------------------+
-| This project is an attempt to provide an Automatic-Resource-Management
library for the scala |
-| distribution. It is based off of code contributed to the Scalax project.
|
-|
|
-| Maven dependency: com.jsuereth:scala-arm_2.11
|
-| Licensed under MIT license: https://github.com/jsuereth/scala-arm
|
-+-----------------------------------------------------------------------------------------------------+
-
-+--=MChange
c3p0=-------------------------------------------------------------------------------------+
-| c3p0 is an easy-to-use library for making traditional JDBC drivers
"enterprise-ready" by |
-| augmenting them with functionality defined by the jdbc3 spec and the
optional extensions to jdbc2. |
-|
|
-| Maven dependency: com.mchange:c3p0
|
-| Licensed under EPL license: http://www.eclipse.org/legal/epl-v10.html
|
-+-----------------------------------------------------------------------------------------------------+
-
-+--=extJWNL=------------------------------------------------------------------------------------------+
-| extJWNL (Extended Java WordNet Library) is a Java API for creating, reading
and updating |
-| dictionaries in WordNet format.
|
-|
|
-| Maven dependency: net.sf.extjwnl
|
-| Licensed under BSD license: http://extjwnl.sourceforge.net/license.txt
|
-+-----------------------------------------------------------------------------------------------------+
-
-+--=SQL
Formatter=------------------------------------------------------------------------------------+
-| Java port of great SQL formatter
https://github.com/zeroturnaround/sql-formatter. |
-|
|
-| Maven dependency: com.github.vertical-blank:sql-formatter
|
-| Licensed under MIT license:
https://github.com/vertical-blank/sql-formatter/blob/master/LICENSE |
-+-----------------------------------------------------------------------------------------------------+
-
-+--=Stanford
CoreNLP=---------------------------------------------------------------------------------+
-| Stanford CoreNLP provides a set of human language technology tools. It can
give the base forms |
-| of words, their parts of speech, whether they are names of companies,
people, etc., normalize |
-| dates, times, and numeric quantities, mark up the structure of sentences in
terms of phrases |
-| and syntactic dependencies, indicate which noun phrases refer to the same
entities, |
-| indicate sentiment, extract particular or open-class relations between
entity mentions, |
-| get the quotes people said, etc.
|
-|
|
-| NOTE:
|
-| Stanford CoreNLP is an optional runtime library and NOT REQUIRED for
building or using |
-| NLPCraft. NLPCraft, however, provides runtime integration with this
popular library in |
-| case when the end-user can accept its license and decides to use it.
|
-|
|
-| Maven dependency: edu.stanford.nlp:stanford-corenlp
|
-| Licensed under GPLv3 license: https://stanfordnlp.github.io/CoreNLP/#license
|
-+-----------------------------------------------------------------------------------------------------+
diff --git a/NOTICE b/NOTICE
index 9eeebf1..ffe39a3 100644
--- a/NOTICE
+++ b/NOTICE
@@ -3,3 +3,92 @@ Copyright (C) 2020 The Apache Software Foundation
This product includes software developed at
The Apache Software Foundation (http://www.apache.org/).
+
+Protocol Buffers
+Copyright 2008 Google Inc.
+MIT License
+
+OpenCensus
+Copyright 2017, OpenCensus Authors
+ASLv2 License
+
+OpenZipkin
+Copyright 2015-2020 The OpenZipkin Authors
+ASLv2 License
+
+Jaeger Tracing
+Copyright (c) 2018, The Jaeger Authors
+Copyright (c) 2016, Uber Technologies Inc.
+ASLv2 License
+
+Apache Thrift
+Copyright (C) 2006 - 2019, The Apache Software Foundation
+ASLv2 License
+
+Prometheus
+Copyright 2012-2015 The Prometheus Authors
+Includes software developed at SoundCloud Ltd. (https://soundcloud.com/).
+ASLv2 License
+
+ANTLR
+Copyright (c) 2012 Terence Parr and Sam Harwell
+BSD License
+
+Apache Commons (and sub-projects)
+Copyright 2001-2020 The Apache Software Foundation
+ASLv2 License
+
+Apache HttpComponents Client
+Copyright 1999-2020 The Apache Software Foundation
+ASLv2 License
+
+Apache Ignite
+Copyright 2014-2020 The Apache Software Foundation
+ASLv2 License
+
+Apache Commons Logging
+Copyright 2003-2016 The Apache Software Foundation
+ASLv2 License
+
+Apache OpenNLP
+Copyright 2017 The Apache Software Foundation
+ASLv2 License
+
+Scala Logging
+Copyright 2014 Typesafe Inc.
+ASLv2 License
+
+Akka
+Copyright (C) 2009-2020 Lightbend Inc.
+ASLv2 License
+
+JSOUP
+Copyright (c) 2009-2020 Jonathan Hedley <https://jsoup.org/>
+MIT License
+
+Apache Tika
+Copyright 2015 The Apache Software Foundation
+Copyright 1993-2010 University Corporation for Atmospheric Research/Unidata
+Copyright 2005 Bytecode Pty Ltd.
+ASLv2 License
+
+Scala Test
+Copyright Artima, Inc. (http://www.artima.com/).
+ASLv2 License
+
+Lift
+Copyright 2006-2011 WorldWide Conferencing, LLC
+ASLv2 License
+
+c3p0
+Copyright (C) 2018 Machinery For Change, Inc.
+EPL License
+
+extJWNL
+Copyright (C) 2010-2018 Aliaksandr Autayeu
+Copyright (C) 2000-2007 the JWNL development team
+BSD License
+
+SQL Formatter
+Copyright (c) 2019 Yohei Yamana
+MIT License
diff --git a/LICENSE b/bindist/LICENSE
similarity index 59%
copy from LICENSE
copy to bindist/LICENSE
index 063fbf9..942d6f2 100644
--- a/LICENSE
+++ b/bindist/LICENSE
@@ -201,86 +201,85 @@
See the License for the specific language governing permissions and
limitations under the License.
-+=================================================================================+
-|
|
-| NLPCraft project uses or integrates with the following 3rd party software
|
-| (binary dependencies) that is licensed under non-Apache License 2.0. See the
|
-| following link on ASF 3rd party license compatibility:
|
-| https://www.apache.org/legal/resolved.html
|
-|
|
-+=================================================================================+
-
-+--=Google
Protobuf=----------------------------------------------------------------------------------+
-| Protocol buffers are a language-neutral, platform-neutral extensible
mechanism for |
-| serializing structured data.
|
-|
|
-| Maven dependency: com.google.protobuf:protobuf-java
|
-| BSD License: https://github.com/protocolbuffers/protobuf/blob/master/LICENSE
|
-+-----------------------------------------------------------------------------------------------------+
-
-+--=ANTLR=--------------------------------------------------------------------------------------------+
-| ANTLR (ANother Tool for Language Recognition) is a powerful parser generator
for |
-| reading, processing, executing, or translating structured text or binary
files. |
-| It's widely used to build languages, tools, and frameworks. From a grammar,
|
-| ANTLR generates a parser that can build and walk parse trees.
|
-|
|
-| Maven dependency: org.antlr:antlr4-runtime
|
-| Licensed under BSD license: https://www.antlr.org/license.html
|
-+-----------------------------------------------------------------------------------------------------+
-
-+--=jsoup=--------------------------------------------------------------------------------------------+
-| jsoup is a Java library for working with real-world HTML. It provides a very
convenient API for |
-| fetching URLs and extracting and manipulating data, using the best of HTML5
DOM methods |
-| and CSS selectors.
|
-|
|
-| Maven dependency: org.jsoup:jsoup
|
-| Licensed under MIT license: https://jsoup.org/license
|
-+-----------------------------------------------------------------------------------------------------+
-
-+--=Scala
ARM=----------------------------------------------------------------------------------------+
-| This project is an attempt to provide an Automatic-Resource-Management
library for the scala |
-| distribution. It is based off of code contributed to the Scalax project.
|
-|
|
-| Maven dependency: com.jsuereth:scala-arm_2.11
|
-| Licensed under MIT license: https://github.com/jsuereth/scala-arm
|
-+-----------------------------------------------------------------------------------------------------+
-
-+--=MChange
c3p0=-------------------------------------------------------------------------------------+
-| c3p0 is an easy-to-use library for making traditional JDBC drivers
"enterprise-ready" by |
-| augmenting them with functionality defined by the jdbc3 spec and the
optional extensions to jdbc2. |
-|
|
-| Maven dependency: com.mchange:c3p0
|
-| Licensed under EPL license: http://www.eclipse.org/legal/epl-v10.html
|
-+-----------------------------------------------------------------------------------------------------+
-
-+--=extJWNL=------------------------------------------------------------------------------------------+
-| extJWNL (Extended Java WordNet Library) is a Java API for creating, reading
and updating |
-| dictionaries in WordNet format.
|
-|
|
-| Maven dependency: net.sf.extjwnl
|
-| Licensed under BSD license: http://extjwnl.sourceforge.net/license.txt
|
-+-----------------------------------------------------------------------------------------------------+
-
-+--=SQL
Formatter=------------------------------------------------------------------------------------+
-| Java port of great SQL formatter
https://github.com/zeroturnaround/sql-formatter. |
-|
|
-| Maven dependency: com.github.vertical-blank:sql-formatter
|
-| Licensed under MIT license:
https://github.com/vertical-blank/sql-formatter/blob/master/LICENSE |
-+-----------------------------------------------------------------------------------------------------+
-
-+--=Stanford
CoreNLP=---------------------------------------------------------------------------------+
-| Stanford CoreNLP provides a set of human language technology tools. It can
give the base forms |
-| of words, their parts of speech, whether they are names of companies,
people, etc., normalize |
-| dates, times, and numeric quantities, mark up the structure of sentences in
terms of phrases |
-| and syntactic dependencies, indicate which noun phrases refer to the same
entities, |
-| indicate sentiment, extract particular or open-class relations between
entity mentions, |
-| get the quotes people said, etc.
|
-|
|
-| NOTE:
|
-| Stanford CoreNLP is an optional runtime library and NOT REQUIRED for
building or using |
-| NLPCraft. NLPCraft, however, provides runtime integration with this
popular library in |
-| case when the end-user can accept its license and decides to use it.
|
-|
|
-| Maven dependency: edu.stanford.nlp:stanford-corenlp
|
-| Licensed under GPLv3 license: https://stanfordnlp.github.io/CoreNLP/#license
|
-+-----------------------------------------------------------------------------------------------------+
+ Binary Distribution
+ ===================
+ This NLPCraft distribution bundles 3rd party binary dependencies that are
+ licensed as outlined below. See the following link on ASF 3rd party license
+ compatibility: https://www.apache.org/legal/resolved.html
+
+ Google Protobuf
+ ---------------
+ Protocol buffers are a language-neutral, platform-neutral extensible
mechanism for
+ serializing structured data.
+
+ Maven dependency: com.google.protobuf:protobuf-java
+ BSD License: https://github.com/protocolbuffers/protobuf/blob/master/LICENSE
+
+ ANTLR
+ -----
+ ANTLR (ANother Tool for Language Recognition) is a powerful parser
generator for
+ reading, processing, executing, or translating structured text or binary
files.
+ It's widely used to build languages, tools, and frameworks. From a grammar,
+ ANTLR generates a parser that can build and walk parse trees.
+
+ Maven dependency: org.antlr:antlr4-runtime
+ Licensed under BSD license: https://www.antlr.org/license.html
+
+ jsoup
+ -----
+ jsoup is a Java library for working with real-world HTML. It provides a
very convenient API for
+ fetching URLs and extracting and manipulating data, using the best of HTML5
DOM methods
+ and CSS selectors.
+
+ Maven dependency: org.jsoup:jsoup
+ Licensed under MIT license: https://jsoup.org/license
+
+ Scala ARM
+ ---------
+ This project is an attempt to provide an Automatic-Resource-Management
library for the scala
+ distribution. It is based off of code contributed to the Scalax project.
+
+ Maven dependency: com.jsuereth:scala-arm_2.11
+ Licensed under MIT license: https://github.com/jsuereth/scala-arm
+
+ MChange c3p0
+ ------------
+ c3p0 is an easy-to-use library for making traditional JDBC drivers
"enterprise-ready" by
+ augmenting them with functionality defined by the jdbc3 spec and the
optional extensions to jdbc2.
+
+ Maven dependency: com.mchange:c3p0
+ Licensed under EPL license: http://www.eclipse.org/legal/epl-v10.html
+
+ extJWNL
+ -------
+ extJWNL (Extended Java WordNet Library) is a Java API for creating, reading
and updating
+ dictionaries in WordNet format.
+
+ Maven dependency: net.sf.extjwnl
+ Licensed under BSD license: http://extjwnl.sourceforge.net/license.txt
+
+ SQL Formatter
+ -------------
+ Java port of great SQL formatter
https://github.com/zeroturnaround/sql-formatter.
+
+ Maven dependency: com.github.vertical-blank:sql-formatter
+ Licensed under MIT license:
https://github.com/vertical-blank/sql-formatter/blob/master/LICENSE
+
+ Stanford CoreNLP
+ ----------------
+ Stanford CoreNLP provides a set of human language technology tools. It can
give the base forms
+ of words, their parts of speech, whether they are names of companies,
people, etc., normalize
+ dates, times, and numeric quantities, mark up the structure of sentences in
terms of phrases
+ and syntactic dependencies, indicate which noun phrases refer to the same
entities,
+ indicate sentiment, extract particular or open-class relations between
entity mentions,
+ get the quotes people said, etc.
+
+ NOTE:
+ Stanford CoreNLP is an optional runtime library and NOT REQUIRED for
building or using
+ NLPCraft. NLPCraft, however, provides runtime integration with this
popular library in
+ case when the end-user can accept its license and decides to use it.
+
+ Maven dependency: edu.stanford.nlp:stanford-corenlp
+ Licensed under GPLv3 license: https://stanfordnlp.github.io/CoreNLP/#license
+
+
diff --git
a/src/main/scala/org/apache/nlpcraft/common/nlp/core/NCNlpPorterStemmer.scala
b/src/main/scala/org/apache/nlpcraft/common/nlp/core/NCNlpPorterStemmer.scala
index c27c6dd..c4ceea1 100644
---
a/src/main/scala/org/apache/nlpcraft/common/nlp/core/NCNlpPorterStemmer.scala
+++
b/src/main/scala/org/apache/nlpcraft/common/nlp/core/NCNlpPorterStemmer.scala
@@ -25,7 +25,7 @@ import scala.language.{implicitConversions, postfixOps}
* See http://snowball.tartarus.org/algorithms/porter/stemmer.html
* for description of the algorithm itself.
*
- * Implementation is based on work by Evgeny Kotelnikov
'[email protected]'
+ * Implementation is loosely based on work by Evgeny Kotelnikov
'[email protected]'
*/
object NCNlpPorterStemmer {
/**