[VOTE] Release DataSketches-pig 1.0.0-incubating-RC1

2019-10-15 Thread leerho
### NEW COMPONENT! ###

Hello IPMC and Apache DataSketches Community,

1. This is a call for vote to release Apache DataSketches-pig version:
 1.0.0-incubating-RC1

NOTE 1: This is the Pig adaptors component of the DataSketches library
that enables Apache Pig to access the datasketches-java core library.

  A. Results from the PPMC vote:

 - 1 (+1 binding) vote, 2 (+1 non-binding) votes

 - The vote thread can be found at the Permalink:

https://mail-archives.apache.org/mod_mbox/datasketches-dev/201910.mbox/<
ca+uapnschtp6bmafnb3g5nt_polx0ytzsdfzuef9gfaatau...@mail.gmail.com>

  B. Testing/actions/votes performed by the voters in the PPMC Stage 1:

- Alexander Saydakov (+1 non-binding)
- Successfully ran example scripts on a Pig cluster: theta, tuple
(array of doubles), quantiles, kll, frequent strings

- Dave Fisher (+1 binding)
- Incubating in name
- Checksums and Signatures are good.
- RatCheck passes
- LICENSE good
- NOTICE good
- DISCLAIMER-WIP
- Builds and Tests pass
- have same questions about the pom.xml which is expected. We’ll
need to find the correct form and then document it.

- Jon Malkin (+1 non-binding)
- checksums and signature are correct
- LICENSE and NOTICE present
- DISCLAIMER-WIP
- Builds and tests pass with and without strict profile

2. Source repository:
- https://github.com/apache/incubator-datasketches-pig

Git Tag for this release:
-
https://github.com/apache/incubator-datasketches-pit/tree/1.0.0-incubating-RC1
on branch 1.0.X-incubating

Git HashId for this release starts with: 6ba0040

3. The Release Candidate / Dist (Zip Repository):
-
https://dist.apache.org/repos/dist/dev/incubator/datasketches/pig/1.0.0-incubating-RC1/

The public signing key can be found in the KEYS file:
- https://dist.apache.org/repos/dist/dev/incubator/datasketches/KEYS

The artifacts have been signed with --keyid-format SHORT : 8CD4A902

4. Repository: [Nexus](http://repository.apache.org) (Jar Artifacts):
-
https://repository.apache.org/content/groups/staging/org/apache/datasketches/datasketches-pig/1.0.0-incubating/

5. Build & Test Guide:
-
https://github.com/apache/incubator-datasketches-pig/blob/1.0.0-incubating-RC1/README.md

6. This vote is the 2nd stage vote:
   - This voting process will remain open for at least 72 hours
 and until at least 3 (+1) IPMC votes or a majority of (+1) are
acquired.

Please vote accordingly:

[ ] +1 approve
[ ] +0 no opinion
[ ] -1 disapprove with the reason

Thanks,
Lee Rhodes
lee...@apache.org


[ANNOUNCE] DataSketches Hive 1.0.0-incubating released!

2019-10-11 Thread leerho
### NEW COMPONENT! ###

Hello All,

1. The Apache DataSketches Hive 1.0.0-incubating has been released!

NOTE 1: This is the Hive adaptors component of the DataSketches library
that enables Apache Hive to access the datasketches-java core library.

Vote Thread Permalink:

https://mail-archives.apache.org/mod_mbox/incubator-general/201910.mbox/%3CCA%2BUaPnsdR2pZidr4geb-bn3VHtk6qRWOknST_WTtNitYCgRDoQ%40mail.gmail.com%3E
There were 4 binding (+1) votes: Furkan Kamaci, Dave Fisher, Kevin
Ratnasekera, Justin Mclean

2. Source repository:
- https://github.com/apache/incubator-datasketches-hive

Git Tag for this release: (first go to branch 1.0.X-incubating)
-
https://github.com/apache/incubator-datasketches-hive/tree/1.0.0-incubating

Git HashId for this release starts with: 5319e72

3. Dist/release:
https://dist.apache.org/repos/dist/release/incubator/datasketches/hive/1.0.0-incubating/

4. Nexus:
https://repository.apache.org/content/repositories/releases/org/apache/datasketches/datasketches-hive/1.0.0-incubating/

5. Download, build, test and documentation:

https://github.com/apache/incubator-datasketches-hive/blob/master/README.md


Thank you to the IPMC members and community for taking the time to review
and
provide guidance on our release!

On behalf of the Apache DataSketches Community,

Lee
lee...@apache.org


=
*Disclaimer*

Apache DataSketches (incubating) is an effort undergoing incubation at
The Apache Software Foundation (ASF), sponsored by the Apache Incubator
PMC. Incubation is required of all newly accepted projects until a
further review indicates that the infrastructure, communications, and
decision making process have stabilized in a manner consistent with
other successful ASF projects. While incubation status is not
necessarily a reflection of the completeness or stability of the code,
it does indicate that the project has yet to be fully endorsed by the
ASF.


Re: [VOTE] Release DataSketches-hive 1.0.0-incubating-RC1

2019-10-11 Thread leerho
Thank you for finding these.

There were 8 files where the Yahoo header was entered a little differently
and my RegEx did not capture them properly.  The Apache header was added
but the old Yahoo header was not deleted.  These are now fixed in master.

I have now searched all of our repos where we have performed the header
swap for anything with "Yahoo" and found a few more html links that were
not converted, but no other Yahoo license headers that were missed.

Thanks,
Lee.



On Thu, Oct 10, 2019 at 3:16 PM Justin Mclean 
wrote:

> Hi,
>
> +1 (binding)
>
> I checked:
> - incubating in name
> - signatures and hash fine
> - DISCLAIMER exists
> - LICENSE and NOTICE fine
> - file has ASF header
> - no unexpected binary files
> - can compile from source
>
> A number of file still have "Copyright 2015, Yahoo Inc.” I assume this
> needs to be removed.
>
> Thanks,
> Justin
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>


Re: [VOTE] Release DataSketches-hive 1.0.0-incubating-RC1

2019-10-09 Thread leerho
Thank you for catching that. It will be fixed in master.

On Wed, Oct 9, 2019 at 10:00 AM Thomas Weise  wrote:

> Just a question from my side:
>
> Should NOTICE include "(incubating)" or is this no longer a requirement?
>
>
> https://github.com/apache/incubator-datasketches-hive/blob/1.1.X-incubating/NOTICE
>
> Thanks,
> Thomas
>
>
> On Tue, Oct 8, 2019 at 10:17 AM Dave Fisher  wrote:
>
> > Carrying over my +1 (binding)
> >
> > Regards,
> > Dave
> >
> > > On Oct 7, 2019, at 2:15 PM, leerho  wrote:
> > >
> > > ### NEW COMPONENT! ###
> > >
> > > Hello IPMC and Apache DataSketches Community,
> > >
> > > 1. This is a call for vote to release Apache DataSketches-hive version:
> > > 1.0.0-incubating-RC1
> > >
> > >NOTE 1: This is the Hive adaptors component of the DataSketches
> > library
> > > that enables Apache Hive to access the datasketches-java core library.
> > >
> > >  A. Results from the PPMC vote:
> > >
> > > - 3 (+1 binding) votes.
> > >
> > > - The vote thread can be found at the Permalink:
> > >
> > >
> >
> https://mail-archives.apache.org/mod_mbox/datasketches-dev/201910.mbox/%3cca+uapntlptpic78pyo6fs9mokafo-yhxo9s9yz3-qrkzjcc...@mail.gmail.com%3e
> > >
> > >  B. Testing/actions/votes performed by the voters in the PPMC Stage 1:
> > >
> > >- Furkan Kamaci (+1 binding)
> > > - Incubating in name
> > > - DISCLAIMER exists
> > > - LICENSE and NOTICE are fine
> > > - No unexpected binary files
> > > - Checked PGP signatures
> > > - Checked Checksums
> > > - Code compiles and tests successfully run
> > >
> > > I have one objection:
> > > - src/main/javadoc/stylesheet.css file has missing Apache
> > > license header. (This file has been removed in master)
> > >
> > >- Dave Fisher (+1 binding)
> > >- I checked the same as Furkan.
> > >
> > >I am concerned that you were downloading from
> > repository.apache.org
> > > for many of the artifacts.
> > >This repos is used by apache projects to stage for release and
> > some
> > > organizations do get their
> > >IP addresses banned for over use of it. (we are still discussing
> > > with Dave to understand the issue here)
> > >
> > >- Liang Chen (+1 binding)
> > >- Incubating in name
> > >- DISCLAIMER exists
> > >- LICENSE and NOTICE are fine
> > >- No unexpected binary files
> > >- Checked checksums
> > >
> > > 2. Source repository:
> > >- https://github.com/apache/incubator-datasketches-hive
> > >
> > >Git Tag for this release:
> > >-
> > >
> >
> https://github.com/apache/incubator-datasketches-hive/tree/1.0.0-incubating-RC1
> > > on branch 1.0.X-incubating
> > >
> > >Git HashId for this release starts with: 5319e72
> > >
> > > 3. The Release Candidate / Zip Repository:
> > >-
> > >
> >
> https://dist.apache.org/repos/dist/dev/incubator/datasketches/hive/1.0.0-incubating-RC1/
> > >
> > >The public signing key can be found in the KEYS file:
> > >-
> https://dist.apache.org/repos/dist/dev/incubator/datasketches/KEYS
> > >
> > >The artifacts have been signed with --keyid-format SHORT : 8CD4A902
> > >
> > > 4. Repository: [Nexus](http://repository.apache.org) (Jar Artifacts):
> > >-
> > >
> >
> https://repository.apache.org/content/groups/staging/org/apache/datasketches/datasketches-hive/1.0.0-incubating/
> > >
> > > 5. Build & Test Guide:
> > >-
> > >
> >
> https://github.com/apache/incubator-datasketches-hive/blob/1.0.0-incubating-RC1/README.md
> > >
> > > 6. This vote is the 2nd stage vote:
> > >   - This voting process will remain open for at least 72 hours
> > > and until at least 3 (+1) IPMC votes or a majority of (+1) are
> > > acquired.
> > >
> > > Please vote accordingly:
> > >
> > >[ ] +1 approve
> > >[ ] +0 no opinion
> > >[ ] -1 disapprove with the reason
> > >
> > > Thanks,
> > > Lee Rhodes
> > > lee...@apache.org
> >
> >
> > -
> > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> > For additional commands, e-mail: general-h...@incubator.apache.org
> >
> >
>


configure GitBox output to post to our commits@ vs dev@

2019-10-08 Thread leerho
Hi,

DataSketches podling here,

How can we configure GitBox so it posts its output to our commits@ list and
not our dev@ list?

Thanks,

Lee.


[VOTE] Release DataSketches-hive 1.0.0-incubating-RC1

2019-10-07 Thread leerho
### NEW COMPONENT! ###

Hello IPMC and Apache DataSketches Community,

1. This is a call for vote to release Apache DataSketches-hive version:
 1.0.0-incubating-RC1

NOTE 1: This is the Hive adaptors component of the DataSketches library
that enables Apache Hive to access the datasketches-java core library.

  A. Results from the PPMC vote:

 - 3 (+1 binding) votes.

 - The vote thread can be found at the Permalink:

https://mail-archives.apache.org/mod_mbox/datasketches-dev/201910.mbox/%3cca+uapntlptpic78pyo6fs9mokafo-yhxo9s9yz3-qrkzjcc...@mail.gmail.com%3e

  B. Testing/actions/votes performed by the voters in the PPMC Stage 1:

- Furkan Kamaci (+1 binding)
 - Incubating in name
 - DISCLAIMER exists
 - LICENSE and NOTICE are fine
 - No unexpected binary files
 - Checked PGP signatures
 - Checked Checksums
 - Code compiles and tests successfully run

 I have one objection:
 - src/main/javadoc/stylesheet.css file has missing Apache
license header. (This file has been removed in master)

- Dave Fisher (+1 binding)
- I checked the same as Furkan.

I am concerned that you were downloading from repository.apache.org
for many of the artifacts.
This repos is used by apache projects to stage for release and some
organizations do get their
IP addresses banned for over use of it. (we are still discussing
with Dave to understand the issue here)

- Liang Chen (+1 binding)
- Incubating in name
- DISCLAIMER exists
- LICENSE and NOTICE are fine
- No unexpected binary files
- Checked checksums

2. Source repository:
- https://github.com/apache/incubator-datasketches-hive

Git Tag for this release:
-
https://github.com/apache/incubator-datasketches-hive/tree/1.0.0-incubating-RC1
on branch 1.0.X-incubating

Git HashId for this release starts with: 5319e72

3. The Release Candidate / Zip Repository:
-
https://dist.apache.org/repos/dist/dev/incubator/datasketches/hive/1.0.0-incubating-RC1/

The public signing key can be found in the KEYS file:
- https://dist.apache.org/repos/dist/dev/incubator/datasketches/KEYS

The artifacts have been signed with --keyid-format SHORT : 8CD4A902

4. Repository: [Nexus](http://repository.apache.org) (Jar Artifacts):
-
https://repository.apache.org/content/groups/staging/org/apache/datasketches/datasketches-hive/1.0.0-incubating/

5. Build & Test Guide:
-
https://github.com/apache/incubator-datasketches-hive/blob/1.0.0-incubating-RC1/README.md

6. This vote is the 2nd stage vote:
   - This voting process will remain open for at least 72 hours
 and until at least 3 (+1) IPMC votes or a majority of (+1) are
acquired.

Please vote accordingly:

[ ] +1 approve
[ ] +0 no opinion
[ ] -1 disapprove with the reason

Thanks,
Lee Rhodes
lee...@apache.org


[ANNOUNCE] DataSketches Java 1.1.0-incubating released!

2019-10-03 Thread leerho
### BUG FIX!, Supercedes 1.0.0-incubating ###

Hello All,

1. The Apache DataSketches Java 1.1.0-incubating has been released!

NOTE 1: This is the core Java component of the DataSketches library
that includes all the sketch algorithms in production-ready packages. These
sketches can be called directly from this component or used in conjunction
with the adaptor components such as Hadoop Pig, Hadoop Hive, or the
aggregator adaptors built into Apache Druid.

NOTE 2: This release contains some critical performance improvements
and bug fixes for Apache Druid.

Vote Thread Permalink:

https://mail-archives.apache.org/mod_mbox/incubator-general/201909.mbox/%3CCA%2BUaPnscE0H7M0WqSE3pD214VH_JrUhftV%2BxDb1f-HtjXAabBw%40mail.gmail.com%3E
There were four binding (+1) votes: Dave Fisher, Justin Mclean, Furkan
Kamaci, Kenneth Knowles

2. Source repository:
https://github.com/apache/incubator-datasketches-java

Git Tag for this release:

https://github.com/apache/incubator-datasketches-java/tree/1.1.0-incubating-RC1
 on branch 1.1.X-incubating

3. Dist/release:

https://dist.apache.org/repos/dist/release/incubator/datasketches/java/1.1.0-incubating/

4. Nexus:

https://repository.apache.org/content/repositories/releases/org/apache/datasketches/datasketches-java/1.1.0-incubating/

5. Download, build, test and documentation:

https://github.com/apache/incubator-datasketches-java/blob/master/README.md


Thank you to the IPMC members and community for taking the time to review
and
provide guidance on our release!

On behalf of the Apache DataSketches Community,

Lee
lee...@apache.org


=
*Disclaimer*

Apache DataSketches (incubating) is an effort undergoing incubation at
The Apache Software Foundation (ASF), sponsored by the Apache Incubator
PMC. Incubation is required of all newly accepted projects until a
further review indicates that the infrastructure, communications, and
decision making process have stabilized in a manner consistent with
other successful ASF projects. While incubation status is not
necessarily a reflection of the completeness or stability of the code,
it does indicate that the project has yet to be fully endorsed by the
ASF.


Re: [VOTE] Release DataSketches-java 1.1.0-incubating-RC1

2019-10-02 Thread leerho
Justin,

Thank you for your vote and comments.  Your suggestions for the NOTICE file
have already been incorporated in master going forward.

Please inspect it at
https://github.com/apache/incubator-datasketches-java/blob/master/NOTICE to
see if there are any other changes that you would like to make.

Lee.


On Tue, Oct 1, 2019 at 6:07 PM Justin Mclean 
wrote:

> Hi,
>
> +1 binding
>
> I checked:
> - incubating in name
> - signature and hashes correct
> - LICENSE is OK
> - NOTICE has minor issues
> - no unexpected binary files
> - all source file have ASF headers
> - can compile from source
>
> One minor thing with the NOTICE it’s not correct to say "2019 - Present”,
> copyright applies when a work is published and doesn’t last forever. I’d
> also (as suggested before on the last release) put the ASF copyright first.
>
> Also these two statement seem to possibly contradict each other:
> Copyright 2018 - Present, Verizon
> Copyright 2019 - Present, The Apache Software Foundation
>
> Thanks,
> Justin
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>


Re: [VOTE] Release DataSketches-java 1.1.0-incubating-RC1

2019-10-01 Thread leerho
Thank Furkan, Dave and Kenn for your votes!  Now we have to wait for the
clock to run out.

This is a very important release as we have several other repos to release
that are dependent on this one!

Lee.

On Tue, Oct 1, 2019 at 4:32 PM Kenneth Knowles  wrote:

> +1 (binding)
>
> Carrying over mine as well.
>
> On Mon, Sep 30, 2019 at 10:42 AM Dave Fisher  wrote:
>
> > Hi -
> >
> > Carrying over my +1 (binding)
> >
> > Regards,
> > Dave
> >
> > > On Sep 30, 2019, at 10:37 AM, leerho  wrote:
> > >
> > > ### BUG FIX!, Supercedes 1.0.0-incubating-RC2 ###
> > >
> > > Hello IPMC and Apache DataSketches Community,
> > >
> > > 1. This is a call for vote to release Apache DataSketches-java version:
> > > 1.1.0-incubating-RC1
> > >
> > >NOTE 1: This is the core Java component of the DataSketches library
> > > that includes all the sketch algorithms in production-ready packages.
> > These
> > > sketches can be called directly from this component or used in
> > conjunction
> > > with the adaptor components such as Hadoop Pig, Hadoop Hive, or the
> > > aggregator adaptors built into Apache Druid.
> > >
> > >NOTE 2: This release contains some critical performance improvements
> > > for Apache Druid and a bug fix. This also includes a new Tuple
> > > IntegerSketch and example code for tracking customer engagement.  See
> our
> > > web site for more details.
> > >
> > >  A. Results from the PPMC vote:
> > >
> > > - 4 (+1) votes. Three of the votes were Mentors (IPMC members,
> > > binding).
> > >
> > > - The vote thread can be found at the Permalink:
> > >
> > >
> >
> https://mail-archives.apache.org/mod_mbox/datasketches-dev/201909.mbox/%3cCA+UaPnsq1HCQKy=rwyXar51d6Lou=R8qjojCBFkOUJ0=vy9...@mail.gmail.com%3e
> > >
> > >  B. Testing/actions/votes performed by the voters in the PPMC Stage 1:
> > >
> > >- Furkan Kamaci: (+1 binding)
> > >- Incubating in name
> > >- DISCLAIMER exists
> > >- LICENSE and NOTICE are fine
> > >- No unexpected binary files
> > >- Checked Checksums
> > >- Code compiles and tests successfully run
> > >
> > >- Dave Fisher: (+1 binding)
> > >incubating in name
> > >Signature and Checksum
> > >NOTICE - there needs to be a slight adjustment - see below:
> > >LICENSE - there could be better separation between each license
> > >DISCLAIMER-WIP
> > >Ran mvn build and tests
> > >
> > >Minor Notes:
> > >NOTICE should start like this:
> > >Apache DataSketches Java
> > >Copyright 2019 - The Apache Software Foundation
> > >
> > >Copyright 2015-2018 - Yahoo
> > >Copyright 2018 - Verizon Media
> > >(Note: has been changed for the next version)
> > >
> > >- Kenneth Knowles: (+1 binding)
> > >- Ran `mvn clean test -Pstrict` on the zip archive
> > >- DISCLAIMER-WIP
> > >- incubating in name / artifacts
> > >- LICENSE
> > >- NOTICE
> > >
> > >- Alex Saydakov: (+1)
> > >  - verified signature
> > >  - checked sha512 hash
> > >  - 'mvn install' successful
> > >  - tests in Druid succeeded
> > >
> > >  C. Additional Notes:
> > >- The LICENSE and NOTICE files should be very close to their final
> > form
> > > (with the exception of the minor reordering requested by Dave Fisher).
> > >
> > > 2. Source repository:
> > >- https://github.com/apache/incubator-datasketches-java
> > >
> > >Git Tag for this release:
> > >-
> > >
> >
> https://github.com/apache/incubator-datasketches-java/tree/1.1.0-incubating-RC1
> > > on branch 1.1.X-incubating
> > >
> > >Git HashId for this release starts with: 40c6f4f
> > >
> > > 3. The Release Candidate / Zip Repository:
> > >-
> > >
> >
> https://dist.apache.org/repos/dist/dev/incubator/datasketches/java/1.1.0-incubating-RC1/
> > >
> > >The public signing key can be found in the KEYS file:
> > >-
> https://dist.apache.org/repos/dist/dev/incubator/datasketches/KEYS
> > >
> > >The artifacts have been signed

[VOTE] Release DataSketches-java 1.1.0-incubating-RC1

2019-09-30 Thread leerho
### BUG FIX!, Supercedes 1.0.0-incubating-RC2 ###

Hello IPMC and Apache DataSketches Community,

1. This is a call for vote to release Apache DataSketches-java version:
 1.1.0-incubating-RC1

NOTE 1: This is the core Java component of the DataSketches library
that includes all the sketch algorithms in production-ready packages. These
sketches can be called directly from this component or used in conjunction
with the adaptor components such as Hadoop Pig, Hadoop Hive, or the
aggregator adaptors built into Apache Druid.

NOTE 2: This release contains some critical performance improvements
for Apache Druid and a bug fix. This also includes a new Tuple
IntegerSketch and example code for tracking customer engagement.  See our
web site for more details.

  A. Results from the PPMC vote:

 - 4 (+1) votes. Three of the votes were Mentors (IPMC members,
binding).

 - The vote thread can be found at the Permalink:

https://mail-archives.apache.org/mod_mbox/datasketches-dev/201909.mbox/%3cCA+UaPnsq1HCQKy=rwyXar51d6Lou=R8qjojCBFkOUJ0=vy9...@mail.gmail.com%3e

  B. Testing/actions/votes performed by the voters in the PPMC Stage 1:

- Furkan Kamaci: (+1 binding)
- Incubating in name
- DISCLAIMER exists
- LICENSE and NOTICE are fine
- No unexpected binary files
- Checked Checksums
- Code compiles and tests successfully run

- Dave Fisher: (+1 binding)
incubating in name
Signature and Checksum
NOTICE - there needs to be a slight adjustment - see below:
LICENSE - there could be better separation between each license
DISCLAIMER-WIP
Ran mvn build and tests

Minor Notes:
NOTICE should start like this:
Apache DataSketches Java
Copyright 2019 - The Apache Software Foundation

Copyright 2015-2018 - Yahoo
Copyright 2018 - Verizon Media
(Note: has been changed for the next version)

- Kenneth Knowles: (+1 binding)
- Ran `mvn clean test -Pstrict` on the zip archive
- DISCLAIMER-WIP
- incubating in name / artifacts
- LICENSE
- NOTICE

- Alex Saydakov: (+1)
  - verified signature
  - checked sha512 hash
  - 'mvn install' successful
  - tests in Druid succeeded

  C. Additional Notes:
- The LICENSE and NOTICE files should be very close to their final form
(with the exception of the minor reordering requested by Dave Fisher).

2. Source repository:
- https://github.com/apache/incubator-datasketches-java

Git Tag for this release:
-
https://github.com/apache/incubator-datasketches-java/tree/1.1.0-incubating-RC1
on branch 1.1.X-incubating

Git HashId for this release starts with: 40c6f4f

3. The Release Candidate / Zip Repository:
-
https://dist.apache.org/repos/dist/dev/incubator/datasketches/java/1.1.0-incubating-RC1/

The public signing key can be found in the KEYS file:
- https://dist.apache.org/repos/dist/dev/incubator/datasketches/KEYS

The artifacts have been signed with --keyid-format SHORT : 8CD4A902

4. Repository: Maven Central [Nexus](http://repository.apache.org) (Jar
Artifacts):
-
https://repository.apache.org/content/groups/staging/org/apache/datasketches/datasketches-java/1.1.0-incubating/

5. Build & Test Guide:
-
https://github.com/apache/incubator-datasketches-java/blob/1.1.0-incubating-RC1/README.md

8. This vote is the 2nd stage vote:
   - This voting process will remain open for at least 72 hours
 and until at least 3 (+1) IPMC votes or a majority of (+1) are
acquired.

Please vote accordingly:

[ ] +1 approve
[ ] +0 no opinion
[ ] -1 disapprove with the reason

Thanks,
Lee Rhodes
lee...@apache.org


[ANNOUNCE] DataSketches Java 1.0.0-incubating has been released!

2019-09-23 Thread leerho
### NEW COMPONENT! ###

Hello All,

1. The Apache DataSketches Java 1.0.0-incubating has been released!

NOTE 1: This is the core Java component of the DataSketches library
that includes all the sketch algorithms in production-ready packages. These
sketches can be called directly from this component or used in conjunction
with the adaptor components such as Hadoop Pig, Hadoop Hive, or the
aggregator adaptors built into Apache Druid.

NOTE 2: This release contains some critical performance improvements
and bug fixes for Apache Druid.

NOTE 3: This release was based on Release Candidate RC2.

Vote Thread Permalink:
https://mail-archives.apache.org/mod_mbox/incubator-general/201909.mbox/

There were four (+1) votes: Dave Fisher, Kevin Ratnasekera, Justin
Mclean, Furkan Kamaci.

2. Source repository:
https://github.com/apache/incubator-datasketches-java

Git Tag for this release:

https://github.com/apache/incubator-datasketches-java/tree/1.0.0-incubating-RC2
 on branch 1.0.X-incubating

3. Download, build, test and documentation:
   Note: at the time of code-freze for this release not all of the build
and test information had been incorporated into the README file.
   This will be corrected with the next release. It can be found in the
GitHub master branch:

https://github.com/apache/incubator-datasketches-java/blob/master/README.md


Thank you to the IPMC members and community for taking the time to review
and
provide guidance on our release!

On behalf of the Apache DataSketches Community,

Lee
lee...@apache.org


=
*Disclaimer*

Apache DataSketches (incubating) is an effort undergoing incubation at
The Apache Software Foundation (ASF), sponsored by the Apache Incubator
PMC. Incubation is required of all newly accepted projects until a
further review indicates that the infrastructure, communications, and
decision making process have stabilized in a manner consistent with
other successful ASF projects. While incubation status is not
necessarily a reflection of the completeness or stability of the code,
it does indicate that the project has yet to be fully endorsed by the
ASF.


Re: [VOTE] Release DataSketches-java 1.0.0-incubating-RC2

2019-09-23 Thread leerho
Justin,

Thank you for the vote.

The code for this Release Candidate was frozen as of Tuesday, 17 September
at 17:54 GMT.  Since that time our team has engaged with you a number of
times to perfect our handling of the LICENSE and NOTICE files.   The
current version of how these two files will look in the next release can be
found in the GitHub master branch:

https://github.com/apache/incubator-datasketches-java.

If you still find that these new versions of the LICENSE and NOTICE files
do not meet your requirements, please let us know.

With respect to the 9 items mentioned above:

Items 1, 2, 3, are appropriately mentioned in the new version of the
LICENSE referenced above.  (They were mentioned in the RC2 version but in
the wrong place.)

Item 4 is mentioned under the Public Domain section of the new LICENSE.

Items 5, 7, 8, 9 are references to published research papers  (arxiv.org is
a repository of scientific research papers) There is no code in these
papers that was copied or even adapted in our product.  Referencing these
papers allows users to find the mathematical proofs of why these algorithms
work.

Item 6 is a reference to our own website and our own work.

Thank you for the time you have spent on this!

Lee.



On Sun, Sep 22, 2019 at 12:40 AM Justin Mclean 
wrote:

> Hi,
>
> +1 (binding)
>
> I checked:
> - incubating in name
> - hashed and signatures fine
> - DISCLAIMER exists (and is WIP version)
> - LICENSE and NOTICE need more work (se below)
> - No unexpected binary files
> - All source files have ASF headers
> - Can compile from source
>
> This code [1][2] inside [3]  is missing from LICENSE. ([2] is also
> mentioned in other files). This is also missing from LICENSE [4].
>
> How is [5] licensed? (it’s used by [6]). Similarly with [7][8][9] used by
> other pieces of code.
>
> NOTICE contains license information which should be placed in LICENSE. The
> ASF copyright in NOTICE should probably come first.
>
> There are also still some references to https://datasketches.github.io
> that probably need to be corrected.
>
> Thanks,
> Justin
>
> 1. https://github.com/OpenHFT/Zero-Allocation-Hashing
> 2. https://github.com/Cyan4973/xxHash
> 3. ./src/main/java/org/apache/datasketches/hash/XxHash.java
> 4. https://github.com/aappleby/smhasher/blob/master/src/MurmurHash3.cpp
> 5. https://arxiv.org/abs/1603.05346v1
> 6 ./src/main/java/org/apache/datasketches/kll/KllFloatsSketch.java
> 7 https://arxiv.org/abs/1708.06839
> 8 https://arxiv.org/abs/1306.3284
> 9 http://dblp.org/rec/html/journals/tods/AgarwalCHPWY13
>
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>


Re: IPMCs PLEASE VOTE!

2019-09-21 Thread leerho
Yes, they are all on the to: list on the message I sent.  Not sure what
else I am suppose to do.

On Sat, Sep 21, 2019 at 5:35 PM Justin Mclean 
wrote:

> Hi,
>
> > 72 hours have past and we still need one more IPMC +1 vote.
>
> Your mentors are IPMC members, is there any reason they all have not
> voted? Have you tried asking them?
>
> Thanks,
> Justin
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
> --
>From my cell phone.


IPMCs PLEASE VOTE!

2019-09-20 Thread leerho
Folks,

72 hours have past and we still need one more IPMC +1 vote.
The Vote Letter permalink:
https://mail-archives.apache.org/mod_mbox/datasketches-dev/201909.mbox/%3cCA+UaPnsrh6g_xs900jsc+oJffj6ouTWQWkG3pWyQO+=vqds...@mail.gmail.com%3e

So far, Dave Fisher and Furkan Kamaci have both voted (+1).


Lee.


Re: [VOTE] Release Apache DataSketches-cpp (incubating) 1.0.0 [RC2]

2019-09-19 Thread leerho
Like this:

https://github.com/apache/incubator-datasketches-java/blob/master/LICENSE


On Thu, Sep 19, 2019 at 4:16 PM leerho  wrote:

> Hi Justin,
>
> You might also want to consider taking the license text [2] and placing in
>> a text point and point to that from the license.
>
>
> I presume you mean "consider as an alternative".   This means we wouldn't
> need the actual BSD-2-Clause license text in our LICENSE, nor would we need
> the "Copyright / attribution" statement as those would be in the link.
>
> Correct?
>
> Lee.
>
>
>
> On Thu, Sep 19, 2019 at 3:37 PM Justin Mclean 
> wrote:
>
>> Hi,
>>
>> > In case you didn't get the attachments here are links to them:
>> >
>> > LICENSE
>> > <
>> https://github.com/apache/incubator-datasketches-java/blob/master/LICENSE
>> >
>>
>> If you look at the BSD license [1] you notice it includes a copyright
>> line I would include that in the correct place in the license, rather than
>> seperating it out. You might also want to consider taking the license text
>> [2] and placing in a text point and point to that from the license.
>>
>> > NOTICE
>> > <
>> https://github.com/apache/incubator-datasketches-java/blob/master/NOTICE>
>>
>> That looks good to me.
>>
>> Thanks,
>> Justin
>>
>>
>> 1. https://opensource.org/licenses/BSD-2-Clause
>> 2. https://github.com/Cyan4973/xxHash/blob/dev/LICENSE
>> -
>> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
>> For additional commands, e-mail: general-h...@incubator.apache.org
>>
>>


Re: [VOTE] Release Apache DataSketches-cpp (incubating) 1.0.0 [RC2]

2019-09-19 Thread leerho
Hi Justin,

You might also want to consider taking the license text [2] and placing in
> a text point and point to that from the license.


I presume you mean "consider as an alternative".   This means we wouldn't
need the actual BSD-2-Clause license text in our LICENSE, nor would we need
the "Copyright / attribution" statement as those would be in the link.

Correct?

Lee.



On Thu, Sep 19, 2019 at 3:37 PM Justin Mclean 
wrote:

> Hi,
>
> > In case you didn't get the attachments here are links to them:
> >
> > LICENSE
> > <
> https://github.com/apache/incubator-datasketches-java/blob/master/LICENSE>
>
> If you look at the BSD license [1] you notice it includes a copyright line
> I would include that in the correct place in the license, rather than
> seperating it out. You might also want to consider taking the license text
> [2] and placing in a text point and point to that from the license.
>
> > NOTICE
> > <
> https://github.com/apache/incubator-datasketches-java/blob/master/NOTICE>
>
> That looks good to me.
>
> Thanks,
> Justin
>
>
> 1. https://opensource.org/licenses/BSD-2-Clause
> 2. https://github.com/Cyan4973/xxHash/blob/dev/LICENSE
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>


Re: [VOTE] Release Apache DataSketches-cpp (incubating) 1.0.0 [RC2]

2019-09-19 Thread leerho
In case you didn't get the attachments here are links to them:

LICENSE
<https://github.com/apache/incubator-datasketches-java/blob/master/LICENSE>

NOTICE
<https://github.com/apache/incubator-datasketches-java/blob/master/NOTICE>

On Thu, Sep 19, 2019 at 10:40 AM leerho  wrote:

> Hi Justin,
>
> My gmail account claims that I sent you and general a message with
> attachments,  but apparently they are being filtered out by Apache's
> mail server.
>
> Is this the case?
>
> Lee.
>
> On Wed, Sep 18, 2019 at 4:08 PM leerho  wrote:
>
>> Hi Justin,
>>
>> Thank you for your help, but we would like to be really clear on what you
>> have in mind in terms of proper content of the LICENSE and NOTICE files.
>>
>> Attached are the LICENSE and NOTICE from another one of our repos, but
>> the concept is the same as what Alex is trying to clarify:
>>
>> If these look ok to you, then we know exactly what we have to do.
>>
>> Thank you for your patience!
>>
>> Lee
>>
>>
>> On Wed, Sep 18, 2019 at 3:56 PM Alexander Saydakov <
>> sayda...@verizonmedia.com> wrote:
>>
>>> I am talking about the attribution: link to the original, link to our
>>> file
>>> or files where it is used. Currently this attribution is in the NOTICE.
>>> It
>>> is not duplicated in the LICENSE. Do you propose to move this attribution
>>> to the LICENSE or do you believe that it is not necessary?
>>> Thanks.
>>>
>>> On Wed, Sep 18, 2019 at 3:43 PM Justin Mclean 
>>> wrote:
>>>
>>> > Hi,
>>> >
>>> > > So do you propose to move it to the LICENSE or remove altogether?
>>> >
>>> > That information should already be part of the LICENSE file (for
>>> instance
>>> > as text of the MIT license will include the copyright line). There no
>>> need
>>> > for duplication, but if it is not it should be added there.
>>> >
>>> > Thanks,
>>> > Justin
>>> > -
>>> > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
>>> > For additional commands, e-mail: general-h...@incubator.apache.org
>>> >
>>> >
>>>
>>


Re: [VOTE] Release Apache DataSketches-cpp (incubating) 1.0.0 [RC2]

2019-09-19 Thread leerho
Hi Justin,

My gmail account claims that I sent you and general a message with
attachments,  but apparently they are being filtered out by Apache's
mail server.

Is this the case?

Lee.

On Wed, Sep 18, 2019 at 4:08 PM leerho  wrote:

> Hi Justin,
>
> Thank you for your help, but we would like to be really clear on what you
> have in mind in terms of proper content of the LICENSE and NOTICE files.
>
> Attached are the LICENSE and NOTICE from another one of our repos, but the
> concept is the same as what Alex is trying to clarify:
>
> If these look ok to you, then we know exactly what we have to do.
>
> Thank you for your patience!
>
> Lee
>
>
> On Wed, Sep 18, 2019 at 3:56 PM Alexander Saydakov <
> sayda...@verizonmedia.com> wrote:
>
>> I am talking about the attribution: link to the original, link to our file
>> or files where it is used. Currently this attribution is in the NOTICE. It
>> is not duplicated in the LICENSE. Do you propose to move this attribution
>> to the LICENSE or do you believe that it is not necessary?
>> Thanks.
>>
>> On Wed, Sep 18, 2019 at 3:43 PM Justin Mclean 
>> wrote:
>>
>> > Hi,
>> >
>> > > So do you propose to move it to the LICENSE or remove altogether?
>> >
>> > That information should already be part of the LICENSE file (for
>> instance
>> > as text of the MIT license will include the copyright line). There no
>> need
>> > for duplication, but if it is not it should be added there.
>> >
>> > Thanks,
>> > Justin
>> > -
>> > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
>> > For additional commands, e-mail: general-h...@incubator.apache.org
>> >
>> >
>>
>


Re: [VOTE] Release Apache DataSketches-cpp (incubating) 1.0.0 [RC2]

2019-09-18 Thread leerho
Hi Justin,

Thank you for your help, but we would like to be really clear on what you
have in mind in terms of proper content of the LICENSE and NOTICE files.

Attached are the LICENSE and NOTICE from another one of our repos, but the
concept is the same as what Alex is trying to clarify:

If these look ok to you, then we know exactly what we have to do.

Thank you for your patience!

Lee


On Wed, Sep 18, 2019 at 3:56 PM Alexander Saydakov <
sayda...@verizonmedia.com> wrote:

> I am talking about the attribution: link to the original, link to our file
> or files where it is used. Currently this attribution is in the NOTICE. It
> is not duplicated in the LICENSE. Do you propose to move this attribution
> to the LICENSE or do you believe that it is not necessary?
> Thanks.
>
> On Wed, Sep 18, 2019 at 3:43 PM Justin Mclean 
> wrote:
>
> > Hi,
> >
> > > So do you propose to move it to the LICENSE or remove altogether?
> >
> > That information should already be part of the LICENSE file (for instance
> > as text of the MIT license will include the copyright line). There no
> need
> > for duplication, but if it is not it should be added there.
> >
> > Thanks,
> > Justin
> > -
> > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> > For additional commands, e-mail: general-h...@incubator.apache.org
> >
> >
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org

Re: [VOTE] Release DataSketches-java 1.0.0-incubating-RC2

2019-09-17 Thread leerho
Dave,

Is this what you are looking for?

Lee.

On Tue, Sep 17, 2019 at 1:05 PM Dave Fisher  wrote:

>
>
> > On Sep 17, 2019, at 12:47 PM, leerho  wrote:
> >
> > Dave,
> >
> > Thanks for your vote!
> >
> > Please elaborate on:
> >
> >> I had to “guess” at the build instruction - `mvn clean install`
> >
> >
> > What additional instructions should I add ?
>
> The README.md should explicitly mention `mvn clean install` along with
> some context and also list the resulting target jars that are created and
> what is in each. I should not have to depend on navigating an MD file.
>
> An Alternative would be a BUILD.txt or BUILD.md.
>
> >
> > Also, we have been corresponding with Justin on the structure of the
> > LICENSE and NOTICE files and will get this straightened out with the next
> > release :)
>
> Yup.
>
> Regards,
> Dave
>
> >
> > Lee.
> >
> > On Tue, Sep 17, 2019 at 11:40 AM Dave Fisher  wrote:
> >
> >> +1 (binding)
> >>
> >> - Incubating in name
> >> - DISCLAIMER exists
> >> - LICENSE and NOTICE are fine
> >> - No unexpected binary files
> >> - Checked PGP signatures
> >> - Checked Checksums
> >> - Code compiles and tests successfully run
> >>
> >> I had to “guess” at the build instruction - `mvn clean install`
> >>
> >> I suppose that Justin may have comments about the NOTICE, but I’m OK
> with
> >> it on this release.
> >>
> >> Regards,
> >> Dave
> >>
> >>
> >>> On Sep 17, 2019, at 10:54 AM, leerho  wrote:
> >>>
> >>> ### NEW COMPONENT! ###
> >>>
> >>> Hello Apache DataSketches IPMC and Community,
> >>>
> >>> 1. This is a call for vote to release Apache DataSketches-java version:
> >>> 1.0.0-incubating-RC2
> >>>
> >>>   NOTE 1: This is the core Java component of the DataSketches library
> >>> that includes all the sketch algorithms in production-ready packages.
> >> These
> >>> sketches can be called directly from this component or used in
> >> conjunction
> >>> with the adaptor components such as Hadoop Pig, Hadoop Hive, or the
> >>> aggregator adaptors built into Apache Druid.
> >>>
> >>>   NOTE 2: This release contains some critical performance improvements
> >>> and bug fixes for Apache Druid.
> >>>
> >>>   NOTE 3: Changes from RC1: The previous RC1 had a bug in the release
> >>> script which caused the SHA512sum to fail. Some very minor updates to
> the
> >>> LICENSE, NOTICE, and pom.xml files. Otherwise no code changes.
> >>>
> >>> A. Results from the PPMC vote:
> >>>
> >>>- 3 (+1) votes. One of the votes was a Mentor.
> >>>
> >>>- The vote thread can be found at the Permalink:
> >>>
> >>>
> >>
> https://mail-archives.apache.org/mod_mbox/datasketches-dev/201909.mbox/%3CCA%2BUaPnsfS9YgXREXixxvGvKbF_aLbMTPN1hTjq1ewfa3k9JBJw%40mail.gmail.com%3E
> >>>
> >>> B. Testing/actions/votes performed by the voters in the PPMC Stage 1:
> >>>
> >>>   - Alex Saydakov: (+1)
> >>> Verified signature and checksum.
> >>> The package looks fine. Licenses are present.
> >>> Unit tests run successfully.
> >>>
> >>>   - Jon Malkin: (+1)
> >>> Verified signature and checksum
> >>> Licenses/notices present, although I still need to learn details of
> >>> what
> >>> they're supposed to include so I just checked for non-empty files
> >>> Successfully compiled and ran unit tests.
> >>>
> >>>   - Furkan Kamaci: (+1)
> >>> - Incubating in name
> >>> - DISCLAIMER exists
> >>> - LICENSE and NOTICE are fine
> >>> - No unexpected binary files
> >>> - Checked PGP signatures
> >>> - Checked Checksums
> >>> - Code compiles and tests successfully run
> >>>
> >>> Minor things:
> >>>
> >>> 1) You can update README file at GitHub both for how to compile it
> >> and
> >>> discussion & support. i.e.: https://github.com/apache/lucene-solr
> >>> 2) Seems that project fails to run tests when folder name has space
> >>> character.  (See Note below 

Re: [VOTE] Release DataSketches-java 1.0.0-incubating-RC2

2019-09-17 Thread leerho
Dave,

Thanks for your vote!

Please elaborate on:

> I had to “guess” at the build instruction - `mvn clean install`


What additional instructions should I add ?

Also, we have been corresponding with Justin on the structure of the
LICENSE and NOTICE files and will get this straightened out with the next
release :)

Lee.

On Tue, Sep 17, 2019 at 11:40 AM Dave Fisher  wrote:

> +1 (binding)
>
>  - Incubating in name
>  - DISCLAIMER exists
>  - LICENSE and NOTICE are fine
>  - No unexpected binary files
>  - Checked PGP signatures
>  - Checked Checksums
>  - Code compiles and tests successfully run
>
> I had to “guess” at the build instruction - `mvn clean install`
>
> I suppose that Justin may have comments about the NOTICE, but I’m OK with
> it on this release.
>
> Regards,
> Dave
>
>
> > On Sep 17, 2019, at 10:54 AM, leerho  wrote:
> >
> > ### NEW COMPONENT! ###
> >
> > Hello Apache DataSketches IPMC and Community,
> >
> > 1. This is a call for vote to release Apache DataSketches-java version:
> > 1.0.0-incubating-RC2
> >
> >NOTE 1: This is the core Java component of the DataSketches library
> > that includes all the sketch algorithms in production-ready packages.
> These
> > sketches can be called directly from this component or used in
> conjunction
> > with the adaptor components such as Hadoop Pig, Hadoop Hive, or the
> > aggregator adaptors built into Apache Druid.
> >
> >NOTE 2: This release contains some critical performance improvements
> > and bug fixes for Apache Druid.
> >
> >NOTE 3: Changes from RC1: The previous RC1 had a bug in the release
> > script which caused the SHA512sum to fail. Some very minor updates to the
> > LICENSE, NOTICE, and pom.xml files. Otherwise no code changes.
> >
> >  A. Results from the PPMC vote:
> >
> > - 3 (+1) votes. One of the votes was a Mentor.
> >
> > - The vote thread can be found at the Permalink:
> >
> >
> https://mail-archives.apache.org/mod_mbox/datasketches-dev/201909.mbox/%3CCA%2BUaPnsfS9YgXREXixxvGvKbF_aLbMTPN1hTjq1ewfa3k9JBJw%40mail.gmail.com%3E
> >
> >  B. Testing/actions/votes performed by the voters in the PPMC Stage 1:
> >
> >- Alex Saydakov: (+1)
> >  Verified signature and checksum.
> >  The package looks fine. Licenses are present.
> >  Unit tests run successfully.
> >
> >- Jon Malkin: (+1)
> >  Verified signature and checksum
> >  Licenses/notices present, although I still need to learn details of
> > what
> >  they're supposed to include so I just checked for non-empty files
> >  Successfully compiled and ran unit tests.
> >
> >- Furkan Kamaci: (+1)
> >  - Incubating in name
> >  - DISCLAIMER exists
> >  - LICENSE and NOTICE are fine
> >  - No unexpected binary files
> >  - Checked PGP signatures
> >  - Checked Checksums
> >  - Code compiles and tests successfully run
> >
> >  Minor things:
> >
> >  1) You can update README file at GitHub both for how to compile it
> and
> >  discussion & support. i.e.: https://github.com/apache/lucene-solr
> >  2) Seems that project fails to run tests when folder name has space
> >  character.  (See Note below under Build and Test Guide)
> >
> >  C. Additional Notes:
> >- The LICENSE and NOTICE files may not be in their final form. All the
> > information is present, but some of the information in the NOTICE file
> > needs to be moved to the LICENSE file. This is the result of continued
> > discussions with Justin Mclean.
> >
> > 2. The release candidate:
> >-
> >
> https://dist.apache.org/repos/dist/dev/incubator/datasketches/java/1.0.0-incubating-RC2/
> >
> > 3. Source repository:
> >- https://github.com/apache/incubator-datasketches-java
> >
> >Git Tag for this release:
> >-
> >
> https://github.com/apache/incubator-datasketches-java/tree/1.0.0-incubating-RC2
> > on branch 1.0.X-incubating
> >
> >Git HashId for this release starts with: f8abbbe
> >
> >The artifacts have been signed with --keyid-format SHORT : 8CD4A902
> >
> > 4. Repository: dist.apache.org repository:
> >
> >The public signing key can be found in the KEYS file:
> >- https://dist.apache.org/repos/dist/dev/incubator/datasketches/KEYS
> >
> > 5. Repository: Maven Central (repository.apache.org):
> >
> >The Jar artifacts can be found at
> >-
>

[VOTE] Release DataSketches-java 1.0.0-incubating-RC2

2019-09-17 Thread leerho
### NEW COMPONENT! ###

Hello Apache DataSketches IPMC and Community,

1. This is a call for vote to release Apache DataSketches-java version:
 1.0.0-incubating-RC2

NOTE 1: This is the core Java component of the DataSketches library
that includes all the sketch algorithms in production-ready packages. These
sketches can be called directly from this component or used in conjunction
with the adaptor components such as Hadoop Pig, Hadoop Hive, or the
aggregator adaptors built into Apache Druid.

NOTE 2: This release contains some critical performance improvements
and bug fixes for Apache Druid.

NOTE 3: Changes from RC1: The previous RC1 had a bug in the release
script which caused the SHA512sum to fail. Some very minor updates to the
LICENSE, NOTICE, and pom.xml files. Otherwise no code changes.

  A. Results from the PPMC vote:

 - 3 (+1) votes. One of the votes was a Mentor.

 - The vote thread can be found at the Permalink:

https://mail-archives.apache.org/mod_mbox/datasketches-dev/201909.mbox/%3CCA%2BUaPnsfS9YgXREXixxvGvKbF_aLbMTPN1hTjq1ewfa3k9JBJw%40mail.gmail.com%3E

  B. Testing/actions/votes performed by the voters in the PPMC Stage 1:

- Alex Saydakov: (+1)
  Verified signature and checksum.
  The package looks fine. Licenses are present.
  Unit tests run successfully.

- Jon Malkin: (+1)
  Verified signature and checksum
  Licenses/notices present, although I still need to learn details of
what
  they're supposed to include so I just checked for non-empty files
  Successfully compiled and ran unit tests.

- Furkan Kamaci: (+1)
  - Incubating in name
  - DISCLAIMER exists
  - LICENSE and NOTICE are fine
  - No unexpected binary files
  - Checked PGP signatures
  - Checked Checksums
  - Code compiles and tests successfully run

  Minor things:

  1) You can update README file at GitHub both for how to compile it and
  discussion & support. i.e.: https://github.com/apache/lucene-solr
  2) Seems that project fails to run tests when folder name has space
  character.  (See Note below under Build and Test Guide)

  C. Additional Notes:
- The LICENSE and NOTICE files may not be in their final form. All the
information is present, but some of the information in the NOTICE file
needs to be moved to the LICENSE file. This is the result of continued
discussions with Justin Mclean.

2. The release candidate:
-
https://dist.apache.org/repos/dist/dev/incubator/datasketches/java/1.0.0-incubating-RC2/

3. Source repository:
- https://github.com/apache/incubator-datasketches-java

Git Tag for this release:
-
https://github.com/apache/incubator-datasketches-java/tree/1.0.0-incubating-RC2
on branch 1.0.X-incubating

Git HashId for this release starts with: f8abbbe

The artifacts have been signed with --keyid-format SHORT : 8CD4A902

4. Repository: dist.apache.org repository:

The public signing key can be found in the KEYS file:
- https://dist.apache.org/repos/dist/dev/incubator/datasketches/KEYS

5. Repository: Maven Central (repository.apache.org):

The Jar artifacts can be found at
-
https://repository.apache.org/content/groups/staging/org/apache/datasketches/datasketches-java/1.0.0-incubating/

6. Build & Test Guide: (assuming you have Maven)

The DataSketches-java component is pure Java and is structured as a
Maven project.  You must compile using JDK 8. There is one run-time
dependency of the DataSketches-memory component (recently released), and,
of course, a number of test and Maven plugin dependencies, all of which can
be resolved by Maven.

NOTE: The directory elements of the full absolute path of the target
directory for your unzipped archive must qualify as Java identifiers.  In
other words,
the directory elements must not have any space characters (or non-Java
identifier characters) in any of the path elements.
This is actually required by the Oracle Java Specification in order to
ensure location-independent access to resources:
  (
https://docs.oracle.com/javase/8/docs/technotes/guides/lang/resources.html).
This component does access resource files for testing. The next release
will have a fix that should be tolerant of spaces at least, but this note
should remain.

There are two types of tests: normal unit tests and tests run by the
strict profile.  To run normal unit tests:

  $ mvn clean test

To run the strict profile tests:

  $ mvn clean test -P strict

To install jars in your local .m2 repository:

  $ mvn clean install -DskipTests=true

7. Documentation: The documentation for the DataSketches Java component is
part of the website.

The Overview section on the website has extensive documentation on all
the sketches:
- https://datasketches.github.io

Javadocs:
- https://datasketches.github.io/api/core/snapshot/apidocs/index.html

8. This vote is the 2nd stage vote:
   - This voting 

Re: [VOTE] Release Apache DataSketches-cpp (incubating) 1.0.0 [RC2]

2019-09-16 Thread leerho
Thank you Justin for your clarification!

Thank you Dave for your vote!

Lee.

On Mon, Sep 16, 2019 at 4:56 PM Dave Fisher  wrote:

> +1 (binding)
>
> I checked:
> - incubating in name
> - signatures and hashes file
> - DISCLAIMER exists (WIP version)
> - LICENSE present - note it is being fixed for next version
> - NOTICE present - being fixed for next release.
> - no unexpected binaries
> - All source files have ASF headers with noted exceptions.
> - Can compile on OSX (Mojave) and all tests pass (if one follows the
> directions and uses the cmake variant.
>
> Regards,
> Dave
>
> > On Sep 9, 2019, at 12:05 PM, Alexander Saydakov <
> sayda...@verizonmedia.com> wrote:
> >
> > Dear IPMC and Community,
> >
> > This is a call for vote to release Apache DataSketches-cpp version
> > 1.0.0-incubating
> >
> > This is a C++ version of the DataSketches core library with Python
> bindings.
> >
> > This is the first release of this component.
> >
> > We are not going to release any artifacts besides the source code.
> >
> > Results from the PPMC vote:
> > 3 votes were cast. All of the votes were (+1). One of the votes was from
> a
> > mentor.
> >
> http://mail-archives.apache.org/mod_mbox/datasketches-dev/201909.mbox/thread
> >
> > The release candidate:
> >
> https://dist.apache.org/repos/dist/dev/incubator/datasketches/cpp/1.0.0-incubating-rc2/
> >
> > Source repository:
> > https://github.com/apache/incubator-datasketches-cpp
> >
> > Git tag for this release:
> >
> https://github.com/apache/incubator-datasketches-cpp/releases/tag/1.0.0-incubating-rc2
> >
> > The artifact has been signed with key:
> > pub  4096R/A8E8B4E0 2019-08-20 Alexander Saydakov (CODE SIGNING KEY) <
> > al...@apache.org>
> > Fingerprint=40A2 47F6 1E8C 6C75 678C  69C5 6848 6114 A8E8 B4E0
> >
> > The public signing key can be found in the KEYS file:
> > https://dist.apache.org/repos/dist/dev/incubator/datasketches/KEYS
> >
> > Changes from the RC1:
> > - added license to make files
> > - building and running unit tests does not require pybind11 (dependency
> for
> > Python bindings)
> > - better build instructions
> >
> > This library was intended to be header-only, but this goal was not fully
> > achieved yet with CPC sketch code. This work is in progress.
> >
> > Building and running unit tests requires CppUnit.
> >
> > Installing CppUnit on OSX: brew install cppunit
> > Installing CppUnit on RHEL: yum install cppunit-devel
> >
> > There are currently two ways of building: using existing make files and
> > generating make files using cmake. Exsisting make files might not work on
> > all platforms or with all C++ compilers. Generating make files using
> cmake
> > should solve this problem, but it currently requires cmake version 3.12.0
> > or higher that might not be readily available as a package on all
> platforms.
> >
> > Installing the latest cmake on OSX: brew install cmake
> >
> > Building and running unit tests using existing make files:
> > $ make
> > $ make test
> >
> > Building and running unit tests using cmake:
> > $ mkdir build
> > $ cd build
> > $ cmake ..
> > $ make
> > $ make test
> >
> > Building Python bindings requires cmake 3.12 or higher and pybind11.
> > Please see the following document for details:
> >
> https://github.com/apache/incubator-datasketches-cpp/blob/1.0.0-incubating-rc2/python/README.md
> >
> > This is the second stage vote. The vote will remain open for at least 72
> > hours and at least 3 (+1) IPMC votes or a majority of (+1) are acquired.
> >
> > Please vote accordingly:
> > [ ] +1 approve
> > [ ] +0 no opinion
> > [ ] -1 disapprove with the reason
> >
> > Thanks,
> > Alexander Saydakov
> > al...@apache.org
>
>
> -
> To unsubscribe, e-mail: dev-unsubscr...@datasketches.apache.org
> For additional commands, e-mail: dev-h...@datasketches.apache.org
>
>


Re: [VOTE] Release Apache DataSketches-cpp (incubating) 1.0.0 [RC2]

2019-09-16 Thread leerho
Hi Justin,

We are still a bit uncertain what you mean by "license information" in the
NOTICE file.

Are you suggesting to remove the two single lines:
Copyright (C) 1998-2019 CERN for the benefit of the LHCb and ATLAS
collaborations
and
Copyright (c) 2017 Benjamin R. Jack ?

The rest of the content in this file (after the first 8 lines), is
organized by license type and the text under each license type
is to help the user locate where in our release code we leverage code of
that license type and where the original source code is.  I don't consider
this location information as "license information".  It is just helpful
location information to the relevant code.

Thanks,

Lee.


On Mon, Sep 16, 2019 at 1:29 PM Justin Mclean 
wrote:

> Hi,
>
> > I would like to clarify a few things:
> > - As I understand it is fine to deal with this going forward in the
> master,
>
> Yep that fine, you have my +1 on this release.
>
> > and we can proceed with this vote.
> > - You propose to add one line "Copyright (c) 2017 Benjamin R. Jack" right
> > after the header "MIT License" in the LICENSE file
>
> Yes, you need the full license text as given in the project, that includes
> the copyright line.
>
> > - You propose to remove something from the NOTICE file. What exactly?
>
> The information about 3rd party license information goes in the LICENSE
> file not NOTICE. Those links I provided in my previous email should help.
>
> > Headers with license types? Is this a legal concern or just to keep this
> > file as short as possible?
>
> It’s ASF policy. Yes it’s to keep the NOTICE file short as possible as it
> has an affect on downstream projects.
>
> Thanks,
> Justin
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>


[ANNOUNCE] Apache DataSketches Memory 1.1.0-incubating has been released!

2019-08-25 Thread leerho
Hello All

1. The Apache DataSketches Memory 1.1.0-incubating has been released.

NOTE 1: This is one component of the DataSketches library that provides
high-performance access to off-heap memory for Java applications. These are
low-level functions that are used by other DataSketches components. It does
not provide any command-line interface or GUI.

NOTE 2: This release contains some critical performance improvements
for Apache Druid.

NOTE 3: This particular component has been split out from the rest of
the DataSketches code because there have been a number of requests from
platform projects that are interested in these high-performance functions
independently of the remainder of the DataSketches library.

Vote thread Permalink:

https://lists.apache.org/thread.html/7ccc79054893a47bb1b24069736a43dfe7bc1af75c2a5ac0e03bc58b@%3Cgeneral.incubator.apache.org%3E

2. Download Links:

Dist:
https://dist.apache.org/repos/dist/release/incubator/datasketches/memory/1.1.0-incubating/
Nexus:
https://repository.apache.org/content/repositories/releases/org/apache/datasketches/datasketches-memory/1.1.0-incubating/

3. Source repository:
https://github.com/apache/incubator-datasketches-memory

Git Tag for this release:

https://github.com/apache/incubator-datasketches-memory/tree/1.1.0-incubating-RC2
on branch 1.1.X-incubating

4. Build & Test Guide:

The DataSketches-memory component is pure Java and is structured as a
Maven project.  You must compile using JDK 8. There is one run-time
dependency for logging and, of course, a number of test and Maven plugin
dependencies, all of which can be resolved by Maven.

There are two types of tests: normal unit tests and tests run by the
strict profile.  To run normal unit tests:

$ mvn clean test

To run the strict profile tests:

$ mvn clean test -P strict

To install jars in your local .m2 repository:

$ mvn clean install

5. Documentation: The documentation for the DataSketches Memory component
is part of the website.

Overview documentation:
- https://datasketches.github.io/docs/Memory/MemoryPackage.html
- https://datasketches.github.io/docs/Memory/MemoryPerformance.html

Javadocs:
- https://datasketches.github.io/api/memory/snapshot/apidocs/index.html


6. DataSketches Resources:
- Issues for datasketches-memory:
https://github.com/apache/incubator-datasketches-memory/issues
- Mailing list: d...@datasketches.apache.org


Thank you to the IPMC members and community for taking the time to review
and
provide guidance on our release!

On behalf of the Apache DataSketches Community,

Lee
lee...@apache.org

=
*Disclaimer*

Apache DataSketches (incubating) is an effort undergoing incubation at
The Apache Software Foundation (ASF), sponsored by the Apache Incubator
PMC. Incubation is required of all newly accepted projects until a
further review indicates that the infrastructure, communications, and
decision making process have stabilized in a manner consistent with
other successful ASF projects. While incubation status is not
necessarily a reflection of the completeness or stability of the code,
it does indicate that the project has yet to be fully endorsed by the
ASF.


[RESULT][VOTE] Release Apache DataSketches Memory 1.1.0-incubating-RC2

2019-08-25 Thread leerho
RE TAG:
https://github.com/apache/incubator-datasketches-memory/tree/1.1.0-incubating-RC2

Hello IPMC and Community,

The vote to release the above has passed with 3 (+1) binding votes:
  Furkan Kamaci
  Justin Mclean
  Kenneth Knowles

and 2 non-binding (+1) votes:
  Roman Leventov
  William Guo

Vote thread Permalink:
https://lists.apache.org/thread.html/7ccc79054893a47bb1b24069736a43dfe7bc1af75c2a5ac0e03bc58b@%3Cgeneral.incubator.apache.org%3E

Thank you to the above IPMC members for taking the time to review and
provide guidance on our release!

We will proceed with publishing the approved artifacts and sending out
the appropriate announcements in the coming days.

On behalf of the Apache DataSketches Community,

Lee
lee...@apache.org


Re: [VOTE] Release 1.1.0-incubating-RC2

2019-08-21 Thread leerho
RE: VERIFY VOTE

Folks,

My apologies, there was an error in the Vote Letter whereby the top of the
letter clearly requests a vote on RC2, but down below, the link to the
Release Candidate and the GitHub Tag were actually pointing to RC1.
Everything else appears to be correct.

Please verify that it was RC2 that you examined and voted on.

Thank you!

Lee.



On Wed, Aug 21, 2019 at 8:09 PM Justin Mclean 
wrote:

> Hi,
>
> > &^%$%^&  You are right.  That is a terrible copy/paste error on my part.
> > However, the top of the vote letter clearly calls out RC2.  Are you
> > changing your vote to -1?  If so I will correct the vote letter and we
> will
> > have to start over.
>
> I think it’s clear enough that it was RC2 that was being released and that
> what I looked at. Perhaps just confirm that any other IPMC votes were also
> on RC2?
>
> Thanks,
> Justin
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>


Re: [VOTE] Release 1.1.0-incubating-RC2

2019-08-21 Thread leerho
&^%$%^&  You are right.  That is a terrible copy/paste error on my part.
However, the top of the vote letter clearly calls out RC2.  Are you
changing your vote to -1?  If so I will correct the vote letter and we will
have to start over.



On Wed, Aug 21, 2019 at 7:08 PM Kenneth Knowles  wrote:

> -1 because I did in fact download RC1. I had not noticed, but your call for
> this vote on general@ links to RC1 artifacts. So the archive of this
> thread
> will be confusing if some votes were cast after verifying RC1 while others
> are corrected.
>
> Kenn
>
> On Wed, Aug 21, 2019 at 5:20 PM leerho  wrote:
>
> > Just a guess, did you happen to download RC1 by mistake?
> >
> > Lee.
> >
> > On Wed, Aug 21, 2019 at 5:09 PM leerho  wrote:
> >
> > > Kenn,
> > >
> > > I am puzzled by
> > >
> > > I did have to run `mvn -Dmaven.gitcommitid.skip test` since it is
> looking
> > >> for a .git directory.
> > >
> > >
> > > The git-commit-id plugin is in a separate "nexus-jars" profile which
> must
> > > be called from the command line.
> > > it is specifically placed in the separate profile so that it will not
> be
> > > called from the main Maven build lifecycle.
> > >
> > > If you are running just "mvn test" or even "mvn install" that plugin
> > > should not be called.
> > >
> > > I don't have that problem.
> > >
> > > Lee.
> > >
> > >
> > >
> > > On Wed, Aug 21, 2019 at 11:57 AM Kenneth Knowles 
> > wrote:
> > >
> > >> +1
> > >>
> > >> Triple-checked: LICENSE, DISCLAIMER, license headers, mvn test
> > >>
> > >> I did have to run `mvn -Dmaven.gitcommitid.skip test` since it is
> > looking
> > >> for a .git directory.
> > >>
> > >> On Tue, Aug 20, 2019 at 9:04 PM Justin Mclean <
> jus...@classsoftware.com
> > >
> > >> wrote:
> > >>
> > >> > Hi,
> > >> >
> > >> > +1 (binding)
> > >> >
> > >> > I checked:
> > >> > - incubating in name
> > >> > - signatures and hashes fine
> > >> > - DISCLAIMER exists and uses the WIP text. you also might want to
> fill
> > >> in
> > >> > #Podling-Name#
> > >> > - LICENSE is OK. Do you know what is missing?
> > >> > - NOTICE is fine
> > >> > - NO binary files in release
> > >> > - All source files have ASF header
> > >> > - Can compile from source
> > >> >
> > >> > There’s some very minor issues:
> > >> > - I assume that some of the files may have an incorrect header?
> > >> > [1][2][3][4][6][7][8]. 3rd party headers should probably not be
> > replaced
> > >> > with ASF ones [9] unless they have been extensively modified. This
> > shod
> > >> be
> > >> > mentioned in the DISCLAIMER.
> > >> > - In LICENSE there is probably no need to mention the java files
> that
> > >> use
> > >> > the Gettysburg address.
> > >> > - "lee...@users.noreply.github.com” is probably not the best email
> to
> > >> > sign the release with, please use an apache one in future.
> > >> > - It’s nicer if the source unzips into a directory
> > >> >
> > >> > Thanks,
> > >> > Justin
> > >> >
> > >> > 1.
> src/main/java/org/apache/datasketches/memory/AccessByteBuffer.java
> > >> > 2. src/main/java/org/apache/datasketches/memory/XxHash64.java
> > >> > 3. src/test/java/org/apache/datasketches/memory/XxHash64Test.java
> > >> > 4.
> > src/test/java/org/apache/datasketches/memory/XxHash64LoopingTest.java
> > >> > 5. src/main/java/org/apache/datasketches/memory/Utf8.java
> > >> > 6. src/test/java/org/apache/datasketches/memory/Utf8Test.java
> > >> > 7
> > src/test/java/org/apache/datasketches/memory/IsValidUtf8TestUtil.java
> > >> > 8 src/main/java/org/apache/datasketches/memory/XxHash64.java
> > >> > 9 https://www.apache.org/legal/src-headers.html#3party
> > >> >
> -
> > >> > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> > >> > For additional commands, e-mail: general-h...@incubator.apache.org
> > >> >
> > >> >
> > >>
> > >
> >
>


Re: [VOTE] Release 1.1.0-incubating-RC2

2019-08-21 Thread leerho
Just a guess, did you happen to download RC1 by mistake?

Lee.

On Wed, Aug 21, 2019 at 5:09 PM leerho  wrote:

> Kenn,
>
> I am puzzled by
>
> I did have to run `mvn -Dmaven.gitcommitid.skip test` since it is looking
>> for a .git directory.
>
>
> The git-commit-id plugin is in a separate "nexus-jars" profile which must
> be called from the command line.
> it is specifically placed in the separate profile so that it will not be
> called from the main Maven build lifecycle.
>
> If you are running just "mvn test" or even "mvn install" that plugin
> should not be called.
>
> I don't have that problem.
>
> Lee.
>
>
>
> On Wed, Aug 21, 2019 at 11:57 AM Kenneth Knowles  wrote:
>
>> +1
>>
>> Triple-checked: LICENSE, DISCLAIMER, license headers, mvn test
>>
>> I did have to run `mvn -Dmaven.gitcommitid.skip test` since it is looking
>> for a .git directory.
>>
>> On Tue, Aug 20, 2019 at 9:04 PM Justin Mclean 
>> wrote:
>>
>> > Hi,
>> >
>> > +1 (binding)
>> >
>> > I checked:
>> > - incubating in name
>> > - signatures and hashes fine
>> > - DISCLAIMER exists and uses the WIP text. you also might want to fill
>> in
>> > #Podling-Name#
>> > - LICENSE is OK. Do you know what is missing?
>> > - NOTICE is fine
>> > - NO binary files in release
>> > - All source files have ASF header
>> > - Can compile from source
>> >
>> > There’s some very minor issues:
>> > - I assume that some of the files may have an incorrect header?
>> > [1][2][3][4][6][7][8]. 3rd party headers should probably not be replaced
>> > with ASF ones [9] unless they have been extensively modified. This shod
>> be
>> > mentioned in the DISCLAIMER.
>> > - In LICENSE there is probably no need to mention the java files that
>> use
>> > the Gettysburg address.
>> > - "lee...@users.noreply.github.com” is probably not the best email to
>> > sign the release with, please use an apache one in future.
>> > - It’s nicer if the source unzips into a directory
>> >
>> > Thanks,
>> > Justin
>> >
>> > 1. src/main/java/org/apache/datasketches/memory/AccessByteBuffer.java
>> > 2. src/main/java/org/apache/datasketches/memory/XxHash64.java
>> > 3. src/test/java/org/apache/datasketches/memory/XxHash64Test.java
>> > 4. src/test/java/org/apache/datasketches/memory/XxHash64LoopingTest.java
>> > 5. src/main/java/org/apache/datasketches/memory/Utf8.java
>> > 6. src/test/java/org/apache/datasketches/memory/Utf8Test.java
>> > 7 src/test/java/org/apache/datasketches/memory/IsValidUtf8TestUtil.java
>> > 8 src/main/java/org/apache/datasketches/memory/XxHash64.java
>> > 9 https://www.apache.org/legal/src-headers.html#3party
>> > -
>> > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
>> > For additional commands, e-mail: general-h...@incubator.apache.org
>> >
>> >
>>
>


Re: [VOTE] Release 1.1.0-incubating-RC2

2019-08-21 Thread leerho
Kenn,

I am puzzled by

I did have to run `mvn -Dmaven.gitcommitid.skip test` since it is looking
> for a .git directory.


The git-commit-id plugin is in a separate "nexus-jars" profile which must
be called from the command line.
it is specifically placed in the separate profile so that it will not be
called from the main Maven build lifecycle.

If you are running just "mvn test" or even "mvn install" that plugin should
not be called.

I don't have that problem.

Lee.



On Wed, Aug 21, 2019 at 11:57 AM Kenneth Knowles  wrote:

> +1
>
> Triple-checked: LICENSE, DISCLAIMER, license headers, mvn test
>
> I did have to run `mvn -Dmaven.gitcommitid.skip test` since it is looking
> for a .git directory.
>
> On Tue, Aug 20, 2019 at 9:04 PM Justin Mclean 
> wrote:
>
> > Hi,
> >
> > +1 (binding)
> >
> > I checked:
> > - incubating in name
> > - signatures and hashes fine
> > - DISCLAIMER exists and uses the WIP text. you also might want to fill in
> > #Podling-Name#
> > - LICENSE is OK. Do you know what is missing?
> > - NOTICE is fine
> > - NO binary files in release
> > - All source files have ASF header
> > - Can compile from source
> >
> > There’s some very minor issues:
> > - I assume that some of the files may have an incorrect header?
> > [1][2][3][4][6][7][8]. 3rd party headers should probably not be replaced
> > with ASF ones [9] unless they have been extensively modified. This shod
> be
> > mentioned in the DISCLAIMER.
> > - In LICENSE there is probably no need to mention the java files that use
> > the Gettysburg address.
> > - "lee...@users.noreply.github.com” is probably not the best email to
> > sign the release with, please use an apache one in future.
> > - It’s nicer if the source unzips into a directory
> >
> > Thanks,
> > Justin
> >
> > 1. src/main/java/org/apache/datasketches/memory/AccessByteBuffer.java
> > 2. src/main/java/org/apache/datasketches/memory/XxHash64.java
> > 3. src/test/java/org/apache/datasketches/memory/XxHash64Test.java
> > 4. src/test/java/org/apache/datasketches/memory/XxHash64LoopingTest.java
> > 5. src/main/java/org/apache/datasketches/memory/Utf8.java
> > 6. src/test/java/org/apache/datasketches/memory/Utf8Test.java
> > 7 src/test/java/org/apache/datasketches/memory/IsValidUtf8TestUtil.java
> > 8 src/main/java/org/apache/datasketches/memory/XxHash64.java
> > 9 https://www.apache.org/legal/src-headers.html#3party
> > -
> > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> > For additional commands, e-mail: general-h...@incubator.apache.org
> >
> >
>


Re: [VOTE] Release 1.1.0-incubating-RC2

2019-08-21 Thread leerho
Justin,
Thank you for your detailed response!  This is very helpful.

I will address each one of your license issues:

1. src/main/java/org/apache/datasketches/memory/AccessByteBuffer.java
There is only one method (6 lines long) in this entire file that was
"adapted" from the the original source file and is properly attributed  in
the Javadoc for the method and referenced in the LICENSE file.   Even this
method is modified quite a bit from the original.  Putting their license
header at the top of the file would be disregarding our original code in
the rest of the file.
My recommendation is to leave it as it is.  But if you have a suggestion as
to how this should be handled better, please let me know.

2. src/main/java/org/apache/datasketches/memory/XxHash64.java
About half of the original was copied (in form) and extensively modified.
Considerable functionality was removed and replaced with completely new
functionality.  The source is attributed at the top of the file and in the
LICENSE file.  Because of the extensive modifications, and the presence of
our original code, putting their license header at the top of the file
would be disregarding our original code in the rest of the file.  My
recommendation is to leave it as it is.  But please advise.

3. src/test/java/org/apache/datasketches/memory/XxHash64Test.java
Only one method in this file (~15 lines) is adapted from the original
source and every one of the lines had to be modified to suit our
environment.
The source is attributed at the top of the file and in the LICENSE file.
Because of the extensive modifications, and the presence of our original
code, putting their license header at the top of the file would be
disregarding our original code in the rest of the file.  My recommendation
is to leave it as it is.  But please advise.

4. src/test/java/org/apache/datasketches/memory/XxHash64LoopingTest.java
This file contains one method (~7 lines), which is heavily adapted from the
original source, and about 1000 static data values that are used in a
bit-for-bit compatibility test to make sure our implementation of this hash
function produces the exact same hashes bit-for-bit!  Even so, this file is
not an exact copy of the original.   The source is attributed at the top of
the file and in the LICENSE file. Perhaps it could be argued that because
the majority of this file contains a copy of their test data, Their
copyright header should be used.  Even that would disregard the
modifications we did make.  My recommendation is to leave it as it is.  But
please advise.

5. src/main/java/org/apache/datasketches/memory/Utf8.java
This file adapts portions of the original with extensive modifications.
Method names are changed, looping methods are different, etc.  Some of the
same code comments are copied over, but additional code comments were
added.  Again, bit-for-bit compatibility is critical.  The source is
attributed at the top of the file and in the LICENSE file.  My
recommendation is to leave it as it is.  But please advise.

6. src/test/java/org/apache/datasketches/memory/Utf8Test.java
This file adapts portions of the original with modifications. Some of the
same code comments are copied over, but additional code comments were
added.  It could be argued that the original license header could be placed
at the top.  The source is attributed at the top of the file and in the
LICENSE file.  My recommendation is to leave it as it is.  But please
advise.


7 src/test/java/org/apache/datasketches/memory/IsValidUtf8TestUtil.java
This is a vastly stripped down version of the original.   It could be
argued that the original license header could be placed at the top.  The
source is attributed at the top of the file and in the LICENSE file.  My
recommendation is to leave it as it is.  But please advise.

8 src/main/java/org/apache/datasketches/memory/XxHash64.java  Duplicate of
#2.

Of all of these, perhaps #4, #6, #7 (perhaps #5) could have the source
license header at the top.  Even so, that would be incredibly conservative
and generous.

Lee.



On Tue, Aug 20, 2019 at 9:04 PM Justin Mclean 
wrote:

> Hi,
>
> +1 (binding)
>
> I checked:
> - incubating in name
> - signatures and hashes fine
> - DISCLAIMER exists and uses the WIP text. you also might want to fill in
> #Podling-Name#
> - LICENSE is OK. Do you know what is missing?
> - NOTICE is fine
> - NO binary files in release
> - All source files have ASF header
> - Can compile from source
>
> There’s some very minor issues:
> - I assume that some of the files may have an incorrect header?
> [1][2][3][4][6][7][8]. 3rd party headers should probably not be replaced
> with ASF ones [9] unless they have been extensively modified. This shod be
> mentioned in the DISCLAIMER.
> - In LICENSE there is probably no need to mention the java files that use
> the Gettysburg address.
> - "lee...@users.noreply.github.com” is probably not the best email to
> sign the release with, please use an apache one in future.
> - It’s 

[VOTE] Release 1.1.0-incubating-RC2

2019-08-19 Thread leerho
Hello Apache DataSketches IPMC and Community,

1. This is a call for vote to release Apache DataSketches-memory version:
 1.1.0-incubating-RC2

NOTE 1: This is one component of the DataSketches library that provides
high-performance access
to off-heap memory for Java applications. These are low-level functions
that are used by other
DataSketches components. It does not provide any command-line interface or
GUI.

NOTE 2: This release contains some critical performance improvements for
Apache Druid.

NOTE 3: This particular component has been split out from the rest of the
DataSketches
code because there have been a number of requests from platform projects
that are interested in
these high-performance functions independently of the remainder of the
DataSketches library.

NOTE 4: This RC2 fixes some problems in the structure of the POM file of
RC1. There are no other changes.

  A. Results from the PPMC vote (open for 140 hours (including the weekend))

- 3 votes were cast. All of the votes were (+1). Two of the votes were
from Mentors.

- The vote thread can be found at:
https://mail-archives.apache.org/mod_mbox/datasketches-dev/201907.mbox/browser


  B. Testing/actions/votes performed by the voters:

- Gian Merlino: (+1)
- .asc and .sha512 look good.
- git.properties looks good.
- Ran "mvn package" without issue.
- Source tarball contents match the tag.

Gian made several suggestions for improvements for next release:
  - have the zip unpack in its own directory not the current
directory
  - DISCLAMER
- should be renamed DISCLAMER-WIP
- links to
http://incubator.apache.org/project/#Podling-Name#.html, but it should be
  https://incubator.apache.org/projects/datasketches.html.
(fill in project name,
  'projects' instead of 'project', http would work but https is
nicer.)
  - Vote procedure:
 - Clarify sentence to require not just 3 (+1) votes, but a
majority of (+1s) vs (-1s).
 - When cancelling a vote use subject [CANCEL][VOTE] ...

- Liang Chen (+1 binding)
  - no comments

- Furkan Kamaci (+1 binding)
  - Incubating in name
  - LICENSE is fine
  - NOTICE is fine
  - No unexpected binary files
  - Code compiles and tests successfully run
  - One objection: DISCLAIMER file should be renamed to DISCLAIMER-WIP
and link
at content should point to datasketches.

2. The release candidate:
https://dist.apache.org/repos/dist/dev/incubator/datasketches/memory/1.1.0-incubating-RC1/apache-datasketches-memory-1.1.0-incubating-src.zip

3. Source repository:
https://github.com/apache/incubator-datasketches-memory

Git Tag for this release:
https://github.com/apache/incubator-datasketches-memory/tree/1.1.0-incubating-RC1
on branch 1.1.X-incubating

Git HashId for this release starts with: 32de195

The artifacts have been signed with --keyid-format SHORT : 8CD4A902

4. Repository: dist.apache.org repository:

The public signing key can be found in the KEYS file:
https://dist.apache.org/repos/dist/dev/incubator/datasketches/KEYS

5. Repository: Maven Central (repository.apache.org):

The Jar artifacts can be found at
https://repository.apache.org/content/groups/staging/org/apache/datasketches/datasketches-memory/1.1.0-incubating/

6. Build & Test Guide:

The DataSketches-memory component is pure Java and is structured as a Maven
project.  You must compile using JDK 8.
There is one run-time dependency for logging and, of course, a number of
test and Maven plugin dependencies,
all of which can be resolved by Maven.

There are two types of tests: normal unit tests and tests run by the strict
profile.  To run normal unit tests:

  $ mvn clean test

To run the strict profile tests:

  $ mvn clean test -P strict

To install jars in your local .m2 repository:

  $ mvn clean install

7. Documentation: The documentation for the DataSketches Memory component
is part of the website.

Overview documentation:
- https://datasketches.github.io/docs/Memory/MemoryPackage.html
- https://datasketches.github.io/docs/Memory/MemoryPerformance.html

Javadocs:
- https://datasketches.github.io/api/memory/snapshot/apidocs/index.html

8. This vote is the 2nd stage vote:
   - This voting process will remain open for at least 72 hours
 and until at least 3 (+1) IPMC votes or a majority of (+1) are
acquired.

Please vote accordingly:

[ ] +1 approve
[ ] +0 no opinion
[ ] -1 disapprove with the reason

Thanks,
Lee Rhodes
lee...@apache.org


Re: Showcase your project at ApacheCON at a Podling's Shark Tank

2019-08-16 Thread leerho
I am a committer on DataSketches, a relatively new Podling and would like
to present at the Shark Tank at ACNA19, if possible.

I tried the above links but none seem to work  :(

Lee Rhodes
lee...@apache.org

On Wed, Aug 14, 2019 at 1:45 PM Roman Shaposhnik  wrote:

> Hi Podlings!
>
> in less than a month we're going to have our first
> ApacheCON this year -- the one in Las Vegas. In
> about two month there will be one more in Berlin.
>
> These are not your regular ApacheCONs -- these are
> 20th Anniversary of ASF ApacehCONs! In other words,
> these are not to be missed!
>
> And even if your talk didn't get accepted -- you still
> get an opportunity to highlight your project to, what's
> likely going to be the biggest audience attending.
>
> Here's how: if you (or any community member who's
> passionate about your project) are going to be at either
> of those ApacheCONs consider signing up for
> Podling's Shark Tank
> events:
> https://www.apachecon.com/acna19/s/#/scheduledEvent/1038
> https://aceu19.apachecon.com/session/podlings-shark-tank
>
> Each project presenting will get ~10 min for the pitch and ~5 min
> of panel grilling them on all sorts of things. Kind of like this ;-)
>  https://www.youtube.com/watch?v=wmenN7NEdBc
>
> You've got nothing to lose (in fact, the opposite: you're likely to get
> a prize!) and you will get a chance to receive feedback that might
> actually help you grow your community and ultimately graduate to the
> TLP status. And! Given our awesome panel of judges:
>  * Myrle Krantz
>  * Justin Mclean
>  * Craig Russel
>  * Shane Curcuru
> We guarantee this to be a fun and useful event for your community!
>
> We will be tracking signups over here:
>  https://wiki.apache.org/apachecon/ACNA19PodlingSharkTank
>  https://wiki.apache.org/apachecon/ACEU19PodlingSharkTank
> but for now:
>
> SIMPLY REPLY TO THIS EMAIL if you're interested.
>
> It is first come, first serve -- so don't delay -- sign up today!
>
> Thanks,
> Roman.
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>


Re: [VOTE] DataSketches-memory RC2

2019-08-01 Thread leerho
I ran "mvn apache-rat:check" // didn't find anything:

[*INFO*] *--- *apache-rat-plugin:0.12:check *(default-cli)* @
datasketches-memory* ---*

[*INFO*] Enabled default license matchers.

[*INFO*] Will parse SCM ignores for exclusions...

[*INFO*] Parsing exclusions from
/Users/lrhodes/dev/git/Apache/datasketches-memory/.gitignore

[*INFO*] Finished adding exclusions from SCM ignore files.

[*INFO*] 86 implicit excludes (use -debug for more details).

[*INFO*] Exclude: **/test/resources/**/*.txt

[*INFO*] Exclude: LICENSE

[*INFO*] Exclude: NOTICE

[*INFO*] Exclude: DISCLAIMER

[*INFO*] 99 resources included (use -debug for more details)

[*INFO*] Rat check: Summary over all files. Unapproved: 0, unknown: 0,
generated: 0, approved: 99 licenses.



On Thu, Aug 1, 2019 at 10:06 PM leerho  wrote:

> Found the WIP Disclaimer.
>
> On Thu, Aug 1, 2019 at 9:58 PM leerho  wrote:
>
>> Justin,
>> Thank you so much for your response!
>> ( I feel like I'm in a vacuum at times ... so your help is much
>> appreciated!! )
>>
>> I will do a thorough license search.
>> I will try to locate the WIP Disclaimer ... do you have a link?
>>
>> Right now I am struggling with pom issue related to the
>> maven-remote-resources-plugin.
>> It is creating errors no matter what I do, so I must be doing something
>> wrong:
>>
>>- If I don't have it in pom at all I get the error: "missing required
>>source folder: 'target/maven-shared-archive-resources'
>>- If I include the plugin in the pom project build as:
>>
>> 
>> org.apache.maven.plugins
>> maven-remote-resources-plugin
>>
>> ${maven-remote-resources-plugin.version}
>> 
>> 
>> 
>> process
>> 
>> 
>> 
>> Apache DataSketches Memory
>> (incubating)
>> 
>> 
>>
>> org.apache:apache-jar-resource-bundle:1.5-SNAPSHOT
>>
>> org.apache:apache-incubator-disclaimer-resource-bundle:1.2-SNAPSHOT
>> 
>> 
>> 
>> 
>> 
>>
>>
>> I get the error: maven-remote-resources-plugin:[1.0,) or one of its
>> dependencies could not be resolved: Failure to find
>> org.apache.maven.plugins:maven-remote-resources-plugin:jar:[1.0,) in
>> https://repo.maven.apache.org/maven2 was cached in the local repository,
>> resolution will not be reattempted until the update interval of central has
>> elapsed or updates are forced.
>>
>> Yet I know I have the above versions in my local .m2 !
>>
>> Changing the versions to 1.4 and 1.1 doesn't help -- still can't find it.
>>
>> Running mvn process-resources doesn't help.
>>
>> Running maven project update from eclipse doesn't help.
>>
>> Now if I place the plug-in into its own profile like this ( found this in
>> a blog ) :
>> 
>> 
>> m2e
>> 
>>   
>> m2e.version
>>   
>> 
>> 
>> 
>>   
>> 
>>   org.eclipse.m2e
>>   lifecycle-mapping
>>   ${lifecycle-mapping.version}
>>   
>> 
>>   
>> 
>> 
>>
>> org.apache.maven.plugins
>>
>> maven-remote-resources-plugin
>>
>> ${maven-remote-resources-plugin.version}
>> 
>> process
>> 
>> 
>>   
>> 
>>   
>> 
>>   
>> 
>>   
>> 
>>   
>> 
>> 
>> 
>>
>> Then if I do a "mvn clean" I am back to the error: "missing required
>> source folder: 'target/maven-shared-archive-resources"
>>
>

Re: [VOTE] DataSketches-memory RC2

2019-08-01 Thread leerho
Found the WIP Disclaimer.

On Thu, Aug 1, 2019 at 9:58 PM leerho  wrote:

> Justin,
> Thank you so much for your response!
> ( I feel like I'm in a vacuum at times ... so your help is much
> appreciated!! )
>
> I will do a thorough license search.
> I will try to locate the WIP Disclaimer ... do you have a link?
>
> Right now I am struggling with pom issue related to the
> maven-remote-resources-plugin.
> It is creating errors no matter what I do, so I must be doing something
> wrong:
>
>- If I don't have it in pom at all I get the error: "missing required
>source folder: 'target/maven-shared-archive-resources'
>- If I include the plugin in the pom project build as:
>
> 
> org.apache.maven.plugins
> maven-remote-resources-plugin
> ${maven-remote-resources-plugin.version}
> 
> 
> 
> process
> 
> 
> 
> Apache DataSketches Memory
> (incubating)
> 
> 
>
> org.apache:apache-jar-resource-bundle:1.5-SNAPSHOT
>
> org.apache:apache-incubator-disclaimer-resource-bundle:1.2-SNAPSHOT
> 
> 
> 
> 
> 
>
>
> I get the error: maven-remote-resources-plugin:[1.0,) or one of its
> dependencies could not be resolved: Failure to find
> org.apache.maven.plugins:maven-remote-resources-plugin:jar:[1.0,) in
> https://repo.maven.apache.org/maven2 was cached in the local repository,
> resolution will not be reattempted until the update interval of central has
> elapsed or updates are forced.
>
> Yet I know I have the above versions in my local .m2 !
>
> Changing the versions to 1.4 and 1.1 doesn't help -- still can't find it.
>
> Running mvn process-resources doesn't help.
>
> Running maven project update from eclipse doesn't help.
>
> Now if I place the plug-in into its own profile like this ( found this in
> a blog ) :
> 
> 
> m2e
> 
>   
> m2e.version
>   
> 
> 
> 
>   
> 
>   org.eclipse.m2e
>   lifecycle-mapping
>   ${lifecycle-mapping.version}
>   
> 
>   
> 
> 
>
> org.apache.maven.plugins
>
> maven-remote-resources-plugin
>
> ${maven-remote-resources-plugin.version}
> 
> process
> 
> 
>   
> 
>   
> 
>   
> 
>   
> 
>   
> 
> 
> 
>
> Then if I do a "mvn clean" I am back to the error: "missing required
> source folder: 'target/maven-shared-archive-resources"
>
> Then if I do a "mvn clean process-resources" the errors go away! And I do
> get the  "target/maven-shared-archive-resources" folder with the right
> stuff.
>
> Studying other project is even more confusing: Apache Pinot and Apache
> Druid have the plugin under project build.  (how do they get it to work?)
>
> Apache Beam doesn't specify it at all (they use the
> maven-resources-plugin).
>
> Apache Commons-CSV doesn't specify the plugin at all.
>
> I am not confident that my placing it in its own profile is the right
> thing to do, but it is the only thing that seems to work right now, but no
> other project I have found does it this way.
>
> Advice please.
>
> Lee.
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
>
> On Thu, Aug 1, 2019 at 7:26 PM Justin Mclean 
> wrote:
>
>> Hi,
>>
>> > I have fixed the LICENSE file as you requested.
>>
>> I hope you noticed that a couple of the files included multiple,
>> differently licensed, pieces of 3rd party code. Each license needs to be
>> listed and checked if it is compatible with the Apache License. If Apache
>> LIcense you need to check if they have a NOT

Re: [VOTE] DataSketches-memory RC2

2019-08-01 Thread leerho
Justin,
Thank you so much for your response!
( I feel like I'm in a vacuum at times ... so your help is much
appreciated!! )

I will do a thorough license search.
I will try to locate the WIP Disclaimer ... do you have a link?

Right now I am struggling with pom issue related to the
maven-remote-resources-plugin.
It is creating errors no matter what I do, so I must be doing something
wrong:

   - If I don't have it in pom at all I get the error: "missing required
   source folder: 'target/maven-shared-archive-resources'
   - If I include the plugin in the pom project build as:


org.apache.maven.plugins
maven-remote-resources-plugin
${maven-remote-resources-plugin.version}



process



Apache DataSketches Memory
(incubating)



org.apache:apache-jar-resource-bundle:1.5-SNAPSHOT

org.apache:apache-incubator-disclaimer-resource-bundle:1.2-SNAPSHOT







I get the error: maven-remote-resources-plugin:[1.0,) or one of its
dependencies could not be resolved: Failure to find
org.apache.maven.plugins:maven-remote-resources-plugin:jar:[1.0,) in
https://repo.maven.apache.org/maven2 was cached in the local repository,
resolution will not be reattempted until the update interval of central has
elapsed or updates are forced.

Yet I know I have the above versions in my local .m2 !

Changing the versions to 1.4 and 1.1 doesn't help -- still can't find it.

Running mvn process-resources doesn't help.

Running maven project update from eclipse doesn't help.

Now if I place the plug-in into its own profile like this ( found this in a
blog ) :


m2e

  
m2e.version
  



  

  org.eclipse.m2e
  lifecycle-mapping
  ${lifecycle-mapping.version}
  

  



org.apache.maven.plugins

maven-remote-resources-plugin

${maven-remote-resources-plugin.version}

process


  

  

  

  

  




Then if I do a "mvn clean" I am back to the error: "missing required source
folder: 'target/maven-shared-archive-resources"

Then if I do a "mvn clean process-resources" the errors go away! And I do
get the  "target/maven-shared-archive-resources" folder with the right
stuff.

Studying other project is even more confusing: Apache Pinot and Apache
Druid have the plugin under project build.  (how do they get it to work?)

Apache Beam doesn't specify it at all (they use the maven-resources-plugin).

Apache Commons-CSV doesn't specify the plugin at all.

I am not confident that my placing it in its own profile is the right thing
to do, but it is the only thing that seems to work right now, but no other
project I have found does it this way.

Advice please.

Lee.



















On Thu, Aug 1, 2019 at 7:26 PM Justin Mclean 
wrote:

> Hi,
>
> > I have fixed the LICENSE file as you requested.
>
> I hope you noticed that a couple of the files included multiple,
> differently licensed, pieces of 3rd party code. Each license needs to be
> listed and checked if it is compatible with the Apache License. If Apache
> LIcense you need to check if they have a NOTICE file and copy it’s contents
> into your NOTICE file.
>
> > Is it OK if these fixes go into the next release?
>
> Sure, as always you can make a release if you get 3 or more +1 votes, and
> more +1 than -1 votes. You have no -1 votes, and I believe more than 3 +1
> votes form IPMC members, 72 hours have passed so you are all good to go.
>
> Not however the recent discussion around a work in progress disclaimer,
> you might want to move to using that rather than the standard disclaimer.
>
> You might want to make sure there are not other issues, I didn’t do a full
> check of what was in the release. I’d suggest running rat on it (which
> found some issues) and search for copyright statements or URLs that
> reference GitHub or other repos.
>
> Thanks,
> Justin
>
>
> -
> To unsubscribe, e-mail: 

Re: [VOTE] DataSketches-memory RC2

2019-08-01 Thread leerho
Justin,

Thank you for finding these.
I have fixed the LICENSE file as you requested.
I have deleted the stylesheet.css as it is no longer needed.
The Gettysburg Address is public domain and indicated so in the License
file.
All the other files have corrected notices and are also listed in the
LICENSE file.

These are the only files that I am aware of that had these issues.

The README file has a "compile with Java 1.8." notice.

Is it OK if these fixes go into the next release?
That is, may we continue with the current RC2 release we just voted on?

Lee.



On Thu, Aug 1, 2019 at 3:45 PM Justin Mclean 
wrote:

> Hi,
>
> Can you please fix up your LICENSE file, the source release contains a
> number of 3rd party content (not al of which is under an ALv2 license)  but
> none it it is mentioned in LICENSE. You are also not binding by the terms
> of some of those licenses (i.e. they require you to include the license
> text).
>
> For example [1][2][3][4][5][6][7][8][9], there may be other files.  [4]
> may incorrectly have an ASF header on the file.
>
> Thanks,
> Justin
>
> 1. ./src/test/java/org/apache/datasketches/memory/IsValidUtf8TestUtil.java
> 2. ./src/test/java/org/apache/datasketches/memory/Utf8Test.java
> 3. ./src/main/java/org/apache/datasketches/memory/Utf8.java
> 4. ./src/main/javadoc/stylesheet.css
> 5. ./src/test/resources/GettysburgAddress.txt
> 6. ./src/test/java/org/apache/datasketches/memory/XxHash64Test.java
> 7. ./src/test/java/org/apache/datasketches/memory/XxHash64LoopingTest.java
> 8. ./src/main/java/org/apache/datasketches/memory/XxHash64.java
> 9 ./src/main/java/org/apache/datasketches/memory/AccessByteBuffer.java
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>


Re: [VOTE] DataSketches-memory RC2

2019-08-01 Thread leerho
Ted,

I upgraded Maven to 3.6.1 -- now I see the warning above.
I think I understand, after some research..., the new Maven doesn't like
the ... style of specifying what Maven to
use, it want us to use the Maven enforcer plugin instead.  Jeez, the error
message could have just said that.

On Thu, Aug 1, 2019 at 1:01 PM leerho  wrote:

> Thank you  Furkan KAMACI, Greg Stein, and Ted Dunning.  You all responded
> within a one hour window!
>
> Ted,
>
> [*WARNING*] The
>> project org.apache.datasketches:datasketches-memory:jar:1.0.0-incubating
>> uses
>> prerequisites which is only intended for maven-plugin projects but not
>> for non maven-plugin projects.
>> For such purposes you should use the maven-enforcer-plugin. See
>> https://maven.apache.org/enforcer/enforcer-rules/requireMavenVersion.html
>
>
> I have never seen this warning.
>   Which prerequisite is it complaining about?  What exactly is the
> conflict?
>   The link points to a Require Maven Version rule.
>   I am using Maven 3.3.9.
>   The pom specifies a prerequisite of Maven 3.0.4, which I assume means a
> minimum of 3.0.4.  Is this not an enforcement?
>   It appears you are using Maven 3.6.1.
>   Is it a problem to be running a newer version of maven than 3.0.4 ?
>   Do we need to "enforce" the Maven version as long as it is > 3.0.4 ?
>   How can I reproduce this warning?  (run a new version of Maven ?? )
>
> The maven-compiler-plugin is configured to use java version 1.8 for both
> source and target.
> Apparently, you are saying that this configuration is not being enforced
> when you ran  mvn clean test!
>
> Clearly, if this is the case, we need the enforcer plugin for the JVM
> version!
>
> This is an eye-opener for me. Thank you for catching this!
>
> Lee.
>
>
>
>
>
>
>
>
>
>
>
>
> On Thu, Aug 1, 2019 at 11:54 AM Ted Dunning  wrote:
>
>> +1 (binding)
>>
>> As a side comment, I know that you said it in your second email, but the
>> restriction to *exactly* java 1.8 isn't clear from the README. Also, it is
>> easy to *think* you have changed which version of java you are using, but
>> not actually have done so. For an example of how this illusion can play
>> out, see below.
>>
>> It would be good if you could actually detect the version mismatch and
>> fail
>> early with a coherent message. This can be done with the enforcer plugin
>> (see
>> https://maven.apache.org/enforcer/enforcer-rules/requireJavaVersion.html
>> )
>>
>> $ java -version
>>
>> openjdk version "1.8.0_222"
>>
>> OpenJDK Runtime Environment (AdoptOpenJDK)(build 1.8.0_222-b10)
>>
>> OpenJDK 64-Bit Server VM (AdoptOpenJDK)(build 25.222-b10, mixed mode)
>>
>>
>> $ javac -version
>>
>> javac 1.8.0_222
>>
>>
>> $ mvn clean test
>>
>> [*INFO*] Scanning for projects...
>>
>> [*WARNING*] The project
>> org.apache.datasketches:datasketches-memory:jar:1.0.0-incubating uses
>> prerequisites which is only intended for maven-plugin projects but not for
>> non maven-plugin projects. For such purposes you should use the
>> maven-enforcer-plugin. See
>> https://maven.apache.org/enforcer/enforcer-rules/requireMavenVersion.html
>>
>> [*INFO*]
>>
>> ...
>>
>> [*INFO*] -
>>
>> [*WARNING*] COMPILATION WARNING :
>>
>> [*INFO*] -
>>
>> [*WARNING*]
>>
>> /Users/tdunning/Apache/incubator-datasketches-memory/src/main/java/org/apache/datasketches/memory/UnsafeUtil.java:[25,16]
>> sun.misc.Unsafe is internal proprietary API and may be removed in a future
>> release
>>
>> [*WARNING*]
>>
>> /Users/tdunning/Apache/incubator-datasketches-memory/src/main/java/org/apache/datasketches/memory/AllocateDirectMap.java:[37,18]
>> sun.nio.ch.FileChannelImpl is internal proprietary API and may be removed
>> in a future release
>>
>> ...
>>
>> [*INFO*] 8 warnings
>>
>> [*INFO*] -
>>
>> [*INFO*] -
>>
>> [*ERROR*] COMPILATION ERROR :
>>
>> [*INFO*] -
>>
>> [*ERROR*]
>>
>> /Users/tdunning/Apache/incubator-datasketches-memory/src/main/java/org/apache/datasketches/memory/AllocateDirect.java:[27,16]
>> cannot find symbol
>>
>>   symbol:   class Cleane

Re: [VOTE] DataSketches-memory RC2

2019-08-01 Thread leerho
gt; *Apache Maven 3.6.1 (d66c9c0b3152b2e69ee9bac180bb8fcc8e6af555;
> 2019-04-04T12:00:29-07:00)*
>
> Maven home: /usr/local/Cellar/maven/3.6.1/libexec
>
> Java version: 12.0.2, vendor: AdoptOpenJDK, runtime:
> /Library/Java/JavaVirtualMachines/adoptopenjdk-12.0.2.jdk/Contents/Home
>
> Default locale: en_US, platform encoding: UTF-8
>
> OS name: "mac os x", version: "10.14.6", arch: "x86_64", family: "mac"
>
>
>
> On Thu, Aug 1, 2019 at 10:59 AM Greg Stein  wrote:
>
> > +1 (binding)
> >
> >
> > On Mon, Jul 29, 2019 at 12:09 AM leerho  wrote:
> >
> > > Hello Apache general@incubator community.
> > >
> > > 1. This is a call for vote to release Apache DataSketches-memory
> version:
> > >  1.0.0-incubating-RC2
> > >
> > > NOTE 1: This is one component of the DataSketches library which needs
> to
> > be
> > > released first as other repositories have a dependency on this one.
> Once
> > > this is released, the other components of the library will be able to
> be
> > > released.
> > >
> > >
> > > 2. Results from the PPMC vote (open for more than 102 hours not
> counting
> > > the weekend)
> > >
> > >   - 5 votes were cast. All of the votes were (+1). Two of the votes
> were
> > > from Mentors.
> > >
> > >   - The vote thread can be found at:
> > >
> > >
> >
> https://mail-archives.apache.org/mod_mbox/datasketches-dev/201907.mbox/browser
> > >
> > >
> > > 3. Testing/actions/votes performed by the voters:
> > >
> > >   - Lee Rhodes: (+1)
> > > - All of the code has been properly refactored with
> > > "org.apache.datasketches...".
> > > - All source files have the proper Apache license and have been
> > checked
> > > with the Maven Rat Plugin.
> > > - The code passes all tests with a coverage of > 98%.
> > > - Checkstyle: passes with no warnings.  This uses the configuration
> > > MemoryCheckstyle.xml located in the /tools/ directory.
> > > - SpotBugs: passes with no bugs found: This uses the configuration
> > > FindBugsExcludeFilter.xml located in the /tools/directory.
> > > - mvn clean javadoc:javadoc produces no javadoc errors.  The
> javadocs
> > > will be found under /target/site/
> > > - mvn versions:display-plugin-updates.  This has 2 warnings, but
> the
> > > versions are inherited from the super-pom.
> > >- [WARNING] The following plugins do not have their version
> > > specified:
> > >- [WARNING] maven-assembly-plugin . (from
> > > super-pom) 2.6
> > >- [WARNING] maven-compiler-plugin . (from
> > > super-pom) 3.5.
> > >Note: In the next release we can add the version numbers to the
> > > local pom to eliminate this warning.
> > > - The assembly file signatures and checksums have been verified.
> > >
> > >   - Alex Saydakov: (+1)
> > > - mvn package
> > > - mvn test
> > >
> > >   - Jon Malkin: (+1)
> > > - mvn test
> > > - mvn install
> > >
> > >   - Furkan Kamaci (+1 IPMC Member)
> > > - incubating in name
> > > - Disclamer exists
> > > - License is fine
> > > - Notice is fine
> > > - no unexpected binary files
> > > - code compiles and tests successfully run
> > >
> > >   - Kenneth Knowles (+1 IPMC Member)
> > > Double checked:
> > >   - DISCLAIMER, LICENSE and NOTICE
> > >   - mvn install succeeded
> > >
> > >
> > > 4. Source repository:
> > > https://github.com/apache/incubator-datasketches-memory
> > >
> > >   - Git Tag for this release: 1.0.0-incubating-RC2
> > >
> > >
> > >
> >
> https://github.com/apache/incubator-datasketches-memory/tree/1.0.0-incubating-RC2
> > >
> > >   - Git HashId for this release starts with: ec8f16e
> > >
> > >
> > > 5. DIST/DEV: The release candidate assembly:
> > >
> > >   -
> > >
> > >
> >
> https://dist.apache.org/repos/dist/dev/incubator/datasketches/memory/1.0.0-incubating-RC2/apache-datasketches-memory-1.0.0-incubating-src.zip
> > >
> > >   - The assembly file has been signed with --keyid-format SHORT :
> > 8CD4A902
> > >
> > >   - The public signing key can be found in the KEYS file:
> > > https://dist.apache.org/repos/dist/dev/incubator/datasketches/KEYS
> > >
> > >   - Upon acceptance, the above assembly and signatures will be deployed
> > > into the official Apache release repository:
> > >
> > >
> >
> https://dist.apache.org/repos/dist/release/incubator/datasketches/memory/
> > >
> > >
> > > 6. NEXUS: The Jar and pom attributes have been deployed to Nexus
> Staging
> > > Repository "orgapachedatasketches-1000", which can be examined from the
> > > Nexus UI.
> > >
> > >- Upon acceptance, the staging repository holding the artifacts will
> > be
> > > closed and then the artifacts will be released.
> > >
> > >
> > > 7. Note that Mentors k...@apache.org and furkankam...@gmail.com as
> well
> > as
> > > pa...@asert.com.au have made a number of valuable suggestions on
> > improving
> > > the process that are recorded in a separate thread.  None of the
> > > suggestions impact the voting for this release. This has been valuable
> > > learning for us and we will be implementing these suggestions in the
> next
> > > release.
> > >
> > > Lee
> > > lee...@apache.org
> > >
> >
>


[VOTE-REMINDER]

2019-08-01 Thread leerho
Dear Mentors and incubator community.

It has been 73+ hours since our Vote letter was issued to general@incubator
and not one IPMC member has voted or even commented.

Please we need to move forward, so we would appreciate your vote.

lee...@apache.org


Re: [VOTE] DataSketches-memory RC2

2019-07-29 Thread leerho
Additional information accidentally omitted from the vote letter:

8. Build Guide:

The DataSketches-memory component is pure Java and is structured as a Maven
project.  You must compile using JDK 8.  There is one run-time dependency
and, of course, a number of test and Maven plugin dependencies, all of
which can be resolved by Maven.

This component is a set of low-level library functions that is used by the
other DataSketches components.  It has no user interface so the only thing
you can do is compile and run the unit tests.
There are two types of tests: normal unit tests and tests run by the strict
profile.  To run normal unit tests:

  $ mvn clean test

To run the strict profile tests:

  $ mvn clean test -P strict

Note also that when running the test suite, you might get the following
message:

  SLF4J: Failed to load class "org.slf4j.impl.StaticLoggerBinder".
  SLF4J: Defaulting to no-operation (NOP) logger implementation
  SLF4J: See http://www.slf4j.org/codes.html#StaticLoggerBinder for further
details.

This is normal.  It just indicates that in your environment, you do not
have a logger in your class-path so the default logger of a no-op is used
instead.

To install jars in your local .m2 repository:

  $ mvn clean install

If you are interested in the actual functions of the Memory, you will need
to write some code that calls the Memory functions.

9. The documentation for the DataSketches Memory component is part of the
website.

Overview documentation:
- https://datasketches.github.io/docs/Memory/MemoryPackage.html
- https://datasketches.github.io/docs/Memory/MemoryPerformance.html

Javadocs:
https://datasketches.github.io/api/memory/snapshot/apidocs/index.html

This Memory component has been split out from the rest of the DataSketches
code because there have been a number of requests from folks that are
interested in these high-performance functions independently of the
remainder of the DataSketches library.

Please vote accordingly:

[ ] +1 approve
[ ] +0 no opinion
[ ] -1 disapprove with the reason

Thanks,
Lee
lee...@apache.org

On Sun, Jul 28, 2019 at 10:09 PM leerho  wrote:

> Hello Apache general@incubator community.
>
> 1. This is a call for vote to release Apache DataSketches-memory version:
>  1.0.0-incubating-RC2
>
> NOTE 1: This is one component of the DataSketches library which needs to
> be released first as other repositories have a dependency on this one. Once
> this is released, the other components of the library will be able to be
> released.
>
>
> 2. Results from the PPMC vote (open for more than 102 hours not counting
> the weekend)
>
>   - 5 votes were cast. All of the votes were (+1). Two of the votes were
> from Mentors.
>
>   - The vote thread can be found at:
> https://mail-archives.apache.org/mod_mbox/datasketches-dev/201907.mbox/browser
>
>
> 3. Testing/actions/votes performed by the voters:
>
>   - Lee Rhodes: (+1)
> - All of the code has been properly refactored with
> "org.apache.datasketches...".
> - All source files have the proper Apache license and have been
> checked with the Maven Rat Plugin.
> - The code passes all tests with a coverage of > 98%.
> - Checkstyle: passes with no warnings.  This uses the configuration
> MemoryCheckstyle.xml located in the /tools/ directory.
> - SpotBugs: passes with no bugs found: This uses the configuration
> FindBugsExcludeFilter.xml located in the /tools/directory.
> - mvn clean javadoc:javadoc produces no javadoc errors.  The javadocs
> will be found under /target/site/
> - mvn versions:display-plugin-updates.  This has 2 warnings, but the
> versions are inherited from the super-pom.
>- [WARNING] The following plugins do not have their version
> specified:
>- [WARNING] maven-assembly-plugin . (from
> super-pom) 2.6
>- [WARNING] maven-compiler-plugin . (from
> super-pom) 3.5.
>Note: In the next release we can add the version numbers to the
> local pom to eliminate this warning.
> - The assembly file signatures and checksums have been verified.
>
>   - Alex Saydakov: (+1)
> - mvn package
> - mvn test
>
>   - Jon Malkin: (+1)
> - mvn test
> - mvn install
>
>   - Furkan Kamaci (+1 IPMC Member)
> - incubating in name
> - Disclamer exists
> - License is fine
> - Notice is fine
> - no unexpected binary files
> - code compiles and tests successfully run
>
>   - Kenneth Knowles (+1 IPMC Member)
> Double checked:
>   - DISCLAIMER, LICENSE and NOTICE
>   - mvn install succeeded
>
>
> 4. Source repository:
> https://github.com/apache/incubator-datasketches-memory
>
>   - Git Tag for this release: 1.0.0-incubating-RC2
>
> https://githu

[VOTE] DataSketches-memory RC2

2019-07-28 Thread leerho
Hello Apache general@incubator community.

1. This is a call for vote to release Apache DataSketches-memory version:
 1.0.0-incubating-RC2

NOTE 1: This is one component of the DataSketches library which needs to be
released first as other repositories have a dependency on this one. Once
this is released, the other components of the library will be able to be
released.


2. Results from the PPMC vote (open for more than 102 hours not counting
the weekend)

  - 5 votes were cast. All of the votes were (+1). Two of the votes were
from Mentors.

  - The vote thread can be found at:
https://mail-archives.apache.org/mod_mbox/datasketches-dev/201907.mbox/browser


3. Testing/actions/votes performed by the voters:

  - Lee Rhodes: (+1)
- All of the code has been properly refactored with
"org.apache.datasketches...".
- All source files have the proper Apache license and have been checked
with the Maven Rat Plugin.
- The code passes all tests with a coverage of > 98%.
- Checkstyle: passes with no warnings.  This uses the configuration
MemoryCheckstyle.xml located in the /tools/ directory.
- SpotBugs: passes with no bugs found: This uses the configuration
FindBugsExcludeFilter.xml located in the /tools/directory.
- mvn clean javadoc:javadoc produces no javadoc errors.  The javadocs
will be found under /target/site/
- mvn versions:display-plugin-updates.  This has 2 warnings, but the
versions are inherited from the super-pom.
   - [WARNING] The following plugins do not have their version
specified:
   - [WARNING] maven-assembly-plugin . (from
super-pom) 2.6
   - [WARNING] maven-compiler-plugin . (from
super-pom) 3.5.
   Note: In the next release we can add the version numbers to the
local pom to eliminate this warning.
- The assembly file signatures and checksums have been verified.

  - Alex Saydakov: (+1)
- mvn package
- mvn test

  - Jon Malkin: (+1)
- mvn test
- mvn install

  - Furkan Kamaci (+1 IPMC Member)
- incubating in name
- Disclamer exists
- License is fine
- Notice is fine
- no unexpected binary files
- code compiles and tests successfully run

  - Kenneth Knowles (+1 IPMC Member)
Double checked:
  - DISCLAIMER, LICENSE and NOTICE
  - mvn install succeeded


4. Source repository:
https://github.com/apache/incubator-datasketches-memory

  - Git Tag for this release: 1.0.0-incubating-RC2

https://github.com/apache/incubator-datasketches-memory/tree/1.0.0-incubating-RC2

  - Git HashId for this release starts with: ec8f16e


5. DIST/DEV: The release candidate assembly:

  -
https://dist.apache.org/repos/dist/dev/incubator/datasketches/memory/1.0.0-incubating-RC2/apache-datasketches-memory-1.0.0-incubating-src.zip

  - The assembly file has been signed with --keyid-format SHORT : 8CD4A902

  - The public signing key can be found in the KEYS file:
https://dist.apache.org/repos/dist/dev/incubator/datasketches/KEYS

  - Upon acceptance, the above assembly and signatures will be deployed
into the official Apache release repository:

https://dist.apache.org/repos/dist/release/incubator/datasketches/memory/


6. NEXUS: The Jar and pom attributes have been deployed to Nexus Staging
Repository "orgapachedatasketches-1000", which can be examined from the
Nexus UI.

   - Upon acceptance, the staging repository holding the artifacts will be
closed and then the artifacts will be released.


7. Note that Mentors k...@apache.org and furkankam...@gmail.com as well as
pa...@asert.com.au have made a number of valuable suggestions on improving
the process that are recorded in a separate thread.  None of the
suggestions impact the voting for this release. This has been valuable
learning for us and we will be implementing these suggestions in the next
release.

Lee
lee...@apache.org


Re: DataSketches needs some help

2019-07-24 Thread leerho
Paul,

Thank you for volunteering!  Your help is greatly appreciated!

We have a Release Candidate 2 that needs approval.  If you could look at
that we would be wonderful.

Lee Rhodes
lee...@apache.org


On Wed, Jul 24, 2019 at 4:25 AM Paul King  wrote:

> I have subscribed to their dev list. I'll try to help out a bit informally
> if I can. If needed I might be able to assist with Mentoring but I don't
> have a lot of cycles free at the moment.
>
> Cheers, Paul.
>
> On Wed, Jul 24, 2019 at 10:57 AM Justin Mclean 
> wrote:
>
> > Hi,
> >
> > DataSketches seems to be having some issues getting started and needs
> some
> > extra mentor help.  Is anyone willing to help out by mentoring the
> project?
> >
> > DataSketches [1] is an open source, high-performance library of
> stochastic
> > streaming algorithms commonly called "sketches" in the data sciences.
> > Sketches are small, stateful programs that process massive data as a
> stream
> > and can provide approximate answers, with mathematical guarantees, to
> > computationally difficult queries orders-of-magnitude faster than
> > traditional, exact methods.
> >
> > Thanks,
> > Justin
> >
> > 1. http://datasketches.apache.org
> > -
> > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> > For additional commands, e-mail: general-h...@incubator.apache.org
> >
> >
>
-- 
>From my cell phone.


Re: What is repo.maven.a.o for?

2019-07-08 Thread leerho
Thanks!

On Mon, Jul 8, 2019 at 1:42 PM Matt Sicker  wrote:

> Whenever you make a release from a staging repo, it's published to
> that repo you linked (as well as Maven Central).
>
> That repo should reflect either
> https://repository.apache.org/content/repositories/releases/ or
> https://repository.apache.org/content/groups/public/
>
> On Mon, 8 Jul 2019 at 14:44, leerho  wrote:
> >
> > So far, I think I understand the Nexus repositories (snapshots, staging
> and
> > releases), and the dist.a.o and archive.a.o repositories.
> >
> > But what is https://repo.maven.apache.org/maven2/org/apache repository
> for
> > and how does it get updated?
>
>
>
> --
> Matt Sicker 
>
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>


What is repo.maven.a.o for?

2019-07-08 Thread leerho
So far, I think I understand the Nexus repositories (snapshots, staging and
releases), and the dist.a.o and archive.a.o repositories.

But what is https://repo.maven.apache.org/maven2/org/apache repository for
and how does it get updated?


Question on artifact signatures

2019-07-08 Thread leerho
In the Nexus releases repo (
https://repository.apache.org/content/repositories/releases/org)

I notice that some projects (like Arrow) have attached a .asc, .md5, and
.sha1 signatures to each artifact.

While other projects (like Beam) have additional signatures (.asc.md5,
.asc.sha1) for each .asc file.

What is the recommended practice?


Re: No access to dist.apache.org

2019-06-21 Thread leerho
1) I asked INFRA how to setup the proper structures in dist.a.o.
INFRA tells me that IPMC members create the proper folders in dist/dev and
dist/release directories and once created the PPMC should have access.

2) There is also no "Staging Profile" in Apache Nexus Repository for
datasketches.  When does that get set up?



I have been held up for a full week now in making any progress in migrating
to ASF.  I need to create the first Release Candidate artifact so that it
can be referenced as a dependency by our other repositories.  I keep
running into roadblocks where the basic infrastructure has not be set up
that would allow me to make progress.

I would be really grateful if someone could help guide me through this
first "release" process steps.

Specifically:
a) Review our POM file for correct setup for a release to Apache (at least
a release candidate)
b) Guide me through the required Maven or command-line commands to get this
first release created.

Thank you!







On Fri, Jun 21, 2019 at 11:08 AM leerho  wrote:

> 1) I am trying to create our first podling release candidate and need
> access to dist.apache.org to record PGP keys, etc.  There is no listing
> for our podling project datasketches and I have no write access to that
> site.  How can I make progress?
>
> 2) I would like to create a temporary landing page for our website at
> datasketches.apache.org.
> How do I do that?
>
> Lee.
>


No access to dist.apache.org

2019-06-21 Thread leerho
1) I am trying to create our first podling release candidate and need
access to dist.apache.org to record PGP keys, etc.  There is no listing for
our podling project datasketches and I have no write access to that site.
How can I make progress?

2) I would like to create a temporary landing page for our website at
datasketches.apache.org.
How do I do that?

Lee.


How to move code and docs from GitHub to GitHub

2019-05-10 Thread leerho
I am in the process of trying to move code from our current GitHub
repositories to the newly assigned Apache incubator GitHub repositories.  I
could use some advice on the best way to do this.

So far, I have used the command

$ > git push --all --tags --repo=g...@github.com:apache/incubator-.git

This pushes code and tags from my laptop *clone* of our current repo to the
apache repo.  But it does not transfer important *release documentation*,
which is a feature of the GitHub repo website.

Is there a way to effectively copy everything (code, tags, documentation,
etc) directly from our current origin GitHub repo to the Apache repo?
Without having to use my laptop clone in the middle? AND without wiping out
the current origin GitHub repo?

Otherwise, I am having to manually copy and paste all the associated
release documentation!

Any help would be appreciated!

Thanks


Re: Name Search for DataSketches

2019-04-10 Thread leerho
Awesome, thanks!

On Wed, Apr 10, 2019 at 3:30 PM Kenneth Knowles  wrote:

> I have filed the first two steps in the parent ticket here:
> https://issues.apache.org/jira/browse/INFRA-18195
>
> On Tue, Apr 9, 2019 at 8:31 AM Dave Fisher  wrote:
>
>> Hi -
>>
>> > On Apr 9, 2019, at 8:26 AM, leerho  wrote:
>> >
>> > Folks,
>> >   • I notice that DataSketches name search has been up for a week
>> with no comments.  Is it time to get it approved?
>> Give the podling name search some more time.
>> >   • I have a separate question about Initial Committers and ICLA:
>> We have some committers that are already committers on other Apache
>> projects,  do they need to sign another ICLA for this project?
>> No, they do not need to sign another ICLA.
>>
>> As soon as the project LDAP is created they can be added to the PPMC.
>>
>> Here is a checklist:
>>
>> • LDAP
>> • DNS
>> • Mailing lists created
>> • Mailing list signups
>> • Roster complete
>> • ICLA for initial committers w/o apache accounts
>> • Accounts requested for new apache committers
>> • All initial committers added to the PPMC
>> • Code repository ready
>> • Website
>> • Wiki
>> • Issue Tracker
>>
>> Regards,
>> Dave
>
>


Name Search for DataSketches

2019-04-09 Thread leerho
Folks,

   - I notice that DataSketches name search
    has been
   up for a week with no comments.  Is it time to get it approved?
   - I have a separate question about Initial Committers and ICLA:  We have
   some committers that are already committers on other Apache projects,  do
   they need to sign another ICLA for this project?


Re: [PROPOSAL] Apache DataSketches

2019-03-25 Thread leerho
I went ahead and performed the following searches based on the list someone
else provided.  Perhaps you can use this?

Note: the term "sketch" commonly refers to an artistic visualization or
drawing.
The use of the term "sketch" in the study of algorithms refer to a synopsis
of some larger set of data where the synopsis is approximate, simplified
(not all the detail), and can be executed quickly.  These properties are
shared with artistic sketches, but there the similarity ends. DataSketches
have nothing to do with visualization at all.

Search results.

https://github.com/search?o=desc=datasketches
returned links are indirect references to our site. or a reference to site
about data art.

https://opensource.google.com/projects/search?q=datasketches
No hits

https://sourceforge.net/directory/os:mac/?q=datasketches
No hits

https://www.openhub.net/p?ref=homepage=datasketches
No hits

https://www.trademarkia.com
No hits: "data sketch", "data sketches", "data-sketch", "data-sketches",
"datasketch", or "datasketches".

https://trademarks.justia.com/search?q=datasketches
No hits: "data sketch", "data sketches", "data-sketch", "data-sketches",
"datasketch", or "datasketches".

http://tmsearch.uspto.gov/
No hits: "data sketch", "data sketches", "data-sketch", "data-sketches",
"datasketch", or "datasketches".

https://www.google.com/search?q=datasketches=datasketches
About 37,600 results most all are indirect references to our site or to
sites about artistic visual renderings of data. Searching for
"datasketches" (with quotes) is a much smaller set (6800) that mostly refer
to our software.

https://en.wikipedia.org/wiki/datasketches
q: "datasketches": No hits
q: "data sketches" One hit: the common data science use of the pair of
words referring to sketching algorithms: "The different techniques can be
classified according to the data sketches they store."

https://stackoverflow.com/search?q=datasketches
2 hits that refer back to our software (Druid-datasketches is our software)
q:data sketches

https://www.linkedin.com/company/datasketches/about/
No hits

https://en.oxforddictionaries.com/search?filter=dictionary=datasketches
No hits

On Mon, Mar 25, 2019 at 1:36 PM Kenneth Knowles  wrote:

> The vote is passed to accept into the incubator. Since there is a cost to
> changing the name once infrastructure is set up, I suggest doing the name
> search immediately. There seemed to be some consensus to try to keep the
> DataSketches name. If there are no objections, I will file a
> PODLINGNAMESEARCH for this.
>
> Kenn
>
> On Tue, Feb 26, 2019 at 3:58 PM Liang Chen 
> wrote:
>
> > Hi Justin
> >
> > You are right, should be "Liang Chen", already updated it.
> >
> > Justin, could you please help to check my right to create new proposal on
> > incubator wiki at :
> > https://wiki.apache.org/incubator/ProjectProposals
> >
> > Regards
> > Liang
> >
> > Justin Mclean wrote
> > > Hi,
> > >
> > >> Currently only IPMC members can be official mentors, of the 3 people
> > >> listed here I believe only Jean-Baptiste Onofré is an IPMC member.
> > >
> > > Sorry, my apologies, Liang Chen is also an IPMC member, (Chen Liang,
> and
> > > presumedly a different person, is a committer but not an IPMC member)
> but
> > > I cannot find Gil Yehuda, do you mind provide a link to the roster for
> > > them or their Apache id?
> > >
> > > Thanks,
> > > Justin
> > > -
> > > To unsubscribe, e-mail:
> >
> > > general-unsubscribe@.apache
> >
> > > For additional commands, e-mail:
> >
> > > general-help@.apache
> >
> >
> >
> >
> >
> > --
> > Sent from: http://apache-incubator-general.996316.n3.nabble.com/
> >
> > -
> > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> > For additional commands, e-mail: general-h...@incubator.apache.org
> >
> >
>


Write access to Incubator Wiki

2019-03-04 Thread leerho
Hi,

May I get write access to wiki? My wiki name is *Lee Rhodes*

Project Proposal: Apache DataSketches

Kind Regards,
Lee Rhodes

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: [DISCUSS] Move Apache DataSketches Proposal Draft to Wiki

2019-03-01 Thread leerho
I can read but I cannot make a few formatting changes.  Also, in the sorted
list of Project Proposals the entry for DataSketchesProposal is
misspelled.

On Fri, Mar 1, 2019 at 7:37 PM Kenneth Knowles  wrote:

> Sounds good to me. I read it one more time. My comments should not be seen
> as blocking comments, but just commentary and curiosity.
>
> Do you have a Wiki account?
> https://wiki.apache.org/incubator/ProjectProposals
>
> Kenn
>
> On Fri, Mar 1, 2019 at 4:58 PM lee...@gmail.com  wrote:
>
> > This draft proposal has been available as a Google Doc (
> > https://s.apache.org/datasketches-proposal-draft) since Monday, Feb 25th
> > and has been stable with no new comments since Wednesday, Feb 27th.
> >
> > I would like to propose we move the document to the Wiki and hold a vote
> > soon.
> >
> > If there are no objections, then I will need help moving it to the Wiki,
> > since I don't have permission to do that.
> >
> > Thanks,
> >
> > Lee.
> >
> >
> >
> > -
> > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> > For additional commands, e-mail: general-h...@incubator.apache.org
> >
> >
>


[DISCUSS] Move Apache DataSketches Proposal Draft to Wiki

2019-03-01 Thread leerho
This draft proposal has been available as a Google Doc 
(https://s.apache.org/datasketches-proposal-draft) since Monday, Feb 25th and 
has been stable with no new comments since Wednesday, Feb 27th. 

I would like to propose we move the document to the Wiki and hold a vote soon.

If there are no objections, then I will need help moving it to the Wiki, since 
I don't have permission to do that.

Thanks,

Lee.



-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



DataSketches Proposal - Google Docs Link

2019-02-25 Thread leerho
As primary author, can I be given the ability to directly edit? 
On 2019/02/26 05:37:22, Kenneth Knowles  wrote: 
> It isn't too much work, so I've done it:
> https://s.apache.org/datasketches-proposal-draft
> 
> Kenn
> 
> On Mon, Feb 25, 2019 at 9:31 PM leerho  wrote:
> 
> > Yes, I thought of that.  But it’s not like I’m being overwhelmed with
> > requests to comment ... so far it has been only 3 or 4, and the requested
> > changes have been minor.  I’m assuming that if there are no more
> > substantive changes after this week that the document would be moved to the
> > wiki archive, where, I presume, changes could still be made.
> >
> > I want to do the right thing here, so if you feel that the document would
> > get much better feedback on an unrestricted gDoc site, I will set it up.
> >
> >
> >
> > On Mon, Feb 25, 2019 at 8:32 PM Jim Apple 
> > wrote:
> >
> > > You could use a Google account that is not under Yahoo’s control, then
> > let
> > > anyone in the world add a comment, maybe.
> > >
> > > On Mon, Feb 25, 2019 at 3:26 PM leerho  wrote:
> > >
> > > > Ken,
> > > > Yahoo does not allow me to create a shared link outside our company,
> > > except
> > > > to individual email addresses.  So attempting to share it to the email
> > > > general@incubator.apache.org may not work.  Nonetheless, several
> > > > individuals were able to request access using their individual email
> > > > accounts and I was able to add them.  I will try to add you using
> > > > k...@apache.org, but if that doesn't work, I may need a gmail or
> > > > equivalent
> > > > account for you.
> > > >
> > > > Lee.
> > > >
> > > >
> > > > On Mon, Feb 25, 2019 at 2:59 PM Kenneth Knowles 
> > wrote:
> > > >
> > > > > I could not access that document. I suggest you need to turn on link
> > > > > sharing.
> > > > >
> > > > > Kenn
> > > > >
> > > > > On Mon, Feb 25, 2019 at 12:00 PM lee...@gmail.com 
> > > > > wrote:
> > > > >
> > > > > > Try this link:
> > > > > >
> > > > >
> > > >
> > >
> > https://docs.google.com/document/d/19JKevzFQNcaLA51LFLUlP1hzdFDW7oDJrJO8N6weDv8/edit?usp=sharing
> > > > > >
> > > > > >
> > > > > > On 2019/02/25 05:55:50, leerho  wrote:
> > > > > > > Yes I will try that tomorrow.
> > > > > > >
> > > > > > > On Sun, Feb 24, 2019 at 7:34 PM Kenneth Knowles  > >
> > > > > wrote:
> > > > > > >
> > > > > > > > Can you share the Google doc with the proposal? Per Ted's
> > advice,
> > > > we
> > > > > > can
> > > > > > > > iterate quickly there and move it to the wiki when it becomes a
> > > bit
> > > > > > more
> > > > > > > > stable.
> > > > > > > >
> > > > > > > > Kenn
> > > > > > > >
> > > > > > > > On Fri, Feb 22, 2019 at 10:21 PM lee...@gmail.com <
> > > > lee...@gmail.com>
> > > > > > > > wrote:
> > > > > > > >
> > > > > > > > > Thanks for the offer.  i am a neophyte at this process and
> > > email
> > > > > > app!   I
> > > > > > > > > could use a lot of help getting this off the ground!  Also,
> > I'm
> > > > not
> > > > > > sure
> > > > > > > > > that Mr. Chen and Mr. Onofré have fully accepted taking this
> > on
> > > > :)
> > > > > > > > >
> > > > > > > > > Lee.
> > > > > > > > >
> > > > > > > > > On 2019/02/23 06:03:58, Kenneth Knowles 
> > > wrote:
> > > > > > > > > > Nice.
> > > > > > > > > >
> > > > > > > > > > I would very much like to help mentor this project, though
> > > you
> > > > > > already
> > > > > > > > > have
> > > > > > > > > > a couple good ones.
> > > > > > > > > >
> > > > > > > > > > I concur with in

Re: DataSketches Proposal - Google Docs Link

2019-02-25 Thread leerho
Thank you!


On Mon, Feb 25, 2019 at 9:37 PM Kenneth Knowles  wrote:

> It isn't too much work, so I've done it:
> https://s.apache.org/datasketches-proposal-draft
>
> Kenn
>
> On Mon, Feb 25, 2019 at 9:31 PM leerho  wrote:
>
> > Yes, I thought of that.  But it’s not like I’m being overwhelmed with
> > requests to comment ... so far it has been only 3 or 4, and the requested
> > changes have been minor.  I’m assuming that if there are no more
> > substantive changes after this week that the document would be moved to
> the
> > wiki archive, where, I presume, changes could still be made.
> >
> > I want to do the right thing here, so if you feel that the document would
> > get much better feedback on an unrestricted gDoc site, I will set it up.
> >
> >
> >
> > On Mon, Feb 25, 2019 at 8:32 PM Jim Apple 
> > wrote:
> >
> > > You could use a Google account that is not under Yahoo’s control, then
> > let
> > > anyone in the world add a comment, maybe.
> > >
> > > On Mon, Feb 25, 2019 at 3:26 PM leerho  wrote:
> > >
> > > > Ken,
> > > > Yahoo does not allow me to create a shared link outside our company,
> > > except
> > > > to individual email addresses.  So attempting to share it to the
> email
> > > > general@incubator.apache.org may not work.  Nonetheless, several
> > > > individuals were able to request access using their individual email
> > > > accounts and I was able to add them.  I will try to add you using
> > > > k...@apache.org, but if that doesn't work, I may need a gmail or
> > > > equivalent
> > > > account for you.
> > > >
> > > > Lee.
> > > >
> > > >
> > > > On Mon, Feb 25, 2019 at 2:59 PM Kenneth Knowles 
> > wrote:
> > > >
> > > > > I could not access that document. I suggest you need to turn on
> link
> > > > > sharing.
> > > > >
> > > > > Kenn
> > > > >
> > > > > On Mon, Feb 25, 2019 at 12:00 PM lee...@gmail.com <
> lee...@gmail.com>
> > > > > wrote:
> > > > >
> > > > > > Try this link:
> > > > > >
> > > > >
> > > >
> > >
> >
> https://docs.google.com/document/d/19JKevzFQNcaLA51LFLUlP1hzdFDW7oDJrJO8N6weDv8/edit?usp=sharing
> > > > > >
> > > > > >
> > > > > > On 2019/02/25 05:55:50, leerho  wrote:
> > > > > > > Yes I will try that tomorrow.
> > > > > > >
> > > > > > > On Sun, Feb 24, 2019 at 7:34 PM Kenneth Knowles <
> k...@apache.org
> > >
> > > > > wrote:
> > > > > > >
> > > > > > > > Can you share the Google doc with the proposal? Per Ted's
> > advice,
> > > > we
> > > > > > can
> > > > > > > > iterate quickly there and move it to the wiki when it
> becomes a
> > > bit
> > > > > > more
> > > > > > > > stable.
> > > > > > > >
> > > > > > > > Kenn
> > > > > > > >
> > > > > > > > On Fri, Feb 22, 2019 at 10:21 PM lee...@gmail.com <
> > > > lee...@gmail.com>
> > > > > > > > wrote:
> > > > > > > >
> > > > > > > > > Thanks for the offer.  i am a neophyte at this process and
> > > email
> > > > > > app!   I
> > > > > > > > > could use a lot of help getting this off the ground!  Also,
> > I'm
> > > > not
> > > > > > sure
> > > > > > > > > that Mr. Chen and Mr. Onofré have fully accepted taking
> this
> > on
> > > > :)
> > > > > > > > >
> > > > > > > > > Lee.
> > > > > > > > >
> > > > > > > > > On 2019/02/23 06:03:58, Kenneth Knowles 
> > > wrote:
> > > > > > > > > > Nice.
> > > > > > > > > >
> > > > > > > > > > I would very much like to help mentor this project,
> though
> > > you
> > > > > > already
> > > > > > > > > have
> > > > > > > > > > a couple good ones.
> > > > > > > > > >
> > >

Re: DataSketches Proposal - Google Docs Link

2019-02-25 Thread leerho
Yes, I thought of that.  But it’s not like I’m being overwhelmed with
requests to comment ... so far it has been only 3 or 4, and the requested
changes have been minor.  I’m assuming that if there are no more
substantive changes after this week that the document would be moved to the
wiki archive, where, I presume, changes could still be made.

I want to do the right thing here, so if you feel that the document would
get much better feedback on an unrestricted gDoc site, I will set it up.



On Mon, Feb 25, 2019 at 8:32 PM Jim Apple 
wrote:

> You could use a Google account that is not under Yahoo’s control, then let
> anyone in the world add a comment, maybe.
>
> On Mon, Feb 25, 2019 at 3:26 PM leerho  wrote:
>
> > Ken,
> > Yahoo does not allow me to create a shared link outside our company,
> except
> > to individual email addresses.  So attempting to share it to the email
> > general@incubator.apache.org may not work.  Nonetheless, several
> > individuals were able to request access using their individual email
> > accounts and I was able to add them.  I will try to add you using
> > k...@apache.org, but if that doesn't work, I may need a gmail or
> > equivalent
> > account for you.
> >
> > Lee.
> >
> >
> > On Mon, Feb 25, 2019 at 2:59 PM Kenneth Knowles  wrote:
> >
> > > I could not access that document. I suggest you need to turn on link
> > > sharing.
> > >
> > > Kenn
> > >
> > > On Mon, Feb 25, 2019 at 12:00 PM lee...@gmail.com 
> > > wrote:
> > >
> > > > Try this link:
> > > >
> > >
> >
> https://docs.google.com/document/d/19JKevzFQNcaLA51LFLUlP1hzdFDW7oDJrJO8N6weDv8/edit?usp=sharing
> > > >
> > > >
> > > > On 2019/02/25 05:55:50, leerho  wrote:
> > > > > Yes I will try that tomorrow.
> > > > >
> > > > > On Sun, Feb 24, 2019 at 7:34 PM Kenneth Knowles 
> > > wrote:
> > > > >
> > > > > > Can you share the Google doc with the proposal? Per Ted's advice,
> > we
> > > > can
> > > > > > iterate quickly there and move it to the wiki when it becomes a
> bit
> > > > more
> > > > > > stable.
> > > > > >
> > > > > > Kenn
> > > > > >
> > > > > > On Fri, Feb 22, 2019 at 10:21 PM lee...@gmail.com <
> > lee...@gmail.com>
> > > > > > wrote:
> > > > > >
> > > > > > > Thanks for the offer.  i am a neophyte at this process and
> email
> > > > app!   I
> > > > > > > could use a lot of help getting this off the ground!  Also, I'm
> > not
> > > > sure
> > > > > > > that Mr. Chen and Mr. Onofré have fully accepted taking this on
> > :)
> > > > > > >
> > > > > > > Lee.
> > > > > > >
> > > > > > > On 2019/02/23 06:03:58, Kenneth Knowles 
> wrote:
> > > > > > > > Nice.
> > > > > > > >
> > > > > > > > I would very much like to help mentor this project, though
> you
> > > > already
> > > > > > > have
> > > > > > > > a couple good ones.
> > > > > > > >
> > > > > > > > I concur with incubator as sponsoring entity.
> > > > > > > >
> > > > > > > > Kenn (VP Apache Beam)
> > > > > > > >
> > > > > > > > On Fri, Feb 22, 2019 at 9:45 PM leerho 
> > wrote:
> > > > > > > >
> > > > > > > > > I didn't realize that this mail list does not accept PDF
> > files,
> > > > > > > apparently
> > > > > > > > > only text.  So let me try one more time ... :)  Please let
> me
> > > > know if
> > > > > > > > > this works!
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > = Apache DataSketches Proposal[1] =
> > > > > > > > >
> > > > > > > > > == Abstract ==
> > > > > > > > >
> > > > > > > > > DataSketches.GitHub.io is an open source, high-performance
> > > > library
> > > > > > of
> > > > > > > > > stochastic streaming algorithms commonly called "ske

Re: DataSketches Proposal - Google Docs Link

2019-02-25 Thread leerho
Adding individuals seems to be working.  I was able to add k...@apache.org
successfully, and he successfully added a comment.  Casual readers can get
the gist from the text version I inserted into this thread.  Those that
wish to make comments make a request via the link and I will add them.

Will this work for now?

On Mon, Feb 25, 2019 at 3:30 PM Luciano Resende 
wrote:

> Should we move the proposal to the incubator wiki then?
>
> On Mon, Feb 25, 2019 at 15:26 leerho  wrote:
>
> > Ken,
> > Yahoo does not allow me to create a shared link outside our company,
> except
> > to individual email addresses.  So attempting to share it to the email
> > general@incubator.apache.org may not work.  Nonetheless, several
> > individuals were able to request access using their individual email
> > accounts and I was able to add them.  I will try to add you using
> > k...@apache.org, but if that doesn't work, I may need a gmail or
> > equivalent
> > account for you.
> >
> > Lee.
> >
> >
> > On Mon, Feb 25, 2019 at 2:59 PM Kenneth Knowles  wrote:
> >
> > > I could not access that document. I suggest you need to turn on link
> > > sharing.
> > >
> > > Kenn
> > >
> > > On Mon, Feb 25, 2019 at 12:00 PM lee...@gmail.com 
> > > wrote:
> > >
> > > > Try this link:
> > > >
> > >
> >
> https://docs.google.com/document/d/19JKevzFQNcaLA51LFLUlP1hzdFDW7oDJrJO8N6weDv8/edit?usp=sharing
> > > >
> > > >
> > > > On 2019/02/25 05:55:50, leerho  wrote:
> > > > > Yes I will try that tomorrow.
> > > > >
> > > > > On Sun, Feb 24, 2019 at 7:34 PM Kenneth Knowles 
> > > wrote:
> > > > >
> > > > > > Can you share the Google doc with the proposal? Per Ted's advice,
> > we
> > > > can
> > > > > > iterate quickly there and move it to the wiki when it becomes a
> bit
> > > > more
> > > > > > stable.
> > > > > >
> > > > > > Kenn
> > > > > >
> > > > > > On Fri, Feb 22, 2019 at 10:21 PM lee...@gmail.com <
> > lee...@gmail.com>
> > > > > > wrote:
> > > > > >
> > > > > > > Thanks for the offer.  i am a neophyte at this process and
> email
> > > > app!   I
> > > > > > > could use a lot of help getting this off the ground!  Also, I'm
> > not
> > > > sure
> > > > > > > that Mr. Chen and Mr. Onofré have fully accepted taking this on
> > :)
> > > > > > >
> > > > > > > Lee.
> > > > > > >
> > > > > > > On 2019/02/23 06:03:58, Kenneth Knowles 
> wrote:
> > > > > > > > Nice.
> > > > > > > >
> > > > > > > > I would very much like to help mentor this project, though
> you
> > > > already
> > > > > > > have
> > > > > > > > a couple good ones.
> > > > > > > >
> > > > > > > > I concur with incubator as sponsoring entity.
> > > > > > > >
> > > > > > > > Kenn (VP Apache Beam)
> > > > > > > >
> > > > > > > > On Fri, Feb 22, 2019 at 9:45 PM leerho 
> > wrote:
> > > > > > > >
> > > > > > > > > I didn't realize that this mail list does not accept PDF
> > files,
> > > > > > > apparently
> > > > > > > > > only text.  So let me try one more time ... :)  Please let
> me
> > > > know if
> > > > > > > > > this works!
> > > > > > > > >
> > > > > > > > >
> > > > > > > > > = Apache DataSketches Proposal[1] =
> > > > > > > > >
> > > > > > > > > == Abstract ==
> > > > > > > > >
> > > > > > > > > DataSketches.GitHub.io is an open source, high-performance
> > > > library
> > > > > > of
> > > > > > > > > stochastic streaming algorithms commonly called "sketches"
> in
> > > the
> > > > > > data
> > > > > > > > > sciences. Sketches are small, stateful programs that
> process
> > > > massive
> > > > > > > data
> &g

Re: DataSketches Proposal - Google Docs Link

2019-02-25 Thread leerho
Ken,
Yahoo does not allow me to create a shared link outside our company, except
to individual email addresses.  So attempting to share it to the email
general@incubator.apache.org may not work.  Nonetheless, several
individuals were able to request access using their individual email
accounts and I was able to add them.  I will try to add you using
k...@apache.org, but if that doesn't work, I may need a gmail or equivalent
account for you.

Lee.


On Mon, Feb 25, 2019 at 2:59 PM Kenneth Knowles  wrote:

> I could not access that document. I suggest you need to turn on link
> sharing.
>
> Kenn
>
> On Mon, Feb 25, 2019 at 12:00 PM lee...@gmail.com 
> wrote:
>
> > Try this link:
> >
> https://docs.google.com/document/d/19JKevzFQNcaLA51LFLUlP1hzdFDW7oDJrJO8N6weDv8/edit?usp=sharing
> >
> >
> > On 2019/02/25 05:55:50, leerho  wrote:
> > > Yes I will try that tomorrow.
> > >
> > > On Sun, Feb 24, 2019 at 7:34 PM Kenneth Knowles 
> wrote:
> > >
> > > > Can you share the Google doc with the proposal? Per Ted's advice, we
> > can
> > > > iterate quickly there and move it to the wiki when it becomes a bit
> > more
> > > > stable.
> > > >
> > > > Kenn
> > > >
> > > > On Fri, Feb 22, 2019 at 10:21 PM lee...@gmail.com 
> > > > wrote:
> > > >
> > > > > Thanks for the offer.  i am a neophyte at this process and email
> > app!   I
> > > > > could use a lot of help getting this off the ground!  Also, I'm not
> > sure
> > > > > that Mr. Chen and Mr. Onofré have fully accepted taking this on :)
> > > > >
> > > > > Lee.
> > > > >
> > > > > On 2019/02/23 06:03:58, Kenneth Knowles  wrote:
> > > > > > Nice.
> > > > > >
> > > > > > I would very much like to help mentor this project, though you
> > already
> > > > > have
> > > > > > a couple good ones.
> > > > > >
> > > > > > I concur with incubator as sponsoring entity.
> > > > > >
> > > > > > Kenn (VP Apache Beam)
> > > > > >
> > > > > > On Fri, Feb 22, 2019 at 9:45 PM leerho  wrote:
> > > > > >
> > > > > > > I didn't realize that this mail list does not accept PDF files,
> > > > > apparently
> > > > > > > only text.  So let me try one more time ... :)  Please let me
> > know if
> > > > > > > this works!
> > > > > > >
> > > > > > >
> > > > > > > = Apache DataSketches Proposal[1] =
> > > > > > >
> > > > > > > == Abstract ==
> > > > > > >
> > > > > > > DataSketches.GitHub.io is an open source, high-performance
> > library
> > > > of
> > > > > > > stochastic streaming algorithms commonly called "sketches" in
> the
> > > > data
> > > > > > > sciences. Sketches are small, stateful programs that process
> > massive
> > > > > data
> > > > > > > as a stream and can provide approximate answers, with
> > mathematical
> > > > > > > guarantees, to computationally difficult queries
> > orders-of-magnitude
> > > > > faster
> > > > > > > than traditional, exact methods.
> > > > > > >
> > > > > > > This proposal is to move DataSketches to the Apache Software
> > > > > > > Foundation(ASF) transferring ownership of its copyright
> > intellectual
> > > > > > > property to the ASF.  Thereafter, DataSketches would be
> > officially
> > > > > known as
> > > > > > > Apache DataSketches and its evolution and governance would come
> > under
> > > > > the
> > > > > > > rules and guidance of the ASF.
> > > > > > >
> > > > > > > == Introduction ==
> > > > > > >
> > > > > > > The DataSketches library contains carefully crafted
> > implementations
> > > > of
> > > > > > > sketch algorithms that meet rigorous standards of quality and
> > > > > performance
> > > > > > > and provide capabilities required for large-scale production
> > systems
> > > > > that
> > > > > > >

DataSketches Proposal WRT Commons-Math

2019-02-25 Thread leerho
As your suggestion may come up again, I have addressed it by adding a small 
section to the proposal itself (in the Google Doc).

On 2019/02/25 17:36:31, Ted Dunning  wrote: 
> There is also the general question of whether it is better to be a
> top-level project or to become a contribution to commons math.
> 
> 
> 
> On Sun, Feb 24, 2019 at 10:56 PM leerho  wrote:
> 
> > Yes I will try that tomorrow.
> >
> > On Sun, Feb 24, 2019 at 7:34 PM Kenneth Knowles  wrote:
> >
> > > Can you share the Google doc with the proposal? Per Ted's advice, we can
> > > iterate quickly there and move it to the wiki when it becomes a bit more
> > > stable.
> > >
> > > Kenn
> > >
> > > On Fri, Feb 22, 2019 at 10:21 PM lee...@gmail.com 
> > > wrote:
> > >
> > > > Thanks for the offer.  i am a neophyte at this process and email app!
> >  I
> > > > could use a lot of help getting this off the ground!  Also, I'm not
> > sure
> > > > that Mr. Chen and Mr. Onofré have fully accepted taking this on :)
> > > >
> > > > Lee.
> > > >
> > > > On 2019/02/23 06:03:58, Kenneth Knowles  wrote:
> > > > > Nice.
> > > > >
> > > > > I would very much like to help mentor this project, though you
> > already
> > > > have
> > > > > a couple good ones.
> > > > >
> > > > > I concur with incubator as sponsoring entity.
> > > > >
> > > > > Kenn (VP Apache Beam)
> > > > >
> > > > > On Fri, Feb 22, 2019 at 9:45 PM leerho  wrote:
> > > > >
> > > > > > I didn't realize that this mail list does not accept PDF files,
> > > > apparently
> > > > > > only text.  So let me try one more time ... :)  Please let me know
> > if
> > > > > > this works!
> > > > > >
> > > > > >
> > > > > > = Apache DataSketches Proposal[1] =
> > > > > >
> > > > > > == Abstract ==
> > > > > >
> > > > > > DataSketches.GitHub.io is an open source, high-performance library
> > > of
> > > > > > stochastic streaming algorithms commonly called "sketches" in the
> > > data
> > > > > > sciences. Sketches are small, stateful programs that process
> > massive
> > > > data
> > > > > > as a stream and can provide approximate answers, with mathematical
> > > > > > guarantees, to computationally difficult queries
> > orders-of-magnitude
> > > > faster
> > > > > > than traditional, exact methods.
> > > > > >
> > > > > > This proposal is to move DataSketches to the Apache Software
> > > > > > Foundation(ASF) transferring ownership of its copyright
> > intellectual
> > > > > > property to the ASF.  Thereafter, DataSketches would be officially
> > > > known as
> > > > > > Apache DataSketches and its evolution and governance would come
> > under
> > > > the
> > > > > > rules and guidance of the ASF.
> > > > > >
> > > > > > == Introduction ==
> > > > > >
> > > > > > The DataSketches library contains carefully crafted implementations
> > > of
> > > > > > sketch algorithms that meet rigorous standards of quality and
> > > > performance
> > > > > > and provide capabilities required for large-scale production
> > systems
> > > > that
> > > > > > must process and analyze massive data. The DataSketches core
> > > > repository is
> > > > > > written in Java with a parallel core repository written in C++ that
> > > > > > includes Python wrappers. The DataSketches library also includes
> > > > special
> > > > > > repositories for extending the core library for Apache Hive and
> > > Apache
> > > > Pig.
> > > > > > The sketches developed in the different languages share a common
> > > binary
> > > > > > storage format so that sketches created and stored in Java, for
> > > > example,
> > > > > > can be fully used in C++, and visa versa.  Because the stored
> > sketch
> > > > > > "images" are just a "blob" of bytes (similar to picture images),
> > they

DataSketches Proposal - Google Docs Link

2019-02-25 Thread leerho
Try this link: 
https://docs.google.com/document/d/19JKevzFQNcaLA51LFLUlP1hzdFDW7oDJrJO8N6weDv8/edit?usp=sharing


On 2019/02/25 05:55:50, leerho  wrote: 
> Yes I will try that tomorrow.
> 
> On Sun, Feb 24, 2019 at 7:34 PM Kenneth Knowles  wrote:
> 
> > Can you share the Google doc with the proposal? Per Ted's advice, we can
> > iterate quickly there and move it to the wiki when it becomes a bit more
> > stable.
> >
> > Kenn
> >
> > On Fri, Feb 22, 2019 at 10:21 PM lee...@gmail.com 
> > wrote:
> >
> > > Thanks for the offer.  i am a neophyte at this process and email app!   I
> > > could use a lot of help getting this off the ground!  Also, I'm not sure
> > > that Mr. Chen and Mr. Onofré have fully accepted taking this on :)
> > >
> > > Lee.
> > >
> > > On 2019/02/23 06:03:58, Kenneth Knowles  wrote:
> > > > Nice.
> > > >
> > > > I would very much like to help mentor this project, though you already
> > > have
> > > > a couple good ones.
> > > >
> > > > I concur with incubator as sponsoring entity.
> > > >
> > > > Kenn (VP Apache Beam)
> > > >
> > > > On Fri, Feb 22, 2019 at 9:45 PM leerho  wrote:
> > > >
> > > > > I didn't realize that this mail list does not accept PDF files,
> > > apparently
> > > > > only text.  So let me try one more time ... :)  Please let me know if
> > > > > this works!
> > > > >
> > > > >
> > > > > = Apache DataSketches Proposal[1] =
> > > > >
> > > > > == Abstract ==
> > > > >
> > > > > DataSketches.GitHub.io is an open source, high-performance library
> > of
> > > > > stochastic streaming algorithms commonly called "sketches" in the
> > data
> > > > > sciences. Sketches are small, stateful programs that process massive
> > > data
> > > > > as a stream and can provide approximate answers, with mathematical
> > > > > guarantees, to computationally difficult queries orders-of-magnitude
> > > faster
> > > > > than traditional, exact methods.
> > > > >
> > > > > This proposal is to move DataSketches to the Apache Software
> > > > > Foundation(ASF) transferring ownership of its copyright intellectual
> > > > > property to the ASF.  Thereafter, DataSketches would be officially
> > > known as
> > > > > Apache DataSketches and its evolution and governance would come under
> > > the
> > > > > rules and guidance of the ASF.
> > > > >
> > > > > == Introduction ==
> > > > >
> > > > > The DataSketches library contains carefully crafted implementations
> > of
> > > > > sketch algorithms that meet rigorous standards of quality and
> > > performance
> > > > > and provide capabilities required for large-scale production systems
> > > that
> > > > > must process and analyze massive data. The DataSketches core
> > > repository is
> > > > > written in Java with a parallel core repository written in C++ that
> > > > > includes Python wrappers. The DataSketches library also includes
> > > special
> > > > > repositories for extending the core library for Apache Hive and
> > Apache
> > > Pig.
> > > > > The sketches developed in the different languages share a common
> > binary
> > > > > storage format so that sketches created and stored in Java, for
> > > example,
> > > > > can be fully used in C++, and visa versa.  Because the stored sketch
> > > > > "images" are just a "blob" of bytes (similar to picture images), they
> > > can
> > > > > be shared across many different systems, languages and platforms.
> > > > >
> > > > > The DataSketches documentation website,
> > https://datasketches.github.io
> > > ,
> > > > > includes general tutorials, a comprehensive research section with
> > > > > references to relevant academic papers, extensive examples for using
> > > the
> > > > > core library directly as well as examples for accessing the library
> > in
> > > > > Hive, Pig, and Apache Spark.
> > > > >
> > > > > The DataSketches library also includes a characterization repository
> > > 

Re: DataSketches Proposal

2019-02-24 Thread leerho
Yes I will try that tomorrow.

On Sun, Feb 24, 2019 at 7:34 PM Kenneth Knowles  wrote:

> Can you share the Google doc with the proposal? Per Ted's advice, we can
> iterate quickly there and move it to the wiki when it becomes a bit more
> stable.
>
> Kenn
>
> On Fri, Feb 22, 2019 at 10:21 PM lee...@gmail.com 
> wrote:
>
> > Thanks for the offer.  i am a neophyte at this process and email app!   I
> > could use a lot of help getting this off the ground!  Also, I'm not sure
> > that Mr. Chen and Mr. Onofré have fully accepted taking this on :)
> >
> > Lee.
> >
> > On 2019/02/23 06:03:58, Kenneth Knowles  wrote:
> > > Nice.
> > >
> > > I would very much like to help mentor this project, though you already
> > have
> > > a couple good ones.
> > >
> > > I concur with incubator as sponsoring entity.
> > >
> > > Kenn (VP Apache Beam)
> > >
> > > On Fri, Feb 22, 2019 at 9:45 PM leerho  wrote:
> > >
> > > > I didn't realize that this mail list does not accept PDF files,
> > apparently
> > > > only text.  So let me try one more time ... :)  Please let me know if
> > > > this works!
> > > >
> > > >
> > > > = Apache DataSketches Proposal[1] =
> > > >
> > > > == Abstract ==
> > > >
> > > > DataSketches.GitHub.io is an open source, high-performance library
> of
> > > > stochastic streaming algorithms commonly called "sketches" in the
> data
> > > > sciences. Sketches are small, stateful programs that process massive
> > data
> > > > as a stream and can provide approximate answers, with mathematical
> > > > guarantees, to computationally difficult queries orders-of-magnitude
> > faster
> > > > than traditional, exact methods.
> > > >
> > > > This proposal is to move DataSketches to the Apache Software
> > > > Foundation(ASF) transferring ownership of its copyright intellectual
> > > > property to the ASF.  Thereafter, DataSketches would be officially
> > known as
> > > > Apache DataSketches and its evolution and governance would come under
> > the
> > > > rules and guidance of the ASF.
> > > >
> > > > == Introduction ==
> > > >
> > > > The DataSketches library contains carefully crafted implementations
> of
> > > > sketch algorithms that meet rigorous standards of quality and
> > performance
> > > > and provide capabilities required for large-scale production systems
> > that
> > > > must process and analyze massive data. The DataSketches core
> > repository is
> > > > written in Java with a parallel core repository written in C++ that
> > > > includes Python wrappers. The DataSketches library also includes
> > special
> > > > repositories for extending the core library for Apache Hive and
> Apache
> > Pig.
> > > > The sketches developed in the different languages share a common
> binary
> > > > storage format so that sketches created and stored in Java, for
> > example,
> > > > can be fully used in C++, and visa versa.  Because the stored sketch
> > > > "images" are just a "blob" of bytes (similar to picture images), they
> > can
> > > > be shared across many different systems, languages and platforms.
> > > >
> > > > The DataSketches documentation website,
> https://datasketches.github.io
> > ,
> > > > includes general tutorials, a comprehensive research section with
> > > > references to relevant academic papers, extensive examples for using
> > the
> > > > core library directly as well as examples for accessing the library
> in
> > > > Hive, Pig, and Apache Spark.
> > > >
> > > > The DataSketches library also includes a characterization repository
> > for
> > > > long running test programs that are used for studying accuracy and
> > > > performance of these sketches over wide ranges of input variables.
> The
> > data
> > > > produced by these programs is used for generating the many
> performance
> > > > plots contained in the documentation website and for academic
> > > > publications.
> > > >
> > > > The code repositories used for production are versioned and published
> > to
> > > > Maven Central on periodic intervals as the library evolves.
> > > >
> >

Re: [PROPOSAL] Apache DataSketches

2019-02-23 Thread leerho



On 2019/02/23 18:54:57, leerho  wrote: 
> Forgive me I am a newbie, but there has got to be a better way to post a
> document that everyone can see and allow it to be updated without having to
> resend it as raw text.  I have an easier to read version of the proposal as
> a Google doc where I could post the link, but I sense that that is a no-no
> in this community.  Any suggestions?
> 
> Lee.
> 
> On Sat, Feb 23, 2019 at 4:07 AM Justin Mclean 
> wrote:
> 
> > Hi,
> >
> > > Currently only IPMC members can be official mentors, of the 3 people
> > listed here I believe only Jean-Baptiste Onofré is an IPMC member.
> >
> > Sorry, my apologies, Liang Chen is also an IPMC member, (Chen Liang, and
> > presumedly a different person, is a committer but not an IPMC member) but I
> > cannot find Gil Yehuda, do you mind provide a link to the roster for them
> > or their Apache id?
> >
> > Thanks,
> > Justin
> > -
> > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> > For additional commands, e-mail: general-h...@incubator.apache.org
> >
> >
> I got it.  It is the Incubator Wiki.  Now trying to get this proposal posted 
> there. 

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Add to Wiki ContributorsGroup

2019-02-23 Thread leerho
Lee Rhodes


Re: [PROPOSAL] Apache DataSketches

2019-02-23 Thread leerho
Forgive me I am a newbie, but there has got to be a better way to post a
document that everyone can see and allow it to be updated without having to
resend it as raw text.  I have an easier to read version of the proposal as
a Google doc where I could post the link, but I sense that that is a no-no
in this community.  Any suggestions?

Lee.

On Sat, Feb 23, 2019 at 4:07 AM Justin Mclean 
wrote:

> Hi,
>
> > Currently only IPMC members can be official mentors, of the 3 people
> listed here I believe only Jean-Baptiste Onofré is an IPMC member.
>
> Sorry, my apologies, Liang Chen is also an IPMC member, (Chen Liang, and
> presumedly a different person, is a committer but not an IPMC member) but I
> cannot find Gil Yehuda, do you mind provide a link to the roster for them
> or their Apache id?
>
> Thanks,
> Justin
> -
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>


[PROPOSAL] Apache DataSketches

2019-02-23 Thread leerho


-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org

DataSketches Proposal

2019-02-22 Thread leerho
Thanks for the offer.  i am a neophyte at this process and email app!   I could 
use a lot of help getting this off the ground!  Also, I'm not sure that Mr. 
Chen and Mr. Onofré have fully accepted taking this on :)

Lee.

On 2019/02/23 06:03:58, Kenneth Knowles  wrote: 
> Nice.
> 
> I would very much like to help mentor this project, though you already have
> a couple good ones.
> 
> I concur with incubator as sponsoring entity.
> 
> Kenn (VP Apache Beam)
> 
> On Fri, Feb 22, 2019 at 9:45 PM leerho  wrote:
> 
> > I didn't realize that this mail list does not accept PDF files, apparently
> > only text.  So let me try one more time ... :)  Please let me know if
> > this works!
> >
> >
> > = Apache DataSketches Proposal[1] =
> >
> > == Abstract ==
> >
> > DataSketches.GitHub.io is an open source, high-performance library of
> > stochastic streaming algorithms commonly called "sketches" in the data
> > sciences. Sketches are small, stateful programs that process massive data
> > as a stream and can provide approximate answers, with mathematical
> > guarantees, to computationally difficult queries orders-of-magnitude faster
> > than traditional, exact methods.
> >
> > This proposal is to move DataSketches to the Apache Software
> > Foundation(ASF) transferring ownership of its copyright intellectual
> > property to the ASF.  Thereafter, DataSketches would be officially known as
> > Apache DataSketches and its evolution and governance would come under the
> > rules and guidance of the ASF.
> >
> > == Introduction ==
> >
> > The DataSketches library contains carefully crafted implementations of
> > sketch algorithms that meet rigorous standards of quality and performance
> > and provide capabilities required for large-scale production systems that
> > must process and analyze massive data. The DataSketches core repository is
> > written in Java with a parallel core repository written in C++ that
> > includes Python wrappers. The DataSketches library also includes special
> > repositories for extending the core library for Apache Hive and Apache Pig.
> > The sketches developed in the different languages share a common binary
> > storage format so that sketches created and stored in Java, for example,
> > can be fully used in C++, and visa versa.  Because the stored sketch
> > "images" are just a "blob" of bytes (similar to picture images), they can
> > be shared across many different systems, languages and platforms.
> >
> > The DataSketches documentation website, https://datasketches.github.io ,
> > includes general tutorials, a comprehensive research section with
> > references to relevant academic papers, extensive examples for using the
> > core library directly as well as examples for accessing the library in
> > Hive, Pig, and Apache Spark.
> >
> > The DataSketches library also includes a characterization repository for
> > long running test programs that are used for studying accuracy and
> > performance of these sketches over wide ranges of input variables. The data
> > produced by these programs is used for generating the many performance
> > plots contained in the documentation website and for academic
> > publications.
> >
> > The code repositories used for production are versioned and published to
> > Maven Central on periodic intervals as the library evolves.
> >
> > The DataSketches library also includes several experimental repositories
> > for use-cases outside the large-scale systems environments, such as
> > sketches for mobile, IoT devices (Android), command-line access of the
> > sketch library, and an experimental repository for vector-based sketches
> > that performs approximate Singular Value Decomposition (SVD) analysis that
> > could potentially be used in Machine Learning (ML) applications.
> >
> > == Background ==
> >
> > The DataSketches library was started in 2012 as internal Yahoo project to
> > dramatically reduce time and resources required for distinct (unique)
> > counting.  An extensive search on the Internet at the time yielded a number
> > of theoretical papers on stochastic streaming algorithms with pseudocode
> > examples, but we did not find any usable open-source code of the quality we
> > felt we needed for our internal production systems.  So we started a small
> > project (one person) to develop our own sketches working directly from
> > published theoretical papers.
> >
> > The DataSketches library was designed from the start with the objective of
> > making these alg

Re: [PROPOSAL] Apache DataSketches

2019-02-22 Thread leerho
use to characterize the accuracy and speed performance of the sketches in
the library and is constantly being updated. Examples of the job command
files used for various tests can be found in the src/main/resources
directory. Some of these tests can run for hours depending on its
configuration.
   * experimental: This repository is an experimental staging area for code
that will eventually end up in another repository. This code is not
versioned and not registered with Maven Central.
   * sketches-misc: Demos and other code not related to production
deployment

* C++ and Python
   * sketches-core-cpp: This is the C++/Python companion to the Java
sketches-core. These implementations are binary compatible with their
counterparts in Java. In other words, a sketch created and stored in C++
can be opened and read in Java and visa-versa. This site also has our
Python adaptors that basically wrap the C++ implementations, making the
high performance C++ implementations available from Python.
   * sketches-postgres: This site provides the postgres-specific adaptors
that wrap the C++ implementations making them available to the Postgres
database users.
   * characterization-cpp: This is the C++/Python companion to the Java
characterization repository.
   * experimental-cpp: This repository is an experimental staging area for
C++ code that will eventually end up in another repository.

* Command-Line Tools
   * sketches-cmd
   * homebrew-sketches
   * homebrew-sketches-cmd

These projects have always been Apache 2.0 licensed. We intend to bundle
all of these repositories since they are all complementary and should be
maintained in one project. Prior to our submission, we will combine all of
these projects into a new git repository.

== Source and Intellectual Property Submission Plan ==

Contributors to the DataSketches project have also signed the Yahoo
Individual Contributor License Agreement (https://yahoocla.herokuapp.com/
in order to contribute to the project.

With respect to trademark rights, Yahoo does not hold a trademark on the
phrase “DataSketches.” Based on feedback and guidance we receive during the
incubation process, we are open to renaming the project if necessary for
trademark or other concerns, but we would prefer not to have to do that.

== External Dependencies ==

All external dependencies are licensed under an Apache 2.0 or
Apache-compatible license. As we grow the DataSketches community we will
configure our build process to require and validate all contributions and
dependencies are licensed under the Apache 2.0 license or are under an
Apache-compatible license.

== Required Resources ==

=== Mailing Lists ===

We currently use a mix of mailing lists. We will migrate our existing
mailing lists to the following:

* d...@datasketches.incubator.apache.org

* u...@datasketches.incubator.apache.org

* priv...@datasketches.incubator.apache.org

* comm...@datasketches.incubator.apache.org

=== Source Control ===

The DataSketches team currently uses Git and would like to continue to do
so. We request a Git repository for DataSketches with mirroring to GitHub
enabled similar the following:

* https://github.com/apache/incubator-datasketches.git

=== Issue Tracking ===

We request the creation of an Apache-hosted JIRA. The DataSketches project
is currently using the public GitHub issue tracker and the public Google
Groups forum/sketches-user for issue tracking and discussions. We will
migrate and combine from these two sources to the Apache JIRA.

Proposed Jira ID: DATASKETCHES

== Initial Committers ==

The following list of individuals have been extremely active in our
community and should have write (commit) permissions to the repository.

* Eshcar Hillel  [eshcar at verizonmedia dot com]

* Kevin Lang[langk at verizonmedia dot com]

* Roman Leventov  [roman.leventov at c.metamarkets dot com]

* Edo Liberty   [libertye at amazon dot com]

* Jon Malkin[jmalkin at verizonmedia dot com]

* Lee Rhodes      [lrhodes at verizonmedia dot com] & [leerho
at gmail dot com]

* Alexander Saydakov [saydakov at verizonmedia dot com]

* Justin Thaler [justin.thaler at georgetown dot edu]

== Affiliations ==

The initial committers are from four organizations: Yahoo, Amazon,
Georgetown University, and Metamarkets/Snap.

=== Champion ===
(Recommended to me: )

Liang Chen, Vice President of Apache CarbonData, [chenliang613 at apache
dot org]
Jean-Baptiste Onofré,[[jb at nanthrax dot net]

=== Nominated Mentors ===
(Recommended to me: )

Liang Chen, Vice President of Apache CarbonData, [chenliang613 at apache
dot org]
Jean-Baptiste Onofré, jb at nanthrax dot net
Gil Yehuda, gyehuda at verizonmedia dot com

=== Sponsoring Entity ===

* The Apache Incubator This is our 1st choice 

* Apache Druid. The incubating Apache Druid project might also be a logical
sponsor. However, DataSketches ha

Re: [PROPOSAL] Apache DataSketches

2019-02-22 Thread leerho
I'll try again ... :)

On Fri, Feb 22, 2019 at 8:00 PM Ted Dunning  wrote:

> It didn't make it again
>
> On Fri, Feb 22, 2019, 8:35 PM leerho  wrote:
>
> > I'm not sure the attached document made it through.
> >
> > On Fri, Feb 22, 2019 at 7:28 PM leerho  wrote:
> >
> > >
> > >
> >
>

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org

Re: [PROPOSAL] Apache DataSketches

2019-02-22 Thread leerho
I'm not sure the attached document made it through.

On Fri, Feb 22, 2019 at 7:28 PM leerho  wrote:

>
>