Re: [Draft REPORT] Apache Parquet - January 2019

2019-01-07 Thread Uwe L. Korn
+1

Uwe

On Mon, Jan 7, 2019, at 9:14 PM, Ryan Blue wrote:
> +1
> 
> On Mon, Jan 7, 2019 at 11:39 AM Julien Le Dem
>  wrote:
> 
> > ## Description:
> > Parquet is a standard and interoperable columnar file format
> > for efficient analytics. Parquet has 3 sub-projects:
> > - parquet-format: format reference doc along with thrift based metadata
> > definition (used by both sub-projects bellow)
> > - parquet-mr: java apis and implementation of the format along with
> > integrations to various projects (thrift, pig, protobuf, avro, ...)
> > - parquet-cpp: C++ apis and implementation of the format along with Python
> > bindings and arrow integration.
> >
> > ## Issues:
> >  No issue at this time
> >
> > ## Activity:
> > Current activity around:
> >
> >- encryption
> >- Page indexing
> >- cutting a new release
> >- improvement on parquet-proto
> >
> >
> > ## Health report:
> > The discussion volume on the mailing lists is stable.
> > Tickets get created and closed at a reasonable pace.
> >
> > ## PMC changes:
> >
> >  - Currently 24 PMC members.
> >  - No new PMC members added in the last 3 months
> >  - Last PMC addition was Zoltan Ivanfi on Sun Apr 15 2018
> >
> > ## Committer base changes:
> >
> >  - Currently 31 committers.
> >  - No new committers added in the last 3 months
> >  - Last committer addition was Benoit Hanotte at Mon May 28 2018
> >
> > ## Releases:
> >
> >  - Last release was Format 2.6.0 on Mon Oct 01 2018
> >
> > ## Mailing list activity:
> >
> >  - dev@parquet.apache.org:
> > - 216 subscribers (up 2 in the last 3 months):
> > - 529 emails sent to list (757 in previous quarter)
> >
> >
> > ## JIRA activity:
> >
> >  - 49 JIRA tickets created in the last 3 months
> >  - 65 JIRA tickets closed/resolved in the last 3 months
> >
> 
> 
> -- 
> Ryan Blue
> Software Engineer
> Netflix


Re: [Draft REPORT] Apache Parquet - January 2019

2019-01-07 Thread Uwe L. Korn
+1

Uwe

On Mon, Jan 7, 2019, at 9:14 PM, Ryan Blue wrote:
> +1
> 
> On Mon, Jan 7, 2019 at 11:39 AM Julien Le Dem
>  wrote:
> 
> > ## Description:
> > Parquet is a standard and interoperable columnar file format
> > for efficient analytics. Parquet has 3 sub-projects:
> > - parquet-format: format reference doc along with thrift based metadata
> > definition (used by both sub-projects bellow)
> > - parquet-mr: java apis and implementation of the format along with
> > integrations to various projects (thrift, pig, protobuf, avro, ...)
> > - parquet-cpp: C++ apis and implementation of the format along with Python
> > bindings and arrow integration.
> >
> > ## Issues:
> >  No issue at this time
> >
> > ## Activity:
> > Current activity around:
> >
> >- encryption
> >- Page indexing
> >- cutting a new release
> >- improvement on parquet-proto
> >
> >
> > ## Health report:
> > The discussion volume on the mailing lists is stable.
> > Tickets get created and closed at a reasonable pace.
> >
> > ## PMC changes:
> >
> >  - Currently 24 PMC members.
> >  - No new PMC members added in the last 3 months
> >  - Last PMC addition was Zoltan Ivanfi on Sun Apr 15 2018
> >
> > ## Committer base changes:
> >
> >  - Currently 31 committers.
> >  - No new committers added in the last 3 months
> >  - Last committer addition was Benoit Hanotte at Mon May 28 2018
> >
> > ## Releases:
> >
> >  - Last release was Format 2.6.0 on Mon Oct 01 2018
> >
> > ## Mailing list activity:
> >
> >  - dev@parquet.apache.org:
> > - 216 subscribers (up 2 in the last 3 months):
> > - 529 emails sent to list (757 in previous quarter)
> >
> >
> > ## JIRA activity:
> >
> >  - 49 JIRA tickets created in the last 3 months
> >  - 65 JIRA tickets closed/resolved in the last 3 months
> >
> 
> 
> -- 
> Ryan Blue
> Software Engineer
> Netflix


Re: [Draft REPORT] Apache Parquet - January 2019

2019-01-07 Thread Ryan Blue
+1

On Mon, Jan 7, 2019 at 11:39 AM Julien Le Dem
 wrote:

> ## Description:
> Parquet is a standard and interoperable columnar file format
> for efficient analytics. Parquet has 3 sub-projects:
> - parquet-format: format reference doc along with thrift based metadata
> definition (used by both sub-projects bellow)
> - parquet-mr: java apis and implementation of the format along with
> integrations to various projects (thrift, pig, protobuf, avro, ...)
> - parquet-cpp: C++ apis and implementation of the format along with Python
> bindings and arrow integration.
>
> ## Issues:
>  No issue at this time
>
> ## Activity:
> Current activity around:
>
>- encryption
>- Page indexing
>- cutting a new release
>- improvement on parquet-proto
>
>
> ## Health report:
> The discussion volume on the mailing lists is stable.
> Tickets get created and closed at a reasonable pace.
>
> ## PMC changes:
>
>  - Currently 24 PMC members.
>  - No new PMC members added in the last 3 months
>  - Last PMC addition was Zoltan Ivanfi on Sun Apr 15 2018
>
> ## Committer base changes:
>
>  - Currently 31 committers.
>  - No new committers added in the last 3 months
>  - Last committer addition was Benoit Hanotte at Mon May 28 2018
>
> ## Releases:
>
>  - Last release was Format 2.6.0 on Mon Oct 01 2018
>
> ## Mailing list activity:
>
>  - dev@parquet.apache.org:
> - 216 subscribers (up 2 in the last 3 months):
> - 529 emails sent to list (757 in previous quarter)
>
>
> ## JIRA activity:
>
>  - 49 JIRA tickets created in the last 3 months
>  - 65 JIRA tickets closed/resolved in the last 3 months
>


-- 
Ryan Blue
Software Engineer
Netflix


Re: [Draft report] Apache Parquet

2016-10-13 Thread Jake Farrell
+1

-Jake

On Wed, Oct 12, 2016 at 8:43 PM, Julien Le Dem  wrote:

> Report from the Apache Parquet committee [Julien Le Dem]
>
> ## Description:
> Parquet is a standard and interoperable columnar file format for
> efficient analytics.
>
> ## Issues:
> there are no issues requiring board attention at this time
>
> ## Activity:
> The community has been converging toward a 1.9 release. The vote will start
> in the coming days. Discussion about better encoding and vectorization apis
> are ongoing.
> The parquet-cpp repo has reached a stable state and should release soon.
> Integration with arrow-cpp is now in the parquet-cpp repo.
>
> ## Health report:
> The PMC and committer list are growing. Discussion is happening on the
> mailing list, JIRA and regular hangout sync up. Notes are sent to the
> mailing list.
>
> ## PMC changes:
>
>  - Currently 22 PMC members.
>  - Wes McKinney was added to the PMC on Thu Sep 01 2016
>
> ## Committer base changes:
>
>  - Currently 25 committers.
>  - Uwe Korn was added as a committer on Sun Sep 04 2016
>
> ## Releases:
>
>  - Last release was Format 2.3.1 on Thu Dec 17 2015
>
> ## Mailing list activity:
>
>  - Activity on the mailing list is still relatively the same
>  - JIRAS are resolved about at the same pace they are opened.
>
>  - dev@parquet.apache.org:
> - 172 subscribers (up 9 in the last 3 months):
> - 486 emails sent to list (394 in previous quarter)
>
>
> ## JIRA activity:
>
>  - 85 JIRA tickets created in the last 3 months
>  - 74 JIRA tickets closed/resolved in the last 3 months
>
> --
> Julien
>


Re: [Draft report] Apache Parquet

2016-10-13 Thread Wes McKinney
+1

On Thu, Oct 13, 2016 at 11:15 AM, Ryan Blue  wrote:
> +1
>
> On Wed, Oct 12, 2016 at 11:40 PM, Uwe Korn  wrote:
>
>> +1
>>
>>
>>
>> On 13.10.16 02:43, Julien Le Dem wrote:
>>
>>> Report from the Apache Parquet committee [Julien Le Dem]
>>>
>>> ## Description:
>>> Parquet is a standard and interoperable columnar file format for
>>> efficient analytics.
>>>
>>> ## Issues:
>>> there are no issues requiring board attention at this time
>>>
>>> ## Activity:
>>> The community has been converging toward a 1.9 release. The vote will
>>> start
>>> in the coming days. Discussion about better encoding and vectorization
>>> apis
>>> are ongoing.
>>> The parquet-cpp repo has reached a stable state and should release soon.
>>> Integration with arrow-cpp is now in the parquet-cpp repo.
>>>
>>> ## Health report:
>>> The PMC and committer list are growing. Discussion is happening on the
>>> mailing list, JIRA and regular hangout sync up. Notes are sent to the
>>> mailing list.
>>>
>>> ## PMC changes:
>>>
>>>   - Currently 22 PMC members.
>>>   - Wes McKinney was added to the PMC on Thu Sep 01 2016
>>>
>>> ## Committer base changes:
>>>
>>>   - Currently 25 committers.
>>>   - Uwe Korn was added as a committer on Sun Sep 04 2016
>>>
>>> ## Releases:
>>>
>>>   - Last release was Format 2.3.1 on Thu Dec 17 2015
>>>
>>> ## Mailing list activity:
>>>
>>>   - Activity on the mailing list is still relatively the same
>>>   - JIRAS are resolved about at the same pace they are opened.
>>>
>>>   - dev@parquet.apache.org:
>>>  - 172 subscribers (up 9 in the last 3 months):
>>>  - 486 emails sent to list (394 in previous quarter)
>>>
>>>
>>> ## JIRA activity:
>>>
>>>   - 85 JIRA tickets created in the last 3 months
>>>   - 74 JIRA tickets closed/resolved in the last 3 months
>>>
>>>
>>
>
>
> --
> Ryan Blue
> Software Engineer
> Netflix


Re: [Draft report] Apache Parquet

2016-10-13 Thread Ryan Blue
+1

On Wed, Oct 12, 2016 at 11:40 PM, Uwe Korn  wrote:

> +1
>
>
>
> On 13.10.16 02:43, Julien Le Dem wrote:
>
>> Report from the Apache Parquet committee [Julien Le Dem]
>>
>> ## Description:
>> Parquet is a standard and interoperable columnar file format for
>> efficient analytics.
>>
>> ## Issues:
>> there are no issues requiring board attention at this time
>>
>> ## Activity:
>> The community has been converging toward a 1.9 release. The vote will
>> start
>> in the coming days. Discussion about better encoding and vectorization
>> apis
>> are ongoing.
>> The parquet-cpp repo has reached a stable state and should release soon.
>> Integration with arrow-cpp is now in the parquet-cpp repo.
>>
>> ## Health report:
>> The PMC and committer list are growing. Discussion is happening on the
>> mailing list, JIRA and regular hangout sync up. Notes are sent to the
>> mailing list.
>>
>> ## PMC changes:
>>
>>   - Currently 22 PMC members.
>>   - Wes McKinney was added to the PMC on Thu Sep 01 2016
>>
>> ## Committer base changes:
>>
>>   - Currently 25 committers.
>>   - Uwe Korn was added as a committer on Sun Sep 04 2016
>>
>> ## Releases:
>>
>>   - Last release was Format 2.3.1 on Thu Dec 17 2015
>>
>> ## Mailing list activity:
>>
>>   - Activity on the mailing list is still relatively the same
>>   - JIRAS are resolved about at the same pace they are opened.
>>
>>   - dev@parquet.apache.org:
>>  - 172 subscribers (up 9 in the last 3 months):
>>  - 486 emails sent to list (394 in previous quarter)
>>
>>
>> ## JIRA activity:
>>
>>   - 85 JIRA tickets created in the last 3 months
>>   - 74 JIRA tickets closed/resolved in the last 3 months
>>
>>
>


-- 
Ryan Blue
Software Engineer
Netflix


Re: [Draft report] Apache Parquet

2016-10-13 Thread Uwe Korn

+1


On 13.10.16 02:43, Julien Le Dem wrote:

Report from the Apache Parquet committee [Julien Le Dem]

## Description:
Parquet is a standard and interoperable columnar file format for
efficient analytics.

## Issues:
there are no issues requiring board attention at this time

## Activity:
The community has been converging toward a 1.9 release. The vote will start
in the coming days. Discussion about better encoding and vectorization apis
are ongoing.
The parquet-cpp repo has reached a stable state and should release soon.
Integration with arrow-cpp is now in the parquet-cpp repo.

## Health report:
The PMC and committer list are growing. Discussion is happening on the
mailing list, JIRA and regular hangout sync up. Notes are sent to the
mailing list.

## PMC changes:

  - Currently 22 PMC members.
  - Wes McKinney was added to the PMC on Thu Sep 01 2016

## Committer base changes:

  - Currently 25 committers.
  - Uwe Korn was added as a committer on Sun Sep 04 2016

## Releases:

  - Last release was Format 2.3.1 on Thu Dec 17 2015

## Mailing list activity:

  - Activity on the mailing list is still relatively the same
  - JIRAS are resolved about at the same pace they are opened.

  - dev@parquet.apache.org:
 - 172 subscribers (up 9 in the last 3 months):
 - 486 emails sent to list (394 in previous quarter)


## JIRA activity:

  - 85 JIRA tickets created in the last 3 months
  - 74 JIRA tickets closed/resolved in the last 3 months