Re: [Draft REPORT] Apache Parquet - January 2019
+1 Uwe On Mon, Jan 7, 2019, at 9:14 PM, Ryan Blue wrote: > +1 > > On Mon, Jan 7, 2019 at 11:39 AM Julien Le Dem > wrote: > > > ## Description: > > Parquet is a standard and interoperable columnar file format > > for efficient analytics. Parquet has 3 sub-projects: > > - parquet-format: format reference doc along with thrift based metadata > > definition (used by both sub-projects bellow) > > - parquet-mr: java apis and implementation of the format along with > > integrations to various projects (thrift, pig, protobuf, avro, ...) > > - parquet-cpp: C++ apis and implementation of the format along with Python > > bindings and arrow integration. > > > > ## Issues: > > No issue at this time > > > > ## Activity: > > Current activity around: > > > >- encryption > >- Page indexing > >- cutting a new release > >- improvement on parquet-proto > > > > > > ## Health report: > > The discussion volume on the mailing lists is stable. > > Tickets get created and closed at a reasonable pace. > > > > ## PMC changes: > > > > - Currently 24 PMC members. > > - No new PMC members added in the last 3 months > > - Last PMC addition was Zoltan Ivanfi on Sun Apr 15 2018 > > > > ## Committer base changes: > > > > - Currently 31 committers. > > - No new committers added in the last 3 months > > - Last committer addition was Benoit Hanotte at Mon May 28 2018 > > > > ## Releases: > > > > - Last release was Format 2.6.0 on Mon Oct 01 2018 > > > > ## Mailing list activity: > > > > - dev@parquet.apache.org: > > - 216 subscribers (up 2 in the last 3 months): > > - 529 emails sent to list (757 in previous quarter) > > > > > > ## JIRA activity: > > > > - 49 JIRA tickets created in the last 3 months > > - 65 JIRA tickets closed/resolved in the last 3 months > > > > > -- > Ryan Blue > Software Engineer > Netflix
Re: [Draft REPORT] Apache Parquet - January 2019
+1 Uwe On Mon, Jan 7, 2019, at 9:14 PM, Ryan Blue wrote: > +1 > > On Mon, Jan 7, 2019 at 11:39 AM Julien Le Dem > wrote: > > > ## Description: > > Parquet is a standard and interoperable columnar file format > > for efficient analytics. Parquet has 3 sub-projects: > > - parquet-format: format reference doc along with thrift based metadata > > definition (used by both sub-projects bellow) > > - parquet-mr: java apis and implementation of the format along with > > integrations to various projects (thrift, pig, protobuf, avro, ...) > > - parquet-cpp: C++ apis and implementation of the format along with Python > > bindings and arrow integration. > > > > ## Issues: > > No issue at this time > > > > ## Activity: > > Current activity around: > > > >- encryption > >- Page indexing > >- cutting a new release > >- improvement on parquet-proto > > > > > > ## Health report: > > The discussion volume on the mailing lists is stable. > > Tickets get created and closed at a reasonable pace. > > > > ## PMC changes: > > > > - Currently 24 PMC members. > > - No new PMC members added in the last 3 months > > - Last PMC addition was Zoltan Ivanfi on Sun Apr 15 2018 > > > > ## Committer base changes: > > > > - Currently 31 committers. > > - No new committers added in the last 3 months > > - Last committer addition was Benoit Hanotte at Mon May 28 2018 > > > > ## Releases: > > > > - Last release was Format 2.6.0 on Mon Oct 01 2018 > > > > ## Mailing list activity: > > > > - dev@parquet.apache.org: > > - 216 subscribers (up 2 in the last 3 months): > > - 529 emails sent to list (757 in previous quarter) > > > > > > ## JIRA activity: > > > > - 49 JIRA tickets created in the last 3 months > > - 65 JIRA tickets closed/resolved in the last 3 months > > > > > -- > Ryan Blue > Software Engineer > Netflix
Re: [Draft REPORT] Apache Parquet - January 2019
+1 On Mon, Jan 7, 2019 at 11:39 AM Julien Le Dem wrote: > ## Description: > Parquet is a standard and interoperable columnar file format > for efficient analytics. Parquet has 3 sub-projects: > - parquet-format: format reference doc along with thrift based metadata > definition (used by both sub-projects bellow) > - parquet-mr: java apis and implementation of the format along with > integrations to various projects (thrift, pig, protobuf, avro, ...) > - parquet-cpp: C++ apis and implementation of the format along with Python > bindings and arrow integration. > > ## Issues: > No issue at this time > > ## Activity: > Current activity around: > >- encryption >- Page indexing >- cutting a new release >- improvement on parquet-proto > > > ## Health report: > The discussion volume on the mailing lists is stable. > Tickets get created and closed at a reasonable pace. > > ## PMC changes: > > - Currently 24 PMC members. > - No new PMC members added in the last 3 months > - Last PMC addition was Zoltan Ivanfi on Sun Apr 15 2018 > > ## Committer base changes: > > - Currently 31 committers. > - No new committers added in the last 3 months > - Last committer addition was Benoit Hanotte at Mon May 28 2018 > > ## Releases: > > - Last release was Format 2.6.0 on Mon Oct 01 2018 > > ## Mailing list activity: > > - dev@parquet.apache.org: > - 216 subscribers (up 2 in the last 3 months): > - 529 emails sent to list (757 in previous quarter) > > > ## JIRA activity: > > - 49 JIRA tickets created in the last 3 months > - 65 JIRA tickets closed/resolved in the last 3 months > -- Ryan Blue Software Engineer Netflix
Re: [Draft report] Apache Parquet
+1 -Jake On Wed, Oct 12, 2016 at 8:43 PM, Julien Le Demwrote: > Report from the Apache Parquet committee [Julien Le Dem] > > ## Description: > Parquet is a standard and interoperable columnar file format for > efficient analytics. > > ## Issues: > there are no issues requiring board attention at this time > > ## Activity: > The community has been converging toward a 1.9 release. The vote will start > in the coming days. Discussion about better encoding and vectorization apis > are ongoing. > The parquet-cpp repo has reached a stable state and should release soon. > Integration with arrow-cpp is now in the parquet-cpp repo. > > ## Health report: > The PMC and committer list are growing. Discussion is happening on the > mailing list, JIRA and regular hangout sync up. Notes are sent to the > mailing list. > > ## PMC changes: > > - Currently 22 PMC members. > - Wes McKinney was added to the PMC on Thu Sep 01 2016 > > ## Committer base changes: > > - Currently 25 committers. > - Uwe Korn was added as a committer on Sun Sep 04 2016 > > ## Releases: > > - Last release was Format 2.3.1 on Thu Dec 17 2015 > > ## Mailing list activity: > > - Activity on the mailing list is still relatively the same > - JIRAS are resolved about at the same pace they are opened. > > - dev@parquet.apache.org: > - 172 subscribers (up 9 in the last 3 months): > - 486 emails sent to list (394 in previous quarter) > > > ## JIRA activity: > > - 85 JIRA tickets created in the last 3 months > - 74 JIRA tickets closed/resolved in the last 3 months > > -- > Julien >
Re: [Draft report] Apache Parquet
+1 On Thu, Oct 13, 2016 at 11:15 AM, Ryan Bluewrote: > +1 > > On Wed, Oct 12, 2016 at 11:40 PM, Uwe Korn wrote: > >> +1 >> >> >> >> On 13.10.16 02:43, Julien Le Dem wrote: >> >>> Report from the Apache Parquet committee [Julien Le Dem] >>> >>> ## Description: >>> Parquet is a standard and interoperable columnar file format for >>> efficient analytics. >>> >>> ## Issues: >>> there are no issues requiring board attention at this time >>> >>> ## Activity: >>> The community has been converging toward a 1.9 release. The vote will >>> start >>> in the coming days. Discussion about better encoding and vectorization >>> apis >>> are ongoing. >>> The parquet-cpp repo has reached a stable state and should release soon. >>> Integration with arrow-cpp is now in the parquet-cpp repo. >>> >>> ## Health report: >>> The PMC and committer list are growing. Discussion is happening on the >>> mailing list, JIRA and regular hangout sync up. Notes are sent to the >>> mailing list. >>> >>> ## PMC changes: >>> >>> - Currently 22 PMC members. >>> - Wes McKinney was added to the PMC on Thu Sep 01 2016 >>> >>> ## Committer base changes: >>> >>> - Currently 25 committers. >>> - Uwe Korn was added as a committer on Sun Sep 04 2016 >>> >>> ## Releases: >>> >>> - Last release was Format 2.3.1 on Thu Dec 17 2015 >>> >>> ## Mailing list activity: >>> >>> - Activity on the mailing list is still relatively the same >>> - JIRAS are resolved about at the same pace they are opened. >>> >>> - dev@parquet.apache.org: >>> - 172 subscribers (up 9 in the last 3 months): >>> - 486 emails sent to list (394 in previous quarter) >>> >>> >>> ## JIRA activity: >>> >>> - 85 JIRA tickets created in the last 3 months >>> - 74 JIRA tickets closed/resolved in the last 3 months >>> >>> >> > > > -- > Ryan Blue > Software Engineer > Netflix
Re: [Draft report] Apache Parquet
+1 On Wed, Oct 12, 2016 at 11:40 PM, Uwe Kornwrote: > +1 > > > > On 13.10.16 02:43, Julien Le Dem wrote: > >> Report from the Apache Parquet committee [Julien Le Dem] >> >> ## Description: >> Parquet is a standard and interoperable columnar file format for >> efficient analytics. >> >> ## Issues: >> there are no issues requiring board attention at this time >> >> ## Activity: >> The community has been converging toward a 1.9 release. The vote will >> start >> in the coming days. Discussion about better encoding and vectorization >> apis >> are ongoing. >> The parquet-cpp repo has reached a stable state and should release soon. >> Integration with arrow-cpp is now in the parquet-cpp repo. >> >> ## Health report: >> The PMC and committer list are growing. Discussion is happening on the >> mailing list, JIRA and regular hangout sync up. Notes are sent to the >> mailing list. >> >> ## PMC changes: >> >> - Currently 22 PMC members. >> - Wes McKinney was added to the PMC on Thu Sep 01 2016 >> >> ## Committer base changes: >> >> - Currently 25 committers. >> - Uwe Korn was added as a committer on Sun Sep 04 2016 >> >> ## Releases: >> >> - Last release was Format 2.3.1 on Thu Dec 17 2015 >> >> ## Mailing list activity: >> >> - Activity on the mailing list is still relatively the same >> - JIRAS are resolved about at the same pace they are opened. >> >> - dev@parquet.apache.org: >> - 172 subscribers (up 9 in the last 3 months): >> - 486 emails sent to list (394 in previous quarter) >> >> >> ## JIRA activity: >> >> - 85 JIRA tickets created in the last 3 months >> - 74 JIRA tickets closed/resolved in the last 3 months >> >> > -- Ryan Blue Software Engineer Netflix
Re: [Draft report] Apache Parquet
+1 On 13.10.16 02:43, Julien Le Dem wrote: Report from the Apache Parquet committee [Julien Le Dem] ## Description: Parquet is a standard and interoperable columnar file format for efficient analytics. ## Issues: there are no issues requiring board attention at this time ## Activity: The community has been converging toward a 1.9 release. The vote will start in the coming days. Discussion about better encoding and vectorization apis are ongoing. The parquet-cpp repo has reached a stable state and should release soon. Integration with arrow-cpp is now in the parquet-cpp repo. ## Health report: The PMC and committer list are growing. Discussion is happening on the mailing list, JIRA and regular hangout sync up. Notes are sent to the mailing list. ## PMC changes: - Currently 22 PMC members. - Wes McKinney was added to the PMC on Thu Sep 01 2016 ## Committer base changes: - Currently 25 committers. - Uwe Korn was added as a committer on Sun Sep 04 2016 ## Releases: - Last release was Format 2.3.1 on Thu Dec 17 2015 ## Mailing list activity: - Activity on the mailing list is still relatively the same - JIRAS are resolved about at the same pace they are opened. - dev@parquet.apache.org: - 172 subscribers (up 9 in the last 3 months): - 486 emails sent to list (394 in previous quarter) ## JIRA activity: - 85 JIRA tickets created in the last 3 months - 74 JIRA tickets closed/resolved in the last 3 months