RE: Date and time for next parquet sync

2019-01-17 Thread Santlal J Gupta
Hi team,

This email id is not available from this Friday onwards. Please add my another 
mail id(santlal561...@gmail.com) in parquet sync meeting.

From my mail id(santlal561...@gmail.com), I already sent a meeting request.

Thanks
Santlal J Gupta

-Original Message-
From: Santlal J Gupta 
Sent: Friday, September 29, 2017 10:19 AM
To: dev@parquet.apache.org
Subject: RE: Date and time for next parquet sync

Yes I want to join.

-Original Message-
From: Lars Volker [mailto:l...@cloudera.com] 
Sent: Thursday, September 28, 2017 8:40 PM
To: dev@parquet.apache.org
Subject: Date and time for next parquet sync

I sent out an meeting request for the next Parquet sync on Wednesday, October 
11th at 9am PST. Please reply to this email if you'd like to join and found 
yourself not on the invite yet.


Re: Date and time for next Parquet sync

2018-09-18 Thread Nandor Kollar
Hi All,

Since it sees that apart from you several other community members
can't attend the meeting tomorrow, would anyone mind if we'd
reschedule it for next Tuesday at the same time?

Thanks,
Nandor

On Tue, Sep 18, 2018 at 9:51 AM, Zoltan Ivanfi  wrote:
> Hi,
>
> It seems that I won't be able to attend after all, sorry for the late
> decline.
>
> Zoltan
>
> On Mon, Sep 10, 2018 at 7:21 PM Ryan Blue  wrote:
>>
>> Sorry, looks like I was wrong on the dates. Thanks, Nandor.
>>
>> On Mon, Sep 10, 2018 at 5:15 AM Nandor Kollar 
>> wrote:
>>
>> > Ryan, I was aware of Strata, actually I wanted to schedule it to 18th
>> > September, but forgot to change 'next week' in the email. So in fact I
>> > already pushed it out one week, sorry for the confusion.
>> >
>> > Gidon, 19th is fine for me, if there's no objection against it, then
>> > we can have it then!
>> >
>> > Thanks,
>> > Nandor
>> >
>> > On Fri, Sep 7, 2018 at 9:21 PM, Ryan Blue 
>> > wrote:
>> > > We may want to push this out another week because it also conflicts
>> > > with
>> > > Strata NY. I think a few of us will be travelling Tuesday and both
>> > > Julien
>> > > and I have talks on Wednesday.
>> > >
>> > > On Fri, Sep 7, 2018 at 6:24 AM Gidon Gershinsky 
>> > wrote:
>> > >
>> > >> Hi Nandor,
>> > >>
>> > >> Can we make it Wed this time, Sept 19? Or any of Tue/Wed on another
>> > week.
>> > >> Sept 18 is the Yom Kippur eve - this basically means I won't have a
>> > >> technical ability to join a call.
>> > >>
>> > >> Regarding the Google doc vs reviewed PR + .md file - it indeed
>> > >> becomes
>> > >> difficult and unneccesary to maintain two
>> > >> versions of the same documentation. Following you last mail, there
>> > >> was a
>> > >> high volume of review
>> > >> activity at the google doc, but now the spike is winding down, I'll
>> > >> be
>> > >> removing the duplicate part from the google doc
>> > >> (keeping the samples), with new comments to go to PRs (md and code).
>> > I'll
>> > >> send a detailed mail early next week.
>> > >>
>> > >>
>> > >> Cheers, Gidon.
>> > >>
>> > >> On Fri, Sep 7, 2018 at 3:42 PM Nandor Kollar
>> > > > >> >
>> > >> wrote:
>> > >>
>> > >> > Hi All,
>> > >> >
>> > >> > I'd like propose to have a Parquet Sync next week Tuesday
>> > >> > (September
>> > >> > 18th) at 6pm CEST / 9 am PST.
>> > >> >
>> > >> > Some of the topics which would be nice to discuss:
>> > >> > - review column indexes (PRs and feature branch)
>> > >> > - move Java code from format to mr (PR #517)
>> > >> > - Bloom filter spec
>> > >> > - columnar encryption spec (and general question, where to track
>> > >> > specs, Google doc vs reviewed PR + .md file)
>> > >> > - Refactor modules to use the new logical type API (PR under
>> > >> > review)
>> > >> > - new format release scope (nano precision timestamp, bloom filer?,
>> > >> > columnar encryption?)
>> > >> >
>> > >> > I'll send the meeting invite shortly. Feel free to propose other
>> > >> > time
>> > >> > slot if it is not suitable for you, and bring any additional topic
>> > >> > you'd like to discuss.
>> > >> >
>> > >> > Regards,
>> > >> > Nandor
>> > >> >
>> > >>
>> > >
>> > >
>> > > --
>> > > Ryan Blue
>> > > Software Engineer
>> > > Netflix
>> >
>>
>>
>> --
>> Ryan Blue
>> Software Engineer
>> Netflix


Re: Date and time for next Parquet sync

2018-09-18 Thread Zoltan Ivanfi
Hi,

It seems that I won't be able to attend after all, sorry for the late
decline.

Zoltan

On Mon, Sep 10, 2018 at 7:21 PM Ryan Blue  wrote:

> Sorry, looks like I was wrong on the dates. Thanks, Nandor.
>
> On Mon, Sep 10, 2018 at 5:15 AM Nandor Kollar 
> wrote:
>
> > Ryan, I was aware of Strata, actually I wanted to schedule it to 18th
> > September, but forgot to change 'next week' in the email. So in fact I
> > already pushed it out one week, sorry for the confusion.
> >
> > Gidon, 19th is fine for me, if there's no objection against it, then
> > we can have it then!
> >
> > Thanks,
> > Nandor
> >
> > On Fri, Sep 7, 2018 at 9:21 PM, Ryan Blue 
> > wrote:
> > > We may want to push this out another week because it also conflicts
> with
> > > Strata NY. I think a few of us will be travelling Tuesday and both
> Julien
> > > and I have talks on Wednesday.
> > >
> > > On Fri, Sep 7, 2018 at 6:24 AM Gidon Gershinsky 
> > wrote:
> > >
> > >> Hi Nandor,
> > >>
> > >> Can we make it Wed this time, Sept 19? Or any of Tue/Wed on another
> > week.
> > >> Sept 18 is the Yom Kippur eve - this basically means I won't have a
> > >> technical ability to join a call.
> > >>
> > >> Regarding the Google doc vs reviewed PR + .md file - it indeed becomes
> > >> difficult and unneccesary to maintain two
> > >> versions of the same documentation. Following you last mail, there
> was a
> > >> high volume of review
> > >> activity at the google doc, but now the spike is winding down, I'll be
> > >> removing the duplicate part from the google doc
> > >> (keeping the samples), with new comments to go to PRs (md and code).
> > I'll
> > >> send a detailed mail early next week.
> > >>
> > >>
> > >> Cheers, Gidon.
> > >>
> > >> On Fri, Sep 7, 2018 at 3:42 PM Nandor Kollar
> >  > >> >
> > >> wrote:
> > >>
> > >> > Hi All,
> > >> >
> > >> > I'd like propose to have a Parquet Sync next week Tuesday (September
> > >> > 18th) at 6pm CEST / 9 am PST.
> > >> >
> > >> > Some of the topics which would be nice to discuss:
> > >> > - review column indexes (PRs and feature branch)
> > >> > - move Java code from format to mr (PR #517)
> > >> > - Bloom filter spec
> > >> > - columnar encryption spec (and general question, where to track
> > >> > specs, Google doc vs reviewed PR + .md file)
> > >> > - Refactor modules to use the new logical type API (PR under review)
> > >> > - new format release scope (nano precision timestamp, bloom filer?,
> > >> > columnar encryption?)
> > >> >
> > >> > I'll send the meeting invite shortly. Feel free to propose other
> time
> > >> > slot if it is not suitable for you, and bring any additional topic
> > >> > you'd like to discuss.
> > >> >
> > >> > Regards,
> > >> > Nandor
> > >> >
> > >>
> > >
> > >
> > > --
> > > Ryan Blue
> > > Software Engineer
> > > Netflix
> >
>
>
> --
> Ryan Blue
> Software Engineer
> Netflix
>


Re: Date and time for next Parquet sync

2018-09-10 Thread Ryan Blue
Sorry, looks like I was wrong on the dates. Thanks, Nandor.

On Mon, Sep 10, 2018 at 5:15 AM Nandor Kollar  wrote:

> Ryan, I was aware of Strata, actually I wanted to schedule it to 18th
> September, but forgot to change 'next week' in the email. So in fact I
> already pushed it out one week, sorry for the confusion.
>
> Gidon, 19th is fine for me, if there's no objection against it, then
> we can have it then!
>
> Thanks,
> Nandor
>
> On Fri, Sep 7, 2018 at 9:21 PM, Ryan Blue 
> wrote:
> > We may want to push this out another week because it also conflicts with
> > Strata NY. I think a few of us will be travelling Tuesday and both Julien
> > and I have talks on Wednesday.
> >
> > On Fri, Sep 7, 2018 at 6:24 AM Gidon Gershinsky 
> wrote:
> >
> >> Hi Nandor,
> >>
> >> Can we make it Wed this time, Sept 19? Or any of Tue/Wed on another
> week.
> >> Sept 18 is the Yom Kippur eve - this basically means I won't have a
> >> technical ability to join a call.
> >>
> >> Regarding the Google doc vs reviewed PR + .md file - it indeed becomes
> >> difficult and unneccesary to maintain two
> >> versions of the same documentation. Following you last mail, there was a
> >> high volume of review
> >> activity at the google doc, but now the spike is winding down, I'll be
> >> removing the duplicate part from the google doc
> >> (keeping the samples), with new comments to go to PRs (md and code).
> I'll
> >> send a detailed mail early next week.
> >>
> >>
> >> Cheers, Gidon.
> >>
> >> On Fri, Sep 7, 2018 at 3:42 PM Nandor Kollar
>  >> >
> >> wrote:
> >>
> >> > Hi All,
> >> >
> >> > I'd like propose to have a Parquet Sync next week Tuesday (September
> >> > 18th) at 6pm CEST / 9 am PST.
> >> >
> >> > Some of the topics which would be nice to discuss:
> >> > - review column indexes (PRs and feature branch)
> >> > - move Java code from format to mr (PR #517)
> >> > - Bloom filter spec
> >> > - columnar encryption spec (and general question, where to track
> >> > specs, Google doc vs reviewed PR + .md file)
> >> > - Refactor modules to use the new logical type API (PR under review)
> >> > - new format release scope (nano precision timestamp, bloom filer?,
> >> > columnar encryption?)
> >> >
> >> > I'll send the meeting invite shortly. Feel free to propose other time
> >> > slot if it is not suitable for you, and bring any additional topic
> >> > you'd like to discuss.
> >> >
> >> > Regards,
> >> > Nandor
> >> >
> >>
> >
> >
> > --
> > Ryan Blue
> > Software Engineer
> > Netflix
>


-- 
Ryan Blue
Software Engineer
Netflix


Re: Date and time for next Parquet sync

2018-09-10 Thread Nandor Kollar
Ryan, I was aware of Strata, actually I wanted to schedule it to 18th
September, but forgot to change 'next week' in the email. So in fact I
already pushed it out one week, sorry for the confusion.

Gidon, 19th is fine for me, if there's no objection against it, then
we can have it then!

Thanks,
Nandor

On Fri, Sep 7, 2018 at 9:21 PM, Ryan Blue  wrote:
> We may want to push this out another week because it also conflicts with
> Strata NY. I think a few of us will be travelling Tuesday and both Julien
> and I have talks on Wednesday.
>
> On Fri, Sep 7, 2018 at 6:24 AM Gidon Gershinsky  wrote:
>
>> Hi Nandor,
>>
>> Can we make it Wed this time, Sept 19? Or any of Tue/Wed on another week.
>> Sept 18 is the Yom Kippur eve - this basically means I won't have a
>> technical ability to join a call.
>>
>> Regarding the Google doc vs reviewed PR + .md file - it indeed becomes
>> difficult and unneccesary to maintain two
>> versions of the same documentation. Following you last mail, there was a
>> high volume of review
>> activity at the google doc, but now the spike is winding down, I'll be
>> removing the duplicate part from the google doc
>> (keeping the samples), with new comments to go to PRs (md and code). I'll
>> send a detailed mail early next week.
>>
>>
>> Cheers, Gidon.
>>
>> On Fri, Sep 7, 2018 at 3:42 PM Nandor Kollar > >
>> wrote:
>>
>> > Hi All,
>> >
>> > I'd like propose to have a Parquet Sync next week Tuesday (September
>> > 18th) at 6pm CEST / 9 am PST.
>> >
>> > Some of the topics which would be nice to discuss:
>> > - review column indexes (PRs and feature branch)
>> > - move Java code from format to mr (PR #517)
>> > - Bloom filter spec
>> > - columnar encryption spec (and general question, where to track
>> > specs, Google doc vs reviewed PR + .md file)
>> > - Refactor modules to use the new logical type API (PR under review)
>> > - new format release scope (nano precision timestamp, bloom filer?,
>> > columnar encryption?)
>> >
>> > I'll send the meeting invite shortly. Feel free to propose other time
>> > slot if it is not suitable for you, and bring any additional topic
>> > you'd like to discuss.
>> >
>> > Regards,
>> > Nandor
>> >
>>
>
>
> --
> Ryan Blue
> Software Engineer
> Netflix


Re: Date and time for next Parquet sync

2018-09-09 Thread Gidon Gershinsky
Thanks to the dozens of folks who have found time to read the design
googledoc since the last Parquet sync.

Now that the traffic peak at the doc is over, I'll be handling the overlap
with the new Encryption.md file. It is becoming difficult and unnecessary
to maintain two versions in parallel, therefore the overlapping part will
be removed from the googledoc. The Encryption.md
 (formatted here
)
and the current Thrift file

together provide a technically accurate, down to a single byte, description
of the encryption format and the writer/reader protocol. You can leave new
comments at the document pull request.

Old comments are still available at the google doc, press the comments
button for the Dec'17 to Aug'18 comment history. Also, you can read the
review comments at pull requests, merged (94
, 103
, 104
 in parquet-format, 463
, 464
 in parquet-cpp) and open (
95 *, 471
, 472
 in parquet-mr and 475
 in parquet-cpp).

Besides comment history, the google doc will keep the API description
("Usage samples" section). The sample code is in Java, but the same API is
available in the C++ Parquet version (thanks Tham Ha for the hard work on
this!).

Cheers, Gidon.



On Wed, Aug 29, 2018 at 12:41 PM Nandor Kollar 
wrote:

> Hi all,
>
> Yesterday we talked about the status of the columnar encryption, and
> agreed that before anything related to it gets released, we need a
> reviewed spec. Actually Gidon already opened PR for this:
> https://github.com/apache/parquet-format/pull/101, it is based on the
> design doc (
> https://docs.google.com/document/d/1T89G7xR0zHFV1f2pjTO28jtfVm8qoNVGEJQ70Rsk-bY/edit
> )
> written by him. Julien, Ryan what do you think - is there anything
> else needed?
>
> Regards,
> Nandor
>
> On Tue, Aug 28, 2018 at 7:16 PM, Julien Le Dem
>  wrote:
> > Notes:
> > Anna (Cloudera): Bloom filter update, Iceberg
> > Gabor, Nandor (Cloudera):
> >
> >- Value skipping implementation to be reviewed. Move Java code from
> >parquet-format to parquet-mr. PR ready
> >- How can users of Parquet handle timestamps and TZs. Allow for
> writing
> >timestamp in java. Refactor original type logic to more flexible new
> >original type api.
> >- Column indexes and alignment of pages
> >- Limiting the number of records in a page to avoid skewed splits when
> >compression is really good.
> >
> > Ryan (Netflix): Iceberg stuff back to Parquet: expression library for
> push
> > down. Dictionary and stats based row group filtering.
> > JunJie (Intel): Bloom filter. Need more reviews. Have a vote on the
> design
> > and add it to parquet-format.
> > Julien (Wework): Encryption.
> >
> >
> >- Bloom Filter:
> >https://issues.apache.org/jira/projects/PARQUET/issues/PARQUET-41
> ><
> https://issues.apache.org/jira/projects/PARQUET/issues/PARQUET-41?filter=allopenissues
> >
> >-
> >   - Committed utility class to parquet-cpp
> >   - Uploaded the benchmark result.
> >   - Ready to add into the spec.
> >   - Submit a PR for the parquet reader spec.
> >   - *Action*: review parquet java utility class.
> >   https://github.com/apache/parquet-mr/pull/425
> >   - Encryption:
> >-
> >   - Nandor, Gabor reviewing.
> >   - Apis to allow pluggable key management.
> >   - Need to have a proper review of the spec.
> >   - Need more testing
> >   - Column indices:
> >-
> >   - PR to be reviewed: https://github.com/apache/parquet-mr/pull/514
> >   - Ryan: to review features branch
> >   - Moving java code from parquet-format to parquet-mr:
> >-
> >   - Action: review. https://github.com/apache/parquet-mr/pull/517
> >   - Gets the thrift file from the parquet-format released artifact.
> >   - Maximum number of records per page:
> >-
> >   - We should add a property with a maximum number of records per
> page
> >   and per row group.
> >   - Need to benchmark to figure out a good default. 10K?
> >   - Iceberg:
> >-
> >   - Some of the iceberg code should be in Parquet:
> >   -
> >  - Rewrote record reconstruction stack
> >  -
> > - Reuses page reader and decoder
> > - Then does a triple iterator that return an entire column
> in a
> > file (iterator of triples)
> >  

Re: Date and time for next Parquet sync

2018-09-07 Thread Ryan Blue
We may want to push this out another week because it also conflicts with
Strata NY. I think a few of us will be travelling Tuesday and both Julien
and I have talks on Wednesday.

On Fri, Sep 7, 2018 at 6:24 AM Gidon Gershinsky  wrote:

> Hi Nandor,
>
> Can we make it Wed this time, Sept 19? Or any of Tue/Wed on another week.
> Sept 18 is the Yom Kippur eve - this basically means I won't have a
> technical ability to join a call.
>
> Regarding the Google doc vs reviewed PR + .md file - it indeed becomes
> difficult and unneccesary to maintain two
> versions of the same documentation. Following you last mail, there was a
> high volume of review
> activity at the google doc, but now the spike is winding down, I'll be
> removing the duplicate part from the google doc
> (keeping the samples), with new comments to go to PRs (md and code). I'll
> send a detailed mail early next week.
>
>
> Cheers, Gidon.
>
> On Fri, Sep 7, 2018 at 3:42 PM Nandor Kollar  >
> wrote:
>
> > Hi All,
> >
> > I'd like propose to have a Parquet Sync next week Tuesday (September
> > 18th) at 6pm CEST / 9 am PST.
> >
> > Some of the topics which would be nice to discuss:
> > - review column indexes (PRs and feature branch)
> > - move Java code from format to mr (PR #517)
> > - Bloom filter spec
> > - columnar encryption spec (and general question, where to track
> > specs, Google doc vs reviewed PR + .md file)
> > - Refactor modules to use the new logical type API (PR under review)
> > - new format release scope (nano precision timestamp, bloom filer?,
> > columnar encryption?)
> >
> > I'll send the meeting invite shortly. Feel free to propose other time
> > slot if it is not suitable for you, and bring any additional topic
> > you'd like to discuss.
> >
> > Regards,
> > Nandor
> >
>


-- 
Ryan Blue
Software Engineer
Netflix


Re: Date and time for next Parquet sync

2018-09-07 Thread Gidon Gershinsky
Hi Nandor,

Can we make it Wed this time, Sept 19? Or any of Tue/Wed on another week.
Sept 18 is the Yom Kippur eve - this basically means I won't have a
technical ability to join a call.

Regarding the Google doc vs reviewed PR + .md file - it indeed becomes
difficult and unneccesary to maintain two
versions of the same documentation. Following you last mail, there was a
high volume of review
activity at the google doc, but now the spike is winding down, I'll be
removing the duplicate part from the google doc
(keeping the samples), with new comments to go to PRs (md and code). I'll
send a detailed mail early next week.


Cheers, Gidon.

On Fri, Sep 7, 2018 at 3:42 PM Nandor Kollar 
wrote:

> Hi All,
>
> I'd like propose to have a Parquet Sync next week Tuesday (September
> 18th) at 6pm CEST / 9 am PST.
>
> Some of the topics which would be nice to discuss:
> - review column indexes (PRs and feature branch)
> - move Java code from format to mr (PR #517)
> - Bloom filter spec
> - columnar encryption spec (and general question, where to track
> specs, Google doc vs reviewed PR + .md file)
> - Refactor modules to use the new logical type API (PR under review)
> - new format release scope (nano precision timestamp, bloom filer?,
> columnar encryption?)
>
> I'll send the meeting invite shortly. Feel free to propose other time
> slot if it is not suitable for you, and bring any additional topic
> you'd like to discuss.
>
> Regards,
> Nandor
>


Date and time for next Parquet sync

2018-09-07 Thread Nandor Kollar
Hi All,

I'd like propose to have a Parquet Sync next week Tuesday (September
18th) at 6pm CEST / 9 am PST.

Some of the topics which would be nice to discuss:
- review column indexes (PRs and feature branch)
- move Java code from format to mr (PR #517)
- Bloom filter spec
- columnar encryption spec (and general question, where to track
specs, Google doc vs reviewed PR + .md file)
- Refactor modules to use the new logical type API (PR under review)
- new format release scope (nano precision timestamp, bloom filer?,
columnar encryption?)

I'll send the meeting invite shortly. Feel free to propose other time
slot if it is not suitable for you, and bring any additional topic
you'd like to discuss.

Regards,
Nandor


Re: Date and time for next Parquet sync

2018-08-29 Thread Nandor Kollar
Hi all,

Yesterday we talked about the status of the columnar encryption, and
agreed that before anything related to it gets released, we need a
reviewed spec. Actually Gidon already opened PR for this:
https://github.com/apache/parquet-format/pull/101, it is based on the
design doc 
(https://docs.google.com/document/d/1T89G7xR0zHFV1f2pjTO28jtfVm8qoNVGEJQ70Rsk-bY/edit)
written by him. Julien, Ryan what do you think - is there anything
else needed?

Regards,
Nandor

On Tue, Aug 28, 2018 at 7:16 PM, Julien Le Dem
 wrote:
> Notes:
> Anna (Cloudera): Bloom filter update, Iceberg
> Gabor, Nandor (Cloudera):
>
>- Value skipping implementation to be reviewed. Move Java code from
>parquet-format to parquet-mr. PR ready
>- How can users of Parquet handle timestamps and TZs. Allow for writing
>timestamp in java. Refactor original type logic to more flexible new
>original type api.
>- Column indexes and alignment of pages
>- Limiting the number of records in a page to avoid skewed splits when
>compression is really good.
>
> Ryan (Netflix): Iceberg stuff back to Parquet: expression library for push
> down. Dictionary and stats based row group filtering.
> JunJie (Intel): Bloom filter. Need more reviews. Have a vote on the design
> and add it to parquet-format.
> Julien (Wework): Encryption.
>
>
>- Bloom Filter:
>https://issues.apache.org/jira/projects/PARQUET/issues/PARQUET-41
>
> 
>-
>   - Committed utility class to parquet-cpp
>   - Uploaded the benchmark result.
>   - Ready to add into the spec.
>   - Submit a PR for the parquet reader spec.
>   - *Action*: review parquet java utility class.
>   https://github.com/apache/parquet-mr/pull/425
>   - Encryption:
>-
>   - Nandor, Gabor reviewing.
>   - Apis to allow pluggable key management.
>   - Need to have a proper review of the spec.
>   - Need more testing
>   - Column indices:
>-
>   - PR to be reviewed: https://github.com/apache/parquet-mr/pull/514
>   - Ryan: to review features branch
>   - Moving java code from parquet-format to parquet-mr:
>-
>   - Action: review. https://github.com/apache/parquet-mr/pull/517
>   - Gets the thrift file from the parquet-format released artifact.
>   - Maximum number of records per page:
>-
>   - We should add a property with a maximum number of records per page
>   and per row group.
>   - Need to benchmark to figure out a good default. 10K?
>   - Iceberg:
>-
>   - Some of the iceberg code should be in Parquet:
>   -
>  - Rewrote record reconstruction stack
>  -
> - Reuses page reader and decoder
> - Then does a triple iterator that return an entire column in a
> file (iterator of triples)
> - Record reconstruction class that handles everything that the
> current one does but with {list, map} factories
> -
>- 20% faster to write, 5% faster to read
>- Easier to write object mappers
> - Helps with page level skipping.
> - High level abstractions in the iceberg library:
>  -
> - Take an expression and simplify it (not, ...) to run on
> metadata
> - Take a complex expression and split the part on the
> partition/min/max and the remaining part.
>
>
>
>
>
>
> On Mon, Aug 27, 2018 at 4:56 AM, Nandor Kollar > wrote:
>
>> Yes, CEST.
>>
>> On Mon, Aug 27, 2018 at 1:01 PM, Uwe L. Korn  wrote:
>> > Hello Nador,
>> >
>> > probably I can make this time. Just a timezone question: Is it 6pm CET
>> or 6pm CEST? I guess the latter.
>> >
>> > See http://timesched.pocoo.org/?date=2018-08-28=central-
>> europe-standard-time!,pacific-standard-time=1080,1140
>> >
>> > Uwe
>> >
>> > On Mon, Aug 27, 2018, at 12:20 PM, Nandor Kollar wrote:
>> >> Hi All,
>> >>
>> >> As discussed on last Parquet sync, I propose to have an other meeting
>> >> on August 28th, at 6pm CET / 9 am PST to discuss those topic which we
>> >> didn't have time on the sync at August 15th, and of course any new
>> >> topic too.
>> >>
>> >> Sorry for the late notice, feel free to propose other time slot if is
>> >> is not suitable for you! Calendar entry to follow.
>> >>
>> >> Regards,
>> >> Nandor
>>


Re: Date and time for next Parquet sync

2018-08-28 Thread Julien Le Dem
Notes:
Anna (Cloudera): Bloom filter update, Iceberg
Gabor, Nandor (Cloudera):

   - Value skipping implementation to be reviewed. Move Java code from
   parquet-format to parquet-mr. PR ready
   - How can users of Parquet handle timestamps and TZs. Allow for writing
   timestamp in java. Refactor original type logic to more flexible new
   original type api.
   - Column indexes and alignment of pages
   - Limiting the number of records in a page to avoid skewed splits when
   compression is really good.

Ryan (Netflix): Iceberg stuff back to Parquet: expression library for push
down. Dictionary and stats based row group filtering.
JunJie (Intel): Bloom filter. Need more reviews. Have a vote on the design
and add it to parquet-format.
Julien (Wework): Encryption.


   - Bloom Filter:
   https://issues.apache.org/jira/projects/PARQUET/issues/PARQUET-41
   

   -
  - Committed utility class to parquet-cpp
  - Uploaded the benchmark result.
  - Ready to add into the spec.
  - Submit a PR for the parquet reader spec.
  - *Action*: review parquet java utility class.
  https://github.com/apache/parquet-mr/pull/425
  - Encryption:
   -
  - Nandor, Gabor reviewing.
  - Apis to allow pluggable key management.
  - Need to have a proper review of the spec.
  - Need more testing
  - Column indices:
   -
  - PR to be reviewed: https://github.com/apache/parquet-mr/pull/514
  - Ryan: to review features branch
  - Moving java code from parquet-format to parquet-mr:
   -
  - Action: review. https://github.com/apache/parquet-mr/pull/517
  - Gets the thrift file from the parquet-format released artifact.
  - Maximum number of records per page:
   -
  - We should add a property with a maximum number of records per page
  and per row group.
  - Need to benchmark to figure out a good default. 10K?
  - Iceberg:
   -
  - Some of the iceberg code should be in Parquet:
  -
 - Rewrote record reconstruction stack
 -
- Reuses page reader and decoder
- Then does a triple iterator that return an entire column in a
file (iterator of triples)
- Record reconstruction class that handles everything that the
current one does but with {list, map} factories
-
   - 20% faster to write, 5% faster to read
   - Easier to write object mappers
- Helps with page level skipping.
- High level abstractions in the iceberg library:
 -
- Take an expression and simplify it (not, ...) to run on
metadata
- Take a complex expression and split the part on the
partition/min/max and the remaining part.






On Mon, Aug 27, 2018 at 4:56 AM, Nandor Kollar  wrote:

> Yes, CEST.
>
> On Mon, Aug 27, 2018 at 1:01 PM, Uwe L. Korn  wrote:
> > Hello Nador,
> >
> > probably I can make this time. Just a timezone question: Is it 6pm CET
> or 6pm CEST? I guess the latter.
> >
> > See http://timesched.pocoo.org/?date=2018-08-28=central-
> europe-standard-time!,pacific-standard-time=1080,1140
> >
> > Uwe
> >
> > On Mon, Aug 27, 2018, at 12:20 PM, Nandor Kollar wrote:
> >> Hi All,
> >>
> >> As discussed on last Parquet sync, I propose to have an other meeting
> >> on August 28th, at 6pm CET / 9 am PST to discuss those topic which we
> >> didn't have time on the sync at August 15th, and of course any new
> >> topic too.
> >>
> >> Sorry for the late notice, feel free to propose other time slot if is
> >> is not suitable for you! Calendar entry to follow.
> >>
> >> Regards,
> >> Nandor
>


Re: Date and time for next Parquet sync

2018-08-27 Thread Nandor Kollar
Yes, CEST.

On Mon, Aug 27, 2018 at 1:01 PM, Uwe L. Korn  wrote:
> Hello Nador,
>
> probably I can make this time. Just a timezone question: Is it 6pm CET or 6pm 
> CEST? I guess the latter.
>
> See 
> http://timesched.pocoo.org/?date=2018-08-28=central-europe-standard-time!,pacific-standard-time=1080,1140
>
> Uwe
>
> On Mon, Aug 27, 2018, at 12:20 PM, Nandor Kollar wrote:
>> Hi All,
>>
>> As discussed on last Parquet sync, I propose to have an other meeting
>> on August 28th, at 6pm CET / 9 am PST to discuss those topic which we
>> didn't have time on the sync at August 15th, and of course any new
>> topic too.
>>
>> Sorry for the late notice, feel free to propose other time slot if is
>> is not suitable for you! Calendar entry to follow.
>>
>> Regards,
>> Nandor


Re: Date and time for next Parquet sync

2018-08-27 Thread Uwe L. Korn
Hello Nador,

probably I can make this time. Just a timezone question: Is it 6pm CET or 6pm 
CEST? I guess the latter. 

See 
http://timesched.pocoo.org/?date=2018-08-28=central-europe-standard-time!,pacific-standard-time=1080,1140

Uwe

On Mon, Aug 27, 2018, at 12:20 PM, Nandor Kollar wrote:
> Hi All,
> 
> As discussed on last Parquet sync, I propose to have an other meeting
> on August 28th, at 6pm CET / 9 am PST to discuss those topic which we
> didn't have time on the sync at August 15th, and of course any new
> topic too.
> 
> Sorry for the late notice, feel free to propose other time slot if is
> is not suitable for you! Calendar entry to follow.
> 
> Regards,
> Nandor


Date and time for next Parquet sync

2018-08-27 Thread Nandor Kollar
Hi All,

As discussed on last Parquet sync, I propose to have an other meeting
on August 28th, at 6pm CET / 9 am PST to discuss those topic which we
didn't have time on the sync at August 15th, and of course any new
topic too.

Sorry for the late notice, feel free to propose other time slot if is
is not suitable for you! Calendar entry to follow.

Regards,
Nandor


Re: Date and time for next Parquet sync

2018-08-12 Thread Uwe L. Korn
As the meeting falls into my summer vacation I cannot participate but will try 
to join again if there is a meeting two weeks later.

Uwe

> Am 08.08.2018 um 16:43 schrieb Nandor Kollar :
> 
> Hi All,
> 
> It has been a while since we had a Parquet sync, therefore I'd like to
> propose to have one next week on August 15th, at 6pm CET / 9 am PST.
> 
> I'll send a meeting invite with the details soon, let me know if this time
> is not suitable for you!
> 
> Since the last sync there are couple of topics to discuss, like:
> - Status of Parquet encryption
> - Release a new minor version, scope of the new release
> - Bloom filters
> - Move Java specific code from parquet-format to parquet-mr
> - parquet.thrift usage best practices in different language bindings (Java,
> C++, Python, Rust)
> - LZ4 incompatibility
> 
> The agenda is open for suggestions.
> 
> Regards,
> Nandor



Date and time for next Parquet sync

2018-08-08 Thread Nandor Kollar
Hi All,

It has been a while since we had a Parquet sync, therefore I'd like to
propose to have one next week on August 15th, at 6pm CET / 9 am PST.

I'll send a meeting invite with the details soon, let me know if this time
is not suitable for you!

Since the last sync there are couple of topics to discuss, like:
- Status of Parquet encryption
- Release a new minor version, scope of the new release
- Bloom filters
- Move Java specific code from parquet-format to parquet-mr
- parquet.thrift usage best practices in different language bindings (Java,
C++, Python, Rust)
- LZ4 incompatibility

The agenda is open for suggestions.

Regards,
Nandor


Re: Date and Time for next Parquet sync

2018-02-09 Thread Julien Le Dem
If you have received an invitation for next Wednesday, please disregard it
for now.
I was just adding people to the list of reminders.
I'll move it to whenever is the conclusion of this thread.
I have a conflict on Tuesday though.
I am available on Wednesday.

On Wed, Feb 7, 2018 at 11:29 PM, Gabor Szadovszky <
gabor.szadovs...@cloudera.com> wrote:

> Hi All,
>
> I would vote on Tuesday but don’t have any problem with skipping this one
> if Wednesday fits more for others.
>
> Cheers,
> Gabor
>
> > On 7 Feb 2018, at 19:00, Lars Volker  wrote:
> >
> > Hi All,
> >
> > I propose to have the next regular Parquet sync next week, either on
> > Tuesday or Wednesday at 9am PST / 6pm CET.
> >
> > The last one was on a Tuesday so this one would default to Wednesday.
> Let's
> > have a quick vote here by replying to this email with your day of choice.
> > Feel free to propose any other time if neither of these work for you.
> >
> > Cheers, Lars
>
>


Re: Date and Time for next Parquet sync

2018-02-07 Thread Gabor Szadovszky
Hi All,

I would vote on Tuesday but don’t have any problem with skipping this one if 
Wednesday fits more for others.

Cheers,
Gabor

> On 7 Feb 2018, at 19:00, Lars Volker  wrote:
> 
> Hi All,
> 
> I propose to have the next regular Parquet sync next week, either on
> Tuesday or Wednesday at 9am PST / 6pm CET.
> 
> The last one was on a Tuesday so this one would default to Wednesday. Let's
> have a quick vote here by replying to this email with your day of choice.
> Feel free to propose any other time if neither of these work for you.
> 
> Cheers, Lars



Date and Time for next Parquet sync

2018-02-07 Thread Lars Volker
Hi All,

I propose to have the next regular Parquet sync next week, either on
Tuesday or Wednesday at 9am PST / 6pm CET.

The last one was on a Tuesday so this one would default to Wednesday. Let's
have a quick vote here by replying to this email with your day of choice.
Feel free to propose any other time if neither of these work for you.

Cheers, Lars


Re: Date and time for next parquet sync

2018-01-29 Thread Lars Volker
Thanks all who replied, I sent an invite for Tuesday. Cheers, Lars

On Mon, Jan 29, 2018 at 10:56 AM, Marcel Kornacker 
wrote:

> +1 for Tuesday
>
> On Mon, Jan 29, 2018 at 4:03 AM, Uwe L. Korn  wrote:
> > +1, Tuesday to Thursday are ok for me but I would prefer Tuesday this
> week.
> >
> > Uwe
> >
> > On Mon, Jan 29, 2018, at 12:54 PM, Zoltan Ivanfi wrote:
> >> +1 for Tuesday, this week I can't attend on Wednesday.
> >>
> >> Zoltan
> >>
> >> On Mon, Jan 29, 2018 at 7:29 AM Lars Volker  wrote:
> >>
> >> > I'm good with either day. Does anyone prefer Wednesday over Tuesday?
> >> >
> >> > On Tue, Jan 23, 2018 at 11:27 PM, Gabor Szadovszky <
> >> > gabor.szadovs...@cloudera.com> wrote:
> >> >
> >> > > Hi All,
> >> > >
> >> > > As usual, I’m the one who complains…
> >> > > Tuesday/Thursday would be better for me. If one of these days is
> suitable
> >> > > for everyone I would be happy to participate. If not, I’m fine with
> going
> >> > > to the next meeting instead.
> >> > >
> >> > > Cheers,
> >> > > Gabor
> >> > >
> >> > > > On 24 Jan 2018, at 00:56, Lars Volker  wrote:
> >> > > >
> >> > > > Hi All,
> >> > > >
> >> > > > After chatting with Julien I'd like to propose to do the next
> regular
> >> > > > Parquet sync on next Wednesday, January 31st, at 5pm GMT (6pm
> CET, 9am
> >> > > > PST). This will get us back to alternating weeks with the arrow
> sync.
> >> > If
> >> > > > that doesn't work for you, please let me know.
> >> > > >
> >> > > > Cheers, Lars
> >> > >
> >> > >
> >> >
>


Re: Date and time for next parquet sync

2018-01-29 Thread Marcel Kornacker
+1 for Tuesday

On Mon, Jan 29, 2018 at 4:03 AM, Uwe L. Korn  wrote:
> +1, Tuesday to Thursday are ok for me but I would prefer Tuesday this week.
>
> Uwe
>
> On Mon, Jan 29, 2018, at 12:54 PM, Zoltan Ivanfi wrote:
>> +1 for Tuesday, this week I can't attend on Wednesday.
>>
>> Zoltan
>>
>> On Mon, Jan 29, 2018 at 7:29 AM Lars Volker  wrote:
>>
>> > I'm good with either day. Does anyone prefer Wednesday over Tuesday?
>> >
>> > On Tue, Jan 23, 2018 at 11:27 PM, Gabor Szadovszky <
>> > gabor.szadovs...@cloudera.com> wrote:
>> >
>> > > Hi All,
>> > >
>> > > As usual, I’m the one who complains…
>> > > Tuesday/Thursday would be better for me. If one of these days is suitable
>> > > for everyone I would be happy to participate. If not, I’m fine with going
>> > > to the next meeting instead.
>> > >
>> > > Cheers,
>> > > Gabor
>> > >
>> > > > On 24 Jan 2018, at 00:56, Lars Volker  wrote:
>> > > >
>> > > > Hi All,
>> > > >
>> > > > After chatting with Julien I'd like to propose to do the next regular
>> > > > Parquet sync on next Wednesday, January 31st, at 5pm GMT (6pm CET, 9am
>> > > > PST). This will get us back to alternating weeks with the arrow sync.
>> > If
>> > > > that doesn't work for you, please let me know.
>> > > >
>> > > > Cheers, Lars
>> > >
>> > >
>> >


Re: Date and time for next parquet sync

2018-01-29 Thread Uwe L. Korn
+1, Tuesday to Thursday are ok for me but I would prefer Tuesday this week.

Uwe

On Mon, Jan 29, 2018, at 12:54 PM, Zoltan Ivanfi wrote:
> +1 for Tuesday, this week I can't attend on Wednesday.
> 
> Zoltan
> 
> On Mon, Jan 29, 2018 at 7:29 AM Lars Volker  wrote:
> 
> > I'm good with either day. Does anyone prefer Wednesday over Tuesday?
> >
> > On Tue, Jan 23, 2018 at 11:27 PM, Gabor Szadovszky <
> > gabor.szadovs...@cloudera.com> wrote:
> >
> > > Hi All,
> > >
> > > As usual, I’m the one who complains…
> > > Tuesday/Thursday would be better for me. If one of these days is suitable
> > > for everyone I would be happy to participate. If not, I’m fine with going
> > > to the next meeting instead.
> > >
> > > Cheers,
> > > Gabor
> > >
> > > > On 24 Jan 2018, at 00:56, Lars Volker  wrote:
> > > >
> > > > Hi All,
> > > >
> > > > After chatting with Julien I'd like to propose to do the next regular
> > > > Parquet sync on next Wednesday, January 31st, at 5pm GMT (6pm CET, 9am
> > > > PST). This will get us back to alternating weeks with the arrow sync.
> > If
> > > > that doesn't work for you, please let me know.
> > > >
> > > > Cheers, Lars
> > >
> > >
> >


Re: Date and time for next parquet sync

2018-01-29 Thread Zoltan Ivanfi
+1 for Tuesday, this week I can't attend on Wednesday.

Zoltan

On Mon, Jan 29, 2018 at 7:29 AM Lars Volker  wrote:

> I'm good with either day. Does anyone prefer Wednesday over Tuesday?
>
> On Tue, Jan 23, 2018 at 11:27 PM, Gabor Szadovszky <
> gabor.szadovs...@cloudera.com> wrote:
>
> > Hi All,
> >
> > As usual, I’m the one who complains…
> > Tuesday/Thursday would be better for me. If one of these days is suitable
> > for everyone I would be happy to participate. If not, I’m fine with going
> > to the next meeting instead.
> >
> > Cheers,
> > Gabor
> >
> > > On 24 Jan 2018, at 00:56, Lars Volker  wrote:
> > >
> > > Hi All,
> > >
> > > After chatting with Julien I'd like to propose to do the next regular
> > > Parquet sync on next Wednesday, January 31st, at 5pm GMT (6pm CET, 9am
> > > PST). This will get us back to alternating weeks with the arrow sync.
> If
> > > that doesn't work for you, please let me know.
> > >
> > > Cheers, Lars
> >
> >
>


Re: Date and time for next parquet sync

2018-01-28 Thread Lars Volker
I'm good with either day. Does anyone prefer Wednesday over Tuesday?

On Tue, Jan 23, 2018 at 11:27 PM, Gabor Szadovszky <
gabor.szadovs...@cloudera.com> wrote:

> Hi All,
>
> As usual, I’m the one who complains…
> Tuesday/Thursday would be better for me. If one of these days is suitable
> for everyone I would be happy to participate. If not, I’m fine with going
> to the next meeting instead.
>
> Cheers,
> Gabor
>
> > On 24 Jan 2018, at 00:56, Lars Volker  wrote:
> >
> > Hi All,
> >
> > After chatting with Julien I'd like to propose to do the next regular
> > Parquet sync on next Wednesday, January 31st, at 5pm GMT (6pm CET, 9am
> > PST). This will get us back to alternating weeks with the arrow sync. If
> > that doesn't work for you, please let me know.
> >
> > Cheers, Lars
>
>


Re: Date and time for next parquet sync

2018-01-23 Thread Gabor Szadovszky
Hi All,

As usual, I’m the one who complains…
Tuesday/Thursday would be better for me. If one of these days is suitable for 
everyone I would be happy to participate. If not, I’m fine with going to the 
next meeting instead.

Cheers,
Gabor

> On 24 Jan 2018, at 00:56, Lars Volker  wrote:
> 
> Hi All,
> 
> After chatting with Julien I'd like to propose to do the next regular
> Parquet sync on next Wednesday, January 31st, at 5pm GMT (6pm CET, 9am
> PST). This will get us back to alternating weeks with the arrow sync. If
> that doesn't work for you, please let me know.
> 
> Cheers, Lars



Date and time for next parquet sync

2018-01-23 Thread Lars Volker
Hi All,

After chatting with Julien I'd like to propose to do the next regular
Parquet sync on next Wednesday, January 31st, at 5pm GMT (6pm CET, 9am
PST). This will get us back to alternating weeks with the arrow sync. If
that doesn't work for you, please let me know.

Cheers, Lars


Re: Date and time for next parquet sync

2017-09-29 Thread Lars Volker
I added you to the invite.

On Thu, Sep 28, 2017 at 9:48 PM, Santlal J Gupta <
santlal.gu...@bitwiseglobal.com> wrote:

> Yes I want to join.
>
> -Original Message-
> From: Lars Volker [mailto:l...@cloudera.com]
> Sent: Thursday, September 28, 2017 8:40 PM
> To: dev@parquet.apache.org
> Subject: Date and time for next parquet sync
>
> I sent out an meeting request for the next Parquet sync on Wednesday,
> October 11th at 9am PST. Please reply to this email if you'd like to join
> and found yourself not on the invite yet.
>


RE: Date and time for next parquet sync

2017-09-28 Thread Santlal J Gupta
Yes I want to join.

-Original Message-
From: Lars Volker [mailto:l...@cloudera.com] 
Sent: Thursday, September 28, 2017 8:40 PM
To: dev@parquet.apache.org
Subject: Date and time for next parquet sync

I sent out an meeting request for the next Parquet sync on Wednesday, October 
11th at 9am PST. Please reply to this email if you'd like to join and found 
yourself not on the invite yet.


Date and time for next parquet sync

2017-09-28 Thread Lars Volker
I sent out an meeting request for the next Parquet sync on Wednesday,
October 11th at 9am PST. Please reply to this email if you'd like to join
and found yourself not on the invite yet.


Re: Date and time for next Parquet Sync

2017-09-13 Thread Julien Le Dem
Notes:
Parquet Sync Sept 13 2017:

Lars (Impala Cloudera - CA): want feedback on Puja’s pull request for page
index
Anna (Cloudera - Hungary)
Jim (Cloudera - CA): Bloom Filters
Ryan (Netflix - CA): parquet-cli zstd/lz4 to try out. Parquet format
release, logical type PR.
Junjie (Intel - Shanghai): Bloom filter status
Bikramjeet (Cloudera Impala - CA): clarify specification for column stats
and type for min/max storage
Wes (Twosigma - NY): C++
Julien (CA): patch release of parquet-mr

TZs: GMT-8, GMT-5, GMT+1, GMT+8
Time: 9am (SF), 12am (NY), 6pm (Budapest), 1am (Shanghai) !

 - Bloom Filter:
- Junjie submitted pull request for parquet-format and parquet-mr. bloom
filter utility + tests.
- https://github.com/apache/parquet-format/pull/62/files
- not to be merged right away but feedback
- https://github.com/apache/parquet-mr/pull/425/files
- to move to package protected or tests to start incremental merge
without making it public
- Need review: Ryan, Julien, Jim
- compatibility, integration tests?
- old compatibility test repo:
https://github.com/Parquet/parquet-compatibility
- Arrow integration tests:
https://github.com/apache/arrow/tree/master/integration
- Action: Anna, Lars to follow up with Cloudera

Build: travis-ci broken with latest linux thrift-7 incompatibility
 - parquet-mr should move to thrift-9: PARQUET-1103
 - pin thrift to fixed version in build like in parquet-format.

 - Page Index: https://github.com/apache/parquet-format/pull/63
   - Action review by end of next week: Julien, Ryan, Marcel
   - TODO (Lars?): move design doc to markdown in parquet-format
   - should add (brief) comments in thrift definition (clarify in review)

 - zstd/lz4:
   - Ryan has e version of parquet-cli working with zstd, lz4 and brotli
for experimentation
   - building with zstd backported was difficult. (provides hadoop jar)
   - anyone interested in running their own tests?
   - Lars to check at Cloudera.
   - Ryan to send out on the list
   - Wes built benchmarking fixtures in Cpp. todo write tests.
   - use some shareable dataset for validation (NY Taxi dataset?).

 - Logical type PR: https://github.com/apache/parquet-format/pull/51
- TODO: feedback
- reviewers: Julien

 - clarification of min max storage:
   -
https://github.com/apache/parquet-format/blob/master/src/main/thrift/parquet.thrift#L215
   - format of min and max values is the same as defined by the type.

- making releases:
  - want a parquet-format release for:
- logical types (not merged yet)
- page indexes (not merged yet)
- sort order (merged)
  - we won’t block on bloom filter. We can make another release as soon as
it is ready.
  - Ryan to run the parquet-format release.
  - need volunteer for parquet-mr release.



On Wed, Sep 13, 2017 at 8:58 AM, Julien Le Dem 
wrote:

> The Parquet sync is starting now at:
> https://meet.google.com/ent-mvhf-twr
>
> On Tue, Sep 12, 2017 at 8:55 PM, Julien Le Dem 
> wrote:
>
>> +1
>>
>> On Mon, Sep 11, 2017 at 8:36 PM, Lars Volker  wrote:
>>
>>> There were no objections so I sent out a meeting invite to everyone who
>>> was
>>> on the last invite. If you'd like to participate, too, please reply to
>>> this
>>> email.
>>>
>>> Cheers, Lars
>>>
>>> On Mon, Sep 11, 2017 at 11:06 AM, Ryan Blue 
>>> wrote:
>>>
>>> > That works for me.
>>> >
>>> > On Mon, Sep 11, 2017 at 7:55 AM, Lars Volker  wrote:
>>> >
>>> > > Hi All,
>>> > >
>>> > > I'd like to propose to have the next Parquet Sync on Wednesday, Sep
>>> 13th,
>>> > > at 9am PST. Possible topics would be the pull request to add a page
>>> index
>>> > > to the format, ongoing work on bloom filters.
>>> > >
>>> > > If Wednesday does not work for you, please propose another date and
>>> time.
>>> > > Otherwise I'll send out a MR later today.
>>> > >
>>> > > Cheers, Lars
>>> > >
>>> >
>>> >
>>> >
>>> > --
>>> > Ryan Blue
>>> > Software Engineer
>>> > Netflix
>>> >
>>>
>>
>>
>


Re: Date and time for next Parquet Sync

2017-09-13 Thread Julien Le Dem
The Parquet sync is starting now at:
https://meet.google.com/ent-mvhf-twr

On Tue, Sep 12, 2017 at 8:55 PM, Julien Le Dem 
wrote:

> +1
>
> On Mon, Sep 11, 2017 at 8:36 PM, Lars Volker  wrote:
>
>> There were no objections so I sent out a meeting invite to everyone who
>> was
>> on the last invite. If you'd like to participate, too, please reply to
>> this
>> email.
>>
>> Cheers, Lars
>>
>> On Mon, Sep 11, 2017 at 11:06 AM, Ryan Blue 
>> wrote:
>>
>> > That works for me.
>> >
>> > On Mon, Sep 11, 2017 at 7:55 AM, Lars Volker  wrote:
>> >
>> > > Hi All,
>> > >
>> > > I'd like to propose to have the next Parquet Sync on Wednesday, Sep
>> 13th,
>> > > at 9am PST. Possible topics would be the pull request to add a page
>> index
>> > > to the format, ongoing work on bloom filters.
>> > >
>> > > If Wednesday does not work for you, please propose another date and
>> time.
>> > > Otherwise I'll send out a MR later today.
>> > >
>> > > Cheers, Lars
>> > >
>> >
>> >
>> >
>> > --
>> > Ryan Blue
>> > Software Engineer
>> > Netflix
>> >
>>
>
>


Re: Date and time for next Parquet Sync

2017-09-12 Thread Julien Le Dem
+1

On Mon, Sep 11, 2017 at 8:36 PM, Lars Volker  wrote:

> There were no objections so I sent out a meeting invite to everyone who was
> on the last invite. If you'd like to participate, too, please reply to this
> email.
>
> Cheers, Lars
>
> On Mon, Sep 11, 2017 at 11:06 AM, Ryan Blue 
> wrote:
>
> > That works for me.
> >
> > On Mon, Sep 11, 2017 at 7:55 AM, Lars Volker  wrote:
> >
> > > Hi All,
> > >
> > > I'd like to propose to have the next Parquet Sync on Wednesday, Sep
> 13th,
> > > at 9am PST. Possible topics would be the pull request to add a page
> index
> > > to the format, ongoing work on bloom filters.
> > >
> > > If Wednesday does not work for you, please propose another date and
> time.
> > > Otherwise I'll send out a MR later today.
> > >
> > > Cheers, Lars
> > >
> >
> >
> >
> > --
> > Ryan Blue
> > Software Engineer
> > Netflix
> >
>


Re: Date and time for next Parquet Sync

2017-09-11 Thread Ryan Blue
That works for me.

On Mon, Sep 11, 2017 at 7:55 AM, Lars Volker  wrote:

> Hi All,
>
> I'd like to propose to have the next Parquet Sync on Wednesday, Sep 13th,
> at 9am PST. Possible topics would be the pull request to add a page index
> to the format, ongoing work on bloom filters.
>
> If Wednesday does not work for you, please propose another date and time.
> Otherwise I'll send out a MR later today.
>
> Cheers, Lars
>



-- 
Ryan Blue
Software Engineer
Netflix