Re: Request for PR Review

2018-01-30 Thread Pradeep Gollakota
Gentle thread bump. On Thu, Jan 18, 2018 at 4:03 PM, Pradeep Gollakota <pradeep...@gmail.com> wrote: > Hi All, > > Can one of you review my PR at https://github.com/apache/ > parquet-mr/pull/447 please? > > Thanks, > Pradeep >

Request for PR Review

2018-01-18 Thread Pradeep Gollakota
Hi All, Can one of you review my PR at https://github.com/apache/parquet-mr/pull/447 please? Thanks, Pradeep

Re: Missing min/max statistics in file footer

2017-02-10 Thread Pradeep Gollakota
ri, Feb 10, 2017 at 10:17 AM, Lars Volker <l...@cloudera.com> wrote: > Can you check the value of ParquetMetaData.created_by? Once you have that, > you should see if it gets filtered by the code in CorruptStatistics.java. > > On Fri, Feb 10, 2017 at 7:11 PM, Pradeep Gollak

Re: Missing min/max statistics in file footer

2017-02-10 Thread Pradeep Gollakota
statistics are not written to the footer? If you > used parquet-mr, they may be there but be ignored. > > Cheers, Lars > > On Fri, Feb 10, 2017 at 5:31 PM, Pradeep Gollakota <pradeep...@gmail.com> > wrote: > > > Bumping the thread to see if I get any responses. &g

Re: Missing min/max statistics in file footer

2017-02-10 Thread Pradeep Gollakota
Bumping the thread to see if I get any responses. On Wed, Feb 8, 2017 at 6:49 PM, Pradeep Gollakota <pradeep...@gmail.com> wrote: > Hi folks, > > I generated a bunch of parquet files using spark and > ParquetThriftOutputFormat. The thirft model has a column called "device

Missing min/max statistics in file footer

2017-02-08 Thread Pradeep Gollakota
Hi folks, I generated a bunch of parquet files using spark and ParquetThriftOutputFormat. The thirft model has a column called "deviceId" which is a string column. It also has a "timestamp" column of int64. After the files have been generated, I inspected the file footers and noticed that only

Re: Unable to compile thrift

2017-02-08 Thread Pradeep Gollakota
Volker <l...@cloudera.com> wrote: > I remember trying to compile with the latest version of thrift shipped in > Ubuntu 14.04 a few weeks back and got the same error. Using 0.7 worked > though. Sadly I don't know why it fails on a Mac. > > On Feb 8, 2017 21:18, "P

Re: Unable to compile thrift

2017-02-08 Thread Pradeep Gollakota
0 -- let us know if you have issues with > these > > Thanks > Wes > > On Wed, Feb 8, 2017 at 2:19 PM, Pradeep Gollakota <pradeep...@gmail.com> > wrote: > > Hi folks, > > > > I'm trying to build parquet from source. However, the instructions

[jira] [Created] (PARQUET-869) Min/Max record counts for block size checks are not configurable

2017-02-07 Thread Pradeep Gollakota (JIRA)
Pradeep Gollakota created PARQUET-869: - Summary: Min/Max record counts for block size checks are not configurable Key: PARQUET-869 URL: https://issues.apache.org/jira/browse/PARQUET-869 Project