So I guess its just calculating the distance between the offsets. For now we might just make that part of our "catalogue" step. If I wanted to add it to statistics is there somewhere you can point me to where that would be added?
Felipe ᐧ On Wed, Jun 7, 2017 at 1:33 PM, Michael Howard <[email protected]> wrote: > > Could this be a candidate to add to the Statistics? > > Agreed ... this would be good info to have. > > On Wed, Jun 7, 2017 at 2:25 PM, Lars Volker <[email protected]> wrote: > > > Could this be a candidate to add to the Statistics? > > > > On Wed, Jun 7, 2017 at 11:18 AM, Deepak Majeti <[email protected]> > > wrote: > > > > > The parquet metadata does not have such information. > > > > > > > > > On Wed, Jun 7, 2017 at 1:08 PM, Felipe Aramburu <[email protected]> > > > wrote: > > > > > > > Is there any metadata available on the maximum length of an element > > > > BYTE_ARRAY in a row group. > > > > > > > > So for example if I have a column which is of type BYTE_ARRAY Logical > > > type > > > > UTF8 and I want to know what the longest possible element in the row > > > group > > > > is. > > > > > > > > I am looking for a method to do this which does NOT require having to > > go > > > > through the data itself. So I am asking if this metadata is stored > > > > anywhere. > > > > > > > > Felipe > > > > > > > > > > > > > > > > -- > > > regards, > > > Deepak Majeti > > > > > >
