Hi Renato,
I would suggest reading the Arrow specification [1] which explains how
nesting is handled.

-Micah

[1] https://arrow.apache.org/docs/format/Columnar.html

On Thu, Nov 12, 2020 at 6:40 AM Renato Marroquín Mogrovejo <
renatoj.marroq...@gmail.com> wrote:

> Hi Micah,
>
> Thanks for the answer! yeah basically that was my question, I was not
> sure about the full extent of support for nested data, but it seems
> that both (parquet binding and arrow format) do reading/writing nested
> objects.
>
> Just a couple of follow up questions:
> - If Arrow/Feather file format do not use Parquet's FSM for
> reading/writing arbitrarily nested objects, what was the approach
> taken? is there a PR or JIRA issue I could look at?
> - Could you please point me to some docs (or source code) where I can
> read more about how reading/writing arbitrarily nested objects is
> done?
>
> Thanks again for your help!
> Best,
>
> Renato M.
>
> El mié., 11 nov. 2020 a las 21:23, Micah Kornfield
> (<emkornfi...@gmail.com>) escribió:
> >
> > Hi Renato,
> > I'm not clear if you are asking if the Arrow/Feather file format support
> > this or if Arrow's parquet binding support it.
> >
> > Regardless, both formats as of 2.0.0 now support arbitrarily nested data
> > (there were some bugs discovered after the 2.0.0 release, and I just
> > started a discussion on dev@ to see if we are going to do a patch
> release.
> >
> > Thanks,
> > Micah
> >
> > On Wed, Nov 11, 2020 at 1:21 AM Renato Marroquín Mogrovejo <
> > renatoj.marroq...@gmail.com> wrote:
> >
> > > Hi Arrow experts,
> > >
> > > I am trying to find out if Arrow supports reading/writing arbitrary
> > > nested objects similarly to what Parquet supports with its FSM.
> > > I came across this PR https://github.com/apache/arrow/pull/4066 which
> > > aimed to implement the Parquet specific approach (the FSM) but it was
> > > declined because as far as I understand it was too Parquet specific.
> > > Then, there is https://issues.apache.org/jira/browse/ARROW-1644 which
> > > tracks the tasks related to writing nested objects, but I couldn't
> > > make sense of it all.
> > >
> > > Basically my question is: Does arrow provide the functionality of
> > > reading/writing arbitrarily nested objects (similar to Parquet's FSM
> > > implementation)? Could you please point me to some documentation or
> > > code for this?
> > >
> > > Thanks in advance!
> > >
> > >
> > > Best,
> > >
> > > Renato M.
> > >
>

Reply via email to