Hi,
I'd like to propose retiring the parquet-format repo and moving the code
into
parquet-mr. Having the splits repos causes unnecessary complexity and
doesn't
seem to offer much benefit. For example:
1. Making changes that require format changes and implementation is
split. Things
go out of sync.
2. More release version/release process management
3. More things to do and understand getting started
I don't recall why it was originally split; probably an artifact of how it
was born. If
this makes sense, we can consider merging parquet-cpp as well.
The specific proposal is to add a commit to parquet-format to indicate it
is moved
and merged into parquet-mr and move the current parquet-format files into
parquet-mr.
The next release of parquet-mr would release both, with the same version.
Thoughts?
Nong