+1, for the new repo.

The name sounds fine, but good if we have scope of having “hadoop-”
prefix, we have that for almost all of the subprojects/modules

Can hadoop-shims or hadoop-shims-api or something on similar lines work?

-Ayush

> On 01-Jun-2023, at 1:18 AM, Steve Loughran <ste...@cloudera.com.invalid> 
> wrote:
>
> I want to create a new repository to put a shim library to allow previous

> releases to access the more recent hadoop filesystem APIs -currently the
> open source implementations of parquet, ORC can't use vectored io, in
> particular, even though we can in Cloudera. Providing a shim opens them up
> to all *and* gets the APIs more broadly stressed/tested.
> This needs to be in its own repository, not just for rapid initial release,
> but because it is designed to be built as old a version of hadoop we can
> reasonably support, which IMO means hadoop 3.1.0+. I know parquet still
> wants to build against 2.8.x, but to claim support for hadoop 2 means
> "build and test on java7", which is unrealistic in 2023.
>
> Initial WiP implementation, which works with 3.1.0 and tests against others
> https://github.com/steveloughran/fs-api-shim
> the complexity is about testing this -I have contract tests which then need
> to be executed on every supported hadoop release, which will need a
> separate module for each one.
>
> I can create the repo easily enough, just would like approval. And is the
> name OK?
>
> steve

---------------------------------------------------------------------
To unsubscribe, e-mail: common-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-dev-h...@hadoop.apache.org

Reply via email to