Thanks Dongjoon/Gang ! @Gang - I have mostly coded in Java, but can still look into changes coming. I do follow the discussion on Parquet ML too :)
I was a bit hesitant in making unnecessary noise on ML, hence tried to find a way on my own but realized it won't work out. I scanned the JIRA when I started and created this list (some of it is obsolete) ORC TODO list https://issues.apache.org/jira/browse/ORC-521 - Update metadata tools to print encryption information https://issues.apache.org/jira/browse/ORC-468 - Fix incorrect documentation for nanoseconds stream encoding https://issues.apache.org/jira/browse/ORC-247 - Need a way for ConvertTool to rename fields on conversion to Orc https://issues.apache.org/jira/browse/ORC-274 - Remove the columnNames field from Reader.Options https://issues.apache.org/jira/browse/ORC-524 - Java reader should read test orc files in example dir. and compare it with expected dir ORC without Hadoop - https://github.com/apache/orc/pull/641 - https://github.com/apache/orc/pull/189 Next steps for me would be to review the 2.0 milestone and pick up some work. @Dongjoon - Thanks for your support and encouragement. Thanks Ash On Mon, 20 Nov 2023 at 22:05, Gang Wu <ust...@gmail.com> wrote: > Hi Ash, > > Thanks for your interest and contribution! From my perspective, these > issues created for 2.0.0 are worth doing as the next step: > - https://github.com/apache/orc/issues/1543 > - https://github.com/apache/orc/issues/1499 > > If you are familiar with C++, you can also help review the PRs for C++ > column encryption in the coming months. Some people have already > implemented it and are planning to contribute to the community after > some legal process from their employer. > > Best, > Gang > > On Tue, Nov 21, 2023 at 2:00 PM Dongjoon Hyun <dongjoon.h...@gmail.com> > wrote: > > > Thank you for sending a message. :) > > > > Apache ORC 2.0.0 Milestone can help. In addition, you can take over some > > long-standing pending GitHub PRs. > > > > - https://github.com/apache/orc/milestone/20 > > > > Apache ORC JIRA also has more open issues. > > > > Note that there are duplicates because notable JIRA issues are registered > > to Milestone to give more visibility. > > > > Thanks, > > Dongjoon. > > > > > > On Mon, Nov 20, 2023 at 9:35 PM mystic lama <mysticlama...@gmail.com> > > wrote: > > > > > Hello Devs, > > > > > > To set the context, I would like to get more involved with the ORC > > project. > > > While I do not directly work on ORC in my current role, I have a strong > > > personal interest in data formats. > > > > > > Are there work items or wishlists for the project that needs to be > worked > > > upon? > > > More than happy to maintain the work items. > > > > > > Given ORC is very stable, but worth asking :) > > > > > > Thanks > > > Ash > > > > > >