I'd suggest starting with the talks on https://orc.apache.org/talks . The talk about ORC File & vectorization is one of the early ones. If you want a paper, I'd look at "Major Technical Advancements in Apache Hive" at https://web.archive.org/web/20210123181838/http://web.cse.ohio-state.edu/hpcs/WWW/HTML/publications/papers/TR-14-2.pdf .
.. Owen On Sat, Jan 27, 2024 at 8:12 PM Xin Zhao <xzhaot...@gmail.com> wrote: > Hi all! > > > > I am new to Apache ORC community and I really love this format in my > project. > > > > Now I hope to learn it more, especially in some academic paper, but I > cannot find any paper introducing this format design. Is there any > suggestion? > > > > I’d really appreciate you for any key word, thank you very much! > > > > Leon >