Hello Gang, As you probably know, I've been working on a series of small performance improvements in ORC-core and I just wanted to highlight the progress thus far.
Far from "exacting," but I ran the ORC benchmark test for generating (writing) NYC taxi data. I compared the current main branch with the 1.6 branch. The tests ran for roughly the same amount of time and I observed a performance improvement of roughly 25% for this particular workload. I've attached the images. Normally, I would do this work as simply a hobby, in the same way that many people enjoy the mental stimulation of Sudoku, but now that businesses are paying per unit of compute, time is literally money. I hope this saves you some time. And money. Thanks for all the reviews; you made this possible. More PRs to come.
