pitrou commented on pull request #7143: URL: https://github.com/apache/arrow/pull/7143#issuecomment-638388036
> If this is AdvanceTillWord, this hurts performance. I don't think compiler-specific micro-optimization is a good practice. Generally, the compiler should make a better job if you expose all the code, rather than hide it. > Is this mostly InvertRemainingBits? could you provide some more feedback on the algorithm structure you were thinking of? It is. Basically, you could have a running word mask (or byte mask) of the remaining bits in the current word (or byte). > I would expect word level handling to be much faster for cases this is intended for (when there are actual runs). In practice, once integrated into a system you are likely correct though. Yes, I think you're gonna hit diminishing returns when handling more than one byte at a time. Byte level handling would also alleviate the endianness concerns. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
