jorisvandenbossche commented on issue #40128: URL: https://github.com/apache/arrow/issues/40128#issuecomment-1953969098
Agreed that we should be consistent for empty arrays. Haven't tested this, but it might be caused by shortcutting calling the actual kernel for empty input here: https://github.com/apache/arrow/blob/4dc3d04ae84d97d02443c0cef555a46535925c2b/cpp/src/arrow/compute/exec.cc#L781-L795 This creates a generic array of length 0 of the output type, but so in case of a dictionary output type, it will never have an actual dictionary attached to that return value. We might want to generally skip that fast-path in case `output_type_` is a dictionary type? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
