jimj edited a comment on issue #10057: URL: https://github.com/apache/druid/issues/10057#issuecomment-1039529443
I see the same broken behavior in Druid 0.21.1 today for `hashed` partioning. When I specify `targetRowsPerSegment` it appears to be completely ignored and i get 1 giant segment containing about 50million rows (my entire dataset for the segment). If I use the same ingestion spec but instead use `index_hadoop` instead of `index_parallel` I get 1 segment with ~10 partitions all hovering near 5 million rows a piece. I have not been able to duplicate this regression using the test input originally provided in this ticket, however. Anonymizing my dataset will be challenging unfortunately. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
