[GitHub] [druid] jimj edited a comment on issue #10057: index_parallel with single_dim partitionSpec type generating just one file/segment

GitBox Mon, 14 Feb 2022 14:34:20 -0800


jimj edited a comment on issue #10057:
URL: https://github.com/apache/druid/issues/10057#issuecomment-1039529443



   I see the same broken behavior in Druid 0.21.1 today for `hashed` 
partioning.  When I specify `targetRowsPerSegment` it appears to be completely 
ignored and i get 1 giant segment containing about 50million rows (my entire 
dataset for the segment).  If I use the same ingestion spec but instead use 
`index_hadoop` instead of `index_parallel` I get 1 segment with ~10 partitions 
all hovering near 5 million rows a piece.
   
   I have not been able to duplicate this regression using the test input 
originally provided in this ticket, however.  Anonymizing my dataset will be 
challenging unfortunately.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [druid] jimj edited a comment on issue #10057: index_parallel with single_dim partitionSpec type generating just one file/segment

Reply via email to