smitajoshi12 opened a new pull request, #6535: URL: https://github.com/apache/ozone/pull/6535
## What changes were proposed in this pull request? When the number of keys/volume/bucket are huge, the current disk usage UI doesnt make much sense. This pull request introduces enhancements to the Recon disk usage endpoint to significantly improve usability and performance when dealing with large datasets: Top Entities Focus: The endpoint has been updated to efficiently sort and display only the top entities by size. This targeted approach helps users easily identify the most significant space consumers, addressing the impracticality of visualizing thousands of records in a single view. Efficient Sorting with Parallel Streams: To manage and sort vast numbers of records effectively, we've implemented parallel stream processing. Key advantages of using parallel streams include :- Better Utilization of Multi-core Processors: Enables concurrent sorting operations across multiple cores, drastically cutting down processing times for large datasets. Optimized for Large Datasets: The parallelism overhead is more efficiently distributed over a large number of elements, making it particularly suited for our use case. ## What is the link to the Apache JIRA https://issues.apache.org/jira/browse/HDDS-9626 ## How was this patch tested? Manually -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
