kyoty opened a new pull request #32224:
URL: https://github.com/apache/spark/pull/32224


   
   ### What changes were proposed in this pull request?
   columns like 'On Heap Memory Useage', 'On Heap Memory Useage', 'Disk Usage' 
in table `Data Distribution` of storage page should be sorted as 
numerical-order instead of lexicographical-order.
   
   The problem phenomenon is very similar to #32190, but the cause of the 
problem is completely different, so a new pr was submitted.
   
   
   ### Why are the changes needed?
   buf fix,the sorting style should be consistent between different columns.
   
   When the storage page is initialized, it will infer the type of columns in 
the table to detect which sorting method(such as `sort_alpha`, `sort_numberic`, 
`sort_mmdd` etc.) to use. 
   
   **Columns which contains a storage unit in the table will be inferred to be 
sorted use function `sort_alpha`, but the data (numerical value+unit)should be 
converted into the actual number of bytes and then sorted.**
   
   ### Does this PR introduce _any_ user-facing change?
   NO
   
   ### How was this patch tested?
   Only JS was modified, and the manual test result works well.
   
   before:
   ![data distribution table in storage tab sort 
incorrect](https://user-images.githubusercontent.com/52202080/115149934-d55c4800-a098-11eb-8b9f-18b25e09f374.png)
   
   after:
   ![after 
modified](https://user-images.githubusercontent.com/52202080/115150481-2d944980-a09b-11eb-8d46-13fd90fe6140.png)
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to