thomasrebele opened a new issue, #700:
URL: https://github.com/apache/datasketches-java/issues/700

   Is there a function similar to 
[QuantilesFloatsAPI#getRank](https://github.com/apache/datasketches-java/blob/37edd8c9898f8f073aaafad7097cb0949eb84c85/src/main/java/org/apache/datasketches/kll/KllFloatsSketch.java#L230),
 but which interpolates the results between the quantiles stored in the sketch? 
I looked around a bit, but didn't find that functionality. It could be added to 
[FloatsSketchSortedView](https://github.com/apache/datasketches-java/blob/37edd8c9898f8f073aaafad7097cb0949eb84c85/src/main/java/org/apache/datasketches/quantilescommon/FloatsSketchSortedView.java#L34).
   
   Background scenario: The Hive project uses KllFloatSketch for column 
histograms to estimate the selectivity of predicates. For range predicates, 
e.g., `80<x<90`,  it may happen that both range boundaries fall within the same 
bucket of the histogram, e.g., the nearest quantiles could be 50 and 100. In 
that case, the ranks for 80 and 90 (basically all values from 50 (exclusive) to 
100 (inclusive)) are the same as the rank of 100. See 
[HIVE-29365](https://issues.apache.org/jira/browse/HIVE-29365).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to