tisonkun commented on issue #720: URL: https://github.com/apache/datasketches-java/issues/720#issuecomment-3864061134
Got it. Kudos to Kevin for his brilliant `CpcSketch`. We have three encoding tables here. The decoding tables are derived from encoding tables and the deriving logics are available. * LENGTH_LIMITED_UNARY_ENCODING_TABLE65 - having some comments, but doesn't tell how it is computed * COLUMN_PERMUTATIONS_FOR_ENCODING - having some comments, generated by "generatePermutationsForSLIDING.ml". We may reverse engineer from the comments to the original program * ENCODING_TABLES_FOR_HIGH_ENTROPY_BYTE - having some comments, generated by "columnProbabilities.ml" and "generateHuffmanCodes.ml". It is "23 length-limited Huffman codes". Not sure how much details are included in the "Back to the Future" paper. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
