Hi team,

This is a newbie question.
One of my friend in Taiwan is using Spark to write DataSketches to
Postgres. When it comes to estimation he got the data corruption error, and
then realize that the summary written in Postgres should be base64 encoded
to comply with the format.

https://github.com/apache/incubator-datasketches-postgresql/blob/3b553ef4dc7d2c988c41ab56695c5b082d3ce308/src/common.c#L37-L60

He found the other Postgres implementation of HLL does not do base64
though[1].

I just want to learn that what are the considerations for doing base64? Is
it a convention that should be easy to inference or we should document it?

Evans

[1]
https://github.com/citusdata/postgresql-hll?fbclid=IwAR3GP2xgdCOsESuKRsqU4mJ7oeE7p-CPGrgeVUODRwVVShiOGBETfz5A4T8

Reply via email to