ArnavBalyan commented on code in PR #532:
URL: https://github.com/apache/parquet-format/pull/532#discussion_r2554027728


##########
proposals/README.md:
##########
@@ -58,8 +58,7 @@ Once the implementation phase is finished, we can include the 
contribution in th
 
 | ID  | Description  | Status  |
 |-----|--------------|---------|
-| [github issue] | adding this new encoding | POC |
-| [github issue] | add Variant type | Implementation |
+| [Issue #531](https://github.com/apache/parquet-format/issues/531) | FSST 
support for Parquet format | Implementation |

Review Comment:
   Hi @emkornfield I'll raise a draft PR for this, we would not need shared 
dictionaries since the symbol table empirically works best on a per-page level, 
and only a few hundred bytes. The existing decoder holds the symbol table as 
the prefix similar to DELTA_BINARY_PACKED encoder, can discuss more on the pr 
thanks



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to