I would like to produce Parquet files from a "plain java command line
program" *without any framework like Spark or Hadoop* *or serialization
tools like proto buffers* - i.e. produce parquet from mainly arrays of
primitive java types with as good compression ratio and performance as I
can manage.

I have played around with a very nice library called Parquet-Carpet
<https://github.com/jerolba/parquet-carpet> that makes it very simple to
produce Parquet without frameworks in Java (that builds on the Apache
Parquet libraries) but it has some limitations that makes me curious to
also investigate if I could use the Apache Parquet Java libraries directly
but am having a hard time finding resources explaining how to get started.

Are there any "HelloParquet" examples that one could start from etc?

Reply via email to