On Fri, 25 Mar 2022 14:00:11 -0400 Vinoo Ganesh <[email protected]> wrote: > Hi Antoine, > Thanks for the feedback - I definitely agree. In rebuilding the website, > I kept the content/taglines from the initial version, just to preserve > parity. This tagline is something that certainly can and should be updated. > > I actually personally like this description of parquet Databricks' glossary: > > > Apache Parquet is an open source, column-oriented data file format > > designed for efficient data storage and retrieval. It provides efficient > > data compression and encoding schemes with enhanced performance to handle > > complex data in bulk. > > > What are your thoughts on migrating to this?
That's definitely much better, IMHO ! Also, it would probably useful to add another sentence about the availability of implementations (so that people know they may handle it from Java, C++, Python etc.). Regards Antoine. > > Thanks, > Vinoo Ganesh | [email protected] > > <[email protected]> > > > On Fri, Mar 25, 2022 at 1:48 PM Antoine Pitrou <[email protected]> wrote: > > > > > Hello, > > > > Just for the record, I find the introductory sentence a bit weird: > > """ > > Apache Parquet is a columnar storage format available to any project in > > the Hadoop ecosystem > > """ > > > > That is certainly true, but many people not using the Hadoop ecosystem > > also read and write Parquet files, so that description seems to > > downplay the advantages of Parquet. No intrinsic quality (such as > > compactness) is mentioned, and Parquet almost looks like a legacy format > > of some kind. > > > > Regards > > > > Antoine. > > > > > > > > On Fri, 25 Mar 2022 11:09:03 -0400 > > Vinoo Ganesh <[email protected]> > > wrote: > > > Hi All, > > > I'm excited to announce the launch of the new Parquet website - > > > https://parquet.apache.org/. The new website uses Hugo < > > https://gohugo.io/> > > > and is backed by the Docsy <https://www.docsy.dev/> theme. > > > > > > The new website simplifies both the documentation process, with support > > for > > > creating PRs to update/modify the documentation directly from the > > website, > > > as well as the release documentation process, where each release is a new > > > blog post. > > > > > > Documentation for the development/release process of the website can be > > > found here: > > > > > https://github.com/apache/parquet-site/tree/production#website-development-and-deployment > > > > > . > > > > > > Thanks to Xinli for his help getting this over the finish line. > > > > > > Please let me know if you have any feedback or feature requests. > > > > > > Thanks, > > > Vinoo Ganesh | [email protected] > > > > > > <[email protected]> > > > > > > > > > > > >
