crepererum commented on code in PR #59: URL: https://github.com/apache/parquet-site/pull/59#discussion_r1599844055
########## content/en/docs/Overview/_index.md: ########## @@ -6,4 +6,7 @@ description: > All about Parquet. --- -Apache Parquet is a columnar storage format available to any project in the Hadoop ecosystem, regardless of the choice of data processing framework, data model or programming language. +Apache Parquet is an open source, column-oriented data file format designed for efficient data storage and retrieval. +It provides efficient data compression and encoding schemes with enhanced performance to handle complex data in bulk. +Parquet is available in multiple languages including Java, C++, and Python. Review Comment: I think mentioning implementation (both as end-user software and as libs) is valuable but shouldn't be part of the elevator pitch. Other formats usually solve this by a dedicated sub-section or page, e.g.: - https://jpeg.org/jpegxl/software.html (the list format is good, the fact that there's only a single implementation is not) - https://paseto.io/ - https://autocrypt.org/dev-status.html This would also allow multiple implementations for a single language, which sometimes can be valuable (e.g. if you have a backwards compatible, conservative variant and a fancy new one). -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr...@parquet.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org