[PR] PARQUET-2470: Update website with larger ecosystem emphasis [parquet-site]

via GitHub Mon, 13 May 2024 13:17:17 -0700


alamb opened a new pull request, #59:
URL: https://github.com/apache/parquet-site/pull/59

# Rationale
As described on https://issues.apache.org/jira/browse/PARQUET-2470,
Parquet's role in the analytics ecosystem is substantial.

However, https://parquet.apache.org/ currently emphasis Parquet's role in
the Hadoop ecosystem. I think this causes confusion in several ways:

1. It implies that parquet is only focused on Hadoop, when I think it is a
critical technology across other ecosystems that are unrelated to hadoop (e.g.
Apache Iceberg, Delta Lake, etc)
2. It may further the perception that the Apache Parquet project only
focuses on / cares about Hadoop / Java implementation

# Chanages
Update the home page content to mirror the Apache Project Description
https://projects.apache.org/project.html?parquet (which does not mention Hadoop
specifically)

> Apache Parquet is an open source, column-oriented data file format
designed for efficient data storage and retrieval. It provides efficient data
compression and encoding schemes with enhanced performance to handle complex
data in bulk. Parquet is available in multiple languages including Java, C++,
and Python.

## Before this PR

![Screenshot 2024-05-13 at 4 13 31
PM](https://github.com/apache/parquet-site/assets/490673/86a76878-f304-4d43-8156-a3555ccebfbc)

## After the PR

![Screenshot 2024-05-13 at 4 15 17
PM](https://github.com/apache/parquet-site/assets/490673/7479dd8f-3054-410e-9c14-4a8d2a0dccaa)

--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@parquet.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

[PR] PARQUET-2470: Update website with larger ecosystem emphasis [parquet-site]

Reply via email to