alamb opened a new pull request, #59:
URL: https://github.com/apache/parquet-site/pull/59

   # Rationale
   As described on https://issues.apache.org/jira/browse/PARQUET-2470, 
Parquet's role in the analytics ecosystem is substantial. 
   
   However, https://parquet.apache.org/ currently emphasis Parquet's role in 
the Hadoop ecosystem. I think this causes confusion in several ways:
   
   1. It implies that parquet is only focused on Hadoop, when I think it is a 
critical technology across other ecosystems that are unrelated to hadoop (e.g. 
Apache Iceberg, Delta Lake, etc)
   2. It may further the perception that the Apache Parquet project only 
focuses on / cares about Hadoop / Java implementation
   
   # Chanages
   Update the home page content to mirror the Apache Project Description 
https://projects.apache.org/project.html?parquet (which does not mention Hadoop 
specifically)
   
   > Apache Parquet is an open source, column-oriented data file format 
designed for efficient data storage and retrieval. It provides efficient data 
compression and encoding schemes with enhanced performance to handle complex 
data in bulk. Parquet is available in multiple languages including Java, C++, 
and Python.
   
   
   
   ## Before this PR
   
   ![Screenshot 2024-05-13 at 4 13 31 
PM](https://github.com/apache/parquet-site/assets/490673/86a76878-f304-4d43-8156-a3555ccebfbc)
   
   
   ## After the PR
   
   ![Screenshot 2024-05-13 at 4 15 17 
PM](https://github.com/apache/parquet-site/assets/490673/7479dd8f-3054-410e-9c14-4a8d2a0dccaa)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@parquet.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to