This is an automated email from the ASF dual-hosted git repository.
gangwu pushed a commit to branch production
in repository https://gitbox.apache.org/repos/asf/parquet-site.git
The following commit(s) were added to refs/heads/production by this push:
new 5f690a3 PARQUET-2470: Update website with larger ecosystem emphasis
(#59)
5f690a3 is described below
commit 5f690a3d0b89d8680a3f610e9241b040f7e95a58
Author: Andrew Lamb <[email protected]>
AuthorDate: Thu May 16 10:20:10 2024 -0400
PARQUET-2470: Update website with larger ecosystem emphasis (#59)
Co-authored-by: Ed Seidl <[email protected]>
---
content/en/_index.md | 5 ++++-
content/en/docs/Overview/_index.md | 6 +++---
static/doap_Parquet.rdf | 2 +-
3 files changed, 8 insertions(+), 5 deletions(-)
diff --git a/content/en/_index.md b/content/en/_index.md
index fddd023..1644d4c 100644
--- a/content/en/_index.md
+++ b/content/en/_index.md
@@ -9,7 +9,10 @@ title: Parquet
<a class="btn btn-lg btn-secondary me-3 mb-4" href="/blog/">
Download <i class="fab fa-github ms-2 "></i>
</a>
-<p class="lead mt-5">Apache Parquet is a columnar storage format available to
any project in the Hadoop ecosystem, regardless of the choice of data
processing framework, data model or programming language.</p>
+<p class="lead mt-5">
+Apache Parquet is an open source, column-oriented data file format designed
for efficient data storage and retrieval.
+It provides high performance compression and encoding schemes to handle
complex data in bulk and is supported in many programming language and
analytics tools.
+</p>
{{< blocks/link-down color="info" >}}
{{< /blocks/cover >}}
diff --git a/content/en/docs/Overview/_index.md
b/content/en/docs/Overview/_index.md
index 58e9e1d..a8bddb8 100644
--- a/content/en/docs/Overview/_index.md
+++ b/content/en/docs/Overview/_index.md
@@ -6,11 +6,11 @@ description: >
All about Parquet.
---
-Apache Parquet is a columnar storage format available to any project in the
Hadoop ecosystem, regardless of the choice of data processing framework, data
model or programming language.
+Apache Parquet is an open source, column-oriented data file format designed
for efficient data storage and retrieval.
+It provides high performance compression and encoding schemes to handle
complex data in bulk and is supported in many programming language and
analytics tools.
This documentation contains information about both the
[parquet-mr](https://github.com/apache/parquet-mr) and
[parquet-format](https://github.com/apache/parquet-format) repositories.
-
### parquet-format
The parquet-format repository hosts the official specification of the Apache
Parquet file format, defining how data is structured and stored. This
specification, along with Thrift metadata definitions and other crucial
components, is essential for developers to effectively read and write Parquet
files. The parquet-format project specifically contains the format
specifications needed to understand and properly utilize Parquet files.
@@ -43,4 +43,4 @@ Here is a non-exhaustive list of Parquet implementations:
* [cuDF](https://github.com/rapidsai/cudf)
* [Apache Impala](https://github.com/apache/impala)
* [DuckDB](https://github.com/duckdb/duckdb)
-* [fastparquet, a Python implementation of the Apache Parquet
format](https://github.com/dask/fastparquet)
\ No newline at end of file
+* [fastparquet, a Python implementation of the Apache Parquet
format](https://github.com/dask/fastparquet)
diff --git a/static/doap_Parquet.rdf b/static/doap_Parquet.rdf
index ca14f2c..939f29c 100644
--- a/static/doap_Parquet.rdf
+++ b/static/doap_Parquet.rdf
@@ -28,7 +28,7 @@
<homepage rdf:resource="http://parquet.apache.org" />
<asfext:pmc rdf:resource="http://parquet.apache.org" />
<shortdesc>Apache Parquet is a general-purpose columnar storage
format.</shortdesc>
- <description>Apache Parquet is an open source, column-oriented data file
format designed for efficient data storage and retrieval. It provides efficient
data compression and encoding schemes with enhanced performance to handle
complex data in bulk. Parquet is available in multiple languages including
Java, C++, and Python.</description>
+ <description>Apache Parquet is an open source, column-oriented data file
format designed for efficient data storage and retrieval. It provides high
performance compression and encoding schemes to handle complex data in bulk and
is supported in many programming language and analytics tools.</description>
<bug-database rdf:resource="https://issues.apache.org/jira/browse/PARQUET"
/>
<mailing-list rdf:resource="https://parquet.apache.org/community/" />
<download-page
rdf:resource="https://parquet.apache.org/blog/2023/05/18/1.13.1/" />