This is an automated email from the ASF dual-hosted git repository.
gangwu pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/parquet-java.git
The following commit(s) were added to refs/heads/master by this push:
new 6809a1859 PARQUET-2478: Update README with link to parquet website
(#1355)
6809a1859 is described below
commit 6809a18597e56709696af58313343aeea93f0b99
Author: Andrew Lamb <[email protected]>
AuthorDate: Tue May 21 21:21:17 2024 -0400
PARQUET-2478: Update README with link to parquet website (#1355)
Update the introductory content to reduce confusion about parquet in
general.
---
README.md | 16 ++++++++++++----
1 file changed, 12 insertions(+), 4 deletions(-)
diff --git a/README.md b/README.md
index fa947935b..74987cb94 100644
--- a/README.md
+++ b/README.md
@@ -20,11 +20,19 @@
Parquet MR [](https://github.com/apache/parquet-mr/actions)
======
-Parquet-MR contains the java implementation of the [Parquet
format](https://github.com/apache/parquet-format).
-Parquet is a columnar storage format for Hadoop; it provides efficient storage
and encoding of data.
-Parquet uses the [record shredding and assembly
algorithm](https://github.com/julienledem/redelm/wiki/The-striping-and-assembly-algorithms-from-the-Dremel-paper)
described in the Dremel paper to represent nested structures.
+This repository contains a Java implementation of [Apache
Parquet](https://parquet.apache.org/)
+
+Apache Parquet is an open source, column-oriented data file format
+designed for efficient data storage and retrieval. It provides high
+performance compression and encoding schemes to handle complex data in
+bulk and is supported in many programming language and analytics
+tools.
-You can find some details about the format and intended use cases in our
[Hadoop Summit 2013
presentation](http://www.slideshare.net/julienledem/parquet-hadoop-summit-2013)
+The [parquet-format](https://github.com/apache/parquet-format)
+repository contains the file format specificiation.
+
+Parquet uses the [record shredding and assembly
algorithm](https://github.com/julienledem/redelm/wiki/The-striping-and-assembly-algorithms-from-the-Dremel-paper)
described in the Dremel paper to represent nested structures.
+You can find additional details about the format and intended use cases in our
[Hadoop Summit 2013
presentation](http://www.slideshare.net/julienledem/parquet-hadoop-summit-2013)
## Building