This is an automated email from the ASF dual-hosted git repository.

wesm pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/parquet-testing.git


The following commit(s) were added to refs/heads/master by this push:
     new d914f9d  PARQUET-1878: Fix legacy LZ4-compressed file (#14)
d914f9d is described below

commit d914f9d289488c7db1759d7a88a4a1b8f062c7dd
Author: Antoine Pitrou <[email protected]>
AuthorDate: Tue Sep 22 16:44:12 2020 +0200

    PARQUET-1878: Fix legacy LZ4-compressed file (#14)
    
    A Snappy-compressed file was mistakingly uploaded.
    Replace it with a "raw" LZ4-compressed file (generated by Arrow 0.17.0).
    This is the LZ4 block format without the custom Hadoop header.
    
    ```
    Version: 1.0
    Created By: parquet-cpp version 1.5.1-SNAPSHOT
    Total rows: 4
    Number of RowGroups: 1
    Number of Real Columns: 3
    Number of Columns: 3
    Number of Selected Columns: 3
    Column 0: c0 (INT64)
    Column 1: c1 (BYTE_ARRAY)
    Column 2: v11 (DOUBLE)
    --- Row Group: 0 ---
    --- Total Bytes: 281 ---
    --- Rows: 4 ---
    Column 0
      Values: 4, Null Values: 0, Distinct Values: 0
      Max: 1593604801, Min: 1593604800
      Compression: LZ4_RAW, Encodings: PLAIN_DICTIONARY PLAIN RLE
      Uncompressed Size: 100, Compressed Size: 103
    Column 1
      Values: 4, Null Values: 0, Distinct Values: 0
      Max: def, Min: abc
      Compression: LZ4_RAW, Encodings: PLAIN_DICTIONARY PLAIN RLE
      Uncompressed Size: 68, Compressed Size: 70
    Column 2
      Values: 4, Null Values: 0, Distinct Values: 0
      Max: 42.125, Min: 7.7
      Compression: LZ4_RAW, Encodings: PLAIN_DICTIONARY PLAIN RLE
      Uncompressed Size: 109, Compressed Size: 108
    --- Values ---
    c0                            |c1                            |v11           
                |
    1593604800                    |abc                           |42.000000     
                |
    1593604800                    |def                           |7.700000      
                |
    1593604801                    |abc                           |42.125000     
                |
    1593604801                    |def                           |7.700000      
                |
    ```
---
 data/non_hadoop_lz4_compressed.parquet | Bin 2795 -> 1228 bytes
 1 file changed, 0 insertions(+), 0 deletions(-)

diff --git a/data/non_hadoop_lz4_compressed.parquet 
b/data/non_hadoop_lz4_compressed.parquet
index 9ec286f..cfbdc7e 100644
Binary files a/data/non_hadoop_lz4_compressed.parquet and 
b/data/non_hadoop_lz4_compressed.parquet differ

Reply via email to