[ 
https://issues.apache.org/jira/browse/PARQUET-2333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17753911#comment-17753911
 ] 

ASF GitHub Bot commented on PARQUET-2333:
-----------------------------------------

sekikn commented on code in PR #1131:
URL: https://github.com/apache/parquet-mr/pull/1131#discussion_r1293013732


##########
parquet-cli/pom.xml:
##########
@@ -110,6 +110,12 @@
       <artifactId>avro</artifactId>
       <version>${avro.version}</version>
     </dependency>
+    <dependency>
+      <groupId>org.tukaani</groupId>

Review Comment:
   I think it has no license issue, since [its Java 
implementation](https://tukaani.org/xz/java.html) is [public 
domain](https://github.com/tukaani-project/xz-java/blob/v1.9/COPYING), which is 
classified into [Category 
A](https://www.apache.org/legal/resolved.html#handling-public-domain-licensed-works).
 It also doesn't seem to have any dependent libraries.
   
   As other examples, Druid, HBase and Spark already include it.
   https://github.com/apache/druid/blob/druid-27.0.0/pom.xml#L527-L531
   https://github.com/apache/hbase/blob/rel/2.5.5/pom.xml#L1470-L1474
   https://github.com/apache/spark/blob/v3.4.1/pom.xml#L1489-L1493





> Support bzip2 and xz compressions in the to-avro subcommand
> -----------------------------------------------------------
>
>                 Key: PARQUET-2333
>                 URL: https://issues.apache.org/jira/browse/PARQUET-2333
>             Project: Parquet
>          Issue Type: Improvement
>          Components: parquet-cli
>            Reporter: Kengo Seki
>            Assignee: Kengo Seki
>            Priority: Major
>
> parquet-cli's {{to-avro}} subcommand only supports gzip, snappy and zstd as 
> compression codecs for now, but Avro itself also supports bzip2 and xz.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to