[ 
https://issues.apache.org/jira/browse/SPARK-8118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

KaiXinXIaoLei reopened SPARK-8118:
----------------------------------

I  find parquet log is still  noisy log output. eg:
{noformat}
vm3:/opt/apache/hadoop/logs/userlogs/application_1464609606092_0008 # head 
container_1464609606092_0008_01_000002/stdout 
May 31, 2016 3:18:11 PM INFO: org.apache.parquet.hadoop.codec.CodecConfig: 
Compression: GZIP
May 31, 2016 3:18:11 PM INFO: org.apache.parquet.hadoop.ParquetOutputFormat: 
Parquet block size to 134217728
May 31, 2016 3:18:11 PM INFO: org.apache.parquet.hadoop.ParquetOutputFormat: 
Parquet page size to 1048576
May 31, 2016 3:18:11 PM INFO: org.apache.parquet.hadoop.ParquetOutputFormat: 
Parquet dictionary page size to 1048576
May 31, 2016 3:18:11 PM INFO: org.apache.parquet.hadoop.ParquetOutputFormat: 
Dictionary is on
May 31, 2016 3:18:11 PM INFO: org.apache.parquet.hadoop.ParquetOutputFormat: 
Validation is off
May 31, 2016 3:18:11 PM INFO: org.apache.parquet.hadoop.ParquetOutputFormat: 
Writer version is: PARQUET_1_0
May 31, 2016 3:18:13 PM INFO: 
org.apache.parquet.hadoop.InternalParquetRecordWriter: Flushing mem columnStore 
to file. allocated memory: 2,882,551
{noformat}

> Turn off noisy log output produced by Parquet 1.7.0
> ---------------------------------------------------
>
>                 Key: SPARK-8118
>                 URL: https://issues.apache.org/jira/browse/SPARK-8118
>             Project: Spark
>          Issue Type: Sub-task
>          Components: SQL
>    Affects Versions: 1.4.1, 1.5.0
>            Reporter: Cheng Lian
>            Assignee: Cheng Lian
>            Priority: Minor
>             Fix For: 1.5.0
>
>
> Parquet 1.7.0 renames package name to "org.apache.parquet", need to adjust 
> {{ParquetRelation.enableLogForwarding}} accordingly to avoid noisy log output.
> A better approach than simply muting these log lines is to redirect Parquet 
> logs via SLF4J, so that we can handle them consistently. In general these 
> logs are very useful. Esp. when used to diagnosing Parquet memory issue and 
> filter push-down.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to