This is an automated email from the ASF dual-hosted git repository.

dongjoon pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git


The following commit(s) were added to refs/heads/master by this push:
     new 51973aa4c7b7 [SPARK-45423][SQL] Lower `ParquetWriteSupport` log level 
to debug
51973aa4c7b7 is described below

commit 51973aa4c7b7d6b1b0b98b943f4cf78e27475519
Author: Dongjoon Hyun <[email protected]>
AuthorDate: Thu Oct 5 10:21:53 2023 -0700

    [SPARK-45423][SQL] Lower `ParquetWriteSupport` log level to debug
    
    ### What changes were proposed in this pull request?
    
    This PR aims to lower `ParquetWriteSupport` log level from INFO to DEBUG
    
    ### Why are the changes needed?
    
     Currently, `ParquetWriteSupport` is too verbose at INFO level because it 
dumps the Parquet file schema per file. Since this is the only log in 
`ParquetWriteSupport`,  the users can see this via a proper `log4j2.properties` 
setting when they want to debug jobs.
     ```
    23/10/05 16:29:43 INFO ParquetOutputFormat: ParquetRecordWriter [block 
size: 134217728b, row group padding size: 8388608b, validating: false]
    23/10/05 16:29:43 INFO ParquetWriteSupport: Initialized Parquet 
WriteSupport with Catalyst schema:
    {
      "type" : "struct",
      "fields" : [ {
        "name" : "id",
        "type" : "long",
        "nullable" : false,
        "metadata" : { }
      } ]
    }
    and corresponding Parquet message type:
    message spark_schema {
      required int64 id;
    }
    
    23/10/05 16:29:43 INFO MagicCommitTracker: ...
    ```
    
    ### Does this PR introduce _any_ user-facing change?
    
    No.
    
    ### How was this patch tested?
    
    Manual tests.
    
    ### Was this patch authored or co-authored using generative AI tooling?
    
    No.
    
    Closes #43230 from dongjoon-hyun/SPARK-45423.
    
    Authored-by: Dongjoon Hyun <[email protected]>
    Signed-off-by: Dongjoon Hyun <[email protected]>
---
 .../spark/sql/execution/datasources/parquet/ParquetWriteSupport.scala   | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git 
a/sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetWriteSupport.scala
 
b/sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetWriteSupport.scala
index f6248d43c48e..9535bbd585bc 100644
--- 
a/sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetWriteSupport.scala
+++ 
b/sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetWriteSupport.scala
@@ -132,7 +132,7 @@ class ParquetWriteSupport extends WriteSupport[InternalRow] 
with Logging {
       }
     }
 
-    logInfo(
+    logDebug(
       s"""Initialized Parquet WriteSupport with Catalyst schema:
          |${schema.prettyJson}
          |and corresponding Parquet message type:


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to