AngersZhuuuu commented on pull request #32018:
URL: https://github.com/apache/spark/pull/32018#issuecomment-811719759
What confused me is that
```
spark-sql> CREATE TABLE t(i STRING, c string) USING PARQUET PARTITIONED BY
(c);
Time taken: 2.12 seconds
spark-sql> INSERT OVERWRITE t PARTITION (c=null) VALUES ('1');
Time taken: 4.984 seconds
spark-sql> desc formatted t partition(c=null);
i string NULL
c string NULL
# Partition Information
# col_name data_type comment
c string NULL
# Detailed Partition Information
Database default
Table t
Partition Values [c=null]
Location hdfs://tl0/user/hive/warehouse/t/c=null
Serde Library
org.apache.hadoop.hive.ql.io.parquet.serde.ParquetHiveSerDe
InputFormat org.apache.hadoop.hive.ql.io.parquet.MapredParquetInputFormat
OutputFormat org.apache.hadoop.hive.ql.io.parquet.MapredParquetOutputFormat
Storage Properties [path=hdfs://tl0/user/hive/warehouse/t,
serialization.format=1]
Partition Parameters {rawDataSize=-1, numFiles=1,
transient_lastDdlTime=1617244501, totalSize=396, COLUMN_STATS_ACCURATE=false,
numRows=-1}
Created Time Thu Apr 01 10:35:01 SGT 2021
Last Access UNKNOWN
Partition Statistics 396 bytes
# Storage Information
Location hdfs://tl0/user/hive/warehouse/t
Serde Library
org.apache.hadoop.hive.ql.io.parquet.serde.ParquetHiveSerDe
InputFormat org.apache.hadoop.hive.ql.io.parquet.MapredParquetInputFormat
OutputFormat org.apache.hadoop.hive.ql.io.parquet.MapredParquetOutputFormat
Time taken: 0.135 seconds, Fetched 25 row(s)
```
The path can be c=null, for current code, which case the path will be null
and which case it can be `__HIVE_DEFAULT_PARTITION__`?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]