Kevin-Li-2025 opened a new pull request, #23226:
URL: https://github.com/apache/datafusion/pull/23226

   ## Which issue does this PR close?
   
   - Closes #19650.
   
   ## Rationale for this change
   
   Hive-style partition values can contain percent-encoded characters in 
object-store paths, such as `%2F` for `/` or `%20` for a space. 
`parse_partitions_for_path` currently returns those encoded bytes literally, so 
listing tables expose `foo%2Fbar` instead of `foo/bar`.
   
   ## What changes are included in this PR?
   
   - Percent-decode extracted partition values in `parse_partitions_for_path`.
   - Return `Cow<str>` from the parser so unchanged values keep the borrowed 
fast path and decoded values can be owned only when needed.
   - Fall back to the original raw partition value if percent decoding does not 
produce valid UTF-8, rather than dropping the file from listing results.
   - Add helper-level and `PartitionedFile` conversion tests for decoded 
partition values.
   
   ## Are these changes tested?
   
   - `cargo fmt --all --check`
   - `cargo test -p datafusion-catalog-listing`
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to