[ 
https://issues.apache.org/jira/browse/CALCITE-5820?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17741795#comment-17741795
 ] 

Runkang He commented on CALCITE-5820:
-------------------------------------

[~julianhyde] The reason is that {{URL.getPath()}} does not decode %20 to a 
space, when converting URL to File's pathname arg, File does not recongnize 
undecoded %20 characters and causes file not found exception.

I've found an equivalent method to replace URL.getPath, use URI.getRawPath 
instead. And the behaviour is same as Hive's implementation, including won't 
decode %20 to a space.

> Add PARSE_URL function (enabled in Hive and Spark library)
> ----------------------------------------------------------
>
>                 Key: CALCITE-5820
>                 URL: https://issues.apache.org/jira/browse/CALCITE-5820
>             Project: Calcite
>          Issue Type: New Feature
>          Components: core
>    Affects Versions: 1.34.0
>            Reporter: Runkang He
>            Assignee: Runkang He
>            Priority: Major
>              Labels: pull-request-available
>
> Add PARSE_URL (enabled in Hive and Spark library):
> PARSE_URL: Returns the specified part from the URL. Valid values for 
> partToExtract include HOST, PATH, QUERY, REF, PROTOCOL, AUTHORITY, FILE, and 
> USERINFO.
> For example, parse_url('http://facebook.com/path1/p.php?k1=v1&k2=v2#Ref1', 
> 'HOST') returns 'facebook.com'.
> See more details in 
> [Hive|https://cwiki.apache.org/confluence/display/hive/languagemanual+udf#LanguageManualUDF-StringFunctions]
>  and 
> [Spark|https://spark.apache.org/docs/latest/api/sql/index.html#parse_url] doc.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to