[GitHub] [iceberg] rdblue commented on a change in pull request #4244: Core - Add in RESTUtil class for encoding namespace as URL variable

GitBox Mon, 07 Mar 2022 13:47:32 -0800


rdblue commented on a change in pull request #4244:
URL: https://github.com/apache/iceberg/pull/4244#discussion_r821130586




##########
File path: core/src/main/java/org/apache/iceberg/rest/RESTUtil.java
##########
@@ -0,0 +1,122 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one
+ * or more contributor license agreements.  See the NOTICE file
+ * distributed with this work for additional information
+ * regarding copyright ownership.  The ASF licenses this file
+ * to you under the Apache License, Version 2.0 (the
+ * "License"); you may not use this file except in compliance
+ * with the License.  You may obtain a copy of the License at
+ *
+ *   http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing,
+ * software distributed under the License is distributed on an
+ * "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
+ * KIND, either express or implied.  See the License for the
+ * specific language governing permissions and limitations
+ * under the License.
+ */
+
+package org.apache.iceberg.rest;
+
+import java.io.UncheckedIOException;
+import java.io.UnsupportedEncodingException;
+import java.net.URLDecoder;
+import java.net.URLEncoder;
+import java.nio.charset.StandardCharsets;
+import java.util.Map;
+import org.apache.iceberg.catalog.Namespace;
+import org.apache.iceberg.relocated.com.google.common.base.Joiner;
+import org.apache.iceberg.relocated.com.google.common.base.Preconditions;
+import org.apache.iceberg.relocated.com.google.common.base.Splitter;
+import org.apache.iceberg.relocated.com.google.common.collect.Maps;
+import org.apache.iceberg.relocated.com.google.common.collect.Streams;
+
+class RESTUtil {
+  private static final Joiner NULL_JOINER = Joiner.on('\u0000');
+  private static final Splitter NULL_SPLITTER = Splitter.on('\u0000');
+
+  private RESTUtil() {
+  }
+
+  public static String stripTrailingSlash(String path) {
+    if (path == null) {
+      return null;
+    }
+
+    return path.endsWith("/") ? path.substring(0, path.length() - 1) : path;
+  }
+
+  /**
+   * Takes in a map, and returns a copy with entries with keys beginning with 
the designated prefix
+   * with the prefix removed. Any entries whose keys don't begin with the 
prefix are not returned.
+   * <p>
+   * This can be used to get a subset of the configuration related to the REST 
catalog, such as all
+   * properties from a prefix of `spark.sql.catalog.my_catalog.rest.` to get 
REST catalog specific properties
+   * from the spark configuration.
+   */
+  public static Map<String, String> filterByPrefix(Map<String, String> 
properties, String prefix) {
+    Preconditions.checkNotNull(properties, "Invalid properties map: null");
+    Map<String, String> result = Maps.newHashMap();
+    properties.forEach((key, value) -> {
+      if (key != null && key.startsWith(prefix)) {
+        result.put(key.substring(prefix.length()), value);
+      }
+    });
+
+    return result;
+  }
+
+  /**
+   * Returns a String representation of a namespace that is suitable for use 
in a URL / URI.
+   * <p>
+   * This function needs to be called when a namespace is used as a path 
variable (or query parameter etc.),
+   * to format the namespace per the spec.
+   * <p>
+   * {@link #urlDecode} should be used to parse the namespace from a URL 
parameter.
+   * @param ns namespace to encode
+   * @return UTF-8 encoded string representing the namespace, suitable for use 
as a URL parameter
+   */
+  public static String urlEncode(Namespace ns) {
+    Preconditions.checkArgument(ns != null, "Invalid namespace: null");
+    String[] levels = ns.levels();
+    String[] encodedLevels = new String[levels.length];
+
+    for (int i = 0; i < levels.length; i++) {
+      try {
+        encodedLevels[i] = URLEncoder.encode(levels[i], 
StandardCharsets.UTF_8.name());
+      } catch (UnsupportedEncodingException e) {
+        throw new UncheckedIOException(
+            String.format("Failed to URL encode namespace '%s' as UTF-8 
encoding is not supported", ns), e);
+      }
+    }
+
+    return NULL_JOINER.join(encodedLevels);
+  }
+
+  /**
+   * Takes in a string representation of a namespace as used for a URL 
parameter
+   * and returns the corresponding namespace.
+   * <p>
+   * See also {@link #urlEncode} for generating correctly formatted URLs.
+   */
+  public static Namespace urlDecode(String encodedNs) {
+    Preconditions.checkArgument(encodedNs != null, "Invalid namespace: null");
+    Iterable<String> encodedLevels = NULL_SPLITTER.split(encodedNs);
+    String[] decodedLevels =

Review comment:
       Nit: It seems strange that the method above allocates a string and uses 
a for loop, while this one uses a stream and converts using `toArray`. I 
usually like to keep methods that work basically the same way consistent so 
they're easy to understand if you know what is happening in one. I'd probably 
reformat like the `urlEncode` method.




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [iceberg] rdblue commented on a change in pull request #4244: Core - Add in RESTUtil class for encoding namespace as URL variable

Reply via email to