[GitHub] [hudi] nsivabalan commented on a change in pull request #3768: [HUDI-2494] Fixing glob pattern to skip all hoodie meta paths

GitBox Mon, 11 Oct 2021 16:04:39 -0700


nsivabalan commented on a change in pull request #3768:
URL: https://github.com/apache/hudi/pull/3768#discussion_r726622650




##########
File path: 
hudi-client/hudi-spark-client/src/main/scala/org/apache/hudi/HoodieSparkUtils.scala
##########
@@ -64,8 +64,24 @@ object HoodieSparkUtils extends SparkAdapterSupport {
    * [[org.apache.spark.deploy.SparkHadoopUtil]] becomes private since Spark 
3.0.0 and hence we had to copy it locally.
    */
   def globPath(fs: FileSystem, pattern: Path): Seq[Path] = {
-    Option(fs.globStatus(pattern)).map { statuses =>
-      statuses.map(_.getPath.makeQualified(fs.getUri, 
fs.getWorkingDirectory)).toSeq
+    // find base path to assist in skipping meta paths

Review comment:
       not sure if it is required. I did not see a reason why do we need to 
include meta paths in any of our flows. thats why left it as is. Let me know if 
there is a need for one. 




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [hudi] nsivabalan commented on a change in pull request #3768: [HUDI-2494] Fixing glob pattern to skip all hoodie meta paths

Reply via email to