nsivabalan commented on a change in pull request #3768:
URL: https://github.com/apache/hudi/pull/3768#discussion_r726622650
##########
File path:
hudi-client/hudi-spark-client/src/main/scala/org/apache/hudi/HoodieSparkUtils.scala
##########
@@ -64,8 +64,24 @@ object HoodieSparkUtils extends SparkAdapterSupport {
* [[org.apache.spark.deploy.SparkHadoopUtil]] becomes private since Spark
3.0.0 and hence we had to copy it locally.
*/
def globPath(fs: FileSystem, pattern: Path): Seq[Path] = {
- Option(fs.globStatus(pattern)).map { statuses =>
- statuses.map(_.getPath.makeQualified(fs.getUri,
fs.getWorkingDirectory)).toSeq
+ // find base path to assist in skipping meta paths
Review comment:
not sure if it is required. I did not see a reason why do we need to
include meta paths in any of our flows. thats why left it as is. Let me know if
there is a need for one.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]