amaranathv commented on issue #741: java.lang.NoClassDefFoundError: com/uber/hoodie/exception/HoodieException URL: https://github.com/apache/incubator-hudi/issues/741#issuecomment-503164955 I am getting invalid data path now. InValid data paths=[file:///mapr/user/avenka23/delta-streamer/output5/2018/08/31/fe957979-67f8-4b68-a371-6c789e81339c-0_0-21-23_20190618093523.parquet] Is there any property needs to be set or environment variables? I am trying to mimic the docker demo using deltastreamer command line to ingest data from json into hoodie data set. $ spark-submit --master yarn --class com.uber.hoodie.utilities.deltastreamer.HoodieDeltaStreamer `ls /mapr/user/avenka23/hoodie/incubator-hudi/packaging/hoodie-utilities-bundle/target/hoodie-utilities-bundle*-SNAPSHOT.jar` --props file:///mapr/user/avenka23/delta-streamer/config/dfs-source.properties --schemaprovider-class com.uber.hoodie.utilities.schema.FilebasedSchemaProvider --source-class com.uber.hoodie.utilities.sources.JsonDFSSource --source-ordering-field ts --target-base-path file:///mapr/user/avenka23/delta-streamer/output5 --target-table stock_ticks_cow --storage-type COPY_ON_WRITE 19/06/18 09:35:00 WARN Client: Neither spark.yarn.jars nor spark.yarn.archive is set, falling back to uploading libraries under SPARK_HOME. ERROR StatusLogger No log4j2 configuration file found. Using default configuration: logging only errors to the console. 19/06/18 09:35:19 WARN SparkContext: Using an existing SparkContext; some configuration may not take effect. 19/06/18 09:35:28 WARN HoodieTable: InValid data paths=[file:///mapr/user/avenka23/delta-streamer/output5/2018/08/31/fe957979-67f8-4b68-a371-6c789e81339c-0_0-21-23_20190618093523.parquet]
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services
