Angel Conde created HUDI-3994:
----------------------------------
Summary: HoodieDeltaStreamer - Spark master shouldn't have a
default
Key: HUDI-3994
URL: https://issues.apache.org/jira/browse/HUDI-3994
Project: Apache Hudi
Issue Type: Improvement
Components: deltastreamer
Reporter: Angel Conde
Fix For: 0.11.0
When trying to run HoodieDeltaStreamer on AWS Glue I found that the Spark
master has no option to inherit from the environment as it defaults to
{{{}local[2]{}}}. In these kind of Serverless environments where you do not
have access to the master this configuration should be inherited
This can be seen on line 329 on
[HoodieDeltaStreamer|https://github.com/apache/hudi/blob/master/hudi-utilities/src/main/java/org/apache/hudi/utilities/deltastreamer/HoodieDeltaStreamer.java].
{{public String sparkMaster = "local[2]";}}
This should be changed for supporting this kind of scenarios, a
JavaSparkContext option where no Spark master is defined should be there.
*Expected behavior*
The Spark master shouldn't have a default as there are some environments
(usually serverless such as AWS Glue) where it will be inherited.
--
This message was sent by Atlassian Jira
(v8.20.7#820007)