Yes, it's what I saw.

For the heavy dependency, that's why I defined "optional": the user has to specify it and use a specific configuration (knowing what he does).

For the OOZIE, I agree but again it requires a workflow.xml for Oozie. My plan is to avoid for the users to provide a workflow.xml, and instead be able to use the process configuration to define the job to run (directly MapReduce, spark, etc).

Regards
JB

On 02/03/2015 06:30 PM, Srikanth Sundarrajan wrote:
Yes support for Spark was specifically added through 
https://issues.apache.org/jira/browse/OOZIE-1983, to allow users of Oozie or 
Falcon to run Spark jobs. Moving the retention job to Spark would create a 
heavy dependency on spark within Falcon.

With https://issues.apache.org/jira/browse/FALCON-965, it should be possible to 
create an alternate implementation of eviction.

Regards
Srikanth Sundarrajan

Date: Tue, 3 Feb 2015 15:51:38 +0100
From: [email protected]
To: [email protected]
Subject: Falcon with Spark ?

Hi all,

I'm working (and finally resuming my work ;)) on some Falcon features:
- Update and improvements on the ActiveMQ broker
- Complete CDC support of diff/gap storage
- Support of more workflow entities (mapreduce directly instead of Oozie
workflow.xml, etc)

For the workflow entities, I would like to evaluate the "direct" support
of Spark.
Generally speaking, I wonder if "oppositionally", we couldn't leverage
Spark for some internal Falcon processes (like eviction, etc).

WDYT ?

Regards
JB
--
Jean-Baptiste Onofré
[email protected]
http://blog.nanthrax.net
Talend - http://www.talend.com
                                        


--
Jean-Baptiste Onofré
[email protected]
http://blog.nanthrax.net
Talend - http://www.talend.com

Reply via email to