From what I've seen so far (as a user):

1. No periodic job support. 

Are you differentiating between exploratory / presentation work vs production 
pipelines? In my group's work, we would just use cron

2. No built-in authentication -- I created a JIRA for that.

There is a thread about it though. I've had some initial success with nginx as 
a reverse proxy for authentication. But we are running a separate Zeppelin 
instance under each user's account.

3. Zeppelin supports PySpark

> On Apr 1, 2015, at 6:43 PM, Guilherme Silveira <[email protected]> 
> wrote:
> 
> Hi folks, 
> 
> 
> I saw this project and it really excited me.
> I am considering using it, but I have some philosophical questions...
> 
> For me, the benchmark is Databricks Cloud, but it only works in AWS and I 
> need to deploy it onpremise.
> It has all features I need to far, and probably more.
> 
> Do you have plans to add the features below?
> 
> 1- (MUST HAVE) Add jobs scheduler -> the ability to schedule periodic jobs 
> based on time interval (run it every night, so example)
> 2- Login authentication using LDAP
> 3- Add support for python
> 
> 
> Second part:
> 
> What is the relationship between this project and 
> https://twitter.com/SparkNotebook ?
> 
> 
> 
> 
> 

Reply via email to