From what I've seen so far (as a user): 1. No periodic job support.
Are you differentiating between exploratory / presentation work vs production pipelines? In my group's work, we would just use cron 2. No built-in authentication -- I created a JIRA for that. There is a thread about it though. I've had some initial success with nginx as a reverse proxy for authentication. But we are running a separate Zeppelin instance under each user's account. 3. Zeppelin supports PySpark > On Apr 1, 2015, at 6:43 PM, Guilherme Silveira <[email protected]> > wrote: > > Hi folks, > > > I saw this project and it really excited me. > I am considering using it, but I have some philosophical questions... > > For me, the benchmark is Databricks Cloud, but it only works in AWS and I > need to deploy it onpremise. > It has all features I need to far, and probably more. > > Do you have plans to add the features below? > > 1- (MUST HAVE) Add jobs scheduler -> the ability to schedule periodic jobs > based on time interval (run it every night, so example) > 2- Login authentication using LDAP > 3- Add support for python > > > Second part: > > What is the relationship between this project and > https://twitter.com/SparkNotebook ? > > > > >
