This plan sounds great of having a script that can install individual available interpreters. It sounds reasonable to always ship with a spark version for now for lack of complexity in installing that.
The original list of min interpreters seems good, too. The generic jdbc interpreter handles a lot of cases (as far as I understand, PostgreSQL interpreter is actually already a generic jdbc one). Anyhow, my two cents as a new user. Cameron On Fri, Jun 17, 2016 at 11:50 AM, moon soo Lee <m...@apache.org> wrote: > Hi, > > Thanks for bringing this discussion. > it's great idea minimize binary package size. > > Can we set a policy to decide which interpreter goes to > 'zeppelin-bin-min', which is not? > > One alternative is, instead of making 'zeppelin-bin-min', we can make > 'zeppelin-bin-netinst'. > We can provide a shell script such as, 'bin/install-interpreter.sh' and > the script will download interpreters and their dependencies from maven > repository and store under /interpreter dir. By leveraging > DependencyResolver[1], > i think we can make this feature in couple of hours. > > Only spark interpreter can not be installed in simple way, while it > requires some python and R packages under /interpreter dir and they're not > available on maven repository, so it'll need special treatment, but all > other interpreters can be installed in the simple way. > > Then, 'zeppelin-bin-netinst' version can have minimal package size, and > still gives easy way to install all the interpreters. > Also 'bin/install-interpreter.sh' will still useful even if we have > dynamic interpreter loading feature [2], to build offline package. > > what do you think? > > [1] > https://github.com/apache/zeppelin/blob/master/zeppelin-interpreter/src/main/java/org/apache/zeppelin/dep/DependencyResolver.java > [2] https://issues.apache.org/jira/browse/ZEPPELIN-598 > > > On Fri, Jun 17, 2016 at 1:02 AM mina lee <mina...@apache.org> wrote: > >> Hi all! >> >> Zeppelin just started release process. Prior to creating release >> candidate I want to ask users' opinion about how you want it to be packaged. >> >> For the last release(0.5.6), we have released one binary package which >> includes all interpreters. >> The concern with providing one type of binary package is that package >> size will be quite big(~600MB). >> So I am planning to provide two binary packages: >> - zeppelin-0.6.0-bin-all.tgz (includes all interpreters) >> - zeppelin-0.6.0-bin-min.tgz (includes only most used interpreters) >> >> I am thinking about putting *spark(pyspark, sparkr, sql), python, jdbc, >> shell, markdown, angular* in minimized package. >> Could you give your opinion on whether these sets are enough, or some of >> them are ok to be excluded? >> >> Community's opinion will be helpful to make decision not only for 0.6.0 >> but also for 0.7.0 release since we are planning to provide only minimized >> package from 0.7.0 release. From the 0.7.0 version, interpreters those are >> not included in binary package will be able to use dynamic interpreter >> feature [1] which is in progress under [2]. >> >> Thanks, >> Mina >> >> [1] >> http://zeppelin.apache.org/docs/0.6.0-SNAPSHOT/manual/dynamicinterpreterload.html >> [2] https://github.com/apache/zeppelin/pull/908 >> >