Re: Starting to Prepare 0.8.0 RC1

2018-04-26 Thread Mina Lee
Nice! Let me know if you need any help

On Fri, 27 Apr 2018 at 12:17 PM Jun Kim  wrote:

> Awesome!
> 2018년 4월 27일 (금) 오전 10:15, Jongyoul Lee 님이 작성:
>
>> Sounds great!!
>>
>> On Fri, Apr 27, 2018 at 10:13 AM, Xiaohui Liu  wrote:
>>
>>> Fantastic!
>>>
>>> On Fri, 27 Apr 2018 at 9:12 AM, Jeff Zhang  wrote:
>>>
 Hi Folks,

 All the block issues are fixed, I am starting to prepare 0.8.0 RC1



>>
>>
>> --
>> 이종열, Jongyoul Lee, 李宗烈
>> http://madeng.net
>>
> --
> Taejun Kim
>
> Data Mining Lab.
> School of Electrical and Computer Engineering
> University of Seoul
>


Re: Trying to build Zeppelin, hitting the following error

2017-10-16 Thread Mina Lee
Hi Micheal,

have you tried to run `./dev/change_scala_version.sh 2.11` and add
`-Pscala-2.11` profile to mvn command?

When you build Zeppelin, maven doesn't use the scala version that is
pre-installed on your machine,
but it downloads scala version that is specified in active profile.
You will need to add `-Pscala-2.11` in mvn command to activate scala 2.11
since we use scala-2.10 by default.

You can find build command examples in Zeppelin website [1], hope this
helps.

Cheers,
Mina

[1]
https://zeppelin.apache.org/docs/latest/install/build.html#build-command-examples


On Tue, Oct 17, 2017 at 1:40 AM Michael Segel 
wrote:

> Hi,
>
> I’m trying to build 0.7.3 release of zeppeling on a Centos 7 box.
>
> I have /opt/scala and /opt/spark  setup and their bin directories in my
> path.
> (scala 2.11.11 and spark 2.1.1)
>
> I’m trying to build a basic release  ‘mvn clean package -DskipTests’  to
> start…
>
> I’m running in to the following error:
> [ERROR]
> /opt/zeppelin-0.7.3/zeppelin-display/src/main/scala/org/apache/zeppelin/display/angular/AbstractAngularElem.scala:25:
> error: object xml is not a member of package scala
> [INFO] import scala.xml._
> [INFO]  ^
>
>
> This occurs when trying to build the GUI.
>
> Any suggestions?
>
> Thx
>
> -Mike
>
>


[ANNOUNCE] Apache Zeppelin 0.7.3 released

2017-09-22 Thread Mina Lee
The Apache Zeppelin community is pleased to announce the availability of
the 0.7.3 release.

Zeppelin is a collaborative data analytics and visualization tool for
distributed, general-purpose data processing system such as Apache Spark,
Apache Flink, etc.

The community put significant effort into improving Apache Zeppelin since
the last release. 20 contributors provided 30+ patches
for improvements and bug fixes. More than 20+ issues have been resolved.

We encourage you to download the latest release from
http://zeppelin.apache.org/download.html

Release note is available at
http://zeppelin.apache.org/releases/zeppelin-release-0.7.3.html

We welcome your help and feedback. For more information on the project and
how to get involved, visit our website at http://zeppelin.apache.org/

Thank you all users and contributors who have helped to improve Apache
Zeppelin.

Regards,
The Apache Zeppelin community


[ANNOUNCE] Apache Zeppelin 0.7.2 released

2017-06-13 Thread Mina Lee
The Apache Zeppelin community is pleased to announce the availability of
the 0.7.2 release.

Zeppelin is a collaborative data analytics and visualization tool for
distributed, general-purpose data processing system such as Apache Spark,
Apache Flink, etc.

The community put significant effort into improving Apache Zeppelin since
the last release. 25 contributors provided 50+ patches
for improvements and bug fixes. More than 40+ issues have been resolved.

We encourage you to download the latest release from
http://zeppelin.apache.org/download.html

Release note is available at
http://zeppelin.apache.org/releases/zeppelin-release-0.7.2.html

We welcome your help and feedback. For more information on the project and
how to get involved, visit our website at http://zeppelin.apache.org/

Thank you all users and contributors who have helped to improve Apache
Zeppelin.

Regards,
The Apache Zeppelin community


[ANNOUNCE] Apache Zeppelin 0.7.0 released

2017-02-05 Thread Mina Lee
The Apache Zeppelin community is pleased to announce the availability of
the 0.7.0 release.

Zeppelin is a collaborative data analytics and visualization tool for
distributed, general-purpose data processing system such as Apache Spark,
Apache Flink, etc.

The community put significant effort into improving Apache Zeppelin since
the last release, focusing on multi-user support, pluggable visualization,
better interpreter support. More than 100+ contributors provided 700+
patches
for new features, improvements and bug fixes. More than 480+ issues have
been resolved.

We encourage you to download the latest release from
http://zeppelin.apache.org/download.html

Release note is available at
http://zeppelin.apache.org/releases/zeppelin-release-0.7.0.html

We welcome your help and feedback. For more information on the project and
how to get involved, visit our website at http://zeppelin.apache.org/

Thank you all users and contributors who have helped to improve Apache
Zeppelin.

Regards,
The Apache Zeppelin community


[VOTE] Release Apache Zeppelin 0.7.0 (RC4)

2017-02-02 Thread Mina Lee
I propose the following RC to be released for the Apache Zeppelin 0.7.0
release.

The commit id is df007f2284a09caa7c8b35f8b59d5f1993fe8b64 which is
corresponds to the tag v0.7.0-rc4:
*https://git-wip-us.apache.org/repos/asf?p=zeppelin.git;a=shortlog;h=refs/tags/v0.7.0-rc4
*

The release archives (tgz), signature, and checksums are here
https://dist.apache.org/repos/dist/dev/zeppelin/zeppelin-0.7.0-rc4/

The release candidate consists of the following source distribution archive
zeppelin-0.7.0.tgz

In addition, the following supplementary binary distributions are provided
for user convenience at the same location
zeppelin-0.7.0-bin-all.tgz
zeppelin-0.7.0-bin-netinst.tgz

The maven artifacts are here
https://repository.apache.org/content/repositories/orgapachezeppelin-1027

You can find the KEYS file here:
https://dist.apache.org/repos/dist/release/zeppelin/KEYS

Release notes available at
https://issues.apache.org/jira/secure/ReleaseNote.jspa?version=12336544=12316221

Vote will be open for next 72 hours (close at 01:00 5/Feb PST).

[ ] +1 approve
[ ] 0 no opinion
[ ] -1 disapprove (and reason why)


[CANCEL][VOTE] Release Apache Zeppelin 0.7.0 (RC3)

2017-01-25 Thread Mina Lee
There are several critical issues found during the rc3 verification.
I am canceling vote to include below bug fixes:
   - Cron scheduling doesn't work after certain number of running [1]
   - Table is not rendered in certain condition [2]
   - Livy not working in anonymous mode [3]
   - Interpreter install script throws "NoClassDefFoundError" [4]

[1] https://github.com/apache/zeppelin/pull/1941
[2] https://github.com/apache/zeppelin/pull/1942
[3] https://github.com/apache/zeppelin/pull/1938
[4] https://github.com/apache/zeppelin/pull/1943

On Wed, Jan 25, 2017 at 3:44 PM Jeff Zhang <zjf...@gmail.com> wrote:

> And seems ZEPPELIN-1982 also need to be included in 0.7
>
>
> Jeff Zhang <zjf...@gmail.com>于2017年1月25日周三 下午1:38写道:
>
> Please help also merge ZEPPELIN-2006 (
> https://github.com/apache/zeppelin/pull/1938) If there's another RC
>
>
> moon soo Lee <m...@apache.org>于2017年1月25日周三 上午11:22写道:
>
> Currently, cron scheduling feature is broken [1] and patch is available at
> [2].
> Shell we include this patch in 0.7.0 release?
>
> [1] http://issues.apache.org/jira/browse/ZEPPELIN-2009
> [2] https://github.com/apache/zeppelin/pull/1941
>
> On Tue, Jan 24, 2017 at 6:45 PM Prabhjyot Singh <prabhjyotsi...@apache.org>
> wrote:
>
> +1
>
> On Jan 24, 2017 6:25 PM, "Ahyoung Ryu" <ahyoung...@apache.org> wrote:
>
> +1
>
> On Tue, Jan 24, 2017 at 8:19 PM, Hyung Sung Shim <hss...@nflabs.com>
> wrote:
>
> +1
> 2017년 1월 24일 (화) 오전 1:36, Windy Qin <windyqinchaof...@163.com>님이 작성:
>
>
> +1
> On 2017-01-24 17:13 (+0800), Sora Lee <sora0...@zepl.com> wrote:
> > +1
> >
> > On Tue, Jan 24, 2017 at 3:09 PM Jongyoul Lee <jongy...@gmail.com> wrote:
> >
> > > +1 (binding)
> > >
> > > On Tue, Jan 24, 2017 at 2:43 PM, Mina Lee <mina...@apache.org> wrote:
> > >
> > > I propose the following RC to be released for the Apache Zeppelin 0.7.0
> > > release.
> > >
> > > The commit id is 48ad70e8c62975bdb00779bed5919eaca98c5b5d which is
> > > corresponds to the tag v0.7.0-rc3:
> > > *
> https://git-wip-us.apache.org/repos/asf?p=zeppelin.git;a=commit;h=48ad70e8c62975bdb00779bed5919eaca98c5b5d
> > > <
> https://git-wip-us.apache.org/repos/asf?p=zeppelin.git;a=commit;h=48ad70e8c62975bdb00779bed5919eaca98c5b5d
> >*
> > >
> > > The release archives (tgz), signature, and checksums are here
> > > https://dist.apache.org/repos/dist/dev/zeppelin/zeppelin-0.7.0-rc3/
> > >
> > > The release candidate consists of the following source distribution
> > > archive
> > > zeppelin-0.7.0.tgz
> > >
> > > In addition, the following supplementary binary distributions are
> provided
> > > for user convenience at the same location
> > > zeppelin-0.7.0-bin-all.tgz
> > > zeppelin-0.7.0-bin-netinst.tgz
> > >
> > > The maven artifacts are here
> > >
> https://repository.apache.org/content/repositories/orgapachezeppelin-1024
> > >
> > > You can find the KEYS file here:
> > > https://dist.apache.org/repos/dist/release/zeppelin/KEYS
> > >
> > > Release notes available at
> > >
> > >
> https://issues.apache.org/jira/secure/ReleaseNote.jspa?version=12336544=12316221
> > >
> > > Vote will be open for next 72 hours (close at 22:00 26/Jan PST).
> > >
> > > [ ] +1 approve
> > > [ ] 0 no opinion
> > > [ ] -1 disapprove (and reason why)
> > >
> > >
> > >
> > >
> > > --
> > > 이종열, Jongyoul Lee, 李宗烈
> > > http://madeng.net
> > >
> >
>
>
>


[VOTE] Release Apache Zeppelin 0.7.0 (RC3)

2017-01-23 Thread Mina Lee
I propose the following RC to be released for the Apache Zeppelin 0.7.0
release.

The commit id is 48ad70e8c62975bdb00779bed5919eaca98c5b5d which is
corresponds to the tag v0.7.0-rc3:
*https://git-wip-us.apache.org/repos/asf?p=zeppelin.git;a=commit;h=48ad70e8c62975bdb00779bed5919eaca98c5b5d
*

The release archives (tgz), signature, and checksums are here
https://dist.apache.org/repos/dist/dev/zeppelin/zeppelin-0.7.0-rc3/

The release candidate consists of the following source distribution archive
zeppelin-0.7.0.tgz

In addition, the following supplementary binary distributions are provided
for user convenience at the same location
zeppelin-0.7.0-bin-all.tgz
zeppelin-0.7.0-bin-netinst.tgz

The maven artifacts are here
https://repository.apache.org/content/repositories/orgapachezeppelin-1024

You can find the KEYS file here:
https://dist.apache.org/repos/dist/release/zeppelin/KEYS

Release notes available at
https://issues.apache.org/jira/secure/ReleaseNote.jspa?version=12336544=12316221

Vote will be open for next 72 hours (close at 22:00 26/Jan PST).

[ ] +1 approve
[ ] 0 no opinion
[ ] -1 disapprove (and reason why)


[CANCEL][VOTE] Release Apache Zeppelin 0.7.0 (RC2)

2017-01-23 Thread Mina Lee
Thank you Vinay for reporting.
The PR#1647 [1] that disables run on select option change is merged
intentionally, but seems like it didn't consider report view.
I made a PR#1932 [2] to make it runnable by hitting enter after selection
so it can be run in report view too.

I am canceling rc2 vote, and will come with rc3 with PR#1932 [2],
PR#1933(livy job cancel bug fix) [3] merged.

[1] https://github.com/apache/zeppelin/pull/1647
[2] https://github.com/apache/zeppelin/pull/1932
[3] https://github.com/apache/zeppelin/pull/1933


On Tue, Jan 24, 2017 at 3:38 AM Felix Cheung <felixcheun...@hotmail.com>
wrote:

> I'm seeing the same behavior with the marital status drop down box on the
> intro notebook on netinst build.
>
> I think this could be a bigger problem.
>
>
> _
> From: Khalid Huseynov <khalid...@zepl.com<mailto:khalid...@zepl.com>>
> Sent: Sunday, January 22, 2017 8:08 PM
> Subject: Re: [VOTE] Release Apache Zeppelin 0.7.0 (RC2)
> To: <d...@zeppelin.apache.org<mailto:d...@zeppelin.apache.org>>
> Cc: <users@zeppelin.apache.org<mailto:users@zeppelin.apache.org>>
>
>
> I've just tried and was running fine when changing age field in report
> mode. However didn't run on change of marital status neither in report nor
> in default mode. Hopefully more people could verify it. Also as far as i
> know mentioned log isn't related to run feature.
>
> On Sun, Jan 22, 2017 at 1:46 PM, Vinay Shukla <vinayshu...@gmail.com
> <mailto:vinayshu...@gmail.com>> wrote:
> Downloaded
> https://dist.apache.org/repos/dist/dev/zeppelin/zeppelin-0.7.0-rc2/
>
> Then ran Basic Feature tutorial successfully and went to *report* mode and
> it does not re-run the paragraph when the marital or age field value is
> changed (even after shift +  enter)
>
>
> INFO [2017-01-22 13:36:21,924] ({qtp1622006612-76}
> NotebookServer.java[sendNote]:704) - New operation from 0:0:0:0:0:0:0:1 :
> 62990 : anonymous : GET_NOTE : 2A94M5J1Z
>
>  WARN [2017-01-22 13:36:21,926] ({qtp1622006612-76}
> GitNotebookRepo.java[revisionHistory]:157) - No Head found for 2A94M5J1Z,
> No HEAD exists and no explicit starting revision was specified
>
>  INFO [2017-01-22 13:36:26,223] ({qtp1622006612-14}
> NotebookServer.java[sendNote]:704) - New operation from 0:0:0:0:0:0:0:1 :
> 62990 : anonymous : GET_NOTE : 2A94M5J1Z
>
>  WARN [2017-01-22 13:36:26,224] ({qtp1622006612-14}
> GitNotebookRepo.java[revisionHistory]:157) - No Head found for 2A94M5J1Z,
> No HEAD exists and no explicit starting revision was specified
>
> Did anyone else try this?
>
> On Sun, Jan 22, 2017 at 2:04 AM, Hyung Sung Shim <hss...@nflabs.com
> <mailto:hss...@nflabs.com>> wrote:
>
> > +1
> > 2017? 1? 22? (?) ?? 12:54, Ahyoung Ryu <ahyoung...@apache.org ahyoung...@apache.org>>?? ??:
> >
> >> +1
> >>
> >> On Sun, Jan 22, 2017 at 5:52 PM, Jeff Zhang <zjf...@gmail.com zjf...@gmail.com>> wrote:
> >>
> >> > +1
> >> >
> >> >
> >> >
> >> > DuyHai Doan 
> >> > <doanduy...@gmail.com<mailto:doanduy...@gmail.com>>?2017?1?22???
> ??4:01???
> >> >
> >> >> +1
> >> >>
> >> >> On Sun, Jan 22, 2017 at 8:59 AM, Mina Lee <mina...@apache.org
> <mailto:mina...@apache.org>> wrote:
> >> >>
> >> >> I propose the following RC to be released for the Apache Zeppelin
> 0.7.0
> >> >> release.
> >> >>
> >> >> The commit id is d0ddf8fdff75b4229543f7ebfd5eb001f685382d which is
> >> >> corresponds to the tag v0.7.0-rc2:
> >> >> *https://git-wip-us.apache.org/repos/asf?p=zeppelin.git;a=commit;h=
> >> d0ddf8fdff75b4229543f7ebfd5eb001f685382d
> >> >> <https://git-wip-us.apache.org/repos/asf?p=zeppelin.git;a=commit;h=
> >> d0ddf8fdff75b4229543f7ebfd5eb001f685382d>*
> >> >>
> >> >> The release archives (tgz), signature, and checksums are here
> >> >> https://dist.apache.org/repos/dist/dev/zeppelin/zeppelin-0.7.0-rc2/
> >> >>
> >> >> The release candidate consists of the following source distribution
> >> >> archive
> >> >> zeppelin-0.7.0.tgz
> >> >>
> >> >> In addition, the following supplementary binary distributions are
> >> provided
> >> >> for user convenience at the same location
> >> >> zeppelin-0.7.0-bin-all.tgz
> >> >> zeppelin-0.7.0-bin-netinst.tgz
> >> >>
> >> >> The maven artifacts are here
> >> >> https://repository.apache.org/content/repositories/
> >> orgapachezeppelin-1023
> >> >>
> >> >> You can find the KEYS file here:
> >> >> https://dist.apache.org/repos/dist/release/zeppelin/KEYS
> >> >>
> >> >> Release notes available at
> >> >> https://issues.apache.org/jira/secure/ReleaseNote.jspa?
> >> >> version=12336544=12316221
> >> >>
> >> >> Vote will be open for next 72 hours (close at 23:59 24/Jan PST).
> >> >>
> >> >> [ ] +1 approve
> >> >> [ ] 0 no opinion
> >> >> [ ] -1 disapprove (and reason why)
> >> >>
> >> >>
> >> >>
> >>
> >
>
>
>
>


[VOTE] Release Apache Zeppelin 0.7.0 (RC1)

2017-01-21 Thread Mina Lee
Hi folks,

I propose the following RC to be released for the Apache Zeppelin 0.7.0
release.

The commit id is bcbe0855175f55eddd10b918303806eb5e5db091 which is
corresponds to the tag v0.7.0-rc1:
https://git-wip-us.apache.org/repos/asf?p=zeppelin.git;a=commit;h=bcbe0855175f55eddd10b918303806eb5e5db091

The release archives (tgz), signature, and checksums are here
https://dist.apache.org/repos/dist/dev/zeppelin/zeppelin-0.7.0-rc1/

The release candidate consists of the following source distribution archive
zeppelin-0.7.0.tgz

In addition, the following supplementary binary distributions are provided
for user convenience at the same location
zeppelin-0.7.0-bin-all.tgz
zeppelin-0.7.0-bin-netinst.tgz

The maven artifacts are here
https://repository.apache.org/content/repositories/orgapachezeppelin-1022

You can find the KEYS file here:
https://dist.apache.org/repos/dist/release/zeppelin/KEYS

Release notes available at
https://issues.apache.org/jira/secure/ReleaseNote.jspa?version=12336544=12316221

Vote will be open for next 72 hours (close at 3:30 am 24/Jan PST).

[ ] +1 approve
[ ] 0 no opinion
[ ] -1 disapprove (and reason why)


Re: [DISCUSS] Release package size

2017-01-20 Thread Mina Lee
Decision making taking more time than I expected and
I think this shouldn't be blocker for 0.7.0.

We can take more time deciding which interpreters can be included or
excluded.
Until then, I am just going to go with our current one: zeppelin-bin-all,
zeppelin-bin-netinst.

Moon's suggestion looks good too.
Here I summarized interpreter lists that can be included for each option:
 a. Min package includes interpreters, binary size less than 10MB
  > angular, bigquery, hdfs, kylin, livy, md, postgresql, python, sh
 b. Min package includes interpreters 5 or more JIRA issue created per
month.
  > Need to track. This can be overload for release process.
 c. Min package includes/exclude interpreter that community decide via
formal vote.
 > md, jdbc, spark (based on this mailing thread)



On Fri, Jan 20, 2017 at 5:57 PM moon soo Lee <m...@apache.org> wrote:

> Hi,
>
> I think we need to have some policy to decide which interpreter goes into
> zeppelin-bin-min package. And make applying that policy as a part of
> release process.
> Because i can not see any consistent rule except for "it seems" or "i
> guess". And i have no idea how i can explain if somebody ask 'why python is
> not in min package?' 'why xxx is not in min package?'.
>
> If we really want to min package, we must have a policy that gives
> everyone same expectation which goes to min package and which goes not.
> Once we agree on policy we can make it part of the release process.
>
> So, why don't we try define policy together? Here's some idea i can throw.
>
>  a. Min package includes interpreters, binary size less than 10MB
>  b. Min package includes interpreters 5 or more JIRA issue created per
> month.
>  c. Min package includes/exclude interpreter that community decide via
> formal vote.
>
> "10MB", "5 or more" they are number i just made up. We can change them to
> more reasonable numbers.
> Also a,b,c are possible examples. We can refine them, we can use only one,
> we can use all three, we can add more.
>
> My point is, we need to give everyone the same expectation which goes min
> package, which goes not.
> What do you think?
>
> Thanks,
> moon
>
> On Thu, Jan 19, 2017 at 12:47 AM Mina Lee <mina...@apache.org> wrote:
>
> Thank you for sharing your opinion guys.
>
> I like Eric's approach.
> We are planning to provide official docker managed by community.
> There is ongoing work [1] around it, I can focus on this after 0.7.0
> release.
>
> It seems that majority prefers binary package with top used interpreters
> such as spark, md, jdbc.
> I think we can gradually move to providing only netinst package once
> docker is ready.
> For upcoming 0.7.0 release, I'd like to distribute two binary packages:
>   - zeppelin-bin-min(spark, jdbc, md)
>   - zeppelin-bin-netinst(spark only)
>
> [1] https://github.com/apache/zeppelin/pull/1761
>
> Thanks,
> Mina
>
> On Thu, Jan 19, 2017 at 1:57 AM Jongyoul Lee <jongy...@gmail.com> wrote:
>
> I like to deploy netinst only. And it's good idea that Apache Zeppelin
> supports official docker image with all possible interpreters.
>
> On Wed, Jan 18, 2017 at 7:42 PM, Eric Pugh <
> ep...@opensourceconnections.com> wrote:
>
> Can I throw out an alternate approach?   I feel like the key value of the
> “-all” option is to simplify the life of someone who is new to Zeppelin.
>  If you’re a sophisticated Zeppelin user, then picking and choosing
> interpreters is easy, and you you grok why you want to do that….
>
> However, for myself, when I want to demo Zeppelin, I go straight to one of
> the Docker images, specifically
> https://github.com/dylanmei/docker-zeppelin because it bundles in
> everything.
>
> Would providing a similar Docker image on the “Get Zeppelin” page that
> bundles in all the dependencies and interpreters solve the “how do I try
> Zeppelin in 5 minutes” challenge?  The “Get Zeppelin” page is rather
> daunting page!
>
> Eric
>
>
> On Jan 18, 2017, at 12:00 AM, Mohit Jaggi <mohitja...@gmail.com> wrote:
>
>  Including ALL interpreters is not feasible, not due to download size as
> that is easily increased but because we wouldn't want to couple the release
> cycles as pointed out by Jeff. IMHO a few of the most popular ones should
> be included. Yes it is just one extra step but if a computer can do it why
> make a human suffer? :-)
> Re: spark-packages, Spark does include important and mature functionality
> in its assembly e.g. Csv parser was merged into core spark when it matured.
> I believe Z should do the same.
>
> Sent from my iPhone
>
> On Jan 17, 2017, at 8:05 PM, Jeff Zhang <zjf...@gmail.com> wrote:
>
>
>

Re: [DISCUSS] Release package size

2017-01-19 Thread Mina Lee
Thank you for sharing your opinion guys.

I like Eric's approach.
We are planning to provide official docker managed by community.
There is ongoing work [1] around it, I can focus on this after 0.7.0
release.

It seems that majority prefers binary package with top used interpreters
such as spark, md, jdbc.
I think we can gradually move to providing only netinst package once docker
is ready.
For upcoming 0.7.0 release, I'd like to distribute two binary packages:
  - zeppelin-bin-min(spark, jdbc, md)
  - zeppelin-bin-netinst(spark only)

[1] https://github.com/apache/zeppelin/pull/1761

Thanks,
Mina

On Thu, Jan 19, 2017 at 1:57 AM Jongyoul Lee <jongy...@gmail.com> wrote:

I like to deploy netinst only. And it's good idea that Apache Zeppelin
supports official docker image with all possible interpreters.

On Wed, Jan 18, 2017 at 7:42 PM, Eric Pugh <ep...@opensourceconnections.com>
wrote:

Can I throw out an alternate approach?   I feel like the key value of the
“-all” option is to simplify the life of someone who is new to Zeppelin.
 If you’re a sophisticated Zeppelin user, then picking and choosing
interpreters is easy, and you you grok why you want to do that….

However, for myself, when I want to demo Zeppelin, I go straight to one of
the Docker images, specifically
https://github.com/dylanmei/docker-zeppelin because
it bundles in everything.

Would providing a similar Docker image on the “Get Zeppelin” page that
bundles in all the dependencies and interpreters solve the “how do I try
Zeppelin in 5 minutes” challenge?  The “Get Zeppelin” page is rather
daunting page!

Eric


On Jan 18, 2017, at 12:00 AM, Mohit Jaggi <mohitja...@gmail.com> wrote:

 Including ALL interpreters is not feasible, not due to download size as
that is easily increased but because we wouldn't want to couple the release
cycles as pointed out by Jeff. IMHO a few of the most popular ones should
be included. Yes it is just one extra step but if a computer can do it why
make a human suffer? :-)
Re: spark-packages, Spark does include important and mature functionality
in its assembly e.g. Csv parser was merged into core spark when it matured.
I believe Z should do the same.

Sent from my iPhone

On Jan 17, 2017, at 8:05 PM, Jeff Zhang <zjf...@gmail.com> wrote:


Another thing I'd like to talk is that should we move most of interpreters
out of zeppelin project to somewhere else just like spark do for
spark-packages, 2 benefits:

1. Keep the zeppelin project much smaller
2. Each interpreter's improvements won't be blocked by the release of
zeppelin. Interpreters can has its own release cycle as long as
zeppelin-interpreter doesn't break the compatibility.

If it make sense, I can open another thread to discuss it.




Jun Kim <i2r@gmail.com>于2017年1月18日周三 上午11:55写道:

+1 for Jeff's idea! I also use the three interpreters mainly :)

2017년 1월 18일 (수) 오후 12:52, Jeff Zhang <zjf...@gmail.com>님이 작성:


How about also include markdown and jdbc interpreter if this won't cause
binary distribution much bigger ? I guess spark, markdown, and jdbc
interpreters are the top 3 interpreters in zeppelin.



Ahyoung Ryu <ahyoung...@apache.org>于2017年1月18日周三 上午11:33写道:

Thanks Mina always!
+1 for releasing only netinst package.

On Wed, Jan 18, 2017 at 12:29 PM, Prabhjyot Singh <prabhjyotsi...@apache.org
> wrote:

+1

I don't think it's a problem now, but if it keeps increasing then in the
subsequent releases we can ship Zeppelin with few interpreters, and mark
others as plugins that can be downloaded later with instructions with how
to configure.

On Jan 18, 2017 8:54 AM, "Jun Kim" <i2r@gmail.com> wrote:

+1

I think it won't be a problem if we notice it clear.
Maybe we can do that next to the download button here (
http://zeppelin.apache.org/download.html)
A message may be "NOTE: only spark interpreter included since 0.7.0. If you
want other interpreters, please see interpreter installation guide"

2017년 1월 18일 (수) 오후 12:14, Jeff Zhang <zjf...@gmail.com>님이 작성:


+1, we should also mention it in release note and in the 0.7 doc



Mina Lee <mina...@apache.org>于2017年1月18日周三 上午11:12写道:

Hi all,

Zeppelin is about to start 0.7.0 release process, I would like to discuss
about binary package distribution.

Every time we distribute new binary package, size of the
zeppelin-0.x.x-bin-all.tgz package is getting bigger:
   - zeppelin-0.6.0-bin-all.tgz: 506M
   - zeppelin-0.6.1-bin-all.tgz: 517M
   - zeppelin-0.6.2-bin-all.tgz: 547M
   - zeppelin-0.7.0-bin-all.tgz: 720M (Expected)

Mostly it is because the number of interpreters supported by zeppelin keeps
growing,
and there is high chance that we support more interpreters in the near
future.
So instead of asking apache infra team to increase limit,
I would like to suggest to have only zeppelin-0.7.0-bin-netinst.tgz, which
only includes spark interpreter from 0.7.0 release.
One concern is that users need one more step to i

[DISCUSS] Release package size

2017-01-17 Thread Mina Lee
Hi all,

Zeppelin is about to start 0.7.0 release process, I would like to discuss
about binary package distribution.

Every time we distribute new binary package, size of the
zeppelin-0.x.x-bin-all.tgz package is getting bigger:
   - zeppelin-0.6.0-bin-all.tgz: 506M
   - zeppelin-0.6.1-bin-all.tgz: 517M
   - zeppelin-0.6.2-bin-all.tgz: 547M
   - zeppelin-0.7.0-bin-all.tgz: 720M (Expected)

Mostly it is because the number of interpreters supported by zeppelin keeps
growing,
and there is high chance that we support more interpreters in the near
future.
So instead of asking apache infra team to increase limit,
I would like to suggest to have only zeppelin-0.7.0-bin-netinst.tgz, which
only includes spark interpreter from 0.7.0 release.
One concern is that users need one more step to install the interpreters
they use,
but I believe it can be done easily with single line of command [1].

FYI, attaching the link of similar discussion [2] we had last June in
mailing list.

Regards,
Mina

[1]
http://zeppelin.apache.org/docs/0.6.2/manual/interpreterinstallation.html#install-specific-interpreters

[2]
https://lists.apache.org/thread.html/4b54c034cf8d691655156e0cb647243180c57a6829d97aa3c085b63c@%3Cusers.zeppelin.apache.org%3E


Apache Zeppelin in Spain

2016-11-09 Thread Mina Lee
Hi,

I'd like to let you know that some of PMCs and committers will be in Spain
for the ApacheCon Big Data Europe 2016. We are planning to hold a small
Zeppelin hands-on session for beginners on 15th or 16th. And also we want
to help you to get involved in Zeppelin development if you have some
missing feature in your mind but don't know where and how to start. We will
post schedule on the white board or somewhere in Melia Sevilla so please
find us!

You are more than welcome to stop by, say hi and get stickers and t-shirts
:)

We will also have a small meetup in Madrid this Saturday(12th 19-22h). If
you are interested visit
https://www.meetup.com/Madrid-Apache-Zeppelin-Meetup/events/234853331/ and
RSVP.

Looking forward to seeing you guys!

Best,
Mina


Re: Why the deprecation of %dep?

2016-09-26 Thread Mina Lee
Hi,

Sorry for the inconvenience, I confirmed that %dep doesn't work on 0.6.1
but it does on master and branch-0.6 branch.
Since [1] all blockers for 0.6.2 have patch, hopefully we can start 0.6.2
release process this week.

[1] https://issues.apache.org/jira/browse/ZEPPELIN-1347


On Fri, Sep 23, 2016 at 12:29 AM Vitaliy_Zinchenko <
vitaliy.zinche...@gigaspaces.com> wrote:

> Here is ticket related to this issue
> https://issues.apache.org/jira/browse/ZEPPELIN-1466
>
> Does anybody know how to fix this issue?
>
>
>
> --
> View this message in context:
> http://apache-zeppelin-users-incubating-mailing-list.75479.x6.nabble.com/Why-the-deprecation-of-dep-tp3976p4201.html
> Sent from the Apache Zeppelin Users (incubating) mailing list mailing list
> archive at Nabble.com.
>


Re: Dependency jars not refreshed after interpreter restart in Zeppelin 0.6.1

2016-09-20 Thread Mina Lee
Hi Niek,

I tried to dig into the problem here, but I couldn't reproduce with
following step:
1. Add dependency '/my/path/a.jar' in GUI
2. Update '/my/path/a.jar' contents
3. go to interpreter page and click edit -> save

It would be helpful if you can point out what I missed to reproduce the
issue.

FYI, Spark interpreter process sets jars under local-repo/${interpreter_id}
to its classpath on start up so if you make changes under this directory
you will need to restart spark interpreter.

On Sat, Sep 3, 2016 at 11:35 AM tolomaus 
wrote:

> Hi,
>
> I just upgraded from zeppelin 0.5.6/spark 1.6.2 to zeppelin 0.6.1/spark
> 2.0.0 and after moving my application's jars %deps to the spark interpreter
> UI dependencies section I noticed that updated jars are not taken into
> account anymore. Instead Zeppelin continues to load the original versions
> of
> my jars.
>
> I see that zeppelin initially moves the jars to
> [zeppelin]/local-repo/2BXAQ6T44 and sends this location to spark. But even
> when I manually remove them there it still doesn't trigger a refresh.
>
> My temporary workaround is to have my CI system put the jars also to
> [zeppelin]/local-repo/2BXAQ6T44.
>
> I'm using a dedicated spark install in local mode.
>
> Regards,
>
> Niek.
>
>
>
>
> --
> View this message in context:
> http://apache-zeppelin-users-incubating-mailing-list.75479.x6.nabble.com/Dependency-jars-not-refreshed-after-interpreter-restart-in-Zeppelin-0-6-1-tp4035.html
> Sent from the Apache Zeppelin Users (incubating) mailing list mailing list
> archive at Nabble.com.
>


Re: Failed to build Zeppelin pulled from Master Branch

2016-09-14 Thread Mina Lee
Hi Afancy,
if you want to build with Scala 2.11 by using -Pscala-2.11 flag, you will
need to run `./dev/change_scala_version.sh 2.11` prior to running mvn
command. Scala dependent modules in Zeppelin have _2.10 suffix in artifact
id by default and running ./dev/change_scala_version.sh will change this
suffix to _2.11.


On Wed, Sep 14, 2016 at 10:01 AM afancy  wrote:

> Hello Folk,
>
> I am using this command "mvn -X clean package -Pbuild-distr -DskipTests
> -Pspark-2.0 -Phadoop-2.4 -Pyarn -Pscala-2.11 -Ppyspark  -Psparkr" to build
> the source code pulled from master branch, but got the following error. Any
> suggestion is appreciated if you encounter the same problem. Thanks a lot!
>
> /Afancy
>
>
> [INFO]
> 
> [INFO] Reactor Summary:
> [INFO]
> [INFO] Zeppelin ... SUCCESS [
>  3.144 s]
> [INFO] Zeppelin: Interpreter .. SUCCESS [
> 10.096 s]
> [INFO] Zeppelin: Zengine .. SUCCESS [
>  5.014 s]
> [INFO] Zeppelin: Display system apis .. SUCCESS [
> 13.960 s]
> [INFO] Zeppelin: Spark dependencies ... SUCCESS [
> 46.806 s]
> [INFO] Zeppelin: Spark  FAILURE [
>  0.087 s]
> [INFO] Zeppelin: Markdown interpreter . SKIPPED
> [INFO] Zeppelin: Angular interpreter .. SKIPPED
> [INFO] Zeppelin: Shell interpreter  SKIPPED
> [INFO] Zeppelin: Livy interpreter . SKIPPED
> [INFO] Zeppelin: HBase interpreter  SKIPPED
> [INFO] Zeppelin: PostgreSQL interpreter ... SKIPPED
> [INFO] Zeppelin: JDBC interpreter . SKIPPED
> [INFO] Zeppelin: File System Interpreters . SKIPPED
> [INFO] Zeppelin: Flink  SKIPPED
> [INFO] Zeppelin: Apache Ignite interpreter  SKIPPED
> [INFO] Zeppelin: Kylin interpreter  SKIPPED
> [INFO] Zeppelin: Python interpreter ... SKIPPED
> [INFO] Zeppelin: Lens interpreter . SKIPPED
> [INFO] Zeppelin: Apache Cassandra interpreter . SKIPPED
> [INFO] Zeppelin: Elasticsearch interpreter  SKIPPED
> [INFO] Zeppelin: BigQuery interpreter . SKIPPED
> [INFO] Zeppelin: Alluxio interpreter .. SKIPPED
> [INFO] Zeppelin: web Application .. SKIPPED
> [INFO] Zeppelin: Server ... SKIPPED
> [INFO] Zeppelin: Packaging distribution ... SKIPPED
> [INFO]
> 
> [INFO] BUILD FAILURE
> [INFO]
> 
> [INFO] Total time: 01:19 min
> [INFO] Finished at: 2016-09-14T09:15:38+02:00
> [INFO] Final Memory: 116M/918M
> [INFO]
> 
> [ERROR] Failed to execute goal on project zeppelin-spark_2.10: Could not
> resolve dependencies for project
> org.apache.zeppelin:zeppelin-spark_2.10:jar:0.7.0-SNAPSHOT: Failure to find
> org.apache.zeppelin:zeppelin-display_2.11:jar:0.7.0-SNAPSHOT in
> http://repository.apache.org/snapshots was cached in the local
> repository, resolution will not be reattempted until the update interval of
> apache.snapshots has elapsed or updates are forced -> [Help 1]
> org.apache.maven.lifecycle.LifecycleExecutionException: Failed to execute
> goal on project zeppelin-spark_2.10: Could not resolve dependencies for
> project org.apache.zeppelin:zeppelin-spark_2.10:jar:0.7.0-SNAPSHOT: Failure
> to find org.apache.zeppelin:zeppelin-display_2.11:jar:0.7.0-SNAPSHOT in
> http://repository.apache.org/snapshots was cached in the local
> repository, resolution will not be reattempted until the update interval of
> apache.snapshots has elapsed or updates are forced
> at
> org.apache.maven.lifecycle.internal.LifecycleDependencyResolver.getDependencies(LifecycleDependencyResolver.java:221)
> at
> org.apache.maven.lifecycle.internal.LifecycleDependencyResolver.resolveProjectDependencies(LifecycleDependencyResolver.java:127)
> at
> org.apache.maven.lifecycle.internal.MojoExecutor.ensureDependenciesAreResolved(MojoExecutor.java:257)
> at
> org.apache.maven.lifecycle.internal.MojoExecutor.execute(MojoExecutor.java:200)
> at
> org.apache.maven.lifecycle.internal.MojoExecutor.execute(MojoExecutor.java:153)
> at
> org.apache.maven.lifecycle.internal.MojoExecutor.execute(MojoExecutor.java:145)
> at
> org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject(LifecycleModuleBuilder.java:116)
> at
> org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject(LifecycleModuleBuilder.java:80)
> at
> 

Re: Is it possible to have output that is mixed with multiple types (e.g. plain text + html)

2016-09-12 Thread Mina Lee
Zeppelin prints only one type of display system at one time. Since your
example doesn't start with magic keyword(such as %html %table) Zeppelin
will print it as plain text.
There are several rules which type gets priority than others, [1] Comment
on pr #164 will help you to understand how Zeppelin display system works.

[1] https://github.com/apache/zeppelin/pull/164#issuecomment-124231907

On Mon, Sep 12, 2016 at 10:29 AM, Jeff Zhang  wrote:

>
> Part of my interpreter output is plain text, and another part is html. Is
> it possible to display them correctly in zeppelin now ?
>
> e.g. the following code can not display both plain text and html together.
>
> println("Jeff, %html Hello")
>
>
>
> --
> Best Regards
>
> Jeff Zhang
>


Re: Issue loading dependency with SPARK_SUBMIT_OPTIONS w/ 0.6.1

2016-08-16 Thread Mina Lee
I also could reproduce it with Spark 2.0.0, but not with Spark 1.6.1.
If you want to use Zeppelin with Spark 2.0, one alternative you can try is
using [1] "dependencies" in GUI interpreter menu.

[1] http://zeppelin.apache.org/docs/0.6.1/manual/dependencymanagement.html

On Wed, Aug 17, 2016 at 1:46 AM Jeff Zhang  wrote:

> I can reproduce it in 0.6.1 & master branch, please file a ticket for
> that.
>
> On Wed, Aug 17, 2016 at 4:09 AM, Michael Sells  wrote:
>
>> Testing out 0.6.1 with Spark 2.0 and discovered the way we load
>> dependencies doesn't seem to be working with the new update.
>>
>> We pass new dependencies in via a SPARK_SUBMIT_OPTIONS environment
>> variable pass the following flags:
>> --packages com.databricks:spark-avro_2.11:3.0.0
>>
>> Now when I try to import it with:
>> import com.databricks.spark.avro._
>>
>> I get:
>> :25: error: object databricks is not a member of package com
>> import com.databricks.spark.avro._
>>
>> I checked the logs are there is no error retrieving the package. So it
>> seems to be something with the classpath.
>>
>> This works in 0.6.0. Any idea if something changed or if we're doing
>> something wrong? I tried this with a few internal packages as well and it
>> doesn't work with those either.
>>
>> Thanks,
>> Mike
>>
>>
>>
>>
>>
>>
>
>
> --
> Best Regards
>
> Jeff Zhang
>


[ANNOUNCE] Apache Zeppelin 0.6.1 released

2016-08-16 Thread Mina Lee
The Apache Zeppelin community is pleased to announce the availability of
the 0.6.1 release.

Zeppelin is a collaborative data analytics and visualization tool for
distributed, general-purpose data processing system such as Apache Spark,
Apache Flink, etc.

The community put significant effort into improving Apache Zeppelin since
the last release, focusing on supporting Scala 2.11 and Spark 2.0.
22 contributors provided 80+ patches for new features, improvements
and bug fixes. More than 60+ issues have been resolved.

We encourage download the latest release from
http://zeppelin.apache.org/download.html

Release note is available at
http://zeppelin.apache.org/releases/zeppelin-release-0.6.1.html

We welcome your help and feedback. For more information on the project and
how to get involved, visit our website at http://zeppelin.apache.org/

Thanks to all users and contributors who have helped to improve Apache
Zeppelin.

Regards,
The Apache Zeppelin community


Re: How to use R interpreter ?

2016-08-01 Thread mina lee
Have you tried to build it with -Psparkr profile? If you do, you will be
able to use r interpreter either with %r or %spark.r

On Mon, Aug 1, 2016 at 4:05 PM Jeff Zhang  wrote:

>
> I try to use R interpreter using spark.r, r and knitr, but all fails. I
> have already compiled zeppelin with profile sparkr enabled. But I don't see
> R interpreter in interpreter page, I also didn't see spark.r in spark
> interpreter group. Do I miss anything here ? BTW I am using master
>
>
> no spark.r
> [image: image.png]
>
> 3 ways to call r interpreter, but no one succeeded.
>
> [image: 2016-08-01_1458.png]
>
>
>
> The second and third error seems related with ZEPPELIN-1247
>
> https://issues.apache.org/jira/browse/ZEPPELIN-1247
>
>
>
>
>
> --
> Best Regards
>
> Jeff Zhang
>


Re: Spark-sql showing no table

2016-07-22 Thread mina lee
Hi Vikash,

if you want to render dataframe as a table with sqlContext, you will need
to run
z.show(tables)

On Thu, Jul 14, 2016 at 1:22 PM Vikash Kumar 
wrote:

> I am creating a sqlContext from exiting sc.
>
> Var tables = sqlContext.sql(“show tables”)
>
>
>
>
>
> *Thanks and regards,*
>
> *Vikash Kumar*
>
>
>
> *From:* Mohit Jaggi [mailto:mohitja...@gmail.com]
> *Sent:* Wednesday, July 13, 2016 10:24 PM
> *To:* users@zeppelin.apache.org
> *Subject:* Re: Spark-sql showing no table
>
>
>
> make sure you use a hive context
>
>
>
> On Jul 13, 2016, at 12:42 AM, Vikash Kumar 
> wrote:
>
>
>
> Hi all,
>
> I am using spark with scala to read phoenix tables and
> register as temporary table. Which I am able to do.
>
> After that when I am running query :
>
> %sql show tables
>
> Its giving all possible output, but when I am running same
> query with scala sqlContext ,then its not showing any table or neither
> giving any error.
>
> What should I do now because I also copied
> core-site.xml,hdfs-site.xml,hbase-site.xml and hive-site.xml inot zeppelin
> conf folder?
>
>
>


[ANNOUNCE] Apache Zeppelin 0.6.0 released

2016-07-06 Thread mina lee
The Apache Zeppelin community is pleased to announce the availability of
the 0.6.0 release.

Zeppelin is a collaborative data analytics and visualization tool for
distributed, general-purpose data processing system such as Apache Spark,
Apache Flink, etc.

The community put significant effort into improving Apache Zeppelin since
the last release, focusing on having new backend support, implementing
authentication and authorization for enterprise. More than 70+ contributors
provided 360+ patches for new features, improvements and bug fixes. More
than 200+ issues have been resolved.

We encourage download the latest release from
http://zeppelin.apache.org/download.html

Release note is available at
http://zeppelin.apache.org/releases/zeppelin-release-0.6.0.html


We welcome your help and feedback. For more information on the project and
how to get involved, visit our website at http://zeppelin.apache.org/

Thanks to all users and contributors who have helped to improve Apache
Zeppelin.

Regards,
The Apache Zeppelin community


Re: Zeppelin 0.6 Release Date

2016-07-05 Thread mina lee
Hi Kevin,

release process is almost done, we have only one process left which is
publishing release package to mirroring site. We are waiting for Apache
infra team's response for this so 0.6.0 release will be announced sooner or
later.

Thanks,
Mina


On Wed, Jul 6, 2016 at 3:00 AM Kevin Verhoeven 
wrote:

> Hi,
>
>
>
> Is there a release date for Zeppelin 0.6?
>
>
>
> Thanks,
>
>
>
> Kevin
>
>
>