Re: Exporting Spark paragraphs as Spark Applications

2017-01-04 Thread Luciano Resende
Some use cases discussed earlier on this thread:

https://www.mail-archive.com/dev@zeppelin.apache.org/msg06323.html

https://www.mail-archive.com/dev@zeppelin.apache.org/msg06332.html

On Wed, Jan 4, 2017 at 4:51 PM, Jianfeng (Jeff) Zhang <
jzh...@hortonworks.com> wrote:

>
> I don¹t understand why user want to export zeppelin note as spark
> application.
>
> If they want to trigger the running of spark app, why not use zeppelin¹s
> rest api for that. Even user export it as spark application, most of time
> in reality, they need to submit it through spark job server, so why not
> use zeppelin as a spark job server.
> And if the spark app fails, it is pretty hard to debug it, because the
> exporting tool has changed/restructured the source code.
>
>
> If this is a pretty large and complicated spark application, I don¹t think
> zeppelin is a proper tool for that, they¹d better to use IDE for that
> project.
>
> BTW, After https://github.com/apache/zeppelin/pull/1799, user can define
> the dependency between paragraphs, and they can run one whole note which
> contains different interpreters.
>
>
>
> Best Regard,
> Jeff Zhang
>
>
>
>
>
> On 1/5/17, 2:25 AM, "Luciano Resende"  wrote:
>
> >I have made some progress with a tool to handle the points discussed in
> >this thread. It's currently a command line tool and given a Zeppelin
> >notebook (note.json) it generates a Spark scala application, compiles it
> >using the compiler embedded in the scala sdk and then package all these
> >resources into a jar that works with spark-submit command.
> >
> >I would like to start prototyping the integration into the Zeppelin UI and
> >I was wondering if it would be ok to use the above jar as a dependency
> >(e.g. from a maven release) and integrate into zeppelin...
> >
> >Thoughts ?
> >
> >
> >On Mon, Sep 19, 2016 at 7:47 AM, Sourav Mazumder <
> >sourav.mazumde...@gmail.com> wrote:
> >
> >> To Moon's point, This is what my vision is around this feature -
> >>
> >> 1. Use should be able to package 1, more than one, all of the
> >>paragraphs in
> >> a Notebook to create a Jar file which can be used with Spark-Submit.
> >>
> >> 2. The tool should automatically remove the all the interactive
> >>statements
> >> like print, show etc.
> >>
> >> 3. The tool should automatically create a Main class in addition to the
> >>jar
> >> file(s) which will internally call the respective jar. User can then
> >>change
> >> this main class if needed for parameterization through Args.
> >>
> >> Regards,
> >> Sourav
> >>
> >> On Mon, Sep 19, 2016 at 7:33 AM, Sourav Mazumder <
> >> sourav.mazumde...@gmail.com> wrote:
> >>
> >> > I am also pretty much for this.
> >> >
> >> > I have got the similar request from each and every people/group who I
> >> > showcased Zeppelin.Regards,
> >> > Sourav
> >> >
> >> > On Fri, Sep 16, 2016 at 8:06 PM, moon soo Lee 
> wrote:
> >> >
> >> >> Hi Luciano,
> >> >>
> >> >> I've also got a lot of questions about "Productize the notebook"
> >>every
> >> >> time
> >> >> i meet users use Zeppelin in their work.
> >> >>
> >> >> I think it's actually about two different problems that Zeppelin
> >>need to
> >> >> address.
> >> >>
> >> >> *1) Provide way that interactive notebook becomes part of production
> >> data
> >> >> pipeline.*
> >> >>
> >> >> Although Zeppelin does have quite convenient cron-like scheduler for
> >> each
> >> >> Note, built-in cron scheduler is not ready for serious use in the
> >> >> production. Because it lacks some features like actions after
> >> >> success/fail,
> >> >> fault-tolerance, history, and so on. I think community is working on
> >> >> improving it, and it's going to take some time.
> >> >>  Meanwhile, any external enterprise level job scheduler can run Note
> >>or
> >> >> Paragraph via REST api. But we don't have any guide and examples for
> >>it,
> >> >> what are the REST APIs user can use for this purpose, and how to use
> >> them
> >> >> in various cases (e.g. with authentication on, dynamic form
> >>parameters,
> >> >> etc). I think a lot of things need to be improved to make zeppelin
> >> easier
> >> >> to be part of production pipeline.
> >> >>
> >> >> *2) Provide stable way of run spark paragraphs.*
> >> >>
> >> >> Another barrier of using notebook in production pipeline is Scala
> >>REPL
> >> in
> >> >> SparkInterpreter. SparkInterpreter uses Scala REPL to provide
> >> interactive
> >> >> scala session and Scala REPL will eventually hit OOME as it compiles
> >>and
> >> >> runs statements. Current workaround in zeppelin is cron-scheduler
> >>inside
> >> >> of
> >> >> notebook has checkbox that can restart the Note after scheduler runs
> >>it.
> >> >> Of course that option does not apply when external scheduler runs job
> >> >> through REST api.
> >> >>
> >> >> I think what Luciano suggesting, "Export Spark Paragraph as Spark
> >> >> application" is interesting. If Spark Paragraphs can be easily
> >>packaged
> >> >> into jar (spark 

[GitHub] zeppelin issue #1831: [ZEPPELIN-1883] Can't import spark submitted packages ...

2017-01-04 Thread zjffdu
Github user zjffdu commented on the issue:

https://github.com/apache/zeppelin/pull/1831
  
Sorry, I miss your last reply. Do you mean yarn-client mode works for you 
in spark ?
I use the following command to launch pyspark and get the error as 
following:

Launch pyspark ( I am using spark 2.1.0)
```
bin/pyspark --packages 
com.datastax.spark:spark-cassandra-connector_2.10:1.6.2,TargetHolding:pyspark-cassandra:0.3.5
 --exclude-packages org.slf4j:slf4j-api --master yarn-client
```

Fail to import pyspark_cassandra
```
>>> import pyspark_cassandra
Traceback (most recent call last):
  File "", line 1, in 
ImportError: No module named pyspark_cassandra
```



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin pull request #1847: [MINOR] Restore paragraph control button margin...

2017-01-04 Thread minahlee
GitHub user minahlee opened a pull request:

https://github.com/apache/zeppelin/pull/1847

[MINOR] Restore paragraph control button margin style

### What is this PR for?
Restore paragraph control button margin style.

### What type of PR is it?
Hot Fix

### What is the Jira issue?
none

### How should this be tested?

### Screenshots (if appropriate)
**Before**
![screen shot 2017-01-05 at 4 02 09 
pm](https://cloud.githubusercontent.com/assets/8503346/21671682/59473572-d360-11e6-9a99-cd5766516d7f.png)


**After**
![screen shot 2017-01-05 at 3 58 43 
pm](https://cloud.githubusercontent.com/assets/8503346/21671609/ed543c34-d35f-11e6-8158-cb8dcd1d6b1a.png)


### Questions:
* Does the licenses files need update? no
* Is there breaking changes for older versions? no
* Does this needs documentation? no


You can merge this pull request into a Git repository by running:

$ git pull https://github.com/minahlee/zeppelin minor/fix-action-margin

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/zeppelin/pull/1847.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #1847


commit 2730695eb14f7243e869ad1cb700037d85655bd9
Author: Mina Lee 
Date:   2017-01-05T06:57:45Z

Restore change of paragraph control button margin style




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin pull request #1809: ZEPPELIN-1866. No meaningful exception for pig ...

2017-01-04 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/zeppelin/pull/1809


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1831: [ZEPPELIN-1883] Can't import spark submitted packages ...

2017-01-04 Thread 1ambda
Github user 1ambda commented on the issue:

https://github.com/apache/zeppelin/pull/1831
  
1. I read and replied before.

> Q. README shows that user can use --py-files
> A. Users cannot benefit from --packages. They need to download, find 
location of all transitive deps and provide the paths to --py-files

And even in spark, we can use `--packages` in local, mesos-client. Why do 
you think zeppelin should't do?

2. I tested this PR in yarn-client and it works. How did you test this PR 
in yarn-client?

>  since it doesn't resolve the yarn-client mode 

Could you tell me your env? 

- how did you build (command, env)
- zeppelin, yarn, spark versions.





---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin pull request #1797: [ZEPPELIN-1854] make Per User mode in interpret...

2017-01-04 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/zeppelin/pull/1797


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1797: [ZEPPELIN-1854] make Per User mode in interpreter sett...

2017-01-04 Thread jongyoul
Github user jongyoul commented on the issue:

https://github.com/apache/zeppelin/pull/1797
  
Will merge it.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1797: [ZEPPELIN-1854] make Per User mode in interpreter sett...

2017-01-04 Thread jongyoul
Github user jongyoul commented on the issue:

https://github.com/apache/zeppelin/pull/1797
  
@prabhjyotsingh I've misunderstood something. LGTM


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin pull request #1843: [ZEPPELIN-1905] fix: "Clear All Paragraphs" doe...

2017-01-04 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/zeppelin/pull/1843


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1843: [ZEPPELIN-1905] fix: "Clear All Paragraphs" doens't wo...

2017-01-04 Thread jongyoul
Github user jongyoul commented on the issue:

https://github.com/apache/zeppelin/pull/1843
  
LGTM. Will merge it


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1843: [ZEPPELIN-1905] fix: "Clear All Paragraphs" doens't wo...

2017-01-04 Thread soralee
Github user soralee commented on the issue:

https://github.com/apache/zeppelin/pull/1843
  
Tested and it works as expected!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin pull request #1846: ZEPPELIN-1770. Restart only the client user's i...

2017-01-04 Thread zjffdu
GitHub user zjffdu opened a pull request:

https://github.com/apache/zeppelin/pull/1846

ZEPPELIN-1770. Restart only the client user's interpreter when restarting 
interpreter setting


### What is this PR for?
This PR would only restart the trigger user's interpreter rather than all 
the interpreter. So that restarting won't affect other users. 


### What type of PR is it?
[Improvement]

### Todos
* [ ] - Task

### What is the Jira issue?
* https://issues.apache.org/jira/browse/ZEPPELIN-1770

### How should this be tested?
Tested manually.

### Screenshots (if appropriate)

### Questions:
* Does the licenses files need update? No
* Is there breaking changes for older versions? No
* Does this needs documentation? No


You can merge this pull request into a Git repository by running:

$ git pull https://github.com/zjffdu/zeppelin ZEPPELIN-1770

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/zeppelin/pull/1846.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #1846


commit 903322f97f4b719abfd13bfb9e6baeda1da52493
Author: Jeff Zhang 
Date:   2016-12-14T07:04:20Z

ZEPPELIN-1770. Restart only the client user's interpreter when restarting 
interpreter setting




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin pull request #1845: [ZEPPELIN-1906] Use multiple InterpreterResult ...

2017-01-04 Thread prabhjyotsingh
GitHub user prabhjyotsingh opened a pull request:

https://github.com/apache/zeppelin/pull/1845

[ZEPPELIN-1906] Use multiple InterpreterResult for displaying multiple JDBC 
queries

### What is this PR for?
Use multiple InterpreterResult for displaying multiple JDBC queries. 
IMO since other sql editors allows to execute multiple sql separated with 
";" and ours display mechanism being more powerful, hence, it should also allow 
the same.

### What type of PR is it?
[Improvement]


### What is the Jira issue?
* [ZEPPELIN-1906](https://issues.apache.org/jira/browse/ZEPPELIN-1906)

### How should this be tested?
Try running following in a paragraph and check for output. 

```
%jdbc
create table test_temp_table (id int);
select column_name, data_type, character_maximum_length from 
INFORMATION_SCHEMA.COLUMNS where table_name = 'test_temp_table';
SELECT table_name FROM information_schema.tables WHERE table_schema = 
'public';
drop table test_temp_table;
SELECT table_name FROM information_schema.tables WHERE table_schema = 
'public';
```

### Screenshots (if appropriate)

### Questions:
* Does the licenses files need update? N/A
* Is there breaking changes for older versions? N/A
* Does this needs documentation? N/A


You can merge this pull request into a Git repository by running:

$ git pull https://github.com/prabhjyotsingh/zeppelin ZEPPELIN-1906

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/zeppelin/pull/1845.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #1845


commit f5ab79687126849778809c9f1851171bacc52e15
Author: Prabhjyot Singh 
Date:   2017-01-05T05:48:29Z

Use multiple InterpreterResult for displaying multiple JDBC queries




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin pull request #1844: [ZEPPELIN-1825] Use versioned notebook storage ...

2017-01-04 Thread khalidhuseynov
GitHub user khalidhuseynov opened a pull request:

https://github.com/apache/zeppelin/pull/1844

[ZEPPELIN-1825] Use versioned notebook storage by default 

### What is this PR for?
Due to recent work on versioned notes under ZEPPELIN-1044, we can use by 
default Git storage which extends local storage. so the behaviour will be same 
as before with additional ability to use versioning out of box without 
configuring it.


### What type of PR is it?
Improvement |  Refactoring

### Todos
* [x] - change VFS to Git

### What is the Jira issue?
[ZEPPELIN-1825](https://issues.apache.org/jira/browse/ZEPPELIN-1825)

### How should this be tested?
start zeppelin and by default be able to use version control (if git client 
installed)

### Screenshots (if appropriate)

### Questions:
* Does the licenses files need update? no
* Is there breaking changes for older versions? no
* Does this needs documentation? no


You can merge this pull request into a Git repository by running:

$ git pull https://github.com/khalidhuseynov/incubator-zeppelin 
default-git-repo

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/zeppelin/pull/1844.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #1844


commit cb9e9fb394851ae6028003bdf6d9f7af6a7b4ec1
Author: Khalid Huseynov 
Date:   2017-01-05T05:24:50Z

modify default storage to git repo

commit 5edd5e0a321c10b94100ff9dcdca67e35bf5ac2c
Author: Khalid Huseynov 
Date:   2017-01-05T05:32:01Z

fix in zeppelin-site.xml




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[jira] [Created] (ZEPPELIN-1906) Use multiple InterpreterResult for displaying multiple JDBC queries

2017-01-04 Thread Prabhjyot Singh (JIRA)
Prabhjyot Singh created ZEPPELIN-1906:
-

 Summary: Use multiple InterpreterResult for displaying multiple 
JDBC queries
 Key: ZEPPELIN-1906
 URL: https://issues.apache.org/jira/browse/ZEPPELIN-1906
 Project: Zeppelin
  Issue Type: Bug
  Components: Interpreters
Reporter: Prabhjyot Singh
Priority: Minor






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[GitHub] zeppelin issue #1831: [ZEPPELIN-1883] Can't import spark submitted packages ...

2017-01-04 Thread zjffdu
Github user zjffdu commented on the issue:

https://github.com/apache/zeppelin/pull/1831
  
As I said before why not using `--py-files`, I check the repository of 
pyspark-cassandra. 
https://github.com/TargetHolding/pyspark-cassandra

README shows that user can use `--py-files`

```
spark-submit \
--jars /path/to/pyspark-cassandra-assembly-.jar \
--driver-class-path /path/to/pyspark-cassandra-assembly-.jar \
--py-files /path/to/pyspark-cassandra-assembly-.jar \
--conf spark.cassandra.connection.host=your,cassandra,node,names \
--master spark://spark-master:7077 \
yourscript.py
```



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1831: [ZEPPELIN-1883] Can't import spark submitted packages ...

2017-01-04 Thread 1ambda
Github user 1ambda commented on the issue:

https://github.com/apache/zeppelin/pull/1831
  
@zjffdu 

>  since it doesn't resolve the yarn-client mode

1. PySpark also doens't support extending PYTHONPATH in yarn-client. 
2. You are saying this is not right fix repeatedly without providing any 
other idea. So let me ask

- How you can load  pyspark-cassandra using packages as described in their 
README.md in local, mesos-client mode.



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1843: [ZEPPELIN-1905] fix: "Clear All Paragraphs" doens't wo...

2017-01-04 Thread 1ambda
Github user 1ambda commented on the issue:

https://github.com/apache/zeppelin/pull/1843
  
@jongyoul Thanks for opinion :) It works well. I tested like

1. enable shiro
2. add permission per user into a note to check other users can execute 
clear all or not


![1905-in-auth](https://cloud.githubusercontent.com/assets/4968473/21669178/cabc7bc2-d34c-11e6-82ca-1a0cd5119ce0.gif)




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1831: [ZEPPELIN-1883] Can't import spark submitted packages ...

2017-01-04 Thread zjffdu
Github user zjffdu commented on the issue:

https://github.com/apache/zeppelin/pull/1831
  
I still think this is not a correct fix since it doesn't resolve the 
yarn-client mode which I believe most of users use this mode. 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1797: [ZEPPELIN-1854] make Per User mode in interpreter sett...

2017-01-04 Thread prabhjyotsingh
Github user prabhjyotsingh commented on the issue:

https://github.com/apache/zeppelin/pull/1797
  
@jongyoul I didn't get you on this. I haven't changed the order, only  
`ng-if="ticket.ticket !== 'anonymous' && ticket.roles !== '[]'"` changed to 
`ng-if="ticket.principal !== 'anonymous'"`. So as, even if there is no role 
assigned to user, it can have option to instantiate an interpreter "Per user".


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1831: [ZEPPELIN-1883] Can't import spark submitted packages ...

2017-01-04 Thread 1ambda
Github user 1ambda commented on the issue:

https://github.com/apache/zeppelin/pull/1831
  
Any update on this?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1839: [WIP] ZEPPELIN-1903. ZeppelinContext can not display p...

2017-01-04 Thread zjffdu
Github user zjffdu commented on the issue:

https://github.com/apache/zeppelin/pull/1839
  
CI is failed, working on it. 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1842: [WIP] [ZEPPELIN-1619] Load js package as a plugin visu...

2017-01-04 Thread jongyoul
Github user jongyoul commented on the issue:

https://github.com/apache/zeppelin/pull/1842
  
@Leemoonsoo I like this idea to show new menu for helium but how about 
creating a new module like zeppelin-helium? It would be more helpful to 
understand whole structure. What do you think of it?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1843: [ZEPPELIN-1905] fix: "Clear All Paragraphs" doens't wo...

2017-01-04 Thread jongyoul
Github user jongyoul commented on the issue:

https://github.com/apache/zeppelin/pull/1843
  
@1ambda Do you check the personalized mode as well?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin pull request #1795: ZEPPELIN-1851. LazyOpenInterpreter would open i...

2017-01-04 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/zeppelin/pull/1795


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin pull request #1807: ZEPPELIN-1861 Support custom interpreter.sh scr...

2017-01-04 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/zeppelin/pull/1807


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1813: [ZEPPELIN-1868]fix to not shows up the login button on...

2017-01-04 Thread AhyoungRyu
Github user AhyoungRyu commented on the issue:

https://github.com/apache/zeppelin/pull/1813
  
LGTM. Merge if there are no more comments  


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1843: [ZEPPELIN-1905] fix: "Clear All Paragraphs" doens't wo...

2017-01-04 Thread soralee
Github user soralee commented on the issue:

https://github.com/apache/zeppelin/pull/1843
  
@1ambda Thanks for fixing this bug and detailed explain!
Let me comment after test it :)


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin pull request #1816: [MINOR] Fix broken CSS for plain text paragraph...

2017-01-04 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/zeppelin/pull/1816


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin pull request #1822: [ZEPPELIN-1789] Make exported data file name mo...

2017-01-04 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/zeppelin/pull/1822


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1797: [ZEPPELIN-1854] make Per User mode in interpreter sett...

2017-01-04 Thread jongyoul
Github user jongyoul commented on the issue:

https://github.com/apache/zeppelin/pull/1797
  
@prabhjyotsingh When I design this feature, I've researched all possible 
combination for it and I decide to choose "Per note" mode first because I 
thought we should have the first criterium both of "Per note" and "Per user". 
Thus if you want to choose "Per user" at first, I think we should assume "Per 
note" is set to "Shared" and users cannot add "Per note" more anymore. Do you 
agree? If you also think like it, This PR looks good to me.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1840: [ZEPPELIN-1730, 1587] add spark impersonation through ...

2017-01-04 Thread zjffdu
Github user zjffdu commented on the issue:

https://github.com/apache/zeppelin/pull/1840
  
@khalidhuseynov  Have you try it in secured cluster ? IIRC, `--proxy-user` 
can not work with `--principal` & `--keytab` together, that means in secured 
cluster, user have to run `kinit` instead of using `--principal` & `--keytab'. 
This might not be user expect. 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


Re: Exporting Spark paragraphs as Spark Applications

2017-01-04 Thread Jianfeng (Jeff) Zhang

I don¹t understand why user want to export zeppelin note as spark
application. 

If they want to trigger the running of spark app, why not use zeppelin¹s
rest api for that. Even user export it as spark application, most of time
in reality, they need to submit it through spark job server, so why not
use zeppelin as a spark job server.
And if the spark app fails, it is pretty hard to debug it, because the
exporting tool has changed/restructured the source code.
 

If this is a pretty large and complicated spark application, I don¹t think
zeppelin is a proper tool for that, they¹d better to use IDE for that
project.

BTW, After https://github.com/apache/zeppelin/pull/1799, user can define
the dependency between paragraphs, and they can run one whole note which
contains different interpreters.
 


Best Regard,
Jeff Zhang





On 1/5/17, 2:25 AM, "Luciano Resende"  wrote:

>I have made some progress with a tool to handle the points discussed in
>this thread. It's currently a command line tool and given a Zeppelin
>notebook (note.json) it generates a Spark scala application, compiles it
>using the compiler embedded in the scala sdk and then package all these
>resources into a jar that works with spark-submit command.
>
>I would like to start prototyping the integration into the Zeppelin UI and
>I was wondering if it would be ok to use the above jar as a dependency
>(e.g. from a maven release) and integrate into zeppelin...
>
>Thoughts ?
>
>
>On Mon, Sep 19, 2016 at 7:47 AM, Sourav Mazumder <
>sourav.mazumde...@gmail.com> wrote:
>
>> To Moon's point, This is what my vision is around this feature -
>>
>> 1. Use should be able to package 1, more than one, all of the
>>paragraphs in
>> a Notebook to create a Jar file which can be used with Spark-Submit.
>>
>> 2. The tool should automatically remove the all the interactive
>>statements
>> like print, show etc.
>>
>> 3. The tool should automatically create a Main class in addition to the
>>jar
>> file(s) which will internally call the respective jar. User can then
>>change
>> this main class if needed for parameterization through Args.
>>
>> Regards,
>> Sourav
>>
>> On Mon, Sep 19, 2016 at 7:33 AM, Sourav Mazumder <
>> sourav.mazumde...@gmail.com> wrote:
>>
>> > I am also pretty much for this.
>> >
>> > I have got the similar request from each and every people/group who I
>> > showcased Zeppelin.Regards,
>> > Sourav
>> >
>> > On Fri, Sep 16, 2016 at 8:06 PM, moon soo Lee  wrote:
>> >
>> >> Hi Luciano,
>> >>
>> >> I've also got a lot of questions about "Productize the notebook"
>>every
>> >> time
>> >> i meet users use Zeppelin in their work.
>> >>
>> >> I think it's actually about two different problems that Zeppelin
>>need to
>> >> address.
>> >>
>> >> *1) Provide way that interactive notebook becomes part of production
>> data
>> >> pipeline.*
>> >>
>> >> Although Zeppelin does have quite convenient cron-like scheduler for
>> each
>> >> Note, built-in cron scheduler is not ready for serious use in the
>> >> production. Because it lacks some features like actions after
>> >> success/fail,
>> >> fault-tolerance, history, and so on. I think community is working on
>> >> improving it, and it's going to take some time.
>> >>  Meanwhile, any external enterprise level job scheduler can run Note
>>or
>> >> Paragraph via REST api. But we don't have any guide and examples for
>>it,
>> >> what are the REST APIs user can use for this purpose, and how to use
>> them
>> >> in various cases (e.g. with authentication on, dynamic form
>>parameters,
>> >> etc). I think a lot of things need to be improved to make zeppelin
>> easier
>> >> to be part of production pipeline.
>> >>
>> >> *2) Provide stable way of run spark paragraphs.*
>> >>
>> >> Another barrier of using notebook in production pipeline is Scala
>>REPL
>> in
>> >> SparkInterpreter. SparkInterpreter uses Scala REPL to provide
>> interactive
>> >> scala session and Scala REPL will eventually hit OOME as it compiles
>>and
>> >> runs statements. Current workaround in zeppelin is cron-scheduler
>>inside
>> >> of
>> >> notebook has checkbox that can restart the Note after scheduler runs
>>it.
>> >> Of course that option does not apply when external scheduler runs job
>> >> through REST api.
>> >>
>> >> I think what Luciano suggesting, "Export Spark Paragraph as Spark
>> >> application" is interesting. If Spark Paragraphs can be easily
>>packaged
>> >> into jar (spark application) that can be one of way to address 1) and
>> 2).
>> >> In case of user already have stable way to schedule spark application
>> jar.
>> >>
>> >> Actually, Flink interactive shell works in similar way internally as
>>far
>> >> as
>> >> i know. i.e. package compiled class into jar and submit.
>> >>
>> >> One idea for prototyping is,
>> >> How about make a interpreter inside of spark interpreter group, say
>>it's
>> >> %spark.build or some better name.
>> >>
>> >> And if user runs some command like
>> >>
>> >> 

[GitHub] zeppelin pull request #1841: [HOTFIX] fix: DON'T run npm clean in maven clea...

2017-01-04 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/zeppelin/pull/1841


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin pull request #1843: [ZEPPELIN-1905] fix: "Clear All Paragraphs" doe...

2017-01-04 Thread 1ambda
GitHub user 1ambda opened a pull request:

https://github.com/apache/zeppelin/pull/1843

[ZEPPELIN-1905] fix: "Clear All Paragraphs" doens't work in a note

### What is this PR for?

Fixed **Clear all paragraphs** button in a note (I attached a screenshot)

### What type of PR is it?
[Bug Fix]

### Todos

Nothing

### What is the Jira issue?

[ZEPPELIN-1905](https://issues.apache.org/jira/browse/ZEPPELIN-1905)

### How should this be tested?

1. Open a note
2. Run some paragraphs.
3. Click the **Run all paragraphs** button.

### Screenshots (if appropriate)


![clear-all-output-bug](https://cloud.githubusercontent.com/assets/4968473/21660976/391d5010-d315-11e6-9ad0-c4feff32eee6.gif)

### Questions:
* Does the licenses files need update? - NO
* Is there breaking changes for older versions? - NO
* Does this needs documentation? - NO


You can merge this pull request into a Git repository by running:

$ git pull https://github.com/1ambda/zeppelin 
ZEPPELIN-1905/fix-clear-all-paras

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/zeppelin/pull/1843.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #1843


commit 7f95b9bbab389a860d7618d89c5dca6a4141ccf7
Author: 1ambda <1am...@gmail.com>
Date:   2017-01-04T21:59:11Z

fix: Set note content

commit 1f9257e74642845393a418f7bb430282215af83b
Author: 1ambda <1am...@gmail.com>
Date:   2017-01-04T22:00:29Z

fix: Calling isFocused of null (TypeError)

TypeError: Cannot read property 'isFocused' of null
at ChildScope.ParagraphCtrl.$scope.scrollToCursor 
(paragraph.controller.js:718)
at paragraph.controller.js:708
at Scope.$broadcast (angular.js:17143)
at paragraph.controller.js:1052




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[jira] [Created] (ZEPPELIN-1905) BUG: "Clear all paragraphs" doens't work in Note

2017-01-04 Thread Hoon Park (JIRA)
Hoon Park created ZEPPELIN-1905:
---

 Summary: BUG: "Clear all paragraphs" doens't work in Note
 Key: ZEPPELIN-1905
 URL: https://issues.apache.org/jira/browse/ZEPPELIN-1905
 Project: Zeppelin
  Issue Type: Bug
Affects Versions: 0.7.0
Reporter: Hoon Park


{{Clear All Paragraphs}} doesn't work due to invalid frontend imp. 

- {{NotebookServer.java}} sends the {{NOTE}} websocket message
- But {{notebook.controller.js}} doens't apply the result properly since it has 
invalid impl

{code}
  $scope.$on('setNoteContent', function(event, note) {
  

  if ($scope.note === null) {
$scope.note = note;
  } else {
$scope.note.config.personalizedMode = note.config.personalizedMode;
  }
{code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[GitHub] zeppelin issue #1840: [ZEPPELIN-1730, 1587] add spark impersonation through ...

2017-01-04 Thread khalidhuseynov
Github user khalidhuseynov commented on the issue:

https://github.com/apache/zeppelin/pull/1840
  
this is ready for review. @prabhjyotsingh plz help review as original 
author, also @zjffdu @astroshim @Leemoonsoo as followup from #1566. CI failure 
in first profile is irrelevant and due to rat problem under 
[ZEPPELIN-1850](https://issues.apache.org/jira/browse/ZEPPELIN-1850)


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin pull request #1842: [WIP] [ZEPPELIN-1619] Load js package as a plug...

2017-01-04 Thread Leemoonsoo
GitHub user Leemoonsoo opened a pull request:

https://github.com/apache/zeppelin/pull/1842

[WIP] [ZEPPELIN-1619] Load js package as a plugin visualization

### What is this PR for?
Current helium plugin application api (experimental) requires create 
library in java class, and need to create both backend / frontend code in the 
package. Which is good if your plugin requires both frontend and backend code 
running.

However, when user just want to make new visualization which completely 
runs on front-end side in javascript, creating helium application in java 
project and taking care of backend code can be bit of overhead and obstacle for 
javascript developers.

This PR adds capability to load pure javascript package as a visualization.

### how it works

1. create (copy, download) 'helium package json' (e.g. 
`zeppelin-examples/zeppelin-example-horizontalbar/zeppelin-example-horizontalbar.json`)
 file into `ZEPPELIN_HOME/helium` directory.
  The json file point visualization js package in npm repository or local 
file system in `artifact` field.   
  `type` field in the json file need to be `VISUALIZATION`

2. Go to helium GUI menu. (e.g. http://localhost:8080/#/helium).
  The menu will list all available packages. 

![image](https://cloud.githubusercontent.com/assets/1540981/21654974/bda34abe-d26c-11e6-8d16-db482faef9a2.png)
 
3. click 'enable' in any package want to use. 
Once a visualization package is enabled, `HeliumVisualizationFactory` will 
collect all enabled visualizations and create js bundle on the fly.

4. js bundle will be loaded on notebook and additional visualization 
becomes available

![image](https://cloud.githubusercontent.com/assets/1540981/21655057/27d61740-d26d-11e6-88f2-02c653e102c6.png)


### Programming API to create new plugin visualization.

Simply extends 
[visualization.js](https://github.com/apache/zeppelin/blob/master/zeppelin-web/src/app/visualization/visualization.js)
 and overrides some methods, such as

```
  /**
   * get transformation
   */
  getTransformation() {
// override this
  };

  /**
   * Method will be invoked when data or configuration changed
   */
  render(tableData) {
// override this
  };

  /**
   * Refresh visualization.
   */
  refresh() {
// override this
  };

  /**
   * method will be invoked when visualization need to be destroyed.
   * Don't need to destroy this.targetEl.
   */
  destroy() {
// override this
  };

  /**
   * return {
   *   template : angular template string or url (url should end with 
.html),
   *   scope : an object to bind to template scope
   * }
   */
  getSetting() {
// override this
  };
```

This is exactly the same api that built-in visualization uses.

an example implementation included 
`zeppelin-examples/zeppelin-example-horizontalbar/horizontalbar.js`.
Actually [all built-in 
visualizations](https://github.com/apache/zeppelin/tree/master/zeppelin-web/src/app/visualization/builtins)
 are example


### Packaging and publishing visualization

Each visualization will need `package.json` file (e.g. 
`zeppelin-examples/zeppelin-example-horizontalbar/package.json`) to be packaged.
Package can be published in npm repository or package can be deployed to 
the local filesystem.

`zeppelin-examples/zeppelin-example-horizontalbar/` is an example package 
that is deployed in the local filesystem

### Future work

Development mode is not implemented here. Currently, developer need to 
create a locally deployed package. And click 'disable' -> 'enable' package in 
helium gui menu everytime developer made code change, so Zeppelin can bundle 
and reload updated code and display.

There must be faster way to apply code change in plugin visualization 
development mode. This can be the future work.


Any feedback would be appreciated!


### What type of PR is it?
Feature

### Todos
* [x] - Load plugin visualization js package on runtime
* [ ] - Make the feature works in zeppelin Binary package
* [ ] - Show loading indicator while 'enable' / 'disable' package
* [ ] - Add document
* [ ] - Add license of new dependency

### What is the Jira issue?
https://issues.apache.org/jira/browse/ZEPPELIN-1619

### How should this be tested?
Build Zeppelin with `-Pexamples` flag. That'll install example 
visualization package `horizontalbar`.
You'll able to select `horizontalbar` along with other built-in 
visualizations

### Questions:
* Does the licenses files need update? yes
* Is there breaking changes for older versions? no
* Does this needs documentation? yes


You can 

[GitHub] zeppelin pull request #1841: [ZEPPELIN-1850] fix: DON'T run npm clean in mav...

2017-01-04 Thread 1ambda
GitHub user 1ambda opened a pull request:

https://github.com/apache/zeppelin/pull/1841

[ZEPPELIN-1850] fix: DON'T run npm clean in maven clean phase (CI related)

### What is this PR for?

maven clean executes `npm run clean` after #1824. but running npm commands 
requires installing, it's invalid. We can't execute `npm run clean`  removing 
it. 

**This PR resolve CI break** (see broken RAT build profiles)

- https://travis-ci.org/apache/zeppelin/builds/188895921
- https://travis-ci.org/apache/zeppelin/builds/188782988

### What type of PR is it?
[Bug Fix | Hot Fix]

### Todos

Nothing

### What is the Jira issue?

[ZEPPELIN-1850](https://issues.apache.org/jira/browse/ZEPPELIN-1850)

### How should this be tested?

The first build profile (RAT) should pass CI.

### Screenshots (if appropriate)

```
[INFO] --- frontend-maven-plugin:1.3:npm (npm clean) @ zeppelin-web ---
...
[INFO] Running 'npm run clean' in 
/Users/lambda/github/apache-zeppelin/zeppelin-bug/zeppelin-web
[INFO] 

[INFO] BUILD FAILURE
[INFO] 

[INFO] Total time: 0.701 s
[INFO] Finished at: 2017-01-05T03:55:32+09:00
[INFO] Final Memory: 21M/982M
[INFO] 

[ERROR] Failed to execute goal 
com.github.eirslett:frontend-maven-plugin:1.3:npm (npm clean) on project 
zeppelin-web: Failed to run task: 'npm run clean' failed. java.io.IOException: 
Cannot run 
program"/Users/lambda/github/apache-zeppelin/zeppelin-bug/zeppelin-web/node/node"
 (in directory 
"/Users/lambda/github/apache-zeppelin/zeppelin-bug/zeppelin-web"): error=2, No 
such file or directory -> [Help 1]
org.apache.maven.lifecycle.LifecycleExecutionException: Failed to execute 
goal com.github.eirslett:frontend-maven-plugin:1.3:npm (npm clean) on project 
zeppelin-web: Failed to run task
...
 
[ERROR]
[ERROR]
[ERROR] For more information about the errors and possible solutions, 
please read the following articles:
[ERROR] [Help 1] 
http://cwiki.apache.org/confluence/display/MAVEN/MojoFailureException
```

### Questions:
* Does the licenses files need update? - NO
* Is there breaking changes for older versions? - NO
* Does this needs documentation? - NO


You can merge this pull request into a Git repository by running:

$ git pull https://github.com/1ambda/zeppelin 
fix/invalid-clean-setting-in-zeppelin-web

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/zeppelin/pull/1841.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #1841


commit fba308623adf1c3e5cfab7d1e83da9d41b0291de
Author: 1ambda <1am...@gmail.com>
Date:   2017-01-04T18:58:38Z

fix: DON'T run npm command in maven clean phase




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


Re: Exporting Spark paragraphs as Spark Applications

2017-01-04 Thread Luciano Resende
I have made some progress with a tool to handle the points discussed in
this thread. It's currently a command line tool and given a Zeppelin
notebook (note.json) it generates a Spark scala application, compiles it
using the compiler embedded in the scala sdk and then package all these
resources into a jar that works with spark-submit command.

I would like to start prototyping the integration into the Zeppelin UI and
I was wondering if it would be ok to use the above jar as a dependency
(e.g. from a maven release) and integrate into zeppelin...

Thoughts ?


On Mon, Sep 19, 2016 at 7:47 AM, Sourav Mazumder <
sourav.mazumde...@gmail.com> wrote:

> To Moon's point, This is what my vision is around this feature -
>
> 1. Use should be able to package 1, more than one, all of the paragraphs in
> a Notebook to create a Jar file which can be used with Spark-Submit.
>
> 2. The tool should automatically remove the all the interactive statements
> like print, show etc.
>
> 3. The tool should automatically create a Main class in addition to the jar
> file(s) which will internally call the respective jar. User can then change
> this main class if needed for parameterization through Args.
>
> Regards,
> Sourav
>
> On Mon, Sep 19, 2016 at 7:33 AM, Sourav Mazumder <
> sourav.mazumde...@gmail.com> wrote:
>
> > I am also pretty much for this.
> >
> > I have got the similar request from each and every people/group who I
> > showcased Zeppelin.Regards,
> > Sourav
> >
> > On Fri, Sep 16, 2016 at 8:06 PM, moon soo Lee  wrote:
> >
> >> Hi Luciano,
> >>
> >> I've also got a lot of questions about "Productize the notebook" every
> >> time
> >> i meet users use Zeppelin in their work.
> >>
> >> I think it's actually about two different problems that Zeppelin need to
> >> address.
> >>
> >> *1) Provide way that interactive notebook becomes part of production
> data
> >> pipeline.*
> >>
> >> Although Zeppelin does have quite convenient cron-like scheduler for
> each
> >> Note, built-in cron scheduler is not ready for serious use in the
> >> production. Because it lacks some features like actions after
> >> success/fail,
> >> fault-tolerance, history, and so on. I think community is working on
> >> improving it, and it's going to take some time.
> >>  Meanwhile, any external enterprise level job scheduler can run Note or
> >> Paragraph via REST api. But we don't have any guide and examples for it,
> >> what are the REST APIs user can use for this purpose, and how to use
> them
> >> in various cases (e.g. with authentication on, dynamic form parameters,
> >> etc). I think a lot of things need to be improved to make zeppelin
> easier
> >> to be part of production pipeline.
> >>
> >> *2) Provide stable way of run spark paragraphs.*
> >>
> >> Another barrier of using notebook in production pipeline is Scala REPL
> in
> >> SparkInterpreter. SparkInterpreter uses Scala REPL to provide
> interactive
> >> scala session and Scala REPL will eventually hit OOME as it compiles and
> >> runs statements. Current workaround in zeppelin is cron-scheduler inside
> >> of
> >> notebook has checkbox that can restart the Note after scheduler runs it.
> >> Of course that option does not apply when external scheduler runs job
> >> through REST api.
> >>
> >> I think what Luciano suggesting, "Export Spark Paragraph as Spark
> >> application" is interesting. If Spark Paragraphs can be easily packaged
> >> into jar (spark application) that can be one of way to address 1) and
> 2).
> >> In case of user already have stable way to schedule spark application
> jar.
> >>
> >> Actually, Flink interactive shell works in similar way internally as far
> >> as
> >> i know. i.e. package compiled class into jar and submit.
> >>
> >> One idea for prototyping is,
> >> How about make a interpreter inside of spark interpreter group, say it's
> >> %spark.build or some better name.
> >>
> >> And if user runs some command like
> >>
> >> %spark.build
> >> package
> >>
> >> then it builds spark application jar based on spark paragraph in the
> Note.
> >> I think it can be the simplest user interface for the prototype.
> >>
> >> Thanks,
> >> moon
> >>
> >> On Fri, Sep 16, 2016 at 1:11 PM Jeremy Anderson <
> >> jer...@objectadjective.com>
> >> wrote:
> >>
> >> > Luciano, I think this would be a terrific feature. I've heard the
> exact
> >> > same workflow you've describe in all of the research we've done.
> >> >
> >> > ...
> >> >
> >> > Jeremy Anderson
> >> > Founder, Object Adjective
> >> > 415.493.8489
> >> > jer...@objectadjective.com
> >> > objectadjective.com 
> >> >
> >> >
> >> >
> >> > This email and any files transmitted with it are confidential and
> >> > intended solely for the use of the individual or entity to whom they
> are
> >> > addressed.
> >> >
> >> > On 16 September 2016 at 12:19, Luciano Resende 
> >> > wrote:
> >> >
> >> > > While talking with a few different users, 

[GitHub] zeppelin pull request #1840: [ZEPPELIN-1730, 1587] add spark impersonation t...

2017-01-04 Thread khalidhuseynov
Github user khalidhuseynov closed the pull request at:

https://github.com/apache/zeppelin/pull/1840


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin pull request #1840: [ZEPPELIN-1730, 1587] add spark impersonation t...

2017-01-04 Thread khalidhuseynov
GitHub user khalidhuseynov reopened a pull request:

https://github.com/apache/zeppelin/pull/1840

[ZEPPELIN-1730, 1587] add spark impersonation through --proxy-user option

### What is this PR for?
This is to add spark impersonation using --proxy-user option. note that it 
enables also to use spark impersonation without having logged user as system 
user with configured ssh.


### What type of PR is it?
Improvement

### Todos
* [x] - add `--proxy-user`
* [x] - try on standalone spark 1.6.2
* [x] - try on yarn-client mode spark 2.0.1

### What is the Jira issue?
Directly solves 
[ZEPPELIN-1730](https://issues.apache.org/jira/browse/ZEPPELIN-1730) and also 
solves [ZEPPELIN-1587](https://issues.apache.org/jira/browse/ZEPPELIN-1587) 
according to discussion in #1566 since using `--proxy-user` in `spark-submit` 
is preferable method.

### How should this be tested?
1. switch your spark cluster to `per user` and `isolated` mode
2. set up `user impersonation` flag
3. run some job using that spark interpreter
4. spark context should be created with currently logged in user 
credentials on behalf of system user

### Screenshots (if appropriate)
standalone

![spark_sc_impersonation](https://cloud.githubusercontent.com/assets/1642088/21639292/24240286-d224-11e6-8099-9bc74a06f0c2.gif)

yarn-client
https://cloud.githubusercontent.com/assets/1642088/21653117/75410fde-d264-11e6-886f-11d8b5dbd29e.png;>


### Questions:
* Does the licenses files need update? no
* Is there breaking changes for older versions? no
* Does this needs documentation? yes


You can merge this pull request into a Git repository by running:

$ git pull https://github.com/khalidhuseynov/incubator-zeppelin 
feat/spark-proxy-user

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/zeppelin/pull/1840.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #1840


commit 4c3dba9e95ca23fe42055f2a039fbdb423a7f466
Author: Khalid Huseynov 
Date:   2017-01-04T09:49:24Z

add --proxy-user option for spark

commit c1239726fe322b6d5281589716ce2006a3944095
Author: Khalid Huseynov 
Date:   2017-01-04T17:25:27Z

add note in docs




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin pull request #1838: [ZEPPELIN-1832] Fixed a bug in zombie process w...

2017-01-04 Thread cloverhearts
GitHub user cloverhearts reopened a pull request:

https://github.com/apache/zeppelin/pull/1838

[ZEPPELIN-1832] Fixed a bug in zombie process when Zeppelin stopped.

### What is this PR for?
When Zeppelin stops, there is a problem.
Sometimes this happens to the test as well.
It is related to releasing the resources of the remote remote interpreter, 
and its order has been modified.

### What type of PR is it?
Bug Fix

### What is the Jira issue?
https://issues.apache.org/jira/browse/ZEPPELIN-1832
### How should this be tested?
1. Zeppelin start
2. run paragraph for interpreter (for example, %spark println("Hello world")
3. Zeppelin stop
4. check to zombie process 
```
#jps 
```
When Zeppelin exits, the RemoteInterpreter process should not exist.

### Questions:
* Does the licenses files need update? no
* Is there breaking changes for older versions? no
* Does this needs documentation? no


You can merge this pull request into a Git repository by running:

$ git pull https://github.com/cloverhearts/zeppelin ZEPPELIN-1832

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/zeppelin/pull/1838.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #1838


commit b282b9445df47d8672a51cd93b44092bb578135b
Author: cloverhearts 
Date:   2017-01-04T08:51:46Z

Fixed a bug in zombie process when Zeppelin stopped.




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1838: [ZEPPELIN-1832] Fixed a bug in zombie process when Zep...

2017-01-04 Thread cloverhearts
Github user cloverhearts commented on the issue:

https://github.com/apache/zeppelin/pull/1838
  
@Leemoonsoo Yes, I will :)


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin pull request #1838: [ZEPPELIN-1832] Fixed a bug in zombie process w...

2017-01-04 Thread cloverhearts
Github user cloverhearts closed the pull request at:

https://github.com/apache/zeppelin/pull/1838


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1814: [ZEPPELIN-1869] changed the API response to generate t...

2017-01-04 Thread Leemoonsoo
Github user Leemoonsoo commented on the issue:

https://github.com/apache/zeppelin/pull/1814
  
@cloverhearts I think you need update related unittests, too.

```
Results :

Failed tests: 
  InterpreterRestApiTest.testAddDeleteRepository:352 Test create method:
Expected: HTTP response <201> from /api/interpreter/repository/
 but: got <200> OK
  InterpreterRestApiTest.testCreatedInterpreterDependencies:171 test create 
method:
Expected: HTTP response <201> from /api/interpreter/setting
 but: got <200> OK
  InterpreterRestApiTest.testSettingsCRUD:124 test create method:
Expected: HTTP response <201> from /api/interpreter/setting/
 but: got <200> OK
  ZeppelinRestApiTest.testCloneNote:329 test note clone method:
Expected: HTTP response <201> from /api/notebook/2C7FSPHN2
 but: got <200> OK
  ZeppelinRestApiTest.testImportNotebook:265 
Expected: HTTP response <201> from /api/notebook/import/
 but: got <200> OK
  ZeppelinRestApiTest.testInsertParagraph:593 Test insert method:
Expected: HTTP response <201> from /api/notebook/2C6FACAQZ/paragraph
 but: got <200> OK
  ZeppelinRestApiTest.testNoteCreateNoName:127->testNoteCreate:175 test 
note create method:
Expected: HTTP response <201> from /api/notebook/
 but: got <200> OK
  ZeppelinRestApiTest.testNoteCreateWithName:122->testNoteCreate:175 test 
note create method:
Expected: HTTP response <201> from /api/notebook/
 but: got <200> OK
  ZeppelinRestApiTest.testNoteCreateWithParagraphs:140 test note create 
method:
Expected: HTTP response <201> from /api/notebook/
 but: got <200> OK
  NotebookRestApiTest.testCloneNote:129 
Expected: HTTP response <201> from /api/notebook/2C6JVYEJD
 but: got <200> OK
  
NotebookSecurityRestApiTest.testThatOtherUserCannotAccessNoteIfPermissionSet:81->createNoteForUser:155
 test note create method:
Expected: HTTP response <201> from /api/notebook/
 but: got <200> OK
  
NotebookSecurityRestApiTest.testThatUserCanSearchNote:115->createNoteForUser:155
 test note create method:
Expected: HTTP response <201> from /api/notebook/
 but: got <200> OK
  
NotebookSecurityRestApiTest.testThatWriterCannotRemoveNote:98->createNoteForUser:155
 test note create method:
Expected: HTTP response <201> from /api/notebook/
 but: got <200> OK
  
NotebookSecurityRestApiTest.testThatOtherUserCanAccessNoteIfPermissionNotSet:72->createNoteForUser:155
 test note create method:
Expected: HTTP response <201> from /api/notebook/
 but: got <200> OK
  
NotebookSecurityRestApiTest.testThatUserCanCreateAndRemoveNote:63->createNoteForUser:155
 test note create method:
Expected: HTTP response <201> from /api/notebook/
 but: got <200> OK

Tests run: 82, Failures: 15, Errors: 0, Skipped: 1
```


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1788: ZEPPELIN-1826. Flaky Test: ZeppelinSparkClusterTest.zR...

2017-01-04 Thread jongyoul
Github user jongyoul commented on the issue:

https://github.com/apache/zeppelin/pull/1788
  
LGTM


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1824: [ZEPPELIN-1850] Remove grunt-contrib-clean (grunt)

2017-01-04 Thread Leemoonsoo
Github user Leemoonsoo commented on the issue:

https://github.com/apache/zeppelin/pull/1824
  
Since this PR merge, [first CI test profile 
fails](https://travis-ci.org/apache/zeppelin/jobs/188757727) with

```
[ERROR] Failed to execute goal 
com.github.eirslett:frontend-maven-plugin:1.3:npm (npm clean) on project 
zeppelin-web: Failed to run task: 'npm run clean' failed. java.io.IOException: 
Cannot run program "/home/travis/build/apache/zeppelin/zeppelin-web/node/node" 
(in directory "/home/travis/build/apache/zeppelin/zeppelin-web"): error=2, No 
such file or directory -> [Help 1]
```

@1ambda Could you take a look?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1788: ZEPPELIN-1826. Flaky Test: ZeppelinSparkClusterTest.zR...

2017-01-04 Thread Leemoonsoo
Github user Leemoonsoo commented on the issue:

https://github.com/apache/zeppelin/pull/1788
  
LGTM and merge if no further discussions.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1835: [MINOR][DOCS] Add troubleshooting section to spark_clu...

2017-01-04 Thread Leemoonsoo
Github user Leemoonsoo commented on the issue:

https://github.com/apache/zeppelin/pull/1835
  
LGTM and merge if no further discussions!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1838: [ZEPPELIN-1832] Fixed a bug in zombie process when Zep...

2017-01-04 Thread Leemoonsoo
Github user Leemoonsoo commented on the issue:

https://github.com/apache/zeppelin/pull/1838
  
multiple ci test profile fails with the same error.

```
09:51:15,163 ERROR 
org.apache.zeppelin.interpreter.remote.RemoteInterpreterEventPoller:245 - Can't 
handle event RemoteInterpreterEvent(type:OUTPUT_UPDATE_ALL, 
data:{"noteId":"2C75F1GM3","paragraphId":"20170104-095114_1076613332","messages":[]})
java.lang.NullPointerException
at 
org.apache.zeppelin.interpreter.remote.RemoteInterpreterEventPoller.run(RemoteInterpreterEventPoller.java:186)
09:51:15,226  INFO org.apache.zeppelin.interpreter.InterpreterGroup:145 - 
Close interpreter group 2C8ACKHEJ:shared_process
09:51:15,226  INFO 
org.apache.zeppelin.interpreter.remote.RemoteInterpreterProcess:121 - shutdown 
interpreter process
09:51:16,226 ERROR 
org.apache.zeppelin.interpreter.remote.AppendOutputRunner:68 - Wait for 
OutputBuffer queue interrupted: null
09:51:17,657  INFO 
org.apache.zeppelin.interpreter.remote.RemoteInterpreterProcess:131 - Exception 
in RemoteInterpreterProcess while synchronized dereference, can safely ignore 
exception while client.shutdown() may terminates remote process
09:51:17,660  INFO 
org.apache.zeppelin.interpreter.remote.RemoteInterpreterManagedProcess:164 - 
Interpreter process exited 0


No output has been received in the last 10m0s, this potentially indicates a 
stalled build or something wrong with the build itself.
Check the details on how to adjust your build configuration on: 
https://docs.travis-ci.com/user/common-build-problems/#Build-times-out-because-no-output-was-received

The build has been terminated
```

@cloverhearts could you take a look?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin pull request #1836: [ZEPPELIN-1887] fix: DON'T create new para when...

2017-01-04 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/zeppelin/pull/1836


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1799: [ZEPPELIN-1165 : WIP] Code-based job workflow

2017-01-04 Thread rasehorn
Github user rasehorn commented on the issue:

https://github.com/apache/zeppelin/pull/1799
  
@cloverhearts 
I think a picture and some pseudocode tells more than thousand words, so I 
created one.

Also: I'm only talking about the use case to ensure a certain sequence of 
paragraph executions when runAll is called for the notebook. If you explicitely 
call z.run(paragraphId) within a certain notebook after runAll() was called, 
you propably execute those paragraphs twice.

The easiest way to ensure a certain sequence of paragraph execution after 
runAll() was issued is to make the paragraphs wait for the one they depend on 
to finish. 

Lets say we have three paragraphs. 
The first one is necessary to prepare the data and define temporary tables. 
The second and third paragraphs depend on that data, so it does not make sense 
to execute them before paragraph 1 finished.
Since the last two paragraphs are in status "running" and wait in parallel 
for the first paragraph to finish, they will be executed in parallel.

Please see the picture 
![wait 
pseudocode](https://cloud.githubusercontent.com/assets/22585000/21642718/903ba9be-d284-11e6-8efb-958adca7861a.jpg)

From my point of view this would be the easiest way for a ZeppelinUser to 
ensure a certain sequence of paragraph execution including control which 
paragraphs are executed in parallel. 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin pull request #1840: [ZEPPELIN-1730, 1587] WIP add spark impersonati...

2017-01-04 Thread khalidhuseynov
GitHub user khalidhuseynov opened a pull request:

https://github.com/apache/zeppelin/pull/1840

[ZEPPELIN-1730, 1587] WIP add spark impersonation through --proxy-user 
option

### What is this PR for?
This is to add spark impersonation using --proxy-user option. note that it 
enables also to use spark impersonation without having logged user as system 
user with configured ssh.


### What type of PR is it?
Improvement

### Todos
* [x] - add for *nix
* [ ] - add for windows
* [x] - testing for standalone
* [ ] - testing for yarn mode

### What is the Jira issue?
Directly solves 
[ZEPPELIN-1730](https://issues.apache.org/jira/browse/ZEPPELIN-1730) and also 
solves [ZEPPELIN-1587](https://issues.apache.org/jira/browse/ZEPPELIN-1587) 
according to discussion in #1566 since using `--proxy-user` in `spark-submit` 
is preferable method.

### How should this be tested?
1. switch your spark cluster to `per user` and `isolated` mode
2. set up `user impersonation` flag
3. run some job using that spark interpreter
4. spark context should be created with currently logged in user 
credentials on behalf of system user

### Screenshots (if appropriate)

![spark_sc_impersonation](https://cloud.githubusercontent.com/assets/1642088/21639292/24240286-d224-11e6-8099-9bc74a06f0c2.gif)



### Questions:
* Does the licenses files need update? no
* Is there breaking changes for older versions? no
* Does this needs documentation? no?


You can merge this pull request into a Git repository by running:

$ git pull https://github.com/khalidhuseynov/incubator-zeppelin 
feat/spark-proxy-user

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/zeppelin/pull/1840.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #1840


commit b68a4a06218977d94621ef2bde7506da45f821f9
Author: Khalid Huseynov 
Date:   2017-01-04T09:49:24Z

add --proxy-user option for spark




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1797: [ZEPPELIN-1854] make Per User mode in interpreter sett...

2017-01-04 Thread prabhjyotsingh
Github user prabhjyotsingh commented on the issue:

https://github.com/apache/zeppelin/pull/1797
  
Merging this if no more discussion.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1795: ZEPPELIN-1851. LazyOpenInterpreter would open interpre...

2017-01-04 Thread zjffdu
Github user zjffdu commented on the issue:

https://github.com/apache/zeppelin/pull/1795
  
ping @jongyoul 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1788: ZEPPELIN-1826. Flaky Test: ZeppelinSparkClusterTest.zR...

2017-01-04 Thread zjffdu
Github user zjffdu commented on the issue:

https://github.com/apache/zeppelin/pull/1788
  
ping @Leemoonsoo @jongyoul @bzz 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin pull request #1837: ZEPPELIN-1901. Output schema might be null for ...

2017-01-04 Thread zjffdu
GitHub user zjffdu reopened a pull request:

https://github.com/apache/zeppelin/pull/1837

ZEPPELIN-1901. Output schema might be null for PigQueryInterpreter

### What is this PR for?
The output schema might be null if user doesn't specify it explicitly. In 
this PR, I will use 'col_{pos}' to replace the column name if it is null. 


### What type of PR is it?
[Improvement]

### Todos
* [ ] - Task

### What is the Jira issue?
* https://issues.apache.org/jira/browse/ZEPPELIN-1901

### How should this be tested?
Unit test is added.

### Screenshots (if appropriate)

### Questions:
* Does the licenses files need update? No
* Is there breaking changes for older versions? No
* Does this needs documentation? No


You can merge this pull request into a Git repository by running:

$ git pull https://github.com/zjffdu/zeppelin ZEPPELIN-1901

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/zeppelin/pull/1837.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #1837


commit a42615644710b7189d690c85d1aa09f773a76b11
Author: Jeff Zhang 
Date:   2017-01-04T08:29:50Z

ZEPPELIN-1901. Output schema might be null for PigQueryInterpreter




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin pull request #1837: ZEPPELIN-1901. Output schema might be null for ...

2017-01-04 Thread zjffdu
Github user zjffdu closed the pull request at:

https://github.com/apache/zeppelin/pull/1837


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin pull request #1838: [ZEPPELIN-1832] Fixed a bug in zombie process w...

2017-01-04 Thread cloverhearts
GitHub user cloverhearts reopened a pull request:

https://github.com/apache/zeppelin/pull/1838

[ZEPPELIN-1832] Fixed a bug in zombie process when Zeppelin stopped.

### What is this PR for?
When Zeppelin stops, there is a problem.
Sometimes this happens to the test as well.
It is related to releasing the resources of the remote remote interpreter, 
and its order has been modified.

### What type of PR is it?
Bug Fix

### What is the Jira issue?
https://issues.apache.org/jira/browse/ZEPPELIN-1832
### How should this be tested?
1. Zeppelin start
2. run paragraph for interpreter (for example, %spark println("Hello world")
3. Zeppelin stop
4. check to zombie process 
```
#jps 
```
When Zeppelin exits, the RemoteInterpreter process should not exist.

### Questions:
* Does the licenses files need update? no
* Is there breaking changes for older versions? no
* Does this needs documentation? no


You can merge this pull request into a Git repository by running:

$ git pull https://github.com/cloverhearts/zeppelin ZEPPELIN-1832

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/zeppelin/pull/1838.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #1838


commit b282b9445df47d8672a51cd93b44092bb578135b
Author: cloverhearts 
Date:   2017-01-04T08:51:46Z

Fixed a bug in zombie process when Zeppelin stopped.




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin pull request #1838: [ZEPPELIN-1832] Fixed a bug in zombie process w...

2017-01-04 Thread cloverhearts
Github user cloverhearts closed the pull request at:

https://github.com/apache/zeppelin/pull/1838


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1838: [ZEPPELIN-1832] Fixed a bug in zombie process when Zep...

2017-01-04 Thread cloverhearts
Github user cloverhearts commented on the issue:

https://github.com/apache/zeppelin/pull/1838
  
@AhyoungRyu @soralee Thank you :)


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1839: ZEPPELIN-1903. ZeppelinContext can not display pandas ...

2017-01-04 Thread zjffdu
Github user zjffdu commented on the issue:

https://github.com/apache/zeppelin/pull/1839
  
@bzz @Leemoonsoo @felixcheung Please help review


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin pull request #1839: ZEPPELIN-1903. ZeppelinContext can not display ...

2017-01-04 Thread zjffdu
GitHub user zjffdu opened a pull request:

https://github.com/apache/zeppelin/pull/1839

ZEPPELIN-1903. ZeppelinContext can not display pandas DataFrame in 
PySparkInterpreter

### What is this PR for?
I copy some code from `PythonInterpreter` to `PySparkInterpreter` to enable 
display pandas DataFrame in `PySparkInterpreter`. Ideally IMO all the features 
in PythonInterpreter should be available in `PySparkInterpeter`. 
`PySparkInterpreter` should be an extension of PythonInterpreter.  After 
refactoring of PythonInterpreter is done, we can consider about it.


### What type of PR is it?
[Improvement]

### Todos
* [ ] - Task

### What is the Jira issue?
* https://issues.apache.org/jira/browse/ZEPPELIN-1903

### How should this be tested?
Unit test is added and also manually tested. 

### Screenshots (if appropriate)

![image](https://cloud.githubusercontent.com/assets/164491/21637701/17a68a22-d2a3-11e6-9a5c-4ec2183b5951.png)

### Questions:
* Does the licenses files need update? No
* Is there breaking changes for older versions? No
* Does this needs documentation? No


You can merge this pull request into a Git repository by running:

$ git pull https://github.com/zjffdu/zeppelin ZEPPELIN-1903

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/zeppelin/pull/1839.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #1839


commit e6c4df41c33c0d3b9f8987d362fa881a157383ec
Author: Jeff Zhang 
Date:   2017-01-04T09:21:46Z

ZEPPELIN-1903. ZeppelinContext can not display pandas DataFrame in 
PySparkInterpreter




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1838: [ZEPPELIN-1832] Fixed a bug in zombie process when Zep...

2017-01-04 Thread soralee
Github user soralee commented on the issue:

https://github.com/apache/zeppelin/pull/1838
  
Test and It works very well!
It was really necessary for me. ! Thanks for taking care of it 
@cloverhearts 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[jira] [Created] (ZEPPELIN-1903) ZeppelinContext can not display pandas DataFrame in PySparkInterpreter

2017-01-04 Thread Jeff Zhang (JIRA)
Jeff Zhang created ZEPPELIN-1903:


 Summary: ZeppelinContext can not display pandas DataFrame in 
PySparkInterpreter
 Key: ZEPPELIN-1903
 URL: https://issues.apache.org/jira/browse/ZEPPELIN-1903
 Project: Zeppelin
  Issue Type: Bug
Affects Versions: 0.7.0
Reporter: Jeff Zhang
Assignee: Jeff Zhang






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[GitHub] zeppelin issue #1838: [ZEPPELIN-1832] Fixed a bug in zombie process when Zep...

2017-01-04 Thread AhyoungRyu
Github user AhyoungRyu commented on the issue:

https://github.com/apache/zeppelin/pull/1838
  
It works nicely. Thanks for the fixing @cloverhearts! 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[jira] [Created] (ZEPPELIN-1902) Default interpreter select box is null after server started when Shiro is on

2017-01-04 Thread Ahyoung Ryu (JIRA)
Ahyoung Ryu created ZEPPELIN-1902:
-

 Summary: Default interpreter select box is null after server 
started when Shiro is on
 Key: ZEPPELIN-1902
 URL: https://issues.apache.org/jira/browse/ZEPPELIN-1902
 Project: Zeppelin
  Issue Type: Bug
Reporter: Ahyoung Ryu
Assignee: Ahyoung Ryu
Priority: Blocker
 Fix For: 0.7.0


A default interpreter select box in "Create new note" dialog is null when Shiro 
is activated. It needs refresh. (This is not happened when Shiro is off) 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[GitHub] zeppelin issue #1836: [ZEPPELIN-1887] fix: DON'T create new para when run al...

2017-01-04 Thread soralee
Github user soralee commented on the issue:

https://github.com/apache/zeppelin/pull/1836
  
Tested and it works very well!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1838: [ZEPPELIN-1832] Fixed a bug in zombie process when Zep...

2017-01-04 Thread cloverhearts
Github user cloverhearts commented on the issue:

https://github.com/apache/zeppelin/pull/1838
  
@minahlee @AhyoungRyu 
Hello, could you please review for this pr?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1838: [ZEPPELIN-1832] Fixed a bug in zombie process when Zep...

2017-01-04 Thread AhyoungRyu
Github user AhyoungRyu commented on the issue:

https://github.com/apache/zeppelin/pull/1838
  
@cloverhearts Nice! Thanks for the patch. Let me try 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin pull request #1838: [ZEPPELIN-1832] Fixed a bug in zombie process w...

2017-01-04 Thread cloverhearts
GitHub user cloverhearts opened a pull request:

https://github.com/apache/zeppelin/pull/1838

[ZEPPELIN-1832] Fixed a bug in zombie process when Zeppelin stopped.

### What is this PR for?
When Zeppelin stops, there is a problem.
Sometimes this happens to the test as well.
It is related to releasing the resources of the remote remote interpreter, 
and its order has been modified.

### What type of PR is it?
Bug Fix

### What is the Jira issue?
https://issues.apache.org/jira/browse/ZEPPELIN-1832
### How should this be tested?
1. Zeppelin start
2. run paragraph for interpreter (for example, %spark println("Hello world")
3. Zeppelin stop
4. check to zombie process 
```
#jps 
```
When Zeppelin exits, the RemoteInterpreter process should not exist.

### Questions:
* Does the licenses files need update? no
* Is there breaking changes for older versions? no
* Does this needs documentation? no


You can merge this pull request into a Git repository by running:

$ git pull https://github.com/cloverhearts/zeppelin ZEPPELIN-1832

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/zeppelin/pull/1838.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #1838


commit b282b9445df47d8672a51cd93b44092bb578135b
Author: cloverhearts 
Date:   2017-01-04T08:51:46Z

Fixed a bug in zombie process when Zeppelin stopped.




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1836: [ZEPPELIN-1887] fix: DON'T create new para when run al...

2017-01-04 Thread AhyoungRyu
Github user AhyoungRyu commented on the issue:

https://github.com/apache/zeppelin/pull/1836
  
Tested and works well as before 👍 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1837: ZEPPELIN-1901. Output schema might be null for PigQuer...

2017-01-04 Thread zjffdu
Github user zjffdu commented on the issue:

https://github.com/apache/zeppelin/pull/1837
  
@felixcheung Please help review, I will add document in #1830 to encourage 
user to specify schema explictly. 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin pull request #1775: [ZEPPELIN-1745] View revisions in non-editable ...

2017-01-04 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/zeppelin/pull/1775


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin pull request #1837: ZEPPELIN-1901. Output schema might be null for ...

2017-01-04 Thread zjffdu
GitHub user zjffdu opened a pull request:

https://github.com/apache/zeppelin/pull/1837

ZEPPELIN-1901. Output schema might be null for PigQueryInterpreter

### What is this PR for?
The output schema might be null if user doesn't specify it explicitly. In 
this PR, I will use 'col_{pos}' to replace the column name if it is null. 


### What type of PR is it?
[Improvement]

### Todos
* [ ] - Task

### What is the Jira issue?
* https://issues.apache.org/jira/browse/ZEPPELIN-1901

### How should this be tested?
Unit test is added.

### Screenshots (if appropriate)

### Questions:
* Does the licenses files need update? No
* Is there breaking changes for older versions? No
* Does this needs documentation? No


You can merge this pull request into a Git repository by running:

$ git pull https://github.com/zjffdu/zeppelin ZEPPELIN-1901

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/zeppelin/pull/1837.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #1837


commit a42615644710b7189d690c85d1aa09f773a76b11
Author: Jeff Zhang 
Date:   2017-01-04T08:29:50Z

ZEPPELIN-1901. Output schema might be null for PigQueryInterpreter




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[jira] [Created] (ZEPPELIN-1901) Output schema might be null for PigQueryInterpreter

2017-01-04 Thread Jeff Zhang (JIRA)
Jeff Zhang created ZEPPELIN-1901:


 Summary: Output schema might be null for PigQueryInterpreter
 Key: ZEPPELIN-1901
 URL: https://issues.apache.org/jira/browse/ZEPPELIN-1901
 Project: Zeppelin
  Issue Type: Bug
Affects Versions: 0.7.0
Reporter: Jeff Zhang
Assignee: Jeff Zhang
 Fix For: 0.7.0






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[GitHub] zeppelin issue #1833: [ZEPPELIN-1880] Fix shell interpreter output streaming...

2017-01-04 Thread AhyoungRyu
Github user AhyoungRyu commented on the issue:

https://github.com/apache/zeppelin/pull/1833
  
@soralee Thanks for testing it out. Right I think the rerunning issue is 
due to that I mentioned in [this 
comment](https://github.com/apache/zeppelin/pull/1833#issuecomment-270078954). 

@astroshim @Leemoonsoo BTW just to be making sure, did you build this 
branch including shell interpreter like `mvn clean package -DskipTests -pl 
'shell, zeppelin-interpreter, zeppelin-server, zeppelin-zengine'` to test this 
patch? After build and restart Zeppelin, the result should be same with 
@soralee and me. 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1834: [ZEPPELIN-1864] Improvement to show folder and note af...

2017-01-04 Thread soralee
Github user soralee commented on the issue:

https://github.com/apache/zeppelin/pull/1834
  
Thanks for detailed test @AhyoungRyu! :heart_eyes: 
Sure, actually I missed the filter in the Navbar! 
Let me check and update PR!!



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] zeppelin issue #1833: [ZEPPELIN-1880] Fix shell interpreter output streaming...

2017-01-04 Thread soralee
Github user soralee commented on the issue:

https://github.com/apache/zeppelin/pull/1833
  
I tested and I also faced the problem in this branch. (It is same problem 
with above @AhyoungRyu's comment.)

At first time, It was worked very well. 

![z1833_c](https://cloud.githubusercontent.com/assets/8110458/21635067/d825cc36-d29e-11e6-9246-b3e61506b8a7.gif)

But when I rerun same paragraph, it was not worked. 

![z1833_c_1](https://cloud.githubusercontent.com/assets/8110458/21635077/f6019758-d29e-11e6-8a50-e92556960196.gif)



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---