Eric, There are elements in 702 that I think would be great to see in the final merge! Great to hear your open to Plan B.
Jeff On 3/7/16, 11:20 PM, "Eric Charles" <e...@apache.org> wrote: >Small precisions on #702: > >(snip...) >> >> - 702 >> * CI fails > >Just pushed and it is failing for non R related reasons... > >Most importantly, I have seen since a few days that the test are no more >executed for the spark interpreter for all PR builds > >[INFO] --- maven-surefire-plugin:2.17:test (default-test) @ >zeppelin-spark --- >[INFO] Tests are skipped. > >Will have a look at it. > >> * no tests > >There is some test > >https://github.com/datalayer/zeppelin-datalayer/blob/rscala-z/spark/src/test/java/org/apache/zeppelin/spark/SparkRInterpreterTest.java > >> * no docs >> > >There is some doc > >https://github.com/datalayer/zeppelin-datalayer/blob/rscala-z/docs/interpreter/R.md > > >> That being said, my personal opinion is that we should follow C, and #208 >> there has more chances of being merged first. >> >> Again, the goal is not to compare both contributions in terms of >> features/merit and decide here which is better, but to build a consensus on >> how we as a community proceed in situation of two contributions of same >> pluggable feature. In this thread, it means to have no -1s for for at least >> one option, though a thoughtful compromise from all sides. >> >> What do you guys think? >> > >I would favor b) but this may take too much time, so to get users the >best choice as soon as possible, c) sounds to me like the way to go. > >> >> With PPMC hat on, I feel that we may need to start a separate thread for a >> generalised decidion-making process in such situation, irrigating of >> current state of issue with R interpreter. And after a making a decision >> there, we could use the same guiding principle to resolve this issue, as >> well as any other one in the future. >> >> >> >> -- >> Alex >> >> On Tue, Mar 8, 2016 at 2:45 PM, Jeff Steinmetz <jeffrey.steinm...@gmail.com> >> wrote: >> >>> I should clarify my preference regarding Plan A (to only merge 1 - at >>> least initially). >>> “Which” PR to merge (or merge first) is TBD - at least for myself. I’m >>> still testing both PR options. >>> Since the original request was not to debate the fate\merit\features of >>> any particular contribution in this thread, I’ll post my 702 PR findings >>> separately. >>> >>> >>> >>> ---- >>> Jeff Steinmetz >>> Principal Architect >>> Akili Interactive >>> www.akiliinteractive.com <http://www.akiliinteractive.com/> >>> >>> >>> >>> >>> >>> >>> On 3/2/16, 9:12 PM, "Jeff Steinmetz" <jeffrey.steinm...@gmail.com> wrote: >>> >>>> I too prefer plan A - merging two different R interpreters sounds like a >>> maintenance and documentation headache for end users. >>>> >>>> >>>> Do you or the community feel there are “specific” additional steps from a >>> “technical” or “development” perspective that need to happen in order merge >>> 208? >>>> If we know what’s holding back the merge technically (all history aside) >>> we can work as a community to solve it. >>>> >>>> Olympic spirit! >>>> Looking forward to helping this through. >>>> >>>> ---- >>>> Jeff Steinmetz >>>> Principal Architect >>>> Akili Interactive >>>> >>>> >>>> >>>> >>>> >>>> >>>> On 3/2/16, 8:14 PM, "Amos Elberg" <amos.elb...@gmail.com> wrote: >>>> >>>>> Alex -- the gist of my email is that we already have a consensus, and >>> have had >>>>> a consensus since November. The consensus was to merge 208. That's >>> "Plan A." >>>>> >>>>> With all respect, I don't see that anyone other than you believes we >>> don't >>>>> have a consensus on Plan A already, or has any issue with Plan A. >>>>> >>>>> In fact, I'm going to call now for "lazy consensus" on Plan A: End the >>> debate >>>>> and move rapidly to merge 208, completing whatever work is necessary to >>> do >>>>> that (if any). >>>>> >>>>> For the record, yes, I do object to Plan C. Numerous users have >>> complained >>>>> that with two different PRs, they don't know which interpreter to use. >>> That's >>>>> a strong reason to not merge two. In fact it will confuse people more, >>> because >>>>> one interpreter's R environment won't be shared with the other >>> interpreter, >>>>> and you can't move variables between them. Moreover, no-one has >>> presented any >>>>> benefit to merging the second one. >>>>> >>>>> In addition, while 208 seems to be ready to merge (waiting only on the >>> work >>>>> you're doing on CI), the second PR is nowhere close. So, that's another >>>>> reason: 208 should not have to wait for the other to be ready. >>>>> >>>>> But in any event, I disagree that there is any issue here. >>>>> >>>>> If you intend to continue this thread, then please address the issues >>> raised >>>>> in my e-mail earlier. Please also explain any strong objection to Plan >>> A. >>>>> >>>>> Thanks, >>>>> >>>>> -Amos >>>>> >>>>> On Thursday, March 03, 2016 12:09:33 PM Alexander Bezzubov wrote: >>>>>> Guys, please let's keep the discussion focused on the subject. >>>>>> >>>>>> Amos, I do not understand, are you saying that you do object on the >>>>>> community proceeding with plan C? If not - there is no need to >>> answer\post >>>>>> in this thread right now. >>>>>> >>>>>> Again, we are not debating fate\merit\features of any particular >>>>>> contribution here. >>>>>> >>>>>> Please post in this thread only if you strongly disagree with the >>> suggested >>>>>> plan. >>>>>> I'm calling for a lazy consensus and as soon as there are no >>> objections - >>>>>> will be ready to proceed with the plan above. >>>>>> >>>>>> Sooner we reach a consensus on the topic - sooner we can make further >>>>>> progress. >>>>>> >>>>>> -- >>>>>> Alex >>>>>> >>>>>> On Thu, Mar 3, 2016 at 11:45 AM, Amos Elberg <amos.elb...@gmail.com> >>> wrote: >>>>>>> Alex - What are we still debating at this point? >>>>>>> >>>>>>> I'm starting to feel like Charlie Brown with the football here. >>>>>>> >>>>>>> The PR was submitted in August and originally reviewed at the >>> beginning of >>>>>>> September. >>>>>>> In, I think, early December, it was then extensively reviewed and >>>>>>> discussed. I made a few requested changes, and at that time there >>> was a >>>>>>> decision to merge 208 pending Moon working on the CI problem. >>>>>>> In January the PR was reviewed again, by you and others, and I >>> thought >>>>>>> you'd decided to merge pending some changes from me, and you were >>> going to >>>>>>> work on CI. >>>>>>> In February, when people continued to email the list to ask what was >>> up, >>>>>>> we >>>>>>> said again that the community was moving to merge 208. >>>>>>> The thread started a few days ago. Nobody argued for changing the >>> plan. >>>>>>> The discussion lapsed until, today, I responded to a technical point. >>>>>>> >>>>>>> I'm not sure why this is coming up again. If Eric (or others) feel >>>>>>> strongly about the issues Eric raised with 208, which is things like >>>>>>> whether to link rscala or fork it (or whatever), why can't they just >>>>>>> submit >>>>>>> PRs with those change after 208 is merged? The architectures of the >>> two >>>>>>> PRs have been converging as Eric's been incorporating functionality. >>>>>>> No-one claims that Eric's interpreter provides any additional >>>>>>> functionality, or that its more stable, or anything like that. So >>> why are >>>>>>> we still talking about this? >>>>>>> >>>>>>> If the issue is that Eric put in substantial work, that was a choice >>> he >>>>>>> made after he knew the status of 208. He also had the benefit of >>> seeing >>>>>>> how I solved various technical problems, like using rscala, sharing >>> the >>>>>>> Spark Context, etc. In fact, when I first started on this project, >>> I saw >>>>>>> that Eric had done some preliminary work, and wrote him to see if we >>> could >>>>>>> collaborate. He wasn't interested. In November, when I heard that >>>>>>> Datalayer had produced an interpreter (I didn't realize Datalayer is >>> Eric) >>>>>>> I wrote them offering to work together. No reply. And in December >>> also. >>>>>>> No reply. Eric didn't even submit the PR until after there was >>> already a >>>>>>> consensus to merge 208. His PR only started to approach feature >>> parity in >>>>>>> the last few weeks, after we decided *again* to try to merge 208. >>>>>>> >>>>>>> Someone commented earlier in this thread that we need to get this >>> resolved >>>>>>> so the community can move on. I agree. I want to move on also. >>>>>>> >>>>>>> Is there any substantial reason at this point why we're revisiting >>> the >>>>>>> issue instead of simply trying to merge 208? Is there any reason >>> not to >>>>>>> view the discussion in this email chain as resolved in favor of >>> merging >>>>>>> 208 >>>>>>> and moving forward? Is there anything you're waiting on me for that >>> you >>>>>>> need so 208 can get merged? What, at this point, is left to be done >>> so >>>>>>> 208 >>>>>>> can be merged? >>>>>>> >>>>>>> On Wed, Mar 2, 2016 at 7:53 PM, Alexander Bezzubov <b...@apache.org> >>> wrote: >>>>>>>> Thank you guys for actually answering the question! >>>>>>>> >>>>>>>> My personal opinion on making a progress here, and in further >>> cases like >>>>>>>> that, lies with a plan C. >>>>>>>> >>>>>>>> Please correct me if I'm wrong, but what I can see in this thread >>> is a >>>>>>>> consensus around going further with plan C: merging contribution >>> as soon >>>>>>> >>>>>>> as >>>>>>> >>>>>>>> it is ready, without the need to block another contributions (as >>> they >>>>>>> >>>>>>> have >>>>>>> >>>>>>>> technical merit, of course) and let actual users decide. >>>>>>>> >>>>>>>> At this point, I'd really love to hear only from people that >>> disagree >>>>>>> >>>>>>> with >>>>>>> >>>>>>>> above and have strong opinions about that or think that the >>> concerns >>>>>>>> they >>>>>>>> have raised before were not addressed properly. >>>>>>>> >>>>>>>> Thanks again, >>>>>>>> I really appreciate everyone's time, spent on this issue. >>>>>>>> >>>>>>>> -- >>>>>>>> Alex >>>>>>>> >>>>>>>> >>>>>>>> On Tue, Mar 1, 2016 at 1:02 PM, Jeff Steinmetz < >>>>>>>> jeffrey.steinm...@gmail.com> >>>>>>>> >>>>>>>> wrote: >>>>>>>>> I too was able to use R via PR 208 with success. >>>>>>>>> >>>>>>>>> I have it running as expected within the Virtual Machine >>> outlined in >>>>>>> >>>>>>> this >>>>>>> >>>>>>>>> updated PR >>>>>>>>> https://github.com/apache/incubator-zeppelin/pull/751/ >>>>>>>>> >>>>>>>>> >>>>>>>>> With the `repl` package (also installed via the VM script), >>> plotting >>>>>>> >>>>>>> such >>>>>>> >>>>>>>>> as native R histograms worked within the notebook display system >>> as >>>>>>> >>>>>>> well. >>>>>>> >>>>>>>>> So - this looks good to me. >>>>>>>>> >>>>>>>>> Not to oversimplify things, it “seems” this PR (or this PR and a >>>>>>>>> future >>>>>>>> >>>>>>>> PR >>>>>>>> >>>>>>>>> for packaging) just needs: >>>>>>>>> - the packaging worked out (get the R scripts included in the >>>>>>>>> >>>>>>>>> distribution) >>>>>>>>> >>>>>>>>> - a few license additions to the rscala files (if they are not >>>>>>> >>>>>>> generated >>>>>>> >>>>>>>>> but part of the base requirements) >>>>>>>>> >>>>>>>>> - a profile addition such as -P r to only build with R binaries >>> if >>>>>>>>> >>>>>>>>> desired. >>>>>>>>> >>>>>>>>> Unless I am missing something, it could be merged with one final >>>>>>> >>>>>>> focused >>>>>>> >>>>>>>>> effort. >>>>>>>>> Somebody could tweak the documentation a bit to match the tone >>> of the >>>>>>>>> other interpreter docs post merge. >>>>>>>>> >>>>>>>>> Regards, >>>>>>>>> Jeff Steinmetz >>>>>>>>> Principal Architect >>>>>>>>> Akili Interactive >>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>> On 2/29/16, 6:45 AM, "Sourav Mazumder" < >>> sourav.mazumde...@gmail.com> >>>>>>>>> >>>>>>>>> wrote: >>>>>>>>>> Very similar is my experience too. >>>>>>>>>> >>>>>>>>>> Could run PR 208 with least effort. And so far I am very >>> successful >>>>>>>>>> to >>>>>>>> >>>>>>>> use >>>>>>>> >>>>>>>>>> it to create demonstrations covering end to end machine >>> learning use >>>>>>>> >>>>>>>> cases >>>>>>>> >>>>>>>>>> in Zeppelin (showcasing how data can be shared across scala, >>> SparkR, >>>>>>>>>> R >>>>>>>>>> easily where data preparation/model creation done in >>> SparkR/Scala >>>>>>> >>>>>>> where >>>>>>> >>>>>>>> as >>>>>>>> >>>>>>>>>> visualization in R) using PR 208 in different meetups and other >>>>>>> >>>>>>> forums. >>>>>>> >>>>>>>>>> Regards, >>>>>>>>>> Sourav >>>>>>>>>> >>>>>>>>>> On Mon, Feb 29, 2016 at 5:04 AM, enzo < >>> e...@smartinsightsfromdata.com >>>>>>>>>> >>>>>>>>>> wrote: >>>>>>>>>>> As a keen R user I tried both branches, but I couldn’t make >>> work >>>>>>>>> >>>>>>>>> Charles' >>>>>>>>> >>>>>>>>>>> version (maybe my mistake). I found some issue on Amos' >>> version >>>>>>>> >>>>>>>> (mainly >>>>>>>> >>>>>>>>>>> about charting), reported on his github page (he has >>> suggested to >>>>>>> >>>>>>> test >>>>>>> >>>>>>>>> more >>>>>>>>> >>>>>>>>>>> extensively and report after merge - fair enough). >>>>>>>>>>> >>>>>>>>>>> In conclusion I do not have sound enough elements to judge on >>> which >>>>>>>> >>>>>>>> one >>>>>>>> >>>>>>>>> is >>>>>>>>> >>>>>>>>>>> better. As I’m in favour of competition as a general >>> principle, >>>>>>> >>>>>>> taking >>>>>>> >>>>>>>>> into >>>>>>>>> >>>>>>>>>>> account that they seem to be close to the finishing line I >>> would >>>>>>>>> >>>>>>>>> suggest to >>>>>>>>> >>>>>>>>>>> merge each one and let users decide: I concur with Eran. >>>>>>>>>>> >>>>>>>>>>> It would be useful (just to avoid similar occurrences in the >>>>>>>>>>> future) >>>>>>>> >>>>>>>> to >>>>>>>> >>>>>>>>>>> understand why we arrived here though. How is it possible >>> that a >>>>>>>>>>> fundamental pr as R interpreter takes so long to be >>> integrated? I >>>>>>>> >>>>>>>> would >>>>>>>> >>>>>>>>>>> humbly suggest for the future to give better treatment to the >>> big >>>>>>>>> >>>>>>>>> hitting >>>>>>>>> >>>>>>>>>>> functionalities. Clearly the more a ‘big’ functionality is >>>>>>>>>>> delayed, >>>>>>>> >>>>>>>> the >>>>>>>> >>>>>>>>>>> more will be deemed attractive to develop alternative versions >>>>>>>>>>> (some >>>>>>>>> >>>>>>>>> time >>>>>>>>> >>>>>>>>>>> better versions, some time equally useful). >>>>>>>>>>> >>>>>>>>>>> Another consideration is the over present issue of graphics. >>> From >>>>>>> >>>>>>> an >>>>>>> >>>>>>>> R >>>>>>>> >>>>>>>>>>> standpoint, due to the extreme richness of its graphic >>> offering, so >>>>>>>> >>>>>>>> far >>>>>>>> >>>>>>>>> I >>>>>>>>> >>>>>>>>>>> found that no notebook is entirely satisfactory: for example >>> the >>>>>>>> >>>>>>>> growing >>>>>>>> >>>>>>>>>>> family of htmlwidgets are badly (or not) displayed in many >>> cases. >>>>>>>>>>> It >>>>>>>>> >>>>>>>>> would >>>>>>>>> >>>>>>>>>>> certainly benefit the community to invest time and activities >>> on >>>>>>>>> >>>>>>>>> perfecting >>>>>>>>> >>>>>>>>>>> these issues, but so be it. >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> Enzo >>>>>>>>>>> e...@smartinsightsfromdata.com >>>>>>>>>>> >>>>>>>>>>>> On 29 Feb 2016, at 12:36, Eran Witkon <eranwit...@gmail.com >>>> >>>>>>> >>>>>>> wrote: >>>>>>>>>>>> I think we should ask ourselves what is the guiding >>> principle >>>>>>> >>>>>>> here, >>>>>>> >>>>>>>>> for >>>>>>>>> >>>>>>>>>>>> example, if in the future I want to create yet another JDBC >>>>>>>>> >>>>>>>>> interpreter >>>>>>>>> >>>>>>>>>>> or >>>>>>>>>>> >>>>>>>>>>>> Flink interpreter, should I only extend the one that already >>>>>>>>>>>> exist >>>>>>>> >>>>>>>> or >>>>>>>> >>>>>>>>>>> can I >>>>>>>>>>> >>>>>>>>>>>> create my own and let the user community decide? >>> realistically I >>>>>>>> >>>>>>>> don't >>>>>>>> >>>>>>>>>>>> think we can control where people invest their time and >>>>>>> >>>>>>> contribution >>>>>>> >>>>>>>>> and >>>>>>>>> >>>>>>>>>>> as >>>>>>>>>>> >>>>>>>>>>>> long as it has no licencing issues and align with other >>> project >>>>>>>>> >>>>>>>>> guidance >>>>>>>>> >>>>>>>>>>> it >>>>>>>>>>> >>>>>>>>>>>> should be up to the users to decide. >>>>>>>>>>>> >>>>>>>>>>>> Eran W >>>>>>>>>>>> >>>>>>>>>>>> On Mon, Feb 29, 2016 at 2:13 PM DuyHai Doan < >>> doanduy...@gmail.com >>>>>>>>>>> >>>>>>>>>>> wrote: >>>>>>>>>>>>> Hello Alexander >>>>>>>>>>>>> >>>>>>>>>>>>> My opinion is no one, unless being an expert with R, is >>> able to >>>>>>>> >>>>>>>> judge >>>>>>>> >>>>>>>>>>> the >>>>>>>>>>> >>>>>>>>>>>>> quality of both contributions apart from the authors >>> themselves. >>>>>>> >>>>>>> So >>>>>>> >>>>>>>>>>> let's >>>>>>>>>>> >>>>>>>>>>>>> make them work together to merge 2 PR into a good one. >>> Those >>>>>>>>>>>>> PR, >>>>>>>>>>>>> especially the #208 has been there for a while and it's >>> high >>>>>>>>>>>>> time >>>>>>>>> >>>>>>>>> they >>>>>>>>> >>>>>>>>>>> get >>>>>>>>>>> >>>>>>>>>>>>> merged so the community can move on. >>>>>>>>>>>>> >>>>>>>>>>>>> Unless there are R experts in the Zeppelin community and >>> so they >>>>>>>>> >>>>>>>>> should >>>>>>>>> >>>>>>>>>>>>> speak to give their own opinions. >>>>>>>>>>>>> >>>>>>>>>>>>> My 2 cents >>>>>>>>>>>>> >>>>>>>>>>>>> Duy Hai DOAN >>>>>>>>>>>>> >>>>>>>>>>>>> On Mon, Feb 29, 2016 at 1:04 PM, Alexander Bezzubov < >>>>>>>> >>>>>>>> b...@apache.org> >>>>>>>> >>>>>>>>>>>>> wrote: >>>>>>>>>>>>>> Hi fellow Zeppelin community members, >>>>>>>>>>>>>> >>>>>>>>>>>>>> as you know, we have 2 contributions for ZEPPELIN-156 >>>>>>>>>>>>>> <https://issues.apache.org/jira/browse/ZEPPELIN-156> AKA >>> R >>>>>>>>>>>>>> <https://github.com/apache/incubator-zeppelin/pull/208> >>>>>>>> >>>>>>>> interpreter >>>>>>>> >>>>>>>>>>>>>> <https://github.com/apache/incubator-zeppelin/pull/702>. >>>>>>>>>>>>>> Both have merit, so wearing my PPMC hat, I'd like to >>> suggest us >>>>>>> >>>>>>> to >>>>>>> >>>>>>>>>>> make a >>>>>>>>>>> >>>>>>>>>>>>>> decision, how we move forward with it avoiding user >>> confusion. >>>>>>>>>>>>>> >>>>>>>>>>>>>> Here is what we can do: >>>>>>>>>>>>>> - a. pick only one of those and merge it >>>>>>>>>>>>>> - b. ask authors of both of them to collaborate together >>> and >>>>>>> >>>>>>> come >>>>>>> >>>>>>>> up >>>>>>>> >>>>>>>>>>>>> with >>>>>>>>>>>>> >>>>>>>>>>>>>> one >>>>>>>>>>>>>> - c. merge each, as soon as it's ready and let >>>>>>>>>>>>>> users\maintainers >>>>>>>>> >>>>>>>>> decide >>>>>>>>> >>>>>>>>>>>>>> which one is best at the end >>>>>>>>>>>>>> >>>>>>>>>>>>>> This is not an official VOTE (which is possible to >>> arrange, but >>>>>>> >>>>>>> is >>>>>>> >>>>>>>>>>> rather >>>>>>>>>>> >>>>>>>>>>>>>> bad way to build a consensus). >>>>>>>>>>>>>> >>>>>>>>>>>>>> It is a discussion, aimed to see if we all, as community, >>> can >>>>>>>> >>>>>>>> build >>>>>>>> >>>>>>>>> a >>>>>>>>> >>>>>>>>>>>>>> consensus together cooperatively - meaning, *everyone >>>>>>> >>>>>>> compromises >>>>>>> >>>>>>>>>>>>>> something *and* there are no really strong opinions >>> against the >>>>>>>>> >>>>>>>>> final >>>>>>>>> >>>>>>>>>>>>>> plan*. >>>>>>>>>>>>>> >>>>>>>>>>>>>> I specifically DO NOT ask which one is better, have more >>>>>>> >>>>>>> features, >>>>>>> >>>>>>>>> etc, >>>>>>>>> >>>>>>>>>>>>>> etc, etc. >>>>>>>>>>>>>> What I ask for are opinions on a community way of >>> reconciling >>>>>>> >>>>>>> this >>>>>>> >>>>>>>>>>>>>> situation and moving project forward together. >>>>>>>>>>>>>> >>>>>>>>>>>>>> What do you think? >>>>>>>>>>>>>> >>>>>>>>>>>>>> -- >>>>>>>>>>>>>> Kind regards, >>>>>>>>>>>>>> Alexander. >>>>> >>>> >>> >>> >>