Thanks for the great summary. I don't have additional suggestions. Gwen
On Sun, Jul 27, 2014 at 11:03 AM, Arvind Prabhakar <arv...@apache.org> wrote: > Thanks Gwen and Jarcec. It appears that we all agree to the few basic > points below: > > a) Sqoop2 is promising effort although not near completion. We agree that > there is no need to discuss shutting that down at this time. > b) The naming of Sqoop2 is such that it raises expectations in > users/adopters to be better than Sqoop(1) and thus leads to confusion. > > The second point (b) above is the key issue that needs resolution. The > options discussed thus far are as follows: > > 1. Put a code name for Sqoop2 so that it is not confused with Sqoop(1). > This seems to have good community support. > 2. Use a explicit labels such as "stable" vs "beta/alpha/experimental" for > various Sqoop releases. > 3. Use explicit UI messaging to warn Sqoop2 users that it is not the same > as Sqoop(1) and is far behind on feature completeness and stability. There > seems to be some concerns around how this can be done given the > client/server architecture of Sqoop2. > 4. A combination of options 2 and 3 above. > > Are there any suggestions to mitigate this problem? Perhaps we should > cross-post this thread to user list as well to see if they agree with the > options here and/or if they have any other suggestions. > > Regards, > Arvind Prabhakar > > > > On Sat, Jul 26, 2014 at 6:50 PM, Jarek Jarcec Cecho <jar...@apache.org> > wrote: > >> Hi Arvind, >> thank you very much for sharing your concerns! You’ve risen a very good >> points. >> >> I personally see value in Sqoop 2 as the new architecture will allow us to >> cover much more use cases, various compliancy regulations and will >> eventually simplify user’s life. Based on the recent increase in dev >> activity, it seems that I’m not the only one who do believes in that and >> hence I strongly believe that discontinuing the effort doesn’t seem as the >> right way to go. I’m more then happy to discuss this topic further if you >> believe that it’s the right way though. >> >> Having said that I do believe in Sqoop 2, I have to second Gwen that >> current situation is very confusing to our users. I’ve seen significant >> number of users confused about why 1.99.4 do not have Avro/HBase/Hive >> integration when Sqoop 1 already have that. I was anticipating the >> confusion and hence I’ve suggested to use version number 1.99.x instead of >> 2.0.0 back when we were working on getting the first cut out of the door. I >> hoped that version 1.99.x will make obvious to everybody that it’s not >> “2.0.0” quite yet. Sadly it seems that this alone did not helped as much as >> I hoped. >> >> Hence I do see value in changing our public messaging as you’ve suggested >> to make the message more clearer. I personally like the idea with code name >> as that is quite popular in other projects and companies (remember Windows >> Longorn?) and it seems to be conveying the message. I do see a lot of >> variability of using that code name though - I don’t think that we >> necessarily have to remove any possible reference to “Sqoop 2” from the >> face of earth. I believe that the code name is very well suited for our >> webpage, wiki and perhaps a blog posts to make obvious that there is just >> one single stable Sqoop version and then some ongoing effort that do have >> available several cuts. I believe that just by doing that we will decrease >> confusion about what version should user download and use. We can discuss >> to what extent we want to push the code name and to what extent we will >> keep the reference to “Sqoop 2”. After all I’m confident that in not too >> distant future, we will have Sqoop 2 that will offer the comparable >> capability and features as Sqoop 1. >> >> I don’t think that the code name is the only idea that will decrease the >> immediate user confusion and hence I’m happy to hear others thoughts! >> >> Jarcec >> >> On Jul 26, 2014, at 6:00 PM, Gwen Shapira <gshap...@cloudera.com> wrote: >> >> > Thanks Arvind for your detailed explanation. >> > >> > I agree that naming releases stable and alpha is a good idea. I don't >> > agree that it will solve the issue, but we can't know until we try. >> > >> > Considering that Sqoop2 is intentionally a client-server architecture >> > with multiple clients and a REST API as an additional access point, I >> > believe that it is not feasible to mark UI as beta. >> > >> > I want to stress that I personally believe that Sqoop2 is a very >> > viable branch effort, to the extent that I am actively contributing to >> > it. >> > As Sqoop2 becomes more and more viable alternative to Sqoop1, we need >> > to prepare, as a community, to support both versions. >> > >> > Considering the number of features currently in Sqoop1 and the number >> > of production Sqoop1 users, I can see us supporting both versions for >> > the next 2 years. Making it easy for the community to support both is >> > my top concern here. Having to resolve endless confusions for two >> > years doesn't seem like a happy future to me. I see the Python >> > community fighting the same issue since they broke compatibility >> > between versions 2 and 3. I'd like to see us learn from those mistakes >> > and do better. >> > >> > I agree that a discussion would have been better than a vote. I was >> > under the mistaken impression that there is a consensus on the matter. >> > I renamed the thread to make it clear that we are interested in >> > hearing opinions from the rest of the community on this subject. >> > >> > >> > Bike-sheddingly yours, >> > >> > Gwen Shapira >> > >> > >> > >> > On Sat, Jul 26, 2014 at 4:44 PM, Arvind Prabhakar <arv...@apache.org> >> wrote: >> >> Thanks for the detailed pointers Gwen. I understand your concerns better >> >> now. My understanding from these threads as well as what you have >> described >> >> is that the confusion you refer to stems from the fact that Sqoop2 is >> not >> >> at feature parity with Sqoop(1) yet. >> >> >> >> It will be great to *discuss* what are the various ways to address this >> and >> >> then call a vote to decide upon the approach to use. Some other >> approaches >> >> that I can suggest are: >> >> >> >> 1. Calling Sqoop1 explicitly as "stable" in our downloads section, or >> even >> >> within the release label. So instead of Sqoop-1.4.5, it would be >> >> Sqoop-1.4.5-stable. >> >> >> >> 2. Alternatively calling Sqoop2 explicitly "alpha", "beta" or >> >> "experimental". Eg - Sqoop-1.99.4 would become Sqoop-1.99.4-beta. >> >> >> >> 3. Or perhaps a combination of both of these. >> >> >> >> 4. Plus good UI messaging that clearly outlines the critical differences >> >> between these to products. >> >> >> >> Personally, I do not believe that having a code name will solve the >> issue >> >> at hand, and may even make it worse. If the motivation is to call out >> >> Sqoop2 as something "not-Sqoop", then perhaps we should discuss the >> >> viability of this branch effort. If we conclude that it is not going to >> >> progress any further, we could call a vote on discontinuing this effort >> and >> >> instead focusing on the main Sqoop1 branch alone. >> >> >> >> Hope you understand my point of view on this. >> >> >> >> Regards, >> >> Arvind Prabhakar >> >> >> >> >> >> >> >> >> >> On Fri, Jul 25, 2014 at 10:53 AM, Gwen Shapira <gshap...@cloudera.com> >> >> wrote: >> >> >> >>> Hi Arvind, >> >>> >> >>> Here are few more threads from the last month where we had to explain >> >>> Sqoop2 status or explain that you can't use "sqoop import" with >> >>> Sqoop2, etc: >> >>> >> >>> >> >>> >> http://mail-archives.apache.org/mod_mbox/sqoop-user/201407.mbox/%3CCA%2BP7NPNTFuPYqf74M5OFw4e9xKZm2ns%3DZ0ydkkuJ06Jcg31hnw%40mail.gmail.com%3E >> >>> >> >>> >> >>> >> http://mail-archives.apache.org/mod_mbox/sqoop-user/201407.mbox/%3CCAAJ8D%3D9Ho%3DYH7jdavNAb1gwz19Z5dapmS98yR71KmM5LsjCEVw%40mail.gmail.com%3E >> >>> >> >>> >> >>> >> http://mail-archives.apache.org/mod_mbox/sqoop-user/201407.mbox/%3CCAPwc21YtdgAm9jO3%2Bs0asBZ2JkG%3DVCp5PLpkO5xQuuBPKQGuTw%40mail.gmail.com%3E >> >>> >> >>> >> >>> >> http://mail-archives.apache.org/mod_mbox/sqoop-user/201406.mbox/%3CCAOrS3pxWuxL1X9Sb816N_o1Jd==gs9ww6uje2po+fpaw7vh...@mail.gmail.com%3E >> >>> >> >>> In addition, I noticed the problem when talking to users in >> >>> conferences, customers, members of support team, etc (not to mention >> >>> that I got confused personally when I started out.) >> >>> I didn't bring much evidence in my first email because I thought there >> >>> was a wide consensus about the problem. >> >>> >> >>> I have several goals with the code-name: >> >>> >> >>> * We need to remove the impression that the new version is like Sqoop >> >>> only better. It is only somewhat like Sqoop and will not be strictly >> >>> better for many months yet. >> >>> * We need to clarify that this project is not even close to production >> >>> quality. >> >>> * We need to make it easy for us to quickly figure out which version >> >>> the user is talking about. We also need to make it easy for the users >> >>> to describe what they are using. >> >>> * We want to have fun :) >> >>> >> >>> I think the name Pelican Project will help with all goals: >> >>> - It is clearly not the same as Sqoop. So there's no existing >> >>> expectations on what will be supported. >> >>> - It is a "Project" and not a product yet. >> >>> - Sqoop and Pelican don't look or sound similar. No one can expect to >> >>> use Sqoop by running "pelican-shell" or to use Pelican by calling >> >>> "sqoop import". >> >>> - And a cute mascot will make every future presentation and blog post >> >>> on the topic much more fun. >> >>> >> >>> You do bring up good points of concern: >> >>> >> >>> a) Existing releases: I disagree code-names for in-progress >> >>> development cause too much confusion. They seem fairly common in the >> >>> software world. >> >>> >> >>> >> http://royal.pingdom.com/2010/05/27/the-developer-obsession-with-code-names-114-interesting-examples/ >> >>> >> >>> b) "could impact the reproducibility of previous release builds which >> >>> is not very good for the project." >> >>> This sounds fairly serious. Can you elaborate what you mean by >> >>> reproducibility of release build? >> >>> >> >>> Gwen >> >>> >> >>> >> >>> >> >>> >> >>> >> >>> >> >>> >> >>> >> >>> On Fri, Jul 25, 2014 at 8:02 AM, Arvind Prabhakar <arv...@apache.org> >> >>> wrote: >> >>>> Hi Gwen, >> >>>> >> >>>> Other than the recent thread [1] on our user list, is there any other >> >>>> precedent regarding the confusion this issue has caused? If so, I >> would >> >>>> appreciate if you could point it out. >> >>>> >> >>>> Personally, I do agree that we ought to have a better mechanism to >> >>>> communicate the completeness (or incompleteness) of a release in >> order to >> >>>> ensure the users understand what benefits or drawbacks they may get. >> >>>> Incidentally, this was the primary reason for numbering the Sqoop2 >> >>> release >> >>>> as 1.99.x, thereby indicating that the release is not quite 2.0 yet, >> >>> which >> >>>> seems to be not working as well as expected. >> >>>> >> >>>> One traditional way to alleviate this issue would be to label the >> release >> >>>> alpha/beta etc. I prefer doing that instead of putting a code name for >> >>> the >> >>>> release for a couple of reasons - a) we have already made releases of >> >>>> Sqoop2 with the previous versioning scheme and hence changing the name >> >>>> could cause more confusion; and b) renaming the branches to the new >> name >> >>>> could impact the reproducibility of previous release builds which is >> not >> >>>> very good for the project. >> >>>> >> >>>> Another alternative to consider would be to have very clear messaging >> in >> >>>> the user-interface of Sqoop2 that it is still work in progress and not >> >>>> considered at par with Sqoop1. >> >>>> >> >>>> [1] http://s.apache.org/TvD >> >>>> >> >>>> Regards, >> >>>> Arvind Prabhakar >> >>>> >> >>>> >> >>>> On Fri, Jul 25, 2014 at 7:30 AM, Venkat Ranganathan < >> >>>> vranganat...@hortonworks.com> wrote: >> >>>> >> >>>>> +1 for Pelican. But documentation should not be called The Pelican >> >>> Brief >> >>>>> :) >> >>>>> >> >>>>> Venkat >> >>>>> >> >>>>> On Thu, Jul 24, 2014 at 8:12 PM, Abraham Elmahrek <a...@cloudera.com> >> >>>>> wrote: >> >>>>>> There's something about schlep (or schlepper) that I'm having >> trouble >> >>>>>> resisting... but... +1 to Pelican. >> >>>>>> >> >>>>>> >> >>>>>> On Thu, Jul 24, 2014 at 7:18 PM, Jarek Jarcec Cecho < >> >>> jar...@apache.org> >> >>>>>> wrote: >> >>>>>> >> >>>>>>> I’m obviously biased, but +1 to Pelican. >> >>>>>>> >> >>>>>>> Jarcec >> >>>>>>> >> >>>>>>> On Jul 24, 2014, at 7:06 PM, Martin, Nick <nimar...@pssd.com> >> wrote: >> >>>>>>> >> >>>>>>>> +1 Pelican >> >>>>>>>> >> >>>>>>>> -----Original Message----- >> >>>>>>>> From: Gwen Shapira [mailto:gshap...@cloudera.com] >> >>>>>>>> Sent: Thursday, July 24, 2014 9:51 PM >> >>>>>>>> To: dev@sqoop.apache.org >> >>>>>>>> Subject: Code name for Sqoop 2 (please vote!) >> >>>>>>>> >> >>>>>>>> Hi, >> >>>>>>>> >> >>>>>>>> As you may have noticed on the user list, Sqoop2 confuses the hell >> >>> out >> >>>>>>> of everyone. >> >>>>>>>> >> >>>>>>>> Part of the problem is the name - Sqoop2 sounds newer and >> therefore >> >>>>>>> better. People expect better quality and more features - which we >> >>> don't >> >>>>>>> deliver :( >> >>>>>>>> >> >>>>>>>> Therefore, I propose finding Sqoop2 a project code name. This way >> >>> it >> >>>>>>> will sound experimental and will not have the number "2" next to >> it. >> >>>>>>>> We can use the code name to mark the branches in the repo, the >> >>>>>>> documentation, the Hue frontend, etc. This will prevent confusion >> as >> >>> the >> >>>>>>> name Sqoop will go back to refer to just one project, and one that >> >>>>> actually >> >>>>>>> works. >> >>>>>>>> >> >>>>>>>> Suggested names: >> >>>>>>>> Project Pelican (Based on the animal on O'Reilly's Sqoop book) >> >>> Project >> >>>>>>> Schlep (Yiddish for "moving heavy package") >> >>>>>>>> >> >>>>>>>> Friends, contributors, committers and PMC members - please respond >> >>>>> with >> >>>>>>> either: >> >>>>>>>> * Vote (+1) on one of the names above >> >>>>>>>> * Your own suggestion >> >>>>>>>> >> >>>>>>>> We'll be looking to close the vote by August 1st (Next week). >> >>>>>>>> >> >>>>>>>> Gwen >> >>>>>>> >> >>>>>>> >> >>>>> >> >>>>> -- >> >>>>> CONFIDENTIALITY NOTICE >> >>>>> NOTICE: This message is intended for the use of the individual or >> >>> entity to >> >>>>> which it is addressed and may contain information that is >> confidential, >> >>>>> privileged and exempt from disclosure under applicable law. If the >> >>> reader >> >>>>> of this message is not the intended recipient, you are hereby >> notified >> >>> that >> >>>>> any printing, copying, dissemination, distribution, disclosure or >> >>>>> forwarding of this communication is strictly prohibited. If you have >> >>>>> received this communication in error, please contact the sender >> >>> immediately >> >>>>> and delete it from your system. Thank You. >> >>>>> >> >>> >> >>