Re: Towards Hadoop 1.0: Stronger API Compatibility from 0.21 onwards

2009-09-25 Thread Allen Wittenauer
On 9/25/09 10:13 AM, "Dhruba Borthakur" wrote: > It is really nice to have wire-compatibility between clients and servers > running different versions of hadoop. The reason we would like this is > because we can allow the same client (Hive, etc) submit jobs to two > different clusters running diff

Re: Towards Hadoop 1.0: Stronger API Compatibility from 0.21 onwards

2009-09-25 Thread Allen Wittenauer
On 9/25/09 12:44 PM, "Sanjay Radia" wrote: > > On Sep 25, 2009, at 12:03 PM, Allen Wittenauer wrote: > >> On 9/25/09 10:13 AM, "Dhruba Borthakur" wrote: >>> It is really nice to have wire-compatibility between clients and >> servers >&

Re: Towards Hadoop 1.0: Stronger API Compatibility from 0.21 onwards

2009-09-25 Thread Allen Wittenauer
On 9/25/09 1:18 PM, "Doug Cutting" wrote: > The question is not whether wire compatibility is a good thing. The > question is whether API compatibility is useless without wire > compatibility and, vice versa, whether wire compatibility is useless > without API compatibility. They're both valu

Re: Towards Hadoop 1.0: Stronger API Compatibility from 0.21 onwards

2009-09-25 Thread Allen Wittenauer
On 9/25/09 2:40 PM, "Doug Cutting" wrote: > Would it be materially better for you if we waited longer before calling > a release 1.0, assuming that the same features are released in the same > order and on the same schedule regardless of the release name? Yes. There is something magic to manager

Re: HDFS-758 in Hadoop-21 , Updates to Namenode health page

2009-11-25 Thread Allen Wittenauer
-1 We're never going to see 0.21 if features keep getting backported. On 11/24/09 9:44 PM, "Owen O'Malley" wrote: > +1 This sounds like useful information that will and has aided debugging. > > -- Owen

Re: HDFS-758 in Hadoop-21 , Updates to Namenode health page

2009-11-25 Thread Allen Wittenauer
Then you'll have no issues patching other things in 0.21 that are actual bug fixes that also meet this criteria, right? Or does this only apply to things that Yahoo! is hitting/deemed worthy? On 11/25/09 12:03 PM, "Tsz Wo (Nicholas), Sze" wrote: > +1 on committing it to 0.21 > > I also a

Re: HDFS-758 in Hadoop-21 , Updates to Namenode health page

2009-12-07 Thread Allen Wittenauer
On 12/7/09 2:00 PM, "Sanjay Radia" wrote: > Allen raises a good point that the rest of the community may not need > some of the features that Yahoo finds useful internally. FWIW, I have no real issues with the change itself. I'm much more concerned that a UI enhancement was deemed so critical. S

Re: how to assign each hadoop user to different group?

2009-12-17 Thread Allen Wittenauer
Group permissions come from id/whoami/etc. So define them that way in UNIX and it should get carried over to Hadoop. That said, it is probably the wrong behavior to have the default when group resolution fails to be supergroup. On 12/17/09 4:11 AM, "Starry SHI" wrote: > Hi. > > My Hadoop clus

Re: [Fwd: about hadoop security design]

2010-01-06 Thread Allen Wittenauer
On 1/5/10 10:57 PM, "gs...@tce.edu" wrote: > i am doing my research in hadoop security design. > Instead of using kerberos for hadoop security is it possible to > use Ldap authentication protocol??? Using LDAP (or NIS+, or NIS, or passwd/shadow files, or ... ) will require a password prompt, so

JIRA Edits?

2010-01-26 Thread Allen Wittenauer
[I'm sending this to common-dev@ cuz I don't think there is a general 'hey all you dev peeps' list anymore.] Am I missing something or have we lost the ability to edit JIRA comments? The inability to edit my stupid spelling mistakes makes me sad. :(

Re: JIRA Edits?

2010-01-27 Thread Allen Wittenauer
On 1/26/10 9:06 PM, "Owen O'Malley" wrote: > Doug decided he didn't like the majority of people editing their comments on > jira and disabled edits. If you want it back, please start a vote on this > list. I think instead I'll just put comment after comment making corrections. Less bureaucrac

Re: [help!] [paper]deconstruct Hadoop Distributed File System

2010-02-22 Thread Allen Wittenauer
On 2/21/10 1:31 AM, "springring" wrote: > > in addition, can administrator help me to send the attached file. thks. Just pop it up on the wiki.

Re: Top ten challenges and opportunities in Hadoop.

2010-02-24 Thread Allen Wittenauer
On 2/24/10 9:35 AM, "Ravi" wrote: >What are the key problems that the Hadoop community will be trying to > solve in the upcoming versions ? I did not find such discussion in the > archives. Please point me to any webpage or archives discussing this issue. Should be or are?

Re: Namespace partitioning using Locality Sensitive Hashing

2010-03-01 Thread Allen Wittenauer
On 3/1/10 8:48 AM, "Ketan Dixit" wrote: > How LSH is better than normal hashing? Because still, a client or a fixed > namenode has to take decision of which namenode to contact in whatever > hashing ? It looks to me that requests to files under same subtree are > directed to the same namenode

Re: Map Reduce in heterogeneous environ..

2010-03-11 Thread Allen Wittenauer
On 3/11/10 4:53 AM, "Segel, Mike" wrote: > Then push them out respective of their machine types? ... and this is the fundamental problem. I'm still a big fan of using LDAP to help us solve this problem. Rather than push configs, read them from an LDAP store. Roughly: cn=mygrid .. define

Re: [DISCUSSION] Release process

2010-03-24 Thread Allen Wittenauer
On 3/15/10 9:06 AM, "Owen O'Malley" wrote: > From our 21 experience, it looks like our old release strategy is > failing. Maybe this is a dumb question but... Are we sure it isn't the community failing? From where I stand, the major committers (PMC?) have essentially forked Hadoop into

Re: [DISCUSSION] Release process

2010-03-31 Thread Allen Wittenauer
On 3/30/10 8:22 PM, "Owen O'Malley" wrote: > > On Mar 30, 2010, at 3:40 PM, Doug Cutting wrote: > >> Another release we might consider is 1.0 based on 0.20. > > It is tempting and I think that 0.20 is *really* our 1.0, but I think > re-labeling a release a year after it came out would be co

Re: [DISCUSSION] Release process

2010-04-01 Thread Allen Wittenauer
On 4/1/10 2:15 PM, "Mattmann, Chris A (388J)" wrote: > In terms of the significance of the 1.0 labeling, I think it's important for > adoption. Companies wanting a 1.0 product could always pay Cloudera and get a v2 product. ;)

Re: gsoc

2010-04-05 Thread Allen Wittenauer
On Apr 5, 2010, at 12:58 PM, Pek Daniel wrote: > As far as i know Google's solution for storing static configuration is > Chubby. Hadoop currently suggests manage-scripts for keeping in sync > the xml-files on the nodes' local fs. I think, it would be a great > summer project to make hadoop servic

Re: [DISCUSSION] Release process

2010-04-05 Thread Allen Wittenauer
On Apr 5, 2010, at 5:06 PM, Chris K Wensel wrote: > > we need a well healed 1.0 sooner than later. Why?

Re: [DISCUSSION] Release process

2010-04-06 Thread Allen Wittenauer
On Apr 6, 2010, at 6:02 AM, Steve Loughran wrote: > Allen Wittenauer wrote: >> On Apr 5, 2010, at 5:06 PM, Chris K Wensel wrote: >>> we need a well healed 1.0 sooner than later. >> Why? > > I think it would be good for a 0.21 with the newly renamed artifacts &g

Re: Plans for a 0.21 Hadoop Release

2010-04-12 Thread Allen Wittenauer
So, does this mean that everything that has been committed for 0.22 is actually going to be in 0.21? For example, symlinks were committed to trunk for 0.22 according to JIRA. Hmm.. because of JIRA, would it be easier to actually call this release 0.22?

Re: Plans for a 0.21 Hadoop Release

2010-04-12 Thread Allen Wittenauer
On Apr 12, 2010, at 4:28 PM, Tom White wrote: > On Mon, Apr 12, 2010 at 4:14 PM, Allen Wittenauer > wrote: >> So, does this mean that everything that has been committed for 0.22 is >> actually going to be in 0.21? For example, symlinks were committed to trunk >> fo

Re: Hadoop on Solaris 10 and in OSGi bundles

2010-04-28 Thread Allen Wittenauer
On Apr 28, 2010, at 9:30 AM, kovachev wrote: > we are trying to set up Hadoop to run on Solaris 10 within Containers. > However, we encounter many problems. > Could you please write here down all the extra settings needed for running > Hadoop on Solaris? The two big ones: - whoami needs

Re: API for LATE scheduler

2010-05-06 Thread Allen Wittenauer
On May 5, 2010, at 11:36 PM, momina khan wrote: > hi could anyone point me to where i can download the LATE scheduler API plug > in for HAdoop ... i distinctly remember reading up on work on the API in > this mailing list but cant locate it on the web! I've only ever seen LATE referenced in pap

Re: hadoop-cluster-1.ic node errors and warnings

2010-06-21 Thread Allen Wittenauer
[Cutting the CC: line down to size ] On Jun 18, 2010, at 3:37 AM, Bikash Singhal wrote: > Hi folks , > > I have received this error in the hadoop cluster. Has anybody anybody > seen this . Any solution. Since you aren't picking anything out and you've shared a bunch of messages, I'm going to

Re: [JIRA] (COLL-117) seeing warnings "attempt to override final parameter: dfs.data.dir" in tasktracker logs

2010-06-21 Thread Allen Wittenauer
Again, removing a bunch of CC:'es. On Jun 21, 2010, at 2:26 AM, Bikash Singhal wrote: > > Hi Hadoopers, > > I have received WARN in the hadoop cluster. Has anybody seen this . Any > solution? > > >> 2010-06-06 01:45:04,079 WARN org.apache.hadoop.conf.Configuration: >> /var/lib/hadoop-0.20/

Re: Support for non-HDFS distributed, shared POSIX FS

2010-07-01 Thread Allen Wittenauer
On Jul 1, 2010, at 11:28 AM, Chris D wrote: > In the new URI I seek to create, each MapReduce slave would look for input > data from a seemingly local file:///, and write output to it as well. Assume > that the distributed FS handles concurrent reads, writes. Assuming > POSIX-compliance, the Local

Re: Support for non-HDFS distributed, shared POSIX FS

2010-07-01 Thread Allen Wittenauer
On Jul 1, 2010, at 12:00 PM, Chris D wrote: > Yes, it is mountable on all machines simultaneously, and, for example, works > properly through file:///mnt/to/dfs in a single node cluster. Then file:// will likely work a multi-node cluster as well. So I doubt you'll need to write anything at a

Re: Java 1.6.0_19 & Hadoop

2010-07-21 Thread Allen Wittenauer
I seem to be ok with the little bit of _20 I've been using. On Jul 21, 2010, at 5:58 AM, Bill Au wrote: > Now that jdk 1.6.0_21 is out, has anyone been running it with Hadoop? We > have also had problem running Hadoop with 1.6.0_18. So what version of > 1.6.0 would people recommend for use wit

Re: Automatic tool for QA checks

2010-07-26 Thread Allen Wittenauer
On Jul 26, 2010, at 5:13 AM, Jorge Rodrigez wrote: >> (though I am not sure of the exact integration point in >> Hudson). I think hudson is subscribed to the various jira mailing lists. The mail is piped via stdin to a script that parses it and fires off the appropriate action in hudson.

Re: why DFSClient.BlockReader doesnot support seek(long) can I use skip() instead?

2010-09-20 Thread Allen Wittenauer
On Sep 19, 2010, at 7:57 PM, steven zhuang wrote: > hi, all, > I have sent this mail in common user list before, duplicate it > here to seek for more help from experts. You'll likely have more luck on hdfs-dev. > I am wondering why seek(long) is disabled in HDFS.BlockReader? > Can I

Re: Terminal Level Authentication

2010-10-13 Thread Allen Wittenauer
On Oct 13, 2010, at 4:00 AM, Sujitha wrote: > > Hi all, >As a part of my research i am trying to authenticate users.For > this i created a Browser Level Kerberos authentication. SPNEGO or something custom? > After that > i have identified issues related to cookies on the brows

Re: Namenode reboot - namespace

2010-10-14 Thread Allen Wittenauer
On Oct 14, 2010, at 12:58 PM, Null Ecksor wrote: > Does the namenode starts a fresh and gather all the namespace information > again from the datanodes? > > I want to know if at all it looses the info written on the disk, is it able > to obtain the same from the datanodes in the cluster. I've ad

Re: Multi-Master Hadoop Configuration

2010-10-27 Thread Allen Wittenauer
On Oct 27, 2010, at 2:55 PM, Wang, Chengwei wrote: > Hi All, > > I've searched for a while on the possibility to configure multiple masters to > govern large number of slaves, but I barely found any except for > multi-masters for backup in HBase. Is one master enough to support thousands > of

Re: Multi-Master Hadoop Configuration

2010-10-27 Thread Allen Wittenauer
On Oct 27, 2010, at 3:28 PM, Wang, Chengwei wrote: > Thanks for pointing it out! > > Because it did not mentioned how may masters in the practice of 4000 nodes, > is there only one master managing 4000 slaves? Hadoop only supports one master, so yes. [At least until the Avata

Re: web access interface for HDFS

2010-12-16 Thread Allen Wittenauer
On Dec 10, 2010, at 5:40 PM, Springring wrote: > hi all, > I want to making sure one thing --if there are web page in HDFS to access > files? > I know that there are command like "fs -put" and "fs -get",even more we can > download > file from web like "slave:50075".But is there a way to put

Re: TaskTracker not starting on all nodes

2011-03-12 Thread Allen Wittenauer
(Removing common-dev, because this isn't a dev question) On Feb 26, 2011, at 7:25 AM, bikash sharma wrote: > Hi, > I have a 10 nodes Hadoop cluster, where I am running some benchmarks for > experiments. > Surprisingly, when I initialize the Hadoop cluster > (hadoop/bin/start-mapred.sh), in many

Re: definition of slots in Hadoop scheduling

2011-03-12 Thread Allen Wittenauer
(Removing common-dev, because this isn't really a dev question) On Feb 25, 2011, at 5:52 AM, bikash sharma wrote: > Hi, > How is task slot in Hadoop defined with respect to scheduling a map/reduce > task on such slots available on TaskTrackers? On a TaskTracker, one sets how many maps

Re: disable pipelining in Hadoop

2011-03-12 Thread Allen Wittenauer
(Removing common-dev, because this isn't a dev question.) On Mar 1, 2011, at 6:13 AM, bikash sharma wrote: > Hi, > Is there a way to disable the use of pipelining , i.e., the reduce phase is > started only after the map phase is completed? Set mapred.reduce.slowstart.completed.maps to 1. Be aw

Re: Problem with compiling and deploying customed hadoop

2011-03-12 Thread Allen Wittenauer
On Feb 24, 2011, at 5:41 PM, 朱韬 wrote: > Hi,guys: > To meet the needs of the current project, I have to modify the policy of > scheduler. So I checked out the source code > fromhttp://svn.apache.org/repos/asf/hadoop/mapreduce/trunk/. Then I modified > some code and compiled using this scr

Re: Hadoop Version 0.20.2, 0.20.100 and 0.21.0.

2011-03-17 Thread Allen Wittenauer
On Mar 16, 2011, at 12:52 PM, Jane Chen wrote: > Hi, > > I'm quite confused of the status and future of recent Hadoop versions. Since I'm sure I'll be struck down by lightening from someone, just let me put in the disclaimer that these are my opinions and do not necessarily (and prob

Re: Hadoop Version 0.20.2, 0.20.100 and 0.21.0.

2011-03-17 Thread Allen Wittenauer
On Mar 17, 2011, at 1:19 PM, Jane Chen wrote: > In 0.20.2, there are some Interface API that got deprecated > (org.apache.hadoop.mapred), and their abstract class counter parts were > introduced (org.apache.hadoop.mapreduce). I read in the mailing list that > now the deprecated APIs are consid

Re: Unable to connect to the url

2011-03-22 Thread Allen Wittenauer
On Mar 17, 2011, at 10:00 PM, James Ram wrote: > Hi, > > I am using a standalone linux machine. Namenode and Datanode are running. > But when I try to access the UI in my browser its showing "unable to > connect" error. I know its a basic question please help me. I have given > below the configu

Re: Starting 0.20-security-204 branch

2011-04-22 Thread Allen Wittenauer
On Apr 22, 2011, at 4:31 PM, Owen O'Malley wrote: > I've just created the 0.20-security-204 branch to start the stabilization > process for 0.20.204. I hope to get the 203 branch ready for a vote next week. Could someone actually take the branch and try to install from scratch? i.e.,

Re: Hadoop native builds fail on ARM due to -m32

2011-05-10 Thread Allen Wittenauer
On May 10, 2011, at 5:13 PM, Trevor Robinson wrote: > Is the native build failing on ARM (where gcc doesn't support -m32) a > known issue, and is there a workaround or fix pending? That's interesting. I didn't realize there was a gcc that didn't support -m. This seems like an odd thin

HADOOP-6835 concat gzip/bzip2

2011-06-13 Thread Allen Wittenauer
Am I correct in thinking that we should: a) change the description of this JIRA to just say gzip? b) file a new JIRA to add bzip2 concat support? Right now, it seems a bit misleading

Re: HADOOP-6835 concat gzip/bzip2

2011-06-13 Thread Allen Wittenauer
On Jun 13, 2011, at 2:24 PM, Chris Douglas wrote: > Yes to both. I haven't tested the concat bzip2 support, but I've heard > it's broken. The commit log and release note for the issue are correct > on HADOOP-6835. OK, thanks. I was looking for a double-check in case I missed something

Re: Hadoop Master and Slave Discovery

2011-07-06 Thread Allen Wittenauer
On Jul 5, 2011, at 2:40 AM, Steve Loughran wrote: > 1. you could use DNS proper, by way of Bonjour/avahi. You don't need to be > running any mDNS server to support .local, and I would strongly advise > against it in a large cluster (because .local resolution puts a lot of CPU > load on every se

Re: Hadoop Master and Slave Discovery

2011-07-06 Thread Allen Wittenauer
On Jul 6, 2011, at 5:05 PM, Eric Yang wrote: > Did you know that almost all linux desktop system comes with avahi > pre-installed and turn on by default? ... which is why most admins turn those services off by default. :) > What is more interesting is > that there are thousands of thos

Re: [VOTE] Release 0.20.204.0-rc0

2011-07-28 Thread Allen Wittenauer
On Jul 25, 2011, at 7:05 PM, Owen O'Malley wrote: > I've created a release candidate for 0.20.204.0 that I would like to release. > > It is available at: http://people.apache.org/~omalley/hadoop-0.20.204.0-rc0/ > > 0.20.204.0 has many fixes including disk fail in place and the new rpm and > de

Re: [VOTE] Release 0.20.204.0-rc0

2011-07-29 Thread Allen Wittenauer
I can't believe we're holding a vote on a release that isn't passing the nightly build. If my vote was binding, I'd -1 it based upon that alone.

Re: [VOTE] Release 0.20.204.0-rc0

2011-08-02 Thread Allen Wittenauer
On Aug 2, 2011, at 4:28 AM, Steve Loughran wrote: > > I'm getting confused about release roadmaps right now branch-20 is the new trunk, given that features keep popping up in it rather than bug fixes.

Re: [VOTE] Release 0.20.204.0-rc0

2011-08-02 Thread Allen Wittenauer
On Aug 2, 2011, at 12:23 PM, Eli Collins wrote: > > However it is disappointing > to see some of the features being developed on branch-20-security, > rather being developed first on trunk and then ported to > branch-20-security. ... which was exactly my point.

Re: hadoop JARs not in lib/ directory of layout

2011-08-04 Thread Allen Wittenauer
On Aug 4, 2011, at 1:06 PM, Alejandro Abdelnur wrote: > [moving to core-dev@] > > A big release note is doable. > > Still, people normally use 'hadoop' script when submitting jobs and 'hadoop' > would take care of having the JAR in the classpath. What other things would > break? > Eve

Re: hadoop JARs not in lib/ directory of layout

2011-08-04 Thread Allen Wittenauer
On Aug 4, 2011, at 1:59 PM, Alejandro Abdelnur wrote: > Pig, Hive bundle Hadoop JARs with distributions, so no issue there. Re-read what I said: >> I suspect lots of pig, hive, and hbase installations will also break. It still remains a potential issue for those of us who

Re: Architectural Questions

2011-08-09 Thread Allen Wittenauer
On Aug 9, 2011, at 2:28 PM, Harsh J wrote: > Mike, > > On Wed, Aug 10, 2011 at 2:18 AM, Segel, Mike wrote: >> Right. >> The problem is how do you distinguish when someone is asking about the >> Hadoop (HDFS, MapReduce) >> And the Hadoop ecosystem. (HDFS, MapReduce, HBase, Hive, Pig, Oozie, Flu

Re: [VOTE] Should we release 0.20.204.0rc2?

2011-08-10 Thread Allen Wittenauer
On Aug 9, 2011, at 8:55 AM, Owen O'Malley wrote: > All, > Matt rolled a 0.20.204.0rc1, but I think it got lost in the previous vote > thread. Unfortunately, it had the version as 0.20.204 and didn't update the > release notes. I've updated it, run the regression tests and I think we > should

Re: [VOTE] Should we release 0.20.204.0rc2?

2011-08-10 Thread Allen Wittenauer
On Aug 9, 2011, at 8:55 AM, Owen O'Malley wrote: > All, > Matt rolled a 0.20.204.0rc1, but I think it got lost in the previous vote > thread. Unfortunately, it had the version as 0.20.204 and didn't update the > release notes. I've updated it, run the regression tests and I think we > should

Re: [VOTE] Should we release 0.20.204.0rc2?

2011-08-18 Thread Allen Wittenauer
On Aug 18, 2011, at 12:28 AM, Owen O'Malley wrote: > > This vote is still running with no votes other than mine. > > I've tested with and without security on a 60 node cluster and I'm seeing > some failures, but not that many. On a terasort with 15,000 maps and 200 > reduces, I ran the follow

Re: Hadoop 21.1, appends maintenance, roadmap?

2011-08-25 Thread Allen Wittenauer
On Aug 25, 2011, at 1:07 PM, wrote: >> >> >> >> The problem is that the 0.20.2xx releases are neither a superset or subset >> of the 0.21 release. In many ways, the 0.20.2xx.y releases would be better >> named as 1.x.y. > > This confused me totally. Is the thinking to make 0.20.2xx.y as 1.

Re: Hadoop Tools Layout (was Re: DistCpV2 in 0.23)

2011-09-06 Thread Allen Wittenauer
On Sep 6, 2011, at 9:30 AM, Vinod Kumar Vavilapalli wrote: > We still need to answer Amareshwari's question (2) she asked some time back > about the automated code compilation and test execution of the tools module. >>> My #1 question is if tools is basically contrib reborn. If not, what >> ma

Re: Hadoop Tools Layout (was Re: DistCpV2 in 0.23)

2011-09-06 Thread Allen Wittenauer
On Sep 6, 2011, at 4:32 PM, Eli Collins wrote: > > IMO if the tools module only gets stuff like distcp that's maintained > then it's not contrib, if it contains all the stuff from the current > MR contrib then tools is just a re-labeling of contrib. Given that > this proposal only covers moving d

Re: JIRA attachments order

2011-09-09 Thread Allen Wittenauer
On Sep 9, 2011, at 12:57 PM, Eli Collins wrote: > > Patches for trunk should be named: jira-xyz.patch > eg hdfs-123.patch s,patch,txt, since jira doesn't appear to pass a content-type to indicate it is readable by the browser (as you mentioned earlier).

Re: Github integration for Hadoop

2015-10-29 Thread Allen Wittenauer
> On Oct 29, 2015, at 5:14 PM, Andrew Wang wrote: > > However, if we were to use GH for integration, it would be with the > auto-squash to avoid the merge commit. Would this preserve the correct > attribution? FWIW, Yetus *really really really* wants a single commit when it comes to di

Re: Github integration for Hadoop

2015-10-30 Thread Allen Wittenauer
> * Have we tried our precommit on PRs yet? Does it work for multiple > branches? Is there a way to enforce rebase+squash vs. merge on the PR, > since, per Allen, Yetus requires one commit to work? I don’t know about the Jenkins-side of things (e.g., how does Jenkins trigger a build?).

Re: Github integration for Hadoop

2015-11-01 Thread Allen Wittenauer
> On Nov 1, 2015, at 6:05 AM, Tsuyoshi Ozawa wrote: > > Thank you for starting this discussion. It's good for us to rethink > our workflow to grow community. > However, at the moment, my concern is that we can put more pressure on > Yetus community if we move the main workflows into github. > >

Re: [DISCUSS] Looking to a 2.8.0 release

2015-11-11 Thread Allen Wittenauer
> On Nov 11, 2015, at 12:13 PM, Vinod Vavilapalli > wrote: > >— HDFS-6200 Create a separate jar for hdfs-client: Compatible improvement > - no dimension of alpha/betaness here. IMO: this feels like a massive break in backwards compatibility. Anyone who is looking for specific met

Re: [DISCUSS] Looking to a 2.8.0 release

2015-11-11 Thread Allen Wittenauer
> On Nov 11, 2015, at 1:11 PM, Vinod Vavilapalli > wrote: > > I’ll let others comment on specific features. > > Regarding the 3.x vs 2.x point, as I noted before on other threads, given all > the incompatibilities in trunk it will be ways off before users can run their > production workloads

Re: [DISCUSS] Looking to a 2.8.0 release

2015-11-11 Thread Allen Wittenauer
> On Nov 11, 2015, at 2:15 PM, Haohui Mai wrote: > > bq. it basically makes the assumption that everyone recompiles for > every minor release. > > I don't think that the statement holds. HDFS-6200 keeps classes in the > same package. hdfs-client becomes a transitive dependency of the > origina

Re: Need for force-push on feature branches

2015-11-12 Thread Allen Wittenauer
Implementing project-specific patch identification rules are definitely ‘not trivial’. FWIW, the documented ruleset Yetus supports is here: https://yetus.apache.org/documentation/latest/precommit-patchnames/ (Altho, in reality, the code does support more than this but they are sort of

Re: Need for force-push on feature branches

2015-11-12 Thread Allen Wittenauer
> On Nov 12, 2015, at 10:14 AM, Sangjin Lee wrote: > > I don't think we're proposing project-specific rules. It would be a > recognition of the git branch name prefix "feature/". > > If the file name had "HADOOP-67890-HADOOP-12345.001.patch" where > HADOOP-12345 was the feature JIRA but the

Re: Github integration for Hadoop

2015-11-13 Thread Allen Wittenauer
> On Nov 12, 2015, at 10:55 AM, Colin P. McCabe wrote: > > gerrit has a button on the UI to cherry-pick to different branches. > The button creates separate "gerrit changes" which you can then > commit. Eventually we could hook those up to Jenkins-- something > which we've never been able to do

FYI: now using dev-support/docker/Dockerfile

2015-11-20 Thread Allen Wittenauer
I’ve switched the Jenkins builds over to use the Hadoop-bundled Dockerfile. This means that external dependencies are now controlled by Hadoop-itself. If a patch is submitted to a branch that doesn’t have that file, it should rollback to the default Yetus one.

Re: [DISCUSS] Looking to a 2.8.0 release

2015-11-25 Thread Allen Wittenauer
> On Nov 25, 2015, at 11:23 AM, Vinod Kumar Vavilapalli > wrote: > > There are 40 odd incompatible changes in 3.x: > https://issues.apache.org/jira/issues/?jql=project%20in%20%28HADOOP%2C%20YARN%2C%20HDFS%2C%20MAPREDUCE%29%20AND%20resolution%20%3D%20Fixed%20AND%20fixVersion%20%3D%203.0.0%20AND

Re: Disable some of the Hudson integration comments on JIRA

2015-11-26 Thread Allen Wittenauer
> On Nov 25, 2015, at 5:41 PM, Andrew Wang wrote: > > Hi all, > > Right now we get something like 7 comments from Hudson whenever a change is > committed. Would anyone object if I turned off 6 of them? We have > variations like: > > Hadoop-trunk-Commit > Hadoop-Hdfs-trunk-Java8 > Hadoop-Yarn-t

Re: Disable some of the Hudson integration comments on JIRA

2015-11-30 Thread Allen Wittenauer
> On Nov 30, 2015, at 11:33 AM, Andrew Wang wrote: > > Good point Allen. So I guess the broader question is, do we find the > per-commit tracking build and test useful? With our current flakiness > levels, there isn't much signal from a FAILED on one of these integration > jobs. I think Hadoop-t

Re: Disable some of the Hudson integration comments on JIRA

2015-11-30 Thread Allen Wittenauer
> On Nov 30, 2015, at 3:46 PM, Chris Nauroth wrote: > > To be exhaustive, we'd also add the various -Drequire options for full > inclusion of all optional native components. Unfortunately, that gets > tricky because of the various build pre-requisites which would need to be > present on the Jen

Re: [DISCUSS} why is checkstyle harassing me on indentation?

2016-01-18 Thread Allen Wittenauer
> On Jan 18, 2016, at 11:34 AM, Steve Loughran wrote: > > > Yetus checkstyle is going a bit overboard on indentation policy Just the messenger. ;) FWIW: https://wiki.apache.org/hadoop/CodeReviewChecklist

[IMPORTANT] flag day for dev-support/ users

2016-01-21 Thread Allen Wittenauer
This is just a heads up that the contents of dev-support have changed due to HADOOP-12651. This patch effectively removes almost all of the Apache Yetus components and replaces them with wrappers now located in dev-support/bin. Note that in some cases (esp smart-apply-patch), t

Re: TestPatchTips wiki not working for me

2016-01-26 Thread Allen Wittenauer
> On Jan 26, 2016, at 12:19 PM, Ray Chiang wrote: > > Not sure if this should go to Yetus or here, but starting here... > > Maybe I'm missing something, but the commands as documented at > > https://wiki.apache.org/hadoop/TestPatchTips > > are no longer working for me. > > Admittedly, I'm n

Re: [VOTE] Release Apache Hadoop 2.6.4 RC0

2016-02-10 Thread Allen Wittenauer
> On Feb 9, 2016, at 6:27 PM, Junping Du wrote: > > Thanks Yongjun for identifying and proposing this change to 2.6.4. I think > this is the right thing to do and check for following releases. For 2.6.4, it > seems unnecessary to create another release candidate for this issue as we > only ki

node.js and more as dependencies

2016-02-25 Thread Allen Wittenauer
Hey folks. Have any of you looked at YARN-3368? Is adding node.js+a bunch of other stuff as dependencies just for the UI a good idea? Doesn’t that seem significantly heavyweight? How hard is this going to be operationally to manage?

Precommit changed to Yetus 0.2.0 release, src version to follow soon

2016-03-07 Thread Allen Wittenauer
FYI, Apache Yetus 0.2.0 passed vote this morning. In accordance with ASF rules, I’m setting the precommit scripts to use that version rather than Yetus master. As soon as it is officially announced and available on ASF dist servers worldwide, I’ll be filing the necessary jira to update the v

hadoop-integration-tests jenkins job

2016-03-19 Thread Allen Wittenauer
Given that we’re down to half our capacity in build-land, it’s interesting to see what everyone’s running since everything runs slower. You can find some interesting bits. Like ‘hadoop-integration-tests’. Is this job actually providing any value, especially with an aggressive

Branch policy question

2016-03-22 Thread Allen Wittenauer
Since it’s nearly impossible for me to get timely reviews for some build and script changes, is it possible for me to setup a branch, self review+commit to that branch, then request a branch merge? I’m basically looking at doing this for HADOOP-12857 + HADOOP-12930 and their subtasks e

Re: Branch policy question

2016-03-22 Thread Allen Wittenauer
> On Mar 22, 2016, at 10:49 AM, larry mccay wrote: > > That sounds like a reasonable approach and valid use of branches to me. > > Perhaps a set of functional tests could be provided/identified that would > help the review process by showing backward compatibility along with new > extensions fo

Re: Branch policy question

2016-03-22 Thread Allen Wittenauer
> On Mar 22, 2016, at 6:46 PM, Gangumalla, Uma wrote: > >> is it possible for me to setup a branch, self review+commit to that >> branch, then request a branch merge? > Basically this is something like Commit-Then-Review(here review later) > process right. I have not seen we followed this approa

Re: Branch policy question

2016-03-23 Thread Allen Wittenauer
> On Mar 23, 2016, at 10:25 AM, Chris Nauroth wrote: > > 2. Apache feature branches: Sign-off may come from designated branch > committers in addition to full committers. It's OK to break the branch > for work in progress, but it must be fixed before a merge. It's still > review then commit th

Review request: HADOOP-11393

2016-03-26 Thread Allen Wittenauer
This effectively makes the bash code the same as the Windows code with regards to using HADOOP_HOME as the base of operation. It is a large patch, but all it does is: * replace every occurrence of HADOOP_PREFIX with HADOOP_HOME with the standard bits to use and give a deprecation warni

Re: 2.7.3 release plan

2016-04-06 Thread Allen Wittenauer
This is probably a good time to remind/point folks to HADOOP-12893. Apache Hadoop's binary artifacts (with or without native code) and source artifacts are not complying with the licenses of bundled components. I fairly confident this means releases are off the table until someone aud

Re: Looking to a Hadoop 3 release

2016-04-22 Thread Allen Wittenauer
> On Apr 22, 2016, at 5:38 PM, Vinod Kumar Vavilapalli > wrote: > > On an unrelated note, offline I was pitching to a bunch of contributors > another idea to deal with rotting trunk post 3.x: *Make 3.x releases off of > trunk directly*. > > What this gains us is that > - Trunk is always near

Re: Looking to a Hadoop 3 release

2016-04-22 Thread Allen Wittenauer
> On Apr 22, 2016, at 6:10 PM, Vinod Kumar Vavilapalli > wrote: > > Nope. > > I’m proposing making a new 3.x release (as has been discussed in this thread) > off today’s trunk (instead of creating a fresh branch-3) and create a new > trunk-incompt where incompatible changes that we don’t wan

Re: Recent PreCommit Jenkins failures

2016-04-27 Thread Allen Wittenauer
> On Apr 27, 2016, at 3:53 PM, Arun Suresh wrote: > > which I guess is due to the haskell site throwing a 502 and prior to that I > saw a couple of failures with > hackage.haskell.org has been down all day for some reason. I’ve got a patch sitting in HADOOP-13067 which will switch us over t

git commit errors

2016-05-03 Thread Allen Wittenauer
Just a heads up that I accidentally forced push over branch-2 and branch-2.8 (in the process of pushing a feature branch) around 10am PT this morning that dropped several commits. I have since reset us back to those git refs. If anything has been committed since then, please recommit

Guidance needed on HADOOP-13096 and HADOOP-13097

2016-05-04 Thread Allen Wittenauer
When the sub-projects re-merged, maven work was done, whatever, the shell scripts for MR and YARN were placed (effectively) outside of the normal maven hierarchy. In order to add unit tests to the shell scripts for these sub-projects, it means effectively turning hadoop-yarn-project/ha

Re: every JIRA is now getting a no-longer-employed message

2016-05-05 Thread Allen Wittenauer
> On May 5, 2016, at 3:01 AM, Steve Loughran wrote: > > > whenever someone opens a new JIRA, it gains an auto attachment, > "ran...@yahoo-inc.com is no longer with Yahoo! > Inc." ... > > e.g > > https://issues.apache.org/jira/browse/HADOOP-13099 > > Does anyone

Re: Guidance needed on HADOOP-13096 and HADOOP-13097

2016-05-06 Thread Allen Wittenauer
:22 PM, Larry McCay wrote: > > I would vote for C or D with a filed JIRA to clean up the maven structure as > a separate effort. > Before moving to D, could you describe any reason to not go with C? > > On May 4, 2016, at 9:51 PM, Allen Wittenauer wrote: > >> &g

7 build nodes are down

2016-05-06 Thread Allen Wittenauer
Anyone know why? - To unsubscribe, e-mail: common-dev-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-dev-h...@hadoop.apache.org

[VOTE] Merge feature branch HADOOP-12930

2016-05-09 Thread Allen Wittenauer
Hey gang! I’d like to call a vote to run for 7 days (ending May 16 at 13:30 PT) to merge the HADOOP-12930 feature branch into trunk. This branch was developed exclusively by me as per the discussion two months ago as a way to make what would be a rather large patch hopefully ea

  1   2   3   4   5   6   7   8   9   10   >