[Bug 63470] analytics1012 fails Hadoop applications and jobs

2014-04-04 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=63470

Andrew Otto  changed:

   What|Removed |Added

 Status|NEW |RESOLVED
 Resolution|--- |FIXED

--- Comment #5 from Andrew Otto  ---
YES!  Found it.  /etc/hosts had a bad IP listed on analytics1012 for itself. 
Fixed and things look much better now!

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 63470] analytics1012 fails Hadoop applications and jobs

2014-04-03 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=63470

--- Comment #4 from Oliver Keyes  ---
(In reply to Toby Negrin from comment #3)
> This matches some anecdotal evidence from Oliver that there were problems
> with the  analytics2012 node.
> 
Yep. I reported this a while ago, but it looks like the bug turned out to be a
pair of bugs ("analytics1012 keeps dropping jobs" and "INSERT OVERWRITE doesn't
work") and the second one masked the first.

> Diederik updated the java version IIRC. I do not know how he made this
> change.
> 

Not sure the details, but I'm pretty sure he just went into the box and
upgraded by hand.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 63470] analytics1012 fails Hadoop applications and jobs

2014-04-03 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=63470

--- Comment #3 from Toby Negrin  ---
This matches some anecdotal evidence from Oliver that there were problems with
the  analytics2012 node.

Diederik updated the java version IIRC. I do not know how he made this change.

I suspect the fastest way forward with this node is to decommission it and
repave it because we don't really know what Diederik did with it. Perhaps
puppet can tell us if there versions are different?

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 63470] analytics1012 fails Hadoop applications and jobs

2014-04-03 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=63470

--- Comment #2 from christ...@quelltextlich.at ---
Bug 63472 might be related.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 63470] analytics1012 fails Hadoop applications and jobs

2014-04-03 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=63470

--- Comment #1 from Bingle  ---
Prioritization and scheduling of this bug is tracked on Mingle card
https://wikimedia.mingle.thoughtworks.com/projects/analytics/cards/cards/1522

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l