Re: Changing default task JVM classpath

John Armstrong Thu, 16 Feb 2012 07:35:55 -0800

On 02/16/2012 10:15 AM, Harsh J wrote:

That is how HBase does it: HBaseConfiguration at driver loads up HBase
*xml file configs from driver classpath (or user set() entries, either
way), and then submits that as part of job.xml. These configs should
be all you need.

It should be, and yet I'm running into sporadic problems. The detailsare sort of separate from mapreduce proper, and I'm still not sure ofthe exact root cause (sporadic bugs are the worst), but it seems to comedown to an odd confluence of behaviors from Oozie, Zookeeper, andAccumulo (another implementation of BigTable).

The gist is that occasionally -- randomly -- the Oozie-launched Javaprogram needs to go looking for the Accumulo site configuration, whichrequires looking for an XML file resource on the classpath. Not findingit, it goes with the defaults, meaning Accumulo no longer knows where mycluster's Zookeepers are; it tries to reconnect to localhost (thedefault) and fails in an endless loop.

So yes, I've set the relevant properties in my own configuration which Igive to Oozie, but when "something" happens (my WAG: zookeeper locklost?) Accumulo insists on looking in its SiteConfiguration, which meansloading the XML resource.

For the moment I've placed a softlink in $HADOOP_HOME/conf/ to theneeded Accumulo configuration file, but I'm wondering if I can just tellthe task JVMs to have access to the Accumulo configuration directoriesas well.

Re: Changing default task JVM classpath

Reply via email to