On Mon, Oct 8, 2012 at 4:29 PM, Arpit Gupta <ar...@hortonworks.com> wrote:
>
> i would recommended using the oracle jdk.

oracle version didn't help

> Also from your email below you mention that mapred.child.java.opts and
> mapred.child.ulimit were added to try to solve this problem. Are you setting
> memory settings for your map and reduce tasks? It might help if you share
> the full mapred-site.xml.


attached please find the mapred-site.xml, The configuration is simply
the default configuration generated by the RPM package provided by
Hadoop, with the extra mapred.child.java.opts and mapred.child.ulimit
setup discussed already

at this point we don't have any further clue on what to do except
switch back to 0.20.2

any help is appreciated,
Attila

>
>
> --
> Arpit Gupta
> Hortonworks Inc.
> http://hortonworks.com/
>
> On Oct 8, 2012, at 2:45 AM, Attila Csordas <attilacsor...@gmail.com> wrote:
>
> OpenJDK 64-Bit Server VM (build 19.0-b09, mixed mode)
>
> might the official Oracle Java be better?
>
> Thanks,
> Attila
>
> On Sun, Oct 7, 2012 at 8:37 PM, Arpit Gupta <ar...@hortonworks.com> wrote:
>
> are you using 32bit jdk for your task trackers?
>
> If so reduce the mem setting in mapred.child.java.opts
>
> --
> Arpit
>
> On Oct 7, 2012, at 12:29 PM, Attila Csordas <attilacsor...@gmail.com> wrote:
>
> <property>
> <name>mapred.tasktracker.map.tasks.maximum</name>
> <value>10</value>
> </property>
>
> <property>
> <name>mapred.tasktracker.reduce.tasks.maximum</name>
> <value>6</value>
> </property>
>
> Cheers,
> Attila
>
> On Sun, Oct 7, 2012 at 6:34 AM, Harsh J <ha...@cloudera.com> wrote:
>
> Hi,
>
> What is your # of slots per TaskTracker? Your ulimit seems pretty
> high. I'd set it to 1.5x times heap initially, i.e., 6291456 (6 GB)
> and try.
>
> On Sun, Oct 7, 2012 at 3:50 AM, Attila Csordas <attilacsor...@gmail.com>
> wrote:
>
> some details to this problem:
>
> 12/10/05 12:13:27 INFO mapred.JobClient:  map 0% reduce 0%
> 12/10/05 12:13:40 INFO mapred.JobClient: Task Id :
> attempt_201210051158_0001_m_000002_0, Status : FAILED
> java.lang.Throwable: Child Error
>       at org.apache.hadoop.mapred.TaskRunner.run(TaskRunner.java:271)
> Caused by: java.io.IOException: Task process exit with nonzero status of
> 134.
>       at org.apache.hadoop.mapred.TaskRunner.run(TaskRunner.java:258)
>
> attempt_201210051158_0001_m_000002_0: #
> attempt_201210051158_0001_m_000002_0: # There is insufficient memory
> for the Java Runtime Environment to continue.
> attempt_201210051158_0001_m_000002_0: # pthread_getattr_np
>
> in mapred-site.xml the following memory settings were set after a
> couple trials to get rid of the problem this way:
>
> <property>
> <name>mapred.child.java.opts</name>
> <value>-server -Xmx4096M -Djava.net.preferIPv4Stack=true</value>
> </property>
>
> <property>
> <name>mapred.child.ulimit</name>
> <value>16777216</value>
> </property>
>
> Cheers,
> Attila
>
>
>
> On Fri, Oct 5, 2012 at 10:50 AM, Steve Lewis <lordjoe2...@gmail.com> wrote:
>
> [We get 'There is insufficient memory for the Java Runtime Environment to
> continue.'
> any time we run any job including the most trivial word count process. It is
> true I am generating a jar for a larger job but only running a version of
> wordcount that worked well under 0.2
> Any bright ideas???
> This is a new 1.03 installation and nothing is known to work
>
> Steven M. Lewis PhD
> 4221 105th Ave NE
> Kirkland, WA 98033
> cell 206-384-1340
> skype lordjoe_com
>
>
>
>
> --
> Harsh J
>
>
<?xml version="1.0"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>

<!-- Put site-specific property overrides in this file. -->

<configuration>

<property>
<name>mapred.tasktracker.tasks.sleeptime-before-sigkill</name>
<value>250</value>
<description>Normally, this is the amount of time before killing
processes, and the recommended-default is 5.000 seconds - a value of
5000 here. In this case, we are using it solely to blast tasks before
killing them, and killing them very quickly (1/4 second) to guarantee
that we do not leave VMs around for later jobs.
</description>
</property>

<property>
<name>mapred.system.dir</name>
<value>/mapred/mapredsystem</value>
<final>true</final>
</property>

<property>
<name>mapred.job.tracker</name>
<value>hadoop-master-02.ourdomain:9000</value>
<final>true</final>
</property>

<property>
<name>mapred.job.tracker.http.address</name>
<value>hadoop-master-02.ourdomain:50030</value>
<final>true</final>
</property>

<property>
<name>mapred.local.dir</name>
<value>/var/lib/hadoop/mapred</value>
<final>true</final>
</property>

<property>
<name>mapreduce.cluster.administrators</name>
<value>mapred</value>
</property>

<property>
<name>mapred.map.tasks.speculative.execution</name>
<value>false</value>
<description>If true, then multiple instances of some map tasks
may be executed in parallel.</description>
</property>

<property>
<name>mapred.reduce.tasks.speculative.execution</name>
<value>false</value>
<description>If true, then multiple instances of some reduce tasks
may be executed in parallel.</description>
</property>

<property>
<name>mapred.output.compression.type</name>
<value>BLOCK</value>
<description>If the job outputs are to compressed as SequenceFiles, how
should they be compressed? Should be one of NONE, RECORD or BLOCK.
</description>
</property>

<property>
<name>jetty.connector</name>
<value>org.mortbay.jetty.nio.SelectChannelConnector</value>
</property>

<property>
<name>mapred.task.tracker.task-controller</name>
<value>org.apache.hadoop.mapred.DefaultTaskController</value>
</property>

<property>
<name>mapred.child.root.logger</name>
<value>INFO,TLA</value>
</property>

<property>
<name>mapred.child.java.opts</name>
<value>-server -Xmx4096M -Djava.net.preferIPv4Stack=true</value>
</property>

<property>
<name>mapred.child.ulimit</name>
<value>6291456</value>
</property>

<property>
<name>mapred.job.tracker.persist.jobstatus.active</name>
<value>true</value>
<description>Indicates if persistency of job status information is
active or not.
</description>
</property>

<property>
<name>mapred.job.tracker.persist.jobstatus.dir</name>
<value>file:////var/log/hadoop/mapred/jobstatus</value>
<description>The directory where the job status information is persisted
in a file system to be available after it drops of the memory queue and
between jobtracker restarts.
</description>
</property>

<property>
<name>mapred.job.tracker.history.completed.location</name>
<value>/mapred/history/done</value>
</property>

<property>
<name>mapred.heartbeats.in.second</name>
<value>200</value>
<description>to enable HADOOP:5784</description>
</property>

<property>
<name>mapreduce.tasktracker.outofband.heartbeat</name>
<value>true</value>
<description>to enable MAPREDUCE:270</description>
</property>

<property>
<name>mapred.jobtracker.maxtasks.per.job</name>
<value>200000</value>
<final>true</final>
<description>The maximum number of tasks for a single job.
A value of -1 indicates that there is no maximum.
</description>
</property>

<property>
<name>mapreduce.jobtracker.kerberos.principal</name>
<value>jt/_HOST@${local.realm}</value>
<description>
JT principal
</description>
</property>

<property>
<name>mapreduce.tasktracker.kerberos.principal</name>
<value>tt/_HOST@${local.realm}</value>
<description>
TT principal.
</description>
</property>


<property>
<name>hadoop.job.history.user.location</name>
<value>none</value>
</property>

<property>
<name>mapreduce.jobtracker.keytab.file</name>
<value>/etc/security/keytabs/jt.service.keytab</value>
<description>
The keytab for the jobtracker principal.
</description>
</property>

<property>
<name>mapreduce.tasktracker.keytab.file</name>
<value>/etc/security/keytabs/tt.service.keytab</value>
<description>The filename of the keytab for the task tracker</description>
</property>

<property>
<name>mapreduce.jobtracker.staging.root.dir</name>
<value>/user</value>
<description>The Path prefix for where the staging directories should be
placed. The next level is always the user's
name. It is a path in the default file system.
</description>
</property>


<property>
<name>mapreduce.job.acl-modify-job</name>
<value></value>
</property>

<property>
<name>mapreduce.job.acl-view-job</name>
<value>Dr.Who</value>
</property>

<property>
<name>mapreduce.tasktracker.group</name>
<value>hadoop</value>
<description>The group that the task controller uses for accessing the
task controller. The mapred user must be a member and users should *not*
be members.
</description>
</property>

<property>
<name>mapred.acls.enabled</name>
<value>true</value>
</property>

<property>
<name>mapred.jobtracker.taskScheduler</name>
<value>org.apache.hadoop.mapred.CapacityTaskScheduler</value>
</property>
<property>
<name>mapred.queue.names</name>
<value>default</value>
</property>

<!-- settings for the history server -->
<property>
<name>mapreduce.history.server.embedded</name>
<value>false</value>
</property>
<property>
<name>mapreduce.history.server.http.address</name>
<value>hadoop-master-02.ourdomain:51111</value>
</property>
<property>
<name>mapreduce.jobhistory.kerberos.principal</name>
<value>jt/_HOST@${local.realm}</value>
<description>history server principal</description>
</property>
<property>
<name>mapreduce.jobhistory.keytab.file</name>
<value>/etc/security/keytabs/jt.service.keytab</value>
<description>
The keytab for the jobtracker principal.
</description>
</property>

<property>
<name>mapred.hosts</name>
<value>/etc/hadoop/mapred.include</value>
<description>Names a file that contains the list of nodes that may
connect to the jobtracker. If the value is empty, all hosts are
permitted.</description>
</property>

<property>
<name>mapred.hosts.exclude</name>
<value>/etc/hadoop/mapred.exclude</value>
<description>Names a file that contains the list of hosts that
should be excluded by the jobtracker. If the value is empty, no
hosts are excluded.</description>
</property>
<property>
<name>mapred.jobtracker.retirejob.check</name>
<value>10000</value>
</property>
<property>
<name>mapred.jobtracker.retirejob.interval</name>
<value>0</value>
</property>

<property>
<name>mapred.cluster.map.memory.mb</name>
<value>-1</value>
<description>The size, in terms of virtual memory, of a single map slot
in the Map-Reduce framework, used by the scheduler.
A job can ask for multiple slots for a single map task via
mapred.job.map.memory.mb, upto the limit specified by
mapred.cluster.max.map.memory.mb, if the scheduler supports the feature.
The value of -1 indicates that this feature is turned off.
</description>
</property>

<property>
<name>mapred.cluster.reduce.memory.mb</name>
<value>-1</value>
<description>The size, in terms of virtual memory, of a single reduce slot
in the Map-Reduce framework, used by the scheduler.
A job can ask for multiple slots for a single reduce task via
mapred.job.reduce.memory.mb, upto the limit specified by
mapred.cluster.max.reduce.memory.mb, if the scheduler supports the feature.
The value of -1 indicates that this feature is turned off.
</description>
</property>

<property>
<name>mapred.cluster.max.map.memory.mb</name>
<value>-1</value>
<description>The maximum size, in terms of virtual memory, of a single map
task launched by the Map-Reduce framework, used by the scheduler.
A job can ask for multiple slots for a single map task via
mapred.job.map.memory.mb, upto the limit specified by
mapred.cluster.max.map.memory.mb, if the scheduler supports the feature.
The value of -1 indicates that this feature is turned off.
</description>
</property>

<property>
<name>mapred.cluster.max.reduce.memory.mb</name>
<value>-1</value>
<description>The maximum size, in terms of virtual memory, of a single reduce
task launched by the Map-Reduce framework, used by the scheduler.
A job can ask for multiple slots for a single reduce task via
mapred.job.reduce.memory.mb, upto the limit specified by
mapred.cluster.max.reduce.memory.mb, if the scheduler supports the feature.
The value of -1 indicates that this feature is turned off.
</description>
</property>

<property>
<name>mapred.job.map.memory.mb</name>
<value>-1</value>
<description>The size, in terms of virtual memory, of a single map task
for the job.
A job can ask for multiple slots for a single map task, rounded up to the
next multiple of mapred.cluster.map.memory.mb and upto the limit
specified by mapred.cluster.max.map.memory.mb, if the scheduler supports
the feature.
The value of -1 indicates that this feature is turned off iff
mapred.cluster.map.memory.mb is also turned off (-1).
</description>
</property>

<property>
<name>mapred.job.reduce.memory.mb</name>
<value>-1</value>
<description>The size, in terms of virtual memory, of a single reduce task
for the job.
A job can ask for multiple slots for a single map task, rounded up to the
next multiple of mapred.cluster.reduce.memory.mb and upto the limit
specified by mapred.cluster.max.reduce.memory.mb, if the scheduler supports
the feature.
The value of -1 indicates that this feature is turned off iff
mapred.cluster.reduce.memory.mb is also turned off (-1).
</description>
</property>

<property>
  <name>mapred.tasktracker.map.tasks.maximum</name>
  <value>10</value>
</property>

<property>
  <name>mapred.tasktracker.reduce.tasks.maximum</name>
  <value>6</value>
</property>

</configuration>

Reply via email to