What's the value of 'zookeeper.session.timeout' ?

Maybe you can tune it higher.

On Sat, Jan 22, 2011 at 3:13 AM, Wojciech Langiewicz
<[email protected]>wrote:

> Hi,
> I have re-run test with 2 or more mappers and looked into logs more
> closely: The mapreduce job has finished correctly, some map attempts are
> killed, but eventually all of mappers finish. And there's no rule which
> mappers on which servers fail (re-run tests multiple times).
>
> So I suspect that this is not classpath problem, maybe there are settings
> that limit number of connections, because before MasterNotRunning I get this
> in logs:
>
> 2011-01-22 12:06:43,411 INFO org.apache.zookeeper.ZooKeeper: Initiating
> client connection, connectString=hd-master:2181 sessionTimeout=60000
> watcher=org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper@3b5e234c
> 2011-01-22 12:06:43,483 INFO org.apache.zookeeper.ClientCnxn: Opening
> socket connection to server hd-master/10.6.75.212:2181
> 2011-01-22 12:06:43,484 INFO org.apache.zookeeper.ClientCnxn: Socket
> connection established to hd-master/10.6.75.212:2181, initiating session
> 2011-01-22 12:06:43,488 INFO org.apache.zookeeper.ClientCnxn: Unable to
> read additional data from server sessionid 0x0, likely server has closed
> socket, closing socket connection and attempting reconnect
> 2011-01-22 12:06:43,605 INFO
> org.apache.hadoop.hbase.client.HConnectionManager$TableServers: getMaster
> attempt 0 of 10 failed; retrying after sleep of 1000
> java.io.IOException:
> org.apache.zookeeper.KeeperException$ConnectionLossException:
> KeeperErrorCode = ConnectionLoss for /hbase/master
>        at
> org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readAddressOrThrow(ZooKeeperWrapper.java:481)
>        at
> org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readMasterAddressOrThrow(ZooKeeperWrapper.java:377)
>        at
> org.apache.hadoop.hbase.client.HConnectionManager$TableServers.getMaster(HConnectionManager.java:381)
>
>        at
> org.apache.hadoop.hbase.client.HBaseAdmin.<init>(HBaseAdmin.java:78)
>        at
> org.apache.hadoop.hbase.PerformanceEvaluation$Test.testSetup(PerformanceEvaluation.java:745)
>        at
> org.apache.hadoop.hbase.PerformanceEvaluation$Test.test(PerformanceEvaluation.java:764)
>        at
> org.apache.hadoop.hbase.PerformanceEvaluation.runOneClient(PerformanceEvaluation.java:1097)
>        at
> org.apache.hadoop.hbase.PerformanceEvaluation$EvaluationMapTask.map(PerformanceEvaluation.java:446)
>        at
> org.apache.hadoop.hbase.PerformanceEvaluation$EvaluationMapTask.map(PerformanceEvaluation.java:399)
>        at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:144)
>        at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:639)
>        at org.apache.hadoop.mapred.MapTask.run(MapTask.java:315)
>        at org.apache.hadoop.mapred.Child$4.run(Child.java:217)
>        at java.security.AccessController.doPrivileged(Native Method)
>        at javax.security.auth.Subject.doAs(Subject.java:396)
>        at
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1063)
>        at org.apache.hadoop.mapred.Child.main(Child.java:211)
> Caused by: org.apache.zookeeper.KeeperException$ConnectionLossException:
> KeeperErrorCode = ConnectionLoss for /hbase/master
>        at
> org.apache.zookeeper.KeeperException.create(KeeperException.java:90)
>        at
> org.apache.zookeeper.KeeperException.create(KeeperException.java:42)
>        at org.apache.zookeeper.ZooKeeper.getData(ZooKeeper.java:921)
>        at
> org.apache.hadoop.hbase.zookeeper.ZooKeeperWrapper.readAddressOrThrow(ZooKeeperWrapper.java:477)
>        ... 16 more
>
>
> On 22.01.2011 01:36, Stack wrote:
>
>> Then its odd that PE fails.  Can you figure difference between the two
>> environments?  Perhaps your MR jobs are fat jars that include the conf
>> and all dependencies whereas PE is dump and expects the dependencies
>> and conf on CLASSPATH?
>>
>> St.Ack
>>
>> On Fri, Jan 21, 2011 at 11:56 AM, Wojciech Langiewicz
>> <[email protected]>  wrote:
>>
>>> Hi,
>>> I have other mapreduce tasks that are running on this cluster and using
>>> HBase and they are working correctly. All my servers have the same
>>> configuration.
>>>
>>> --
>>> Wojciech Langiewicz
>>>
>>> On 21.01.2011 19:20, Stack wrote:
>>>
>>>>
>>>> When clients are>    1, then PE tries to run a mapreduce job to host the
>>>> loading clients.
>>>>
>>>> Is it possible that the client out in MR task is trying to connect to
>>>> wrong location?  Perhaps the HBase conf dir is not available to the
>>>> running task?  Have you seen
>>>>
>>>>
>>>> http://hbase.apache.org/apidocs/org/apache/hadoop/hbase/mapreduce/package-summary.html#package_description
>>>> ?
>>>>  Perhaps this will help?
>>>>
>>>> St.Ack
>>>>
>>>> On Fri, Jan 21, 2011 at 8:12 AM, Wojciech Langiewicz
>>>> <[email protected]>    wrote:
>>>>
>>>>>
>>>>> Hello
>>>>> I have problem with running HBase performance tests from
>>>>> org.apache.hadoop.hbase.PerformanceEvaluation package. I'm using
>>>>> version
>>>>> from CDH3,
>>>>> Tests are ok when argument nclients is 1, but in case of greater
>>>>> number,
>>>>> after mappers reaching 100% I get this exception (I didn't test all
>>>>> tests,
>>>>> but all of tested by me failed, 'scan' and 'randomRead' fail for sure):
>>>>>
>>>>> 11/01/21 17:05:19 INFO mapred.JobClient: Task Id :
>>>>> attempt_201101211442_0005_m_000000_0, Status : FAILED
>>>>> org.apache.hadoop.hbase.MasterNotRunningException
>>>>>        at
>>>>>
>>>>>
>>>>> org.apache.hadoop.hbase.client.HConnectionManager$TableServers.getMaster(HConnectionManager.java:416)
>>>>>        at
>>>>> org.apache.hadoop.hbase.client.HBaseAdmin.<init>(HBaseAdmin.java:78)
>>>>>        at
>>>>>
>>>>>
>>>>> org.apache.hadoop.hbase.PerformanceEvaluation$Test.testSetup(PerformanceEvaluation.java:745)
>>>>>        at
>>>>>
>>>>>
>>>>> org.apache.hadoop.hbase.PerformanceEvaluation$Test.test(PerformanceEvaluation.java:764)
>>>>>        at
>>>>>
>>>>>
>>>>> org.apache.hadoop.hbase.PerformanceEvaluation.runOneClient(PerformanceEvaluation.java:1097)
>>>>>        at
>>>>>
>>>>>
>>>>> org.apache.hadoop.hbase.PerformanceEvaluation$EvaluationMapTask.map(PerformanceEvaluation.java:446)
>>>>>        at
>>>>>
>>>>>
>>>>> org.apache.hadoop.hbase.PerformanceEvaluation$EvaluationMapTask.map(PerformanceEvaluation.java:399)
>>>>>        at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:144)
>>>>>        at
>>>>> org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:639)
>>>>>        at org.apache.hadoop.mapred.MapTask.run(MapTask.java:315)
>>>>>        at org.apache.hadoop.mapred.Child$4.run(Child.java:217)
>>>>>        at java.security.AccessController.doPrivileged(Native Method)
>>>>>        at javax.security.auth.Subject.doAs(Subject.java:396)
>>>>>        at
>>>>>
>>>>>
>>>>> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1063)
>>>>>        at org.apache.hadoop.mapred.Child.main(Child.java:211)
>>>>>
>>>>> Do you have any ideas how to solve this?
>>>>>
>>>>> --
>>>>> Wojciech Langiewicz
>>>>>
>>>>>
>>>
>>>
>

Reply via email to