[
https://issues.apache.org/jira/browse/HDFS-1782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
John George updated HDFS-1782:
------------------------------
Status: Patch Available (was: Open)
> FSNamesystem.startFileInternal(..) throws NullPointerException
> --------------------------------------------------------------
>
> Key: HDFS-1782
> URL: https://issues.apache.org/jira/browse/HDFS-1782
> Project: Hadoop HDFS
> Issue Type: Bug
> Affects Versions: 0.22.0
> Reporter: John George
> Assignee: John George
> Fix For: 0.22.0
>
> Attachments: HDFS-1782.patch
>
>
> I'm observing when there is one balancer running trying to run another one
> results in
> "Java.lang.NullPointerException" error. I was hoping to see message "Another
> balancer is running.
> Exiting.... Exiting ...". This is a reproducible issue.
> Details
> ========
> 1) Cluster ->elrond
> [hdfs@gsbl90568 smilli]$ hadoop version
> Hadoop 0.22.0.1102280202
> Subversion
> git://hadoopre5.corp.sk1.yahoo.com/home/y/var/builds/thread2/workspace/Cloud-HadoopCOMMON-0.22-Secondary
> -r
> c7c9a21d7289e29f0133452acf8b761e455a84b5
> Compiled by hadoopqa on Mon Feb 28 02:12:38 PST 2011
> From source with checksum 9ecbc6f17e8847a1cddca2282dbd9b31
> [hdfs@gsbl90568 smilli]$
> 2) Run first balancer
> [hdfs@gsbl90565 smilli]$ hdfs balancer
> 11/03/09 16:33:56 INFO balancer.Balancer: namenodes =
> [gsbl90565.blue.ygrid.yahoo.com/98.137.97.57:8020,
> gsbl90569.blue.ygrid.yahoo.com/98.137.97.53:8020]
> 11/03/09 16:33:56 INFO balancer.Balancer: p =
> Balancer.Parameters[BalancingPolicy.Node, threshold=10.0]
> Time Stamp Iteration# Bytes Already Moved Bytes Left To Move
> Bytes Being Moved
> 11/03/09 16:33:57 WARN conf.Configuration: mapred.task.id is deprecated.
> Instead, use mapreduce.task.attempt.id
> 11/03/09 16:33:57 INFO balancer.Balancer: Block token params received from
> NN: keyUpdateInterval=600 min(s),
> tokenLifetime=600 min(s)
> 11/03/09 16:33:57 INFO block.BlockTokenSecretManager: Setting block keys
> 11/03/09 16:33:57 INFO balancer.Balancer: Balancer will update its block keys
> every 150 minute(s)
> 11/03/09 16:33:57 INFO block.BlockTokenSecretManager: Setting block keys
> 11/03/09 16:33:57 INFO balancer.Balancer: Block token params received from
> NN: keyUpdateInterval=600 min(s),
> tokenLifetime=600 min(s)
> 11/03/09 16:33:57 INFO block.BlockTokenSecretManager: Setting block keys
> 11/03/09 16:33:57 INFO balancer.Balancer: Balancer will update its block keys
> every 150 minute(s)
> 11/03/09 16:33:57 INFO block.BlockTokenSecretManager: Setting block keys
> 11/03/09 16:33:57 INFO net.NetworkTopology: Adding a new node:
> /98.137.97.0/98.137.97.62:1004
> 11/03/09 16:33:57 INFO net.NetworkTopology: Adding a new node:
> /98.137.97.0/98.137.97.58:1004
> 11/03/09 16:33:57 INFO net.NetworkTopology: Adding a new node:
> /98.137.97.0/98.137.97.60:1004
> 11/03/09 16:33:57 INFO net.NetworkTopology: Adding a new node:
> /98.137.97.0/98.137.97.59:1004
> 11/03/09 16:33:57 INFO balancer.Balancer: 1 over-utilized:
> [Source[98.137.97.62:1004, utilization=24.152507825759344]]
> 11/03/09 16:33:57 INFO balancer.Balancer: 0 underutilized: []
> 11/03/09 16:33:57 INFO balancer.Balancer: Need to move 207.98 GB to make the
> cluster balanced.
> 11/03/09 16:33:57 INFO balancer.Balancer: Decided to move 10 GB bytes from
> 98.137.97.62:1004 to 98.137.97.58:1004
> 11/03/09 16:33:57 INFO balancer.Balancer: Will move 10 GB in this iteration
> Mar 9, 2011 4:33:57 PM 0 0 KB 207.98 GB
> 10 GB
> .
> .
> .
> 11/03/09 16:34:36 INFO balancer.Balancer: Moving block -63570336576981940
> from 98.137.97.62:1004 to 98.137.97.59:1004
> through 98.137.97.62:1004 is succeeded.
> 11/03/09 16:34:39 INFO balancer.Balancer: Moving block 2379736326585824737
> from 98.137.97.62:1004 to 98.137.97.59:1004
> through 98.137.97.62:1004 is succeeded.
> 11/03/09 16:35:21 INFO balancer.Balancer: Moving block 8884583953927078028
> from 98.137.97.62:1004 to 98.137.97.59:1004
> through 98.137.97.62:1004 is succeeded.
> 11/03/09 16:35:24 INFO balancer.Balancer: Moving block -135758138424743964
> from 98.137.97.62:1004 to 98.137.97.59:1004
> through 98.137.97.62:1004 is succeeded.
> 11/03/09 16:35:27 INFO balancer.Balancer: Moving block -4598153351946352185
> from 98.137.97.62:1004 to 98.137.97.59:1004
> through 98.137.97.62:1004 is succeeded.
> 11/03/09 16:35:33 INFO balancer.Balancer: Moving block 2966087210491094643
> from 98.137.97.62:1004 to 98.137.97.59:1004
> through 98.137.97.62:1004 is succeeded.
> 11/03/09 16:35:42 INFO balancer.Balancer: Moving block -5573983508500804184
> from 98.137.97.62:1004 to 98.137.97.59:1004
> through 98.137.97.62:1004 is succeeded.
> 11/03/09 16:35:58 INFO balancer.Balancer: Moving block -6222779741597113957
> from 98.137.97.62:1004 to 98.137.97.59:1004
> through 98.137.97.62:1004 is succeeded.
> 3) Run another balancer observe
> [hdfs@gsbl90568 smilli]$ hdfs balancer
> 11/03/09 16:34:32 INFO balancer.Balancer: namenodes =
> [gsbl90565.blue.ygrid.yahoo.com/98.137.97.57:8020,
> gsbl90569.blue.ygrid.yahoo.com/98.137.97.53:8020]
> 11/03/09 16:34:32 INFO balancer.Balancer: p =
> Balancer.Parameters[BalancingPolicy.Node, threshold=10.0]
> Time Stamp Iteration# Bytes Already Moved Bytes Left To Move
> Bytes Being Moved
> 11/03/09 16:34:33 WARN conf.Configuration: mapred.task.id is deprecated.
> Instead, use mapreduce.task.attempt.id
> 11/03/09 16:34:33 INFO balancer.Balancer: Block token params received from
> NN: keyUpdateInterval=600 min(s),
> tokenLifetime=600 min(s)
> 11/03/09 16:34:33 INFO block.BlockTokenSecretManager: Setting block keys
> 11/03/09 16:34:33 INFO balancer.Balancer: Balancer will update its block keys
> every 150 minute(s)
> 11/03/09 16:34:33 INFO block.BlockTokenSecretManager: Setting block keys
> java.io.IOException: java.lang.NullPointerException
> at
> org.apache.hadoop.hdfs.server.namenode.FSNamesystem.startFileInternal(FSNamesystem.java:1400)
> at
> org.apache.hadoop.hdfs.server.namenode.FSNamesystem.startFile(FSNamesystem.java:1284)
> at
> org.apache.hadoop.hdfs.server.namenode.NameNode.create(NameNode.java:779)
> at sun.reflect.GeneratedMethodAccessor46.invoke(Unknown Source)
> at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
> at java.lang.reflect.Method.invoke(Method.java:597)
> at
> org.apache.hadoop.ipc.WritableRpcEngine$Server.call(WritableRpcEngine.java:346)
> at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:1399)
> at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:1395)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:396)
> at
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1094)
> at org.apache.hadoop.ipc.Server$Handler.run(Server.java:1393)
> . Exiting ...
> Balancing took 1.366 seconds
> [hdfs@gsbl90568 smilli]$
> Pls let me know if you need additional information.
> Thanks,
> Suma
>
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira