[ 
https://issues.apache.org/jira/browse/HDFS-1782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

John George updated HDFS-1782:
------------------------------

    Status: Patch Available  (was: Open)

> FSNamesystem.startFileInternal(..) throws NullPointerException
> --------------------------------------------------------------
>
>                 Key: HDFS-1782
>                 URL: https://issues.apache.org/jira/browse/HDFS-1782
>             Project: Hadoop HDFS
>          Issue Type: Bug
>    Affects Versions: 0.22.0
>            Reporter: John George
>            Assignee: John George
>             Fix For: 0.22.0
>
>         Attachments: HDFS-1782.patch
>
>
> I'm observing when there is one balancer running trying to run another one 
> results in
> "Java.lang.NullPointerException" error. I was hoping to see message "Another 
> balancer is running. 
> Exiting....  Exiting ...". This is a reproducible issue.
> Details
> ========
> 1) Cluster ->elrond
> [hdfs@gsbl90568 smilli]$ hadoop version
> Hadoop 0.22.0.1102280202
> Subversion 
> git://hadoopre5.corp.sk1.yahoo.com/home/y/var/builds/thread2/workspace/Cloud-HadoopCOMMON-0.22-Secondary
>  -r
> c7c9a21d7289e29f0133452acf8b761e455a84b5
> Compiled by hadoopqa on Mon Feb 28 02:12:38 PST 2011
> From source with checksum 9ecbc6f17e8847a1cddca2282dbd9b31
> [hdfs@gsbl90568 smilli]$
> 2) Run first balancer
> [hdfs@gsbl90565 smilli]$ hdfs balancer
> 11/03/09 16:33:56 INFO balancer.Balancer: namenodes = 
> [gsbl90565.blue.ygrid.yahoo.com/98.137.97.57:8020,
> gsbl90569.blue.ygrid.yahoo.com/98.137.97.53:8020]
> 11/03/09 16:33:56 INFO balancer.Balancer: p         = 
> Balancer.Parameters[BalancingPolicy.Node, threshold=10.0]
> Time Stamp               Iteration#  Bytes Already Moved  Bytes Left To Move  
> Bytes Being Moved
> 11/03/09 16:33:57 WARN conf.Configuration: mapred.task.id is deprecated. 
> Instead, use mapreduce.task.attempt.id
> 11/03/09 16:33:57 INFO balancer.Balancer: Block token params received from 
> NN: keyUpdateInterval=600 min(s),
> tokenLifetime=600 min(s)
> 11/03/09 16:33:57 INFO block.BlockTokenSecretManager: Setting block keys
> 11/03/09 16:33:57 INFO balancer.Balancer: Balancer will update its block keys 
> every 150 minute(s)
> 11/03/09 16:33:57 INFO block.BlockTokenSecretManager: Setting block keys
> 11/03/09 16:33:57 INFO balancer.Balancer: Block token params received from 
> NN: keyUpdateInterval=600 min(s),
> tokenLifetime=600 min(s)
> 11/03/09 16:33:57 INFO block.BlockTokenSecretManager: Setting block keys
> 11/03/09 16:33:57 INFO balancer.Balancer: Balancer will update its block keys 
> every 150 minute(s)
> 11/03/09 16:33:57 INFO block.BlockTokenSecretManager: Setting block keys
> 11/03/09 16:33:57 INFO net.NetworkTopology: Adding a new node: 
> /98.137.97.0/98.137.97.62:1004
> 11/03/09 16:33:57 INFO net.NetworkTopology: Adding a new node: 
> /98.137.97.0/98.137.97.58:1004
> 11/03/09 16:33:57 INFO net.NetworkTopology: Adding a new node: 
> /98.137.97.0/98.137.97.60:1004
> 11/03/09 16:33:57 INFO net.NetworkTopology: Adding a new node: 
> /98.137.97.0/98.137.97.59:1004
> 11/03/09 16:33:57 INFO balancer.Balancer: 1 over-utilized: 
> [Source[98.137.97.62:1004, utilization=24.152507825759344]]
> 11/03/09 16:33:57 INFO balancer.Balancer: 0 underutilized: []
> 11/03/09 16:33:57 INFO balancer.Balancer: Need to move 207.98 GB to make the 
> cluster balanced.
> 11/03/09 16:33:57 INFO balancer.Balancer: Decided to move 10 GB bytes from 
> 98.137.97.62:1004 to 98.137.97.58:1004
> 11/03/09 16:33:57 INFO balancer.Balancer: Will move 10 GB in this iteration
> Mar 9, 2011 4:33:57 PM            0                 0 KB           207.98 GB  
>             10 GB
> .
> .
> .
> 11/03/09 16:34:36 INFO balancer.Balancer: Moving block -63570336576981940 
> from 98.137.97.62:1004 to 98.137.97.59:1004
> through 98.137.97.62:1004 is succeeded.
> 11/03/09 16:34:39 INFO balancer.Balancer: Moving block 2379736326585824737 
> from 98.137.97.62:1004 to 98.137.97.59:1004
> through 98.137.97.62:1004 is succeeded.
> 11/03/09 16:35:21 INFO balancer.Balancer: Moving block 8884583953927078028 
> from 98.137.97.62:1004 to 98.137.97.59:1004
> through 98.137.97.62:1004 is succeeded.
> 11/03/09 16:35:24 INFO balancer.Balancer: Moving block -135758138424743964 
> from 98.137.97.62:1004 to 98.137.97.59:1004
> through 98.137.97.62:1004 is succeeded.
> 11/03/09 16:35:27 INFO balancer.Balancer: Moving block -4598153351946352185 
> from 98.137.97.62:1004 to 98.137.97.59:1004
> through 98.137.97.62:1004 is succeeded.
> 11/03/09 16:35:33 INFO balancer.Balancer: Moving block 2966087210491094643 
> from 98.137.97.62:1004 to 98.137.97.59:1004
> through 98.137.97.62:1004 is succeeded.
> 11/03/09 16:35:42 INFO balancer.Balancer: Moving block -5573983508500804184 
> from 98.137.97.62:1004 to 98.137.97.59:1004
> through 98.137.97.62:1004 is succeeded.
> 11/03/09 16:35:58 INFO balancer.Balancer: Moving block -6222779741597113957 
> from 98.137.97.62:1004 to 98.137.97.59:1004
> through 98.137.97.62:1004 is succeeded.
> 3) Run another balancer observe
> [hdfs@gsbl90568 smilli]$ hdfs balancer
> 11/03/09 16:34:32 INFO balancer.Balancer: namenodes = 
> [gsbl90565.blue.ygrid.yahoo.com/98.137.97.57:8020,
> gsbl90569.blue.ygrid.yahoo.com/98.137.97.53:8020]
> 11/03/09 16:34:32 INFO balancer.Balancer: p         = 
> Balancer.Parameters[BalancingPolicy.Node, threshold=10.0]
> Time Stamp               Iteration#  Bytes Already Moved  Bytes Left To Move  
> Bytes Being Moved
> 11/03/09 16:34:33 WARN conf.Configuration: mapred.task.id is deprecated. 
> Instead, use mapreduce.task.attempt.id
> 11/03/09 16:34:33 INFO balancer.Balancer: Block token params received from 
> NN: keyUpdateInterval=600 min(s),
> tokenLifetime=600 min(s)
> 11/03/09 16:34:33 INFO block.BlockTokenSecretManager: Setting block keys
> 11/03/09 16:34:33 INFO balancer.Balancer: Balancer will update its block keys 
> every 150 minute(s)
> 11/03/09 16:34:33 INFO block.BlockTokenSecretManager: Setting block keys
> java.io.IOException: java.lang.NullPointerException
>         at 
> org.apache.hadoop.hdfs.server.namenode.FSNamesystem.startFileInternal(FSNamesystem.java:1400)
>         at 
> org.apache.hadoop.hdfs.server.namenode.FSNamesystem.startFile(FSNamesystem.java:1284)
>         at 
> org.apache.hadoop.hdfs.server.namenode.NameNode.create(NameNode.java:779)
>         at sun.reflect.GeneratedMethodAccessor46.invoke(Unknown Source)
>         at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>         at java.lang.reflect.Method.invoke(Method.java:597)
>         at 
> org.apache.hadoop.ipc.WritableRpcEngine$Server.call(WritableRpcEngine.java:346)
>         at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:1399)
>         at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:1395)
>         at java.security.AccessController.doPrivileged(Native Method)
>         at javax.security.auth.Subject.doAs(Subject.java:396)
>         at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1094)
>         at org.apache.hadoop.ipc.Server$Handler.run(Server.java:1393)
> .  Exiting ...
> Balancing took 1.366 seconds
> [hdfs@gsbl90568 smilli]$
> Pls let me know if you need additional information.
> Thanks,
> Suma
>               

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to