Re: major hdfs issues

Jack Levin Sun, 13 Mar 2011 13:33:47 -0700

we are running at 128000 ulimit -n.   I am pretty sure the culpreet is the
thrift server, it opens up 20k threads under load, and crashes all other
servers by taking away RAM.


Do you guys disable tcp cookies also?  In regards to iptables, what is the
best way to disable?

-Jack

On Sat, Mar 12, 2011 at 8:24 PM, Todd Lipcon <[email protected]> wrote:

> You may also want to look at the value set for ulimit -u - it's
> unlimited on many OSes, but RHEL6 in particular sets it way too low,
> which will cause the "unable to creative native thread". What OS you
> running?
>
> The conntrack error has to do with ip_conntrack, which is an iptables
> module that keeps a table of TCP connections currently open to the
> box. I'd recommend disabling iptables inside your hadoop cluster.
>
> -Todd
>
> On Sat, Mar 12, 2011 at 3:03 PM, Jack Levin <[email protected]> wrote:
> > Awesome, thanks... This is similar to mysql max-conn setting.
> >
> > -Jack
> >
> > On Sat, Mar 12, 2011 at 11:29 AM, Stack <[email protected]> wrote:
> >
> >> I opened HBASE-3628 to expose the TThreadPoolServer options on the
> >> command-line for thrift server.
> >> St.Ack
> >>
> >> On Sat, Mar 12, 2011 at 11:20 AM, Stack <[email protected]> wrote:
> >> > Via Bryan (and J-D), by default we use the thread pool server from
> >> > Thrift (unless you choose the non-blocking option):
> >> >
> >> > 978       LOG.info("starting HBase ThreadPool Thrift server on " +
> >> > listenAddress + ":" + Integer.toString(listenPort));
> >> > 979       server = new TThreadPoolServer(processor, serverTransport,
> >> > transportFactory, protocolFactory);
> >> >
> >> > By default, the max threads is:
> >> >
> >> >  63     public int minWorkerThreads = 5;
> >> >  64     public int maxWorkerThreads = Integer.MAX_VALUE;
> >> >
> >> > ... so doing some hacking like below in our ThriftServer.java should
> >> > add in a maximum for your:
> >> >
> >> > diff --git
> >> a/src/main/java/org/apache/hadoop/hbase/thrift/ThriftServer.java
> >> > b/src/main/java/org/apache/hadoop/hbase/thrift/ThriftServer.java
> >> > index 06621ab..74856af 100644
> >> > --- a/src/main/java/org/apache/hadoop/hbase/thrift/ThriftServer.java
> >> > +++ b/src/main/java/org/apache/hadoop/hbase/thrift/ThriftServer.java
> >> > @@ -69,6 +69,7 @@ import
> >> org.apache.hadoop.hbase.thrift.generated.TRegionInfo;
> >> >  import org.apache.hadoop.hbase.thrift.generated.TRowResult;
> >> >  import org.apache.hadoop.hbase.util.Bytes;
> >> >  import org.apache.thrift.TException;
> >> > +import org.apache.thrift.TProcessorFactory;
> >> >  import org.apache.thrift.protocol.TBinaryProtocol;
> >> >  import org.apache.thrift.protocol.TCompactProtocol;
> >> >  import org.apache.thrift.protocol.TProtocolFactory;
> >> > @@ -911,9 +912,25 @@ public class ThriftServer {
> >> >       } else {
> >> >         transportFactory = new TTransportFactory();
> >> >       }
> >> > -
> >> > -      LOG.info("starting HBase ThreadPool Thrift server on " +
> >> > listenAddress + ":" + Integer.toString(listenPort));
> >> > -      server = new TThreadPoolServer(processor, serverTransport,
> >> > transportFactory, protocolFactory);
> >> > +      TThreadPoolServer.Options poolServerOptions =
> >> > +        new TThreadPoolServer.Options();
> >> > +      int maxWorkerThreads = Integer.MAX_VALUE;
> >> > +      if (cmd.hasOption("maxWorkerThreads")) {
> >> > +        try {
> >> > +          maxWorkerThreads =
> >> > +            Integer.parseInt(cmd.getOptionValue("maxWorkerThreads",
> >> > "" + Integer.MAX_VALUE));
> >> > +        } catch (NumberFormatException e) {
> >> > +          LOG.error("Could not parse maxWorkerThreads option", e);
> >> > +          printUsageAndExit(options, -1);
> >> > +        }
> >> > +      }
> >> > +      poolServerOptions.maxWorkerThreads = maxWorkerThreads;
> >> > +      LOG.info("starting HBase ThreadPool Thrift server on " +
> >> listenAddress +
> >> > +        ":" + Integer.toString(listenPort) +
> >> > +        ", maxWorkerThreads=" + maxWorkerThreads);
> >> > +      server = new TThreadPoolServer(processor, serverTransport,
> >> > +        transportFactory, transportFactory, protocolFactory,
> >> protocolFactory,
> >> > +        poolServerOptions);
> >> >     }
> >> >
> >> > Looks like other useful options to set in there.
> >> > St.Ack
> >> >
> >> >
> >> > On Sat, Mar 12, 2011 at 10:47 AM, Stack <[email protected]> wrote:
> >> >> I don't see any bounding in the thrift code.  Asking Bryan....
> >> >> St.Ack
> >> >>
> >> >> On Sat, Mar 12, 2011 at 10:04 AM, Jack Levin <[email protected]>
> wrote:
> >> >>> So our problem is this: when we restart a region server, or it goes
> >> >>> down, hbase slows down, while we send super high frequency thrift
> >> >>> calls from our PHP front-end APP we actually spawn up 20000+ threads
> on
> >> >>> thrift, and what this
> >> >>> does is destroys all memory on the boxes, and causes DNs just to
> shut
> >> >>> down, and everything else crash.
> >> >>>
> >> >>> Is there a way to put thread limiter on thrift? Maybe 1000 threads
> MAX?
> >> >>>
> >> >>> -Jack
> >> >>>
> >> >>> On Sat, Mar 12, 2011 at 3:31 AM, Suraj Varma <[email protected]>
> >> wrote:
> >> >>>
> >> >>>> >> to:java.lang.OutOfMemoryError: unable to create new native
> thread
> >> >>>>
> >> >>>> This indicates that you are oversubscribed on your RAM to the
> extent
> >> >>>> that the JVM doesn't have any space to create native threads (which
> >> >>>> are allocated outside of the JVM heap.)
> >> >>>>
> >> >>>> You may actually have to _reduce_ your heap sizes to allow more
> space
> >> >>>> for native threads (do an inventory of all the JVM heaps and don't
> let
> >> >>>> it go over about 75% of available RAM.)
> >> >>>> Another option is to use the -Xss stack size JVM arg to reduce the
> per
> >> >>>> thread stack size - set it to 512k or 256k (you may have to
> >> >>>> experiment/perf test a bit to see what's the optimum size.
> >> >>>> Or ... get more RAM ...
> >> >>>>
> >> >>>> --Suraj
> >> >>>>
> >> >>>> On Fri, Mar 11, 2011 at 8:11 PM, Jack Levin <[email protected]>
> >> wrote:
> >> >>>> > I am noticing following errors also:
> >> >>>> >
> >> >>>> > 2011-03-11 17:52:00,376 ERROR
> >> >>>> > org.apache.hadoop.hdfs.server.datanode.DataNode:
> >> DatanodeRegistration(
> >> >>>> > 10.103.7.3:50010,
> >> storageID=DS-824332190-10.103.7.3-50010-1290043658438,
> >> >>>> > infoPort=50075, ipcPort=50020):DataXceiveServer: Exiting due
> >> >>>> > to:java.lang.OutOfMemoryError: unable to create new native thread
> >> >>>> >        at java.lang.Thread.start0(Native Method)
> >> >>>> >        at java.lang.Thread.start(Thread.java:597)
> >> >>>> >        at
> >> >>>> >
> >> >>>>
> >>
> org.apache.hadoop.hdfs.server.datanode.DataXceiverServer.run(DataXceiverServer.java:132)
> >> >>>> >        at java.lang.Thread.run(Thread.java:619)
> >> >>>> >
> >> >>>> >
> >> >>>> > and this:
> >> >>>> >
> >> >>>> > nf_conntrack: table full, dropping packet.
> >> >>>> > nf_conntrack: table full, dropping packet.
> >> >>>> > nf_conntrack: table full, dropping packet.
> >> >>>> > nf_conntrack: table full, dropping packet.
> >> >>>> > nf_conntrack: table full, dropping packet.
> >> >>>> > nf_conntrack: table full, dropping packet.
> >> >>>> > net_ratelimit: 10 callbacks suppressed
> >> >>>> > nf_conntrack: table full, dropping packet.
> >> >>>> > possible SYN flooding on port 9090. Sending cookies.
> >> >>>> >
> >> >>>> > This seems like a network stack issue?
> >> >>>> >
> >> >>>> > So, does datanode need higher heap than 1GB?  Or possible we ran
> out
> >> of
> >> >>>> RAM
> >> >>>> > for other reasons?
> >> >>>> >
> >> >>>> > -Jack
> >> >>>> >
> >> >>>> > On Thu, Mar 10, 2011 at 1:29 PM, Ryan Rawson <[email protected]
> >
> >> wrote:
> >> >>>> >
> >> >>>> >> Looks like a datanode went down.  InterruptedException is how
> java
> >> >>>> >> uses to interrupt IO in threads, its similar to the EINTR errno.
> >>  That
> >> >>>> >> means the actual source of the abort is higher up...
> >> >>>> >>
> >> >>>> >> So back to how InterruptedException works... at some point a
> thread
> >> in
> >> >>>> >> the JVM decides that the VM should abort.  So it calls
> >> >>>> >> thread.interrupt() on all the threads it knows/cares about to
> >> >>>> >> interrupt their IO.  That is what you are seeing in the logs.
> The
> >> root
> >> >>>> >> cause lies above I think.
> >> >>>> >>
> >> >>>> >> Look for the first "Exception" string or any FATAL or ERROR
> strings
> >> in
> >> >>>> >> the datanode logfiles.
> >> >>>> >>
> >> >>>> >> -ryan
> >> >>>> >>
> >> >>>> >> On Thu, Mar 10, 2011 at 1:03 PM, Jack Levin <[email protected]>
> >> wrote:
> >> >>>> >> > http://pastebin.com/ZmsyvcVc  Here is the regionserver log,
> they
> >> all
> >> >>>> >> have
> >> >>>> >> > similar stuff,
> >> >>>> >> >
> >> >>>> >> > On Thu, Mar 10, 2011 at 11:34 AM, Stack <[email protected]>
> >> wrote:
> >> >>>> >> >
> >> >>>> >> >> Whats in the regionserver logs?  Please put up regionserver
> and
> >> >>>> >> >> datanode excerpts.
> >> >>>> >> >> Thanks Jack,
> >> >>>> >> >> St.Ack
> >> >>>> >> >>
> >> >>>> >> >> On Thu, Mar 10, 2011 at 10:31 AM, Jack Levin <
> [email protected]
> >> >
> >> >>>> wrote:
> >> >>>> >> >> > All was well, until this happen:
> >> >>>> >> >> >
> >> >>>> >> >> > http://pastebin.com/iM1niwrS
> >> >>>> >> >> >
> >> >>>> >> >> > and all regionservers went down, is this xciever issue?
> >> >>>> >> >> >
> >> >>>> >> >> > <property>
> >> >>>> >> >> > <name>dfs.datanode.max.xcievers</name>
> >> >>>> >> >> > <value>12047</value>
> >> >>>> >> >> > </property>
> >> >>>> >> >> >
> >> >>>> >> >> > this is what I have, should I set it higher?
> >> >>>> >> >> >
> >> >>>> >> >> > -Jack
> >> >>>> >> >> >
> >> >>>> >> >>
> >> >>>> >> >
> >> >>>> >>
> >> >>>> >
> >> >>>>
> >> >>>
> >> >>
> >> >
> >>
> >
>
>
>
> --
> Todd Lipcon
> Software Engineer, Cloudera
>

Re: major hdfs issues

Reply via email to