Re: Bind hang out when named reach to 5-600 Mb
At Tue, 20 Jul 2010 01:18:54 -0700 (PDT), khanh rua duonghoahoc_k4...@yahoo.com wrote: I mean hang is bind still running but it cannot response query from user. I suspect it still responds to queries that don't require recursion, e.g. version.bind txt ch. Is that correct? --- JINMEI, Tatuya Internet Systems Consortium, Inc. ___ bind-users mailing list bind-users@lists.isc.org https://lists.isc.org/mailman/listinfo/bind-users
Re: Bind hang out when named reach to 5-600 Mb
At Thu, 8 Jul 2010 02:30:25 -0700 (PDT), khanh rua duonghoahoc_k4...@yahoo.com wrote: I install bind as a cache server on Solaris 10, Sun Sparc T5140. It has problem, bind always hang out when named reach to 5-600 Mb ('prstat' check). I have several servers and all have this problem even when i install bind in zone or try with a 64bit version. T5140's a powerful server but bind can't make use of its power. I'm newb with bind an so i have just try some other way but useless. What should i do to track this problem ? As others asked, please clarify a bit more what hang means. I'd particularly interested in - whether it responds to rndc (e.g. rndc status) - whether it responds to queries for build in data, such as version.bind/TXT/CH (try 'dig @server_address version.bind txt ch' from the local host). - if you enable XML based statistics, whether it responds to statistics request over http. If it does, showing the xml statistics while the problem is happening would be useful. Please also make sure you kernel doesn't have this problem: http://bugs.opensolaris.org/view_bug.do?bug_id=6724237 If you are not sure, and if the query load is not so heavy, (e.g. up to 2000qps or so), you may also want to try rebuilding named with --disable-devpoll --- JINMEI, Tatuya Internet Systems Consortium, Inc. ___ bind-users mailing list bind-users@lists.isc.org https://lists.isc.org/mailman/listinfo/bind-users
Re: Bind hang out when named reach to 5-600 Mb
2010/7/8 khanh rua duonghoahoc_k4...@yahoo.com: Hi, I install bind as a cache server on Solaris 10, Sun Sparc T5140. It has problem, bind always hang out when named reach to 5-600 Mb ('prstat' check). I have several servers and all have this problem even when i install bind in zone or try with a 64bit version. T5140's a powerful server but bind can't make use of its power. I'm newb with bind an so i have just try some other way but useless. What should i do to track this problem ? is this specific to T5140 ? which server type did you use before ? Some time ago, i did some simple benchmark (dnsperf / queryperf) on T1000 and T5240 and the results were bad. my numbers (bind caching server): SUN X2100 can serve 7000 queries/s with 0.6-1ms response time SUN T1000 can serve 600 queries/s with 10-15ms response time (more than 600 means, response time jumps over 100ms) You should do some benchmark (and heavily use rndc stats) before choosing a new architecture -- Fabien ___ bind-users mailing list bind-users@lists.isc.org https://lists.isc.org/mailman/listinfo/bind-users
Re: Bind hang out when named reach to 5-600 Mb
Can u tell me how to do this ? If you enable query log, you might be able to see if it's actually serving queries at that time. I mean it hang is named process is still running but it cannot respond any lookup query. CPU is almost at low rate 5-6 % or lower. It's both hang with single thread or multithread. datasize is m max-cache-size is 2000m; These are versions of bind i tried : bash-3.00# file /usr/sbin/named /usr/sbin/named:ELF 32-bit MSB executable SPARC Version 1, dynamically linked, stripped bash-3.00# which named /usr/sbin/named bash-3.00# file /opt/SUNWbind9.6/sbin/named /opt/SUNWbind9.6/sbin/named:ELF 64-bit MSB executable SPARCV9 Version 1, dynamically linked, not stripped bash-3.00# file /usr/sbin/bind bind-9.7.1b1/ bind9.7.0P1/ bash-3.00# file /usr/sbin/bind-9.7.1b1/named /usr/sbin/bind-9.7.1b1/named: ELF 32-bit MSB executable SPARC Version 1, dynamically linked, not stripped In my circumstance, i can try installing anything as long as it can fix this bug. I saw on sunsolve there 's a patch for bind 119783-15. Don't know is it useful to fix this bug? Mb i will check with other version of bind. Hope this can help. From: Fajar A. Nugraha fa...@fajar.net To: khanh rua duonghoahoc_k4...@yahoo.com Cc: bind-users@lists.isc.org Sent: Thu, July 8, 2010 5:10:02 PM Subject: Re: Bind hang out when named reach to 5-600 Mb On Thu, Jul 8, 2010 at 4:30 PM, khanh rua duonghoahoc_k4...@yahoo.com wrote: Hi, I install bind as a cache server on Solaris 10, Sun Sparc T5140. It has problem, bind always hang out when named reach to 5-600 Mb ('prstat' check). How did you determine that it hang? If you enable query log, you might be able to see if it's actually serving queries at that time. Also, how is the cpu and disk usage at that time (I'm trying to see if you have cpu or disk as bottleneck? My guess is that: - one of your CPU thread is at 100% usage - named is busy serving queries alreadt, so that the new query you issued does not get processed in a timely manner. I have several servers and all have this problem even when i install bind in zone or try with a 64bit version. T5140's a powerful server but bind can't make use of its power. IMHO, it's not really named-specific problem. The thing is Sun/Oracle T-series processors doesn't perform well with single thread loads. And (last time I check anyway) even though bind can make use of multi processor/threads, single thread performance still matters a lot. So you'd probably get much better performance when running named with say ... a generic x86 server/PC with Intel Xeon/Core i-series. -- Fajar ___ bind-users mailing list bind-users@lists.isc.org https://lists.isc.org/mailman/listinfo/bind-users
Re: Bind hang out when named reach to 5-600 Mb
On 07/09/10 07:36, khanh rua wrote: Can u tell me how to do this ? If you enable query log, you might be able to see if it's actually serving queries at that time. I mean it hang is named process is still running but it cannot respond any lookup query. CPU is almost at low rate 5-6 % or lower. It's both hang with single thread or multithread. datasize is m max-cache-size is 2000m; These are versions of bind i tried : bash-3.00# file /usr/sbin/named /usr/sbin/named:ELF 32-bit MSB executable SPARC Version 1, dynamically linked, stripped bash-3.00# which named /usr/sbin/named bash-3.00# file /opt/SUNWbind9.6/sbin/named /opt/SUNWbind9.6/sbin/named:ELF 64-bit MSB executable SPARCV9 Version 1, dynamically linked, not stripped bash-3.00# file /usr/sbin/bind bind-9.7.1b1/ bind9.7.0P1/ bash-3.00# file /usr/sbin/bind-9.7.1b1/named /usr/sbin/bind-9.7.1b1/named: ELF 32-bit MSB executable SPARC Version 1, dynamically linked, not stripped running 'named -V' is the best may to show version and configure options. In my circumstance, i can try installing anything as long as it can fix this bug. I saw on sunsolve there 's a patch for bind 119783-15. Don't know is it useful to fix this bug? No, that patch provides BIND 9.6.1-P3 with no specific changes. Could you collect some lock statistics using plockstat(1M)? i.e.: # plockstat -e 300 -p `pgrep ^named` The -e 300 causes it to collect statistics for 5 minutes - experiment a little and feed back what is reported. Mb i will check with other version of bind. Hope this can help. *From:* Fajar A. Nugraha fa...@fajar.net *To:* khanh rua duonghoahoc_k4...@yahoo.com *Cc:* bind-users@lists.isc.org *Sent:* Thu, July 8, 2010 5:10:02 PM *Subject:* Re: Bind hang out when named reach to 5-600 Mb On Thu, Jul 8, 2010 at 4:30 PM, khanh rua duonghoahoc_k4...@yahoo.com mailto:duonghoahoc_k4...@yahoo.com wrote: Hi, I install bind as a cache server on Solaris 10, Sun Sparc T5140. It has problem, bind always hang out when named reach to 5-600 Mb ('prstat' check). How did you determine that it hang? If you enable query log, you might be able to see if it's actually serving queries at that time. Also, how is the cpu and disk usage at that time (I'm trying to see if you have cpu or disk as bottleneck? My guess is that: - one of your CPU thread is at 100% usage - named is busy serving queries alreadt, so that the new query you issued does not get processed in a timely manner. I have several servers and all have this problem even when i install bind in zone or try with a 64bit version. T5140's a powerful server but bind can't make use of its power. IMHO, it's not really named-specific problem. The thing is Sun/Oracle T-series processors doesn't perform well with single thread loads. And (last time I check anyway) even though bind can make use of multi processor/threads, single thread performance still matters a lot. So you'd probably get much better performance when running named with say ... a generic x86 server/PC with Intel Xeon/Core i-series. -- Fajar ___ bind-users mailing list bind-users@lists.isc.org https://lists.isc.org/mailman/listinfo/bind-users -- --Stacey ___ bind-users mailing list bind-users@lists.isc.org https://lists.isc.org/mailman/listinfo/bind-users
Re: Bind hang out when named reach to 5-600 Mb
On Thu, Jul 8, 2010 at 4:30 PM, khanh rua duonghoahoc_k4...@yahoo.com wrote: Hi, I install bind as a cache server on Solaris 10, Sun Sparc T5140. It has problem, bind always hang out when named reach to 5-600 Mb ('prstat' check). How did you determine that it hang? If you enable query log, you might be able to see if it's actually serving queries at that time. Also, how is the cpu and disk usage at that time (I'm trying to see if you have cpu or disk as bottleneck? My guess is that: - one of your CPU thread is at 100% usage - named is busy serving queries alreadt, so that the new query you issued does not get processed in a timely manner. I have several servers and all have this problem even when i install bind in zone or try with a 64bit version. T5140's a powerful server but bind can't make use of its power. IMHO, it's not really named-specific problem. The thing is Sun/Oracle T-series processors doesn't perform well with single thread loads. And (last time I check anyway) even though bind can make use of multi processor/threads, single thread performance still matters a lot. So you'd probably get much better performance when running named with say ... a generic x86 server/PC with Intel Xeon/Core i-series. -- Fajar ___ bind-users mailing list bind-users@lists.isc.org https://lists.isc.org/mailman/listinfo/bind-users
Re: Bind hang out when named reach to 5-600 Mb
khanh rua wrote: Hi, I install bind as a cache server on Solaris 10, Sun Sparc T5140. It has problem, bind always hang out when named reach to 5-600 Mb ('prstat' check). I have several servers and all have this problem even when i install bind in zone or try with a 64bit version. T5140's a powerful server but bind can't make use of its power. I'm newb with bind an so i have just try some other way but useless. What should i do to track this problem ? Thank in advance, Duc Tien. What version of BIND is this? Are you running multithreaded (how many worker threads if you are - the logs when named starts should tell you this). When you say 'hang' can you give more detail, for example is the named process still running? Is it consuming CPU or sitting motionless (what does top say?) Is it responding to queries (try dig, try queries that you'd expect to be in cache as well as ones that probably aren't). Is it responding to rndc (what do you get from rndc status?). What sort of query rate is the nameserver receiving? Have you configured max-cache-size in named.conf? Kind regards, Cathy ___ bind-users mailing list bind-users@lists.isc.org https://lists.isc.org/mailman/listinfo/bind-users