Re: High memory consumption in bind 9.18.2

2022-07-25 Thread Gregory Sloop
n,
>>>>>>> > >
>>>>>>> > > Gathering the output of named statschannel should be good enough 
>>>>>>> > > for initial assessment (json please).
>>>>>>> > >
>>>>>>> > > For 9.18, make sure the jemalloc is being used at runtime.
>>>>>>> > 
>>>>>>> > Here are commands you asked for:
>>>>>>> > 
>>>>>>> > 1. when running ./configure, make sure the output at the end has this:
>>>>>>> > 
>>>>>>> > Configuration summary:
>>>>>>> > ---
>>>>>>> > Optional features enabled:
>>>>>>> >      Memory allocator: jemalloc
>>>>>>> > 
>>>>>>> > 
>>>>>>> > 2. Then, configure statistics channel in named.conf like this:
>>>>>>> > 
>>>>>>> > statistics-channels {
>>>>>>> >         inet 127.0.0.1 port 8080;
>>>>>>> > };
>>>>>>> > 
>>>>>>> > 
>>>>>>> > 3. With that in place you can grab stats from this URL:
>>>>>>> > http://127.0.0.1:8080/json/v1
>>>>>>> > 
>>>>>>> > Configuration for v9.16 is the same, just skip the jemalloc part.
>>>>>>> > 
>>>>>>> > 4. Bonus points for grabbing /proc//statm content at the same 
>>>>>>> > time
>>>>>>> > as content of the JSON stats endpoint (if you are on Linux).
>>>>>>> > 
>>>>>>> > I hope it helps.
>>>>>>> > Petr Špaček
>>>>>>> > 
>>>>>>> > 
>>>>>>> > 
>>>>>>> > >
>>>>>>> > > Ondrej
>>>>>>> > > --
>>>>>>> > > Ondřej Surý — ISC (He/Him)
>>>>>>> > >
>>>>>>> > > My working hours and your working hours may be different. Please do 
>>>>>>> > > not feel obligated to reply outside your normal working hours.
>>>>>>> > >
>>>>>>> > >> On 18. 5. 2022, at 22:32, Klaus Darilion via bind-users 
>>>>>>> > >>  wrote:
>>>>>>> > >>
>>>>>>> > >> Can you please provide some commands whose output you are 
>>>>>>> > >> interested? I want to collect the statistics for 9.16 before 
>>>>>>> > >> updating to 9.18.
>>>>>>> > >> Thanks
>>>>>>> > >> Klaus
>>>>>>> > >>
>>>>>>> > >>> -Ursprüngliche Nachricht-
>>>>>>> > >>> Von: bind-users  Im Auftrag von 
>>>>>>> > >>> Petr
>>>>>>> > >>> Špacek
>>>>>>> > >>> Gesendet: Mittwoch, 18. Mai 2022 18:20
>>>>>>> > >>> An: bind-users@lists.isc.org
>>>>>>> > >>> Betreff: Re: AW: High memory consumption in bind 9.18.2
>>>>>>> > >>>
>>>>>>> > >>> I would be very interested in hearing more!
>>>>>>> > >>>
>>>>>>> > >>> In majority of our internal testing 9.16 has higher memory 
>>>>>>> > >>> consumption
>>>>>>> > >>> than 9.18, especially when 9.18 is compiled with libjemalloc. And 
>>>>>>> > >>> the
>>>>>>> > >>> differences are not small, for some configurations it can be even 
>>>>>>> > >>> 2x or
>>>>>>> > >>> 3x more on 9.16 than it is on 9.18.
>>>>>>> > >>>
>>>>>>> > >>> If you encounter it again please get back to us so we can 
>>>>>>> > >>> diagnose it.
>>>>>>> > >>>
>>>>>>> > >>> Thank you!
>>>>>>> > >>> Petr Špaček
>>>>>>> > >>>
>>>>>&

Re: Access denied Bind9

2022-03-07 Thread Gregory Sloop
You might search the list archives, as I think this came up recently...
But I think the general consensus is that you shouldn't have a server that is 
both authoritative AND that allows recursive queries. (Security reasons)
And if you do allow both, to limit recursive queries to internal 
(semi-trusted/controlled) hosts only.
 
The options you'll be wanting to look at are:
 
allow-query
allow-recursion
allow-query-cache
 
See the docs.
 
something like;
allow-recursion { local-nets; }; 
 
Where local-nets are the local subnets you want to allow recursion for - 
meaning you trust those hosts on those subnets more than the open internet.
 
 
  

> Just to be clear, the servers are authoritative


> On Tue, Mar 8, 2022 at 5:27 AM Ritah Mulinde  wrote:

>> Thank you Mark

>> Iam abit new to this. How do i fix that??

>> On Tue, Mar 8, 2022 at 5:19 AM Mark Andrews  wrote:

>>> Presumably you are making recursive queries and you are denying them.

 On 8 Mar 2022, at 12:44, Ritah Mulinde  wrote:
 
 Hi Guys
 Just got my primary and secondary name servers  running.
 
 However, when i reload rdnc and tail the syslogs all i get is 
 "(.xx.com): query (cache) '.xx.com/A/IN' denied"
 
 Not sure why.
 
 kindly asking for some pointers on where to start looking
 
 
 Thank you
 -- 
 Visit https://lists.isc.org/mailman/listinfo/bind-users to unsubscribe 
 from this list
 
 ISC funds the development of this software with paid support 
 subscriptions. Contact us at https://www.isc.org/contact/ for more 
 information.
 
 
 bind-users mailing list
 bind-users@lists.isc.org
 https://lists.isc.org/mailman/listinfo/bind-users-- 
Visit https://lists.isc.org/mailman/listinfo/bind-users to unsubscribe from 
this list

ISC funds the development of this software with paid support subscriptions. 
Contact us at https://www.isc.org/contact/ for more information.


bind-users mailing list
bind-users@lists.isc.org
https://lists.isc.org/mailman/listinfo/bind-users


Re: Forwarding zone, setup

2022-03-01 Thread Gregory Sloop
>Are you loading the parent domain and trying to zone forward a child domain on 
>the same DNS server? I.e. loading somedomain.local and trying to forward 
>ab.somedomain.local
 
Yup, exactly.
 
That solution was suggested by Jeff Sumner yesterday, but it seemed a little 
nuts to me (BIND behaving that way) - though your explanation makes that 
behavior seem less crazy.
If I get a chance, I'll perhaps try that, just to see if it fixes it - though 
someone at ISC might save me the work, confirming the behavior. (please do!)
 
And, if that's the case, then static-sub is the far superior option - since 
it's much more simple and straight-forward.
 
Consider it solved. 
If ISC can confirm that behavior for forwarding a child domain when the server 
is also auth for the parent zone, that would be very nice!
 
Thanks to everyone, again, for the help!
 
    

> Are you loading the parent domain and trying to zone forward a child domain 
> on the same DNS server? I.e. loading somedomain.local and trying to forward 
> ab.somedomain.local

> If so an NS delegation is required in every instance I have done in my 
> environment. The NS doesn't need to be "right" but it needs to exist. I don't 
> know the internal BIND logic for that but I have always taken it as "I load 
> the parent and I know the child doesn't exist because there isn't a 
> delegation to make it exist so why would I forward something that doesn't 
> exist".


> On Tue, Mar 1, 2022, 1:18 PM Gregory Sloop  wrote:

>> Static-sub fixes the issue.
>>  
>> Any idea why static-sub works when forwarder doesn't?
>>  
>> (Again, the server is using recursion. Dig queries return the RA flag, so I 
>> know it's actually offering recursion in reality.)
>>  
>> I can live with static-sub just fine, since it works - but I'd really love 
>> to understand why forwarder didn't - just so I can avoid getting bitten by 
>> it in some other situation.
>>  
>> Thanks Andrej!
>> -Greg
>>   

>>> Is static-stub something you are looking for?

>>> Reference documentation:
>>> https://bind9.readthedocs.io/en/v9_18_0/reference.html?highlight=static-stub#zone-types

>>> And in human terms:
>>> https://jpmens.net/2011/01/25/binds-new-static-stub-zone-type/

>>> Ondrej
>>> --
>>> Ondřej Surý (He/Him)
>>> ond...@isc.org

>>> My working hours and your working hours may be different. Please do not 
>>> feel obligated to reply outside your normal working hours.

>>>> On 28. 2. 2022, at 21:47, Gregory Sloop  wrote:

>>>> So, I want to forward all queries for 
>>>> *.ab.somedomain.local to some other internal DNS servers.
>>>> (Records in *.ab.somedomain.local actually are our active domain servers)
>>>>  
>>>> (Yes, I know .local is reserved now, but we've been using it a long time 
>>>> and changing would be rather painful. Unless there's some horrible 
>>>> consequences, I think we'll just continue for now. We won't ever use mDNS.)
>>>>  
>>>> zone "ab.somedomain.local" {
>>>> type forward;
>>>> forward only;
>>>> forwarders { 10.0.0.1; 10.0.0.2; 10.0.0.3; };
>>>> };

>>>> But this doesn't appear to do what I want.
>>>>  
>>>> If I add the above to my regular BIND servers configuration, it doesn't 
>>>> return results like it's forwarding them. (I get NXDOMAIN for 
>>>> abc.ab.somedomain.local.)
>>>>  
>>>> If I do a dig @10.0.0.1 abc.ab.somedomain.local from the BIND server, I 
>>>> get a proper result. (force dig to use the AD name servers directly, 
>>>> instead of relying on the forward.)
>>>>  
>>>> (And yes the resolv.conf file has the ip addresses of the main internal 
>>>> BIND servers in it, and those only.)
>>>> I've looked and while I think I'm doing it right, I'm not entirely sure.
>>>> I figured before I beat my head against the wall for too long, I'd ask the 
>>>> real experts! :)
>>>>  

-- 
Gregory Sloop, Principal: Sloop Network & Computer Consulting
Voice: 503.251.0452 x121
EMail: gr...@sloop.net
http://www.sloop.net
- 
Visit https://lists.isc.org/mailman/listinfo/bind-users to unsubscribe from 
this list

ISC funds the development of this software with paid support subscriptions. 
Contact us at https://www.isc.org/contact/ for more information.


bind-users mailing list
bind-users@lists.isc.org
https://lists.isc.org/mailman/listinfo/bind-users


Re: Forwarding zone, setup

2022-03-01 Thread Gregory Sloop
Static-sub fixes the issue.
 
Any idea why static-sub works when forwarder doesn't?
 
(Again, the server is using recursion. Dig queries return the RA flag, so I 
know it's actually offering recursion in reality.)
 
I can live with static-sub just fine, since it works - but I'd really love to 
understand why forwarder didn't - just so I can avoid getting bitten by it in 
some other situation.
 
Thanks Andrej!
-Greg
  

> Is static-stub something you are looking for?

> Reference documentation:
> https://bind9.readthedocs.io/en/v9_18_0/reference.html?highlight=static-stub#zone-types

> And in human terms:
> https://jpmens.net/2011/01/25/binds-new-static-stub-zone-type/

> Ondrej
> --
> Ondřej Surý (He/Him)
> ond...@isc.org

> My working hours and your working hours may be different. Please do not feel 
> obligated to reply outside your normal working hours.

>> On 28. 2. 2022, at 21:47, Gregory Sloop  wrote:

>> So, I want to forward all queries for 
>> *.ab.somedomain.local to some other internal DNS servers.
>> (Records in *.ab.somedomain.local actually are our active domain servers)
>>  
>> (Yes, I know .local is reserved now, but we've been using it a long time and 
>> changing would be rather painful. Unless there's some horrible consequences, 
>> I think we'll just continue for now. We won't ever use mDNS.)
>>  
>> zone "ab.somedomain.local" {
>> type forward;
>> forward only;
>> forwarders { 10.0.0.1; 10.0.0.2; 10.0.0.3; };
>> };

>> But this doesn't appear to do what I want.
>>  
>> If I add the above to my regular BIND servers configuration, it doesn't 
>> return results like it's forwarding them. (I get NXDOMAIN for 
>> abc.ab.somedomain.local.)
>>  
>> If I do a dig @10.0.0.1 abc.ab.somedomain.local from the BIND server, I get 
>> a proper result. (force dig to use the AD name servers directly, instead of 
>> relying on the forward.)
>>  
>> (And yes the resolv.conf file has the ip addresses of the main internal BIND 
>> servers in it, and those only.)
>> I've looked and while I think I'm doing it right, I'm not entirely sure.
>> I figured before I beat my head against the wall for too long, I'd ask the 
>> real experts! :)
>>  
-- 
Visit https://lists.isc.org/mailman/listinfo/bind-users to unsubscribe from 
this list

ISC funds the development of this software with paid support subscriptions. 
Contact us at https://www.isc.org/contact/ for more information.


bind-users mailing list
bind-users@lists.isc.org
https://lists.isc.org/mailman/listinfo/bind-users


Re: Forwarding zone, setup

2022-03-01 Thread Gregory Sloop
This got held up in moderation. Let me repost it, from my regular mail client...
 
 
> You didn’t share much of your configuration except the one forwarded zone, 
> not a lot to go on.
 
Fair enough. (I guess I thought you could just infer all the needed 
information!  Oops!)

Let me try Ondrej's static-sub and see if that makes a difference.

As for more detail.
Yeah, it's a recursive resolver, used internally only.
It's also authoritative for the somedomain.local zone.
I simply want(ed) to pass queries for *.ad.somedomain.local to another 
server(s), so thought the forwarder setup was the right way to do that.

Sorry for being so lame in not providing enough detail - I think I just figured 
I must be doing something terribly wrong and the forwarder setup must be wrong 
in some obvious detail I wasn't seeing. 
That doesn't appear to be the case, so we'll look again, try static-sub and 
then re-group if it doesn't work.

Thanks all!
 
> But one thing to check, you do have recursion enabled on the server?
> On Mon, Feb 28, 2022 at 6:34 PM Gregory Sloop  wrote:

>> Wow. I hate to be the guy who looks the gift horse in the mouth - but that 
>> just seems "wrong." :) 
>> (Not the answer, but that that would be the way BIND wants it done.)
>>  
>> So, now I've got two sets of NS and glue records? 
>> Please tell me that's not the way BIND insists you do this!
>>  
>> I guess I should try it, but dang.
>> Does anyone know for sure?
>>  
>>   

>>> Add Delegating NS records:

>>> ab.somedomain.local 3600 NS server1.ab.somedomain.local
>>> .
>>> .
>>> .


>>> And glue records

>>> server1.ab.somedomain.local 3600 A 10.0.0.1
>>> .
>>> .


>>> And see if it works. It’s got something to do with the way the record is 
>>> matched (or not) before the forward statement is hit.

>>> J
>>>> On Feb 28, 2022, at 3:47 PM, Gregory Sloop  wrote:

>>>> So, I want to forward all queries for 
>>>> *.ab.somedomain.local to some other internal DNS servers.
>>>> (Records in *.ab.somedomain.local actually are our active domain servers)
>>>>  
>>>> (Yes, I know .local is reserved now, but we've been using it a long time 
>>>> and changing would be rather painful. Unless there's some horrible 
>>>> consequences, I think we'll just continue for now. We won't ever use mDNS.)
>>>>  
>>>> zone "ab.somedomain.local" {
>>>> type forward;
>>>> forward only;
>>>> forwarders { 10.0.0.1; 10.0.0.2; 10.0.0.3; };
>>>> };

>>>> But this doesn't appear to do what I want.
>>>>  
>>>> If I add the above to my regular BIND servers configuration, it doesn't 
>>>> return results like it's forwarding them. (I get NXOMAIN for 
>>>> abc.ab.somedomain.local.)
>>>>  
>>>> If I do a dig @10.0.0.1 abc.ab.somedomain.local from the BIND server, I 
>>>> get a proper result. (force dig to use the AD name servers directly, 
>>>> instead of relying on the forward.)
>>>>  
>>>> (And yes the resolv.conf file has the ip addresses of the main internal 
>>>> BIND servers in it, and those only.)
>>>> I've looked and while I think I'm doing it right, I'm not entirely sure.
>>>> I figured before I beat my head against the wall for too long, I'd ask the 
>>>> real experts! :)
>>>>  -- 
>> Visit https://lists.isc.org/mailman/listinfo/bind-users to unsubscribe from 
>> this list

>> ISC funds the development of this software with paid support subscriptions. 
>> Contact us at https://www.isc.org/contact/ for more information.


>> bind-users mailing list
>> bind-users@lists.isc.org
>> https://lists.isc.org/mailman/listinfo/bind-users

-- 
Gregory Sloop, Principal: Sloop Network & Computer Consulting
Voice: 503.251.0452 x121
EMail: gr...@sloop.net
http://www.sloop.net
- 
Visit https://lists.isc.org/mailman/listinfo/bind-users to unsubscribe from 
this list

ISC funds the development of this software with paid support subscriptions. 
Contact us at https://www.isc.org/contact/ for more information.


bind-users mailing list
bind-users@lists.isc.org
https://lists.isc.org/mailman/listinfo/bind-users


Re: Forwarding zone, setup

2022-02-28 Thread Gregory Sloop
Wow. I hate to be the guy who looks the gift horse in the mouth - but that just 
seems "wrong." :) 
(Not the answer, but that that would be the way BIND wants it done.)
 
So, now I've got two sets of NS and glue records? 
Please tell me that's not the way BIND insists you do this!
 
I guess I should try it, but dang.
Does anyone know for sure?
 
  

> Add Delegating NS records:

> ab.somedomain.local 3600 NS server1.ab.somedomain.local
> .
> .
> .


> And glue records

> server1.ab.somedomain.local 3600 A 10.0.0.1
> .
> .


> And see if it works. It’s got something to do with the way the record is 
> matched (or not) before the forward statement is hit.

> J
>> On Feb 28, 2022, at 3:47 PM, Gregory Sloop  wrote:

>> So, I want to forward all queries for 
>> *.ab.somedomain.local to some other internal DNS servers.
>> (Records in *.ab.somedomain.local actually are our active domain servers)
>>  
>> (Yes, I know .local is reserved now, but we've been using it a long time and 
>> changing would be rather painful. Unless there's some horrible consequences, 
>> I think we'll just continue for now. We won't ever use mDNS.)
>>  
>> zone "ab.somedomain.local" {
>> type forward;
>> forward only;
>> forwarders { 10.0.0.1; 10.0.0.2; 10.0.0.3; };
>> };

>> But this doesn't appear to do what I want.
>>  
>> If I add the above to my regular BIND servers configuration, it doesn't 
>> return results like it's forwarding them. (I get NXOMAIN for 
>> abc.ab.somedomain.local.)
>>  
>> If I do a dig @10.0.0.1 abc.ab.somedomain.local from the BIND server, I get 
>> a proper result. (force dig to use the AD name servers directly, instead of 
>> relying on the forward.)
>>  
>> (And yes the resolv.conf file has the ip addresses of the main internal BIND 
>> servers in it, and those only.)
>> I've looked and while I think I'm doing it right, I'm not entirely sure.
>> I figured before I beat my head against the wall for too long, I'd ask the 
>> real experts! :)
>>  -- 
Visit https://lists.isc.org/mailman/listinfo/bind-users to unsubscribe from 
this list

ISC funds the development of this software with paid support subscriptions. 
Contact us at https://www.isc.org/contact/ for more information.


bind-users mailing list
bind-users@lists.isc.org
https://lists.isc.org/mailman/listinfo/bind-users


Forwarding zone, setup

2022-02-28 Thread Gregory Sloop
So, I want to forward all queries for 
*.ab.somedomain.local to some other internal DNS servers.
(Records in *.ab.somedomain.local actually are our active domain servers)
 
(Yes, I know .local is reserved now, but we've been using it a long time and 
changing would be rather painful. Unless there's some horrible consequences, I 
think we'll just continue for now. We won't ever use mDNS.)
 
zone "ab.somedomain.local" {
type forward;
forward only;
forwarders { 10.0.0.1; 10.0.0.2; 10.0.0.3; };
};

But this doesn't appear to do what I want.
 
If I add the above to my regular BIND servers configuration, it doesn't return 
results like it's forwarding them. (I get NXOMAIN for abc.ab.somedomain.local.)
 
If I do a dig @10.0.0.1 abc.ab.somedomain.local from the BIND server, I get a 
proper result. (force dig to use the AD name servers directly, instead of 
relying on the forward.)
 
(And yes the resolv.conf file has the ip addresses of the main internal BIND 
servers in it, and those only.)
I've looked and while I think I'm doing it right, I'm not entirely sure.
I figured before I beat my head against the wall for too long, I'd ask the real 
experts! :)
 -- 
Visit https://lists.isc.org/mailman/listinfo/bind-users to unsubscribe from 
this list

ISC funds the development of this software with paid support subscriptions. 
Contact us at https://www.isc.org/contact/ for more information.


bind-users mailing list
bind-users@lists.isc.org
https://lists.isc.org/mailman/listinfo/bind-users


Re: Strange named freezing

2021-12-27 Thread Gregory Sloop
Entorpy is not RAM or CPU.
 
VM's or Jails can often have difficulty accessing truly good sources of random 
events, and thus have difficulty having enough available entropy to handle 
encryption/cryptography functions in a timely manner.
 
See:
https://www.google.com/search?q=entropy
 
 
  


> More, than enough. During last freeze server has ~30Gb free RAM and ~ 2-3% 
> CPU load and more than 200Gb free storage space for this jail. DC jail dont 
> have any resources limitations. Its very strange, because during using 
> previously DC in the similar jail on this server I dont have this trouble.

> 27.12.2021 11:07, Ondřej Surý пишет:

>> Does the jail have enough entropy? That would be my first guess…

>> --
>> Ondřej Surý — ISC (He/Him)

>> My working hours and your working hours may be different. Please do not feel 
>> obligated to reply outside your normal working hours.
>>> On 13. 12. 2021, at 7:18, Nikita Druba  wrote:

>>> What can be wrong here? How I can more localize the problem?


> ___
> Please visit https://lists.isc.org/mailman/listinfo/bind-users to unsubscribe 
> from this list

> ISC funds the development of this software with paid support subscriptions. 
> Contact us at https://www.isc.org/contact/ for more information.


> bind-users mailing list
> bind-users@lists.isc.org
> https://lists.isc.org/mailman/listinfo/bind-users

-- 
Gregory Sloop, Principal: Sloop Network & Computer Consulting
Voice: 503.251.0452 x121
EMail: gr...@sloop.net
http://www.sloop.net
---___
Please visit https://lists.isc.org/mailman/listinfo/bind-users to unsubscribe 
from this list

ISC funds the development of this software with paid support subscriptions. 
Contact us at https://www.isc.org/contact/ for more information.


bind-users mailing list
bind-users@lists.isc.org
https://lists.isc.org/mailman/listinfo/bind-users


Re: No more support for windows

2021-06-04 Thread Gregory Sloop
This feels a lot like responding to trolls, but I'll instead assume that you're 
asking (or making a point) in good faith.

So, we'll stipulate that - you're actually interested in truth and knowledge.

So, it's easily compiled on Mac, Unix, FreeBSD, Linux, SunOS, RaspPi, etc.
And it compiles on a huge range of hardware, CPU's etc.

I'd consider that highly portable.

You're welcome to disagree, but then someone else will complain it's not 
available in Amiga, Atari and under Dos and complain it isn't "portable" 
because there's no dos version.

So how many platforms do you have to support, to call it portable? 
(I've always thought of "portable" code, in this context especially, as code 
that is kept open so it will fairly easily compile on any *nix/posix platform 
without too much drama. And I think that's a pretty universal understanding for 
*nix style code.)

So, it seems you are tilting at windmills, complaining about Windows only.

Yes, the fundamentals of Windows are *VERY* different than any 
Linux/Unix/Solaris etc based platform. As such, making it work across all those 
platforms is really quite a lot of work. 
(Making it work fine, even on the future supported platforms (*nix) isn't 
trivial - obviously adding Windows to the mix is far, far more!)

And, it seems like no-one has stepped up to commit the $$$ needed to keep that 
support going.
Even a cheap dev probably charges $100+ an hour. How many hours/dollars do you 
think, in aggregate, is committed to keeping Windows support? It's not going to 
be like buying a $3 app for your phone - since the market for Windows users is 
far smaller.

And, I suspect, if we reach the end of the road for Windows support, and 
there's a half million users out there that want BIND supported on Windows, and 
they'll all pledge a buck a year, than I'd expect that Windows support will 
roll right out.

But if instead there's 100 people willing to pledge even $100 a year, well I'd 
guess that's not likely to pay for it.

ISC manages to pay the people who write code and do support through support 
contracts. Do you have one of those?

So the last option is; 
You, or someone else to simply give away their time for free. 
You up for that?
If you're not, or you don't have that skill set, then complaining bitterly 
seems a little hypocritical.

ISC already releases a huge set of software that you almost certainly use every 
single day (DHCP server and clients, along with BIND) and they aren't charging 
you a dime for that use. They're not charging your ISP either, or a ton of 
other people. So, IMO, they've really done a ton of free work for the community 
already. 

But it seems like you think it's not enough.

Sigh. 
What. Can. I. Say.
ISC does a lot of really good work.
IMO, this kind of a complaint is really misplaced.

And to be clear, I won't engage in a bunch of back-and-forth arguing this 
position. You're welcome to agree or not.
But *I* think you're obviously wrong, and I want everyone at ISC who does all 
that good work, developing great software that they let us use for free that I 
really appreciate their work.

-Greg



PC> What I find ironic is that here:

PC> https://gitlab.isc.org/isc-projects/bind9/-/blob/main/README.md

PC> the very first line says:

PC> "BIND (Berkeley Internet Name Domain) is a complete, highly portable
PC> implementation of the Domain Name System (DNS) protocol."

PC> If this were truly the case, BIND would work on Windows (or any other
PC> platform that doesn't have a "u" in it's name) with minimal effort
PC> and would not require specific funding to adapt it to any particular
PC> platform.

PC> Can we please have a realistic definition of what BIND is and what
PC> it's objectives are?

PC> I for one would be more likely to contribute to the development of
PC> a non-platform-specific, portable BIND than a single-platform-specific
PC> one.

PC> On the other hand, if it has already been decided that BIND can only
PC> realistically be implemented in the *u* arena and will rely on
PC> facilities only available in this arena, then shouldn't this be stated
PC> clearly instead of also declaring that it is highly portable?

PC> Regards,
PC> Peter Coghlan.


>> Do you understand how ironic is for you to complain about “subscription is
>> not going to happen” while **every** email on the mailing list has this
>> note in the footer:

>> ISC funds the development of this software with paid support subscriptions.
>> Contact us at https://www.isc.org/contact/ for more information.

>> --
>> Ondřej Surý — ISC (He/Him)

>> My working hours and your working hours may be different. Please do not feel 
>> obligated to reply outside your normal working hours.

>>> On 4. 6. 2021, at 19:47, Peter via bind-users  
>>> wrote:

>>> 
>>> On 04/06/2021 6:05 pm, John Thurston wrote:

> On 6/4/2021 8:48 AM, Peter via bind-users wrote: 
> When people find out2024 is the year bind is no longer supported for 
> windows people aregoing to be 

Re: Any interest in a write-up showing how to configure BIND 9.17x with DoH and LetsEncrypt?

2021-06-01 Thread Gregory Sloop
It's not like there's been a paucity of "Yeah, I'm interested" messages, but I 
agree with the rest.
+1000

I'd love to see it!



DNS over HTTPS support appears to be steadily increasing and it looks like the 
next version of Windows 10, Windows 10 21H2, will including support for DoH at 
the operating system level.
 
I spent a little time this weekend setting-up BIND 9.17.13 on Ubuntu 21.04 and 
configuring the system as a recursive resolver offering DNS over HTTPS using a 
LetsEncrypt certificate.
 
Is there any interest in me writing this up as a web article, or has everyone 
who’s interested in DoH already got it running comfortably in their test 
environment?
 
Best.
 
Richard.
 ___
Please visit https://lists.isc.org/mailman/listinfo/bind-users to unsubscribe 
from this list

ISC funds the development of this software with paid support subscriptions. 
Contact us at https://www.isc.org/contact/ for more information.


bind-users mailing list
bind-users@lists.isc.org
https://lists.isc.org/mailman/listinfo/bind-users


Re: BIND server; dig vs dig +trace on failing lookup.

2021-03-02 Thread Gregory Sloop
Let me take a swing at it. If I'm wrong, someone correct me.

1) dig the name servers for the 1st level domain. (In this case, it's a .com.)
# dig +short com. NS
c.gtld-servers.net.
f.gtld-servers.net.
j.gtld-servers.net.
l.gtld-servers.net.
k.gtld-servers.net.
d.gtld-servers.net.
g.gtld-servers.net.
i.gtld-servers.net.
a.gtld-servers.net.
e.gtld-servers.net.
m.gtld-servers.net.
b.gtld-servers.net.
h.gtld-servers.net.
---

2) How, ask one of those name servers for the NS for the target domain. 
(cistus.com. in this case.)

# dig +norec @i.gtld-servers.net. cistus.com. NS

; <<>> DiG 9.11.3-1ubuntu1.14-Ubuntu <<>> +norec @i.gtld-servers.net. 
cistus.com. NS
; (2 servers found)
;; global options: +cmd
;; Got answer:
;; ->>HEADER<<- opcode: QUERY, status: NOERROR, id: 49843
;; flags: qr; QUERY: 1, ANSWER: 0, AUTHORITY: 2, ADDITIONAL: 3

;; OPT PSEUDOSECTION:
; EDNS: version: 0, flags:; udp: 4096
;; QUESTION SECTION:
;cistus.com.IN  NS

;; AUTHORITY SECTION:
cistus.com. 172800  IN  NS  srvns.pacifier.com.
cistus.com. 172800  IN  NS  webns.pacifier.com.

;; ADDITIONAL SECTION:
srvns.pacifier.com. 172800  IN  A   216.65.128.5
webns.pacifier.com. 172800  IN  A   216.65.128.1

---

3) Finally, query the records in the additional section to see if they match 
the glue records.

# dig srvns.pacifier.com.

; <<>> DiG 9.11.3-1ubuntu1.14-Ubuntu <<>> srvns.pacifier.com.
;; global options: +cmd
;; Got answer:
;; ->>HEADER<<- opcode: QUERY, status: NOERROR, id: 17445
;; flags: qr rd ra; QUERY: 1, ANSWER: 1, AUTHORITY: 4, ADDITIONAL: 1

;; OPT PSEUDOSECTION:
; EDNS: version: 0, flags:; udp: 4096
; COOKIE: d16ea0cbd520a81cea1154c1603ef7e13d586e8adc0e75f3 (good)
;; QUESTION SECTION:
;srvns.pacifier.com.IN  A

;; ANSWER SECTION:
srvns.pacifier.com. 85728   IN  A   64.255.237.241

;; AUTHORITY SECTION:
pacifier.com.   172126  IN  NS  ns2.iinet.com.
pacifier.com.   172126  IN  NS  ns4.iinet.com.
pacifier.com.   172126  IN  NS  ns1.iinet.com.
pacifier.com.   172126  IN  NS  ns3.iinet.com.


(glue) srvns.pacifier.com. 172800  IN  A   216.65.128.5
vs
(regular lookup) srvns.pacifier.com. 85728   IN  A   64.255.237.241

And they don't - so we know the glue is stale.


Is that right? :)

-Greg


Would you mind showing me how you got there?
[The answer is fab, obviously. But give a man a fish...and all that. :) ]

-Greg



MA> The COM servers have stale glue

MA> srvns.pacifier.com. 172800  IN  A   216.65.128.5
MA> webns.pacifier.com. 172800  IN  A   216.65.128.1

MA> vs

MA> srvns.pacifier.com. 86400   IN  A   64.255.237.241
MA> webns.pacifier.com. 86400   IN  A   64.255.237.240

MA> The later set of servers are what you query when you run dig +trace.
MA> If you prime the cache the plain lookup should work.  Report the out
MA> of date glue to the zone administrator.

MA> Mark

>> On 3 Mar 2021, at 13:06, Gregory Sloop  wrote:

>> I've got a case, (and I see several other similar reports) where BIND is 
>> failing to find an A record for a domain.
>> Yet a dig +trace does.

>> (I'm doing the dig on the BIND server. It's set to be a root resolving 
>> server, not a forwarder.)

>> As I understand this, +trace will also involve resolve.conf options. And in 
>> this case, I've got Google DNS as one of the resolve.conf entries.
>> So, I can see how +trace would deliver different results than simply dig-ing 
>> - provided that +trace does involve resolve.conf.

>> Here's a plain dig, using the BIND server itself - from the console.
>> ---
>> dig cistus.com @10.8.20.5

>> ; <<>> DiG 9.11.3-1ubuntu1.14-Ubuntu <<>> cistus.com @10.8.20.5
>> ;; global options: +cmd
>> ;; Got answer:
>> ;; ->>HEADER<<- opcode: QUERY, status: SERVFAIL, id: 61786
>> ;; flags: qr rd ra; QUERY: 1, ANSWER: 0, AUTHORITY: 0, ADDITIONAL: 1

>> ;; OPT PSEUDOSECTION:
>> ; EDNS: version: 0, flags:; udp: 4096
>> ; COOKIE: 13ec0c9b10770ea12426539e603957900a997f7258962cce (good)
>> ;; QUESTION SECTION:
>> ;cistus.com.IN  A

>> ;; Query time: 0 msec
>> ;; SERVER: 10.8.20.5#53(10.8.20.5)
>> ;; WHEN: Fri Feb 26 12:18:24 PST 2021
>> ;; MSG SIZE  rcvd: 67

>> ---

>> I could post the dig +trace, if it adds any information, but I suspect it 
>> doesn't.

>> So, what methods or steps might I take to figure out why the above 
>> lookup/dig fails?
>> [I intended +trace to do that, but since it's not doing the same thing a 
>> plain dig does, it's not very useful as a diagno

Re: BIND server; dig vs dig +trace on failing lookup.

2021-03-02 Thread Gregory Sloop
Would you mind showing me how you got there? 
[The answer is fab, obviously. But give a man a fish...and all that. :) ]

-Greg



MA> The COM servers have stale glue

MA> srvns.pacifier.com. 172800  IN  A   216.65.128.5
MA> webns.pacifier.com. 172800  IN  A   216.65.128.1

MA> vs

MA> srvns.pacifier.com. 86400   IN  A   64.255.237.241
MA> webns.pacifier.com. 86400   IN  A   64.255.237.240

MA> The later set of servers are what you query when you run dig +trace.
MA> If you prime the cache the plain lookup should work.  Report the out
MA> of date glue to the zone administrator.

MA> Mark

>> On 3 Mar 2021, at 13:06, Gregory Sloop  wrote:

>> I've got a case, (and I see several other similar reports) where BIND is 
>> failing to find an A record for a domain.
>> Yet a dig +trace does.

>> (I'm doing the dig on the BIND server. It's set to be a root resolving 
>> server, not a forwarder.)

>> As I understand this, +trace will also involve resolve.conf options. And in 
>> this case, I've got Google DNS as one of the resolve.conf entries.
>> So, I can see how +trace would deliver different results than simply dig-ing 
>> - provided that +trace does involve resolve.conf.

>> Here's a plain dig, using the BIND server itself - from the console.
>> ---
>> dig cistus.com @10.8.20.5

>> ; <<>> DiG 9.11.3-1ubuntu1.14-Ubuntu <<>> cistus.com @10.8.20.5
>> ;; global options: +cmd
>> ;; Got answer:
>> ;; ->>HEADER<<- opcode: QUERY, status: SERVFAIL, id: 61786
>> ;; flags: qr rd ra; QUERY: 1, ANSWER: 0, AUTHORITY: 0, ADDITIONAL: 1

>> ;; OPT PSEUDOSECTION:
>> ; EDNS: version: 0, flags:; udp: 4096
>> ; COOKIE: 13ec0c9b10770ea12426539e603957900a997f7258962cce (good)
>> ;; QUESTION SECTION:
>> ;cistus.com.IN  A

>> ;; Query time: 0 msec
>> ;; SERVER: 10.8.20.5#53(10.8.20.5)
>> ;; WHEN: Fri Feb 26 12:18:24 PST 2021
>> ;; MSG SIZE  rcvd: 67

>> ---

>> I could post the dig +trace, if it adds any information, but I suspect it 
>> doesn't.

>> So, what methods or steps might I take to figure out why the above 
>> lookup/dig fails?
>> [I intended +trace to do that, but since it's not doing the same thing a 
>> plain dig does, it's not very useful as a diagnostic tool.]

>> I've done some searching to see how to accomplish this, but it's a difficult 
>> question to frame without a ton of worthless hits.
>> So, can someone point me at a good source for a how-to/walk-through? A 
>> previous list posting?

>> Again, the question is; what methods or steps (best practices) might I take 
>> to figure out why the above lookup/dig fails?

>> TIA
>> -Greg
>> ___
>> Please visit https://lists.isc.org/mailman/listinfo/bind-users to 
>> unsubscribe from this list

>> ISC funds the development of this software with paid support subscriptions. 
>> Contact us at https://www.isc.org/contact/ for more information.


>> bind-users mailing list
>> bind-users@lists.isc.org
>> https://lists.isc.org/mailman/listinfo/bind-users
___
Please visit https://lists.isc.org/mailman/listinfo/bind-users to unsubscribe 
from this list

ISC funds the development of this software with paid support subscriptions. 
Contact us at https://www.isc.org/contact/ for more information.


bind-users mailing list
bind-users@lists.isc.org
https://lists.isc.org/mailman/listinfo/bind-users


BIND server; dig vs dig +trace on failing lookup.

2021-03-02 Thread Gregory Sloop
I've got a case, (and I see several other similar reports) where BIND is 
failing to find an A record for a domain.
Yet a dig +trace does.

(I'm doing the dig on the BIND server. It's set to be a root resolving server, 
not a forwarder.)

As I understand this, +trace will also involve resolve.conf options. And in 
this case, I've got Google DNS as one of the resolve.conf entries.
So, I can see how +trace would deliver different results than simply dig-ing - 
provided that +trace does involve resolve.conf.

Here's a plain dig, using the BIND server itself - from the console.
---
dig cistus.com @10.8.20.5

; <<>> DiG 9.11.3-1ubuntu1.14-Ubuntu <<>> cistus.com @10.8.20.5
;; global options: +cmd
;; Got answer:
;; ->>HEADER<<- opcode: QUERY, status: SERVFAIL, id: 61786
;; flags: qr rd ra; QUERY: 1, ANSWER: 0, AUTHORITY: 0, ADDITIONAL: 1

;; OPT PSEUDOSECTION:
; EDNS: version: 0, flags:; udp: 4096
; COOKIE: 13ec0c9b10770ea12426539e603957900a997f7258962cce (good)
;; QUESTION SECTION:
;cistus.com.IN  A

;; Query time: 0 msec
;; SERVER: 10.8.20.5#53(10.8.20.5)
;; WHEN: Fri Feb 26 12:18:24 PST 2021
;; MSG SIZE  rcvd: 67

---

I could post the dig +trace, if it adds any information, but I suspect it 
doesn't.

So, what methods or steps might I take to figure out why the above lookup/dig 
fails? 
[I intended +trace to do that, but since it's not doing the same thing a plain 
dig does, it's not very useful as a diagnostic tool.]

I've done some searching to see how to accomplish this, but it's a difficult 
question to frame without a ton of worthless hits.
So, can someone point me at a good source for a how-to/walk-through? A previous 
list posting?

Again, the question is; what methods or steps (best practices) might I take to 
figure out why the above lookup/dig fails?

TIA
-Greg___
Please visit https://lists.isc.org/mailman/listinfo/bind-users to unsubscribe 
from this list

ISC funds the development of this software with paid support subscriptions. 
Contact us at https://www.isc.org/contact/ for more information.


bind-users mailing list
bind-users@lists.isc.org
https://lists.isc.org/mailman/listinfo/bind-users


Re: How Zone Files Are Read

2020-12-16 Thread Gregory Sloop
This isn't, IMO, very useful as a response to the OP.

To sum up the response; "It's better to never fail!"

Yes, that seems pretty obvious. It *would* be better to never fail. Way, way 
better.
But the big problem in life is; We're always failing! Dammit!

So, learning how to gracefully fail, and understanding what happens and why, 
when something fails, is pretty important to achieve the outcome of; "Not 
failing quite so catastrophically." 

So, while I don't have helpful knowledge to impart to the OP, I think I can say 
that giving the advice of "don't fail" doesn't seem very helpful.





RH> Am 16.12.20 um 17:37 schrieb Tim Daneliuk:
>> I ran into a situation yesterday which got me pondering something about bind.

>> In this case, a single line in a zone file was bad.  The devops automation
>> had inserted a space in the hostname field of a PTR record.

>> What was interesting was that - at startup - bind absolutely refused
>> to load the zone file at all.  I would have expected it to complain
>> about the bad record and ignore it, but load the rest of the
>> good records.

>> Can someone please explain the rationale or logic for this?  Not complaining,
>> just trying to understand for future reference.

RH> it's better not load a invalid zone on a single nameserver at all as you
RH> are supposed to have at least two nameservers and the second one won't
RH> get the failure via master/slave replication

RH> if it has an error something is wrong
RH> if the last version had no error that version is good

RH> for the world *everything* still is good as long there is one slave - 
RH> subtle errors can lead to completly unexpected behavior
RH> ___
RH> Please visit https://lists.isc.org/mailman/listinfo/bind-users to 
unsubscribe from this list___
Please visit https://lists.isc.org/mailman/listinfo/bind-users to unsubscribe 
from this list

ISC funds the development of this software with paid support subscriptions. 
Contact us at https://www.isc.org/contact/ for more information.


bind-users mailing list
bind-users@lists.isc.org
https://lists.isc.org/mailman/listinfo/bind-users


rndc valid key types

2020-07-07 Thread Gregory Sloop
So, I've spent some time looking at the man pages and googling without any 
definitive answer.

I'm generating some new rndc keys for my bind9 config. (9.11.3 in this 
particular case, if it matters.)

rndc-confgen has quite a number of options for the key-type - but I'm not sure 
what BIND9 will handle for RNDC.

I've seen reports that only HMAC-MD5 is the only valid key type.

...

Just before posting this, I checked the RNDC man page and found this: 
[At least I saved myself some public embarrassment! :) ]
---
rndc communicates with the name server over a TCP connection, sending commands 
authenticated with digital signatures. In the current versions of rndc and 
named, the only supported authentication algorithms are HMAC-MD5 (for 
compatibility), HMAC-SHA1, HMAC-SHA224, HMAC-SHA256 (default), HMAC-SHA384 and 
HMAC-SHA512. They use a shared secret on each end of the connection. This 
provides TSIG-style authentication for the command request and the name 
server's response. All commands sent over the channel must be signed by a 
key_id known to the server.
---

Still, the root cause for my query
Is there any (security) reason/implications to use something "better" than MD5?

I'd lean toward something like HMAC-SHA256/384/512.

Perhaps there's a discussion somewhere I haven't found - and I'd be glad to be 
pointed to that, instead of taking someone's time re-typing a bunch of details. 
But I can't seem to find anything. 
I assume it might be easier to forge an update for rndc with an MD5 key, right? 
Is there any reason not to select the strongest - HMAC-SHA512?

Just wanting to be sure I understand the implications of any particular choice.

TIA
-Greg___
Please visit https://lists.isc.org/mailman/listinfo/bind-users to unsubscribe 
from this list

ISC funds the development of this software with paid support subscriptions. 
Contact us at https://www.isc.org/contact/ for more information.


bind-users mailing list
bind-users@lists.isc.org
https://lists.isc.org/mailman/listinfo/bind-users


Re: BIND Masters and slaves

2020-06-15 Thread Gregory Sloop
This whole discussion seems so misguided.
The TLDR; version, at least for me is;
Does it really cost you that much to use language that's polite and kind to 
those around you, and change that language to live up to those ideals when you 
can?

Sure you _have the right_ to run down the street and yell 
impolite/angry/mysoginistic epithets and racial insults at those around you. 

But rather than focusing on _your rights_, how about spending a few minutes 
considering how you'd feel if people did equally insensitive things to your 
mom, you sister, your wife, or your best friend.

When viewed from that perspective, the "outrage" at not using/changing terms 
like master/slave in BIND seem, at least to my mind, kind of hyperbolic.

You can call it politically correct, but I generally like to call it "being 
nice" or "polite" or "kind." 
When did it suddenly become so wrong to be gentle, kind and caring? [That's a 
rhetorical question, so no need to "answer" it.]

The BIND and DHCP lists have historically been incredible lists for really 
polite and helpful people over the many years I've subscribed and participated. 
IME, these lists have been far superior to nearly any other lists I've been a 
part of. Super helpful. Really nice people> Super respectful. Totally awesome. 
I'm nearly always chuffed to see the behavior here.

I hope that continues.


MDR> Completely aside from the topic at hand, I often like to think that 
MDR> after a few years I mastered something. System administration, 
MDR> electronics, programming, whatever has piqued my interest for several 
MDR> years already and got me to invest in it. It is never true. The first 
MDR> profession I pursued was system administration and Linux in general. 
MDR> Even today I still learn so much on the daily. Mastery? I may be 
MDR> experienced with Linux but mastery is still far ahead... It's quite 
MDR> interesting how deep the rabbit hole can go. What matters is how deep we
MDR> want it to go I guess.

MDR> Crackerjack is an interesting name, if anything I'd just want it for 
MDR> shits and giggles :D

MDR> On 6/15/20 9:07 PM, Brett Delmage wrote:
>> After I feel I have mastered DNS and BIND after slaving over the docs 
>> and code for years (I'm not there yet, and I have not) how am I going 
>> to communicate this to people?

>> How will I be able to master anything technical anymore? Should I just 
>> stop trying?


>> Thesaurus.com suggests that one could call one type of DNS server the 
>> "crackerjack" server instead. I guess that's an improvement over 
>> "cracked". "Ace" server is a suggested alternative too, and it's 
>> nicely terse.

>> https://www.thesaurus.com/browse/master?s=t___
Please visit https://lists.isc.org/mailman/listinfo/bind-users to unsubscribe 
from this list

ISC funds the development of this software with paid support subscriptions. 
Contact us at https://www.isc.org/contact/ for more information.


bind-users mailing list
bind-users@lists.isc.org
https://lists.isc.org/mailman/listinfo/bind-users


Re: Proposal to adopt a Code of Conduct for the list

2019-08-02 Thread Gregory Sloop
That code of conduct is a good starting place, at minimum.

Regardless of how it may be now, it may not always be a great a place - and 
setting out the expectations and standards expected is a good hedge against 
future tyrants. 

So, I applaud ISC for taking the steps to formally set out the standards they 
expect on the list. Far better to do it now in "fair" weather then when things 
are much worse.

And on that note...
I know I've said it before - but I really must take the time to thank the many 
posters here. I'd name names - but I'm sure I'd miss someone. 

I've been on a lot of forums and lists the last 30+ years, and I don't believe 
I've ever found one that has such great people who show such great knowledge 
and more importantly, to me at least, such grace and tactfulness and elegance 
when posting here.

Other lists likemumblefreeradiusmumble have great technical skill present, but 
are regularly abrasive, and curt. They may offer good technical advice, but do 
it in such a way that it's not very pleasant.

Other; both illiterate and tyrants or just nice illiterates.

But THIS place, and these posters that we regularly see here are pretty 
awesome. 

The only downside is - I sometimes hesitate to post, since the other great 
posters are likely to crank out something so awesome, through and full of 
details I'll probably forget to put in, I'll feel like a worm. :) [But please 
don't stop on my account!]

So, again - I'm always impressed with the posting tone, and technical quality - 
both here and on the BIND list.

This is one of those lists I stay a member of simply because it's technically 
rewarding to read, and reminds me, often, that there ARE still nice people who 
post on the internet.

Thanks! 


This list is a tremendously helpful and generous group that has provided really 
invaluable assistance to many new and experienced users.  I would like to thank 
all of you who share your time and expertise with random users on the Internet, 
as well as giving feedback to ISC via this list.

We had some discussion within ISC and there are mixed opinions about the need 
for and utility of adopting a Code of Conduct (CoC) to ensure that this list 
remains welcoming, useful and tolerant.  We have considered statements used by 
other projects, lists and open source communities. We agreed that I would begin 
by reiterating a statement posted on this list back in 2016, asking for respect 
and civility, to see what people think of that as a code of conduct for this 
list. 

Here it is:

https://lists.isc.org/pipermail/bind-users/2016-October/097925.html
We should like to remind list users that irrespective of the basic
levels of experience or knowledge of some of the posters here, that they
are asking for community help and that advice should be given politely
and with respect shown both towards the original poster and other
contributors to a thread.

The same courtesy and respect is also expected of list members towards
others when discussing more advanced and complex topics. Please
remember that:

- English is not the first language for all - this can lead to
misunderstandings

- Beginners don't always 'get it' from the start (but with gentle
guidance may become the experts of the future and surprise us all)

If you disagree with another poster on a technical matter, please
explain your position clearly, thoughtfully, and with appropriate
support for your viewpoint.

If you believe that you are the target of an insulting or inappropriate
post, please alert the list administrators* rather than retaliating on
the mailing list.

If you have any other concerns about a poster, please bring them to the
attention of the list administrators.

Insulting and derogatory posts will not be tolerated and will result in
future posts from those list members who are posting in this manner
being held for moderation or suspended indefinitely from this community.

Cathy Almond
ISC Support
* contact the list administrators directly by emailing to 
bind-users-ow...@lists.isc.org

--
Victoria Risk
Product Manager
Internet Systems Consortium
vi...@isc.org
___
Please visit https://lists.isc.org/mailman/listinfo/bind-users to unsubscribe 
from this list

bind-users mailing list
bind-users@lists.isc.org
https://lists.isc.org/mailman/listinfo/bind-users


Re: Bind9 stops responding for some clients

2019-06-05 Thread Gregory Sloop
Thanks for the idea.
I did resolve this a day or two ago.

The story is; 
This server was a fairly recent replacement for an older Ubuntu setup. The new 
server as well as the old one are/were VM's - yet on different VM platforms. 
The old VM was turned off, and was marked never to start except unless manually 
started. [There were a few other things on the VM host that had yet to be 
migrated - so we didn't want it entirely off quite yet.]

The problem happened again in the last day or two - and packet captures showed 
that no packets were even arriving at the new VM.
Since there really wasn't anything that should be blocking that traffic, I 
checked the arp table on a problem client. 
The arp table showed an "incorrect" MAC address for the current BIND server. 
[The MAC in the arp table didn't match the MAC for the new VM.]

While I didn't have the MAC address for the "old" deactivated server handy, it 
was the first obvious problem/solution to check.
Sure enough, after connection to the VM hypervisor, I could see that the "old" 
BIND vm was active.

I killed it, and service returned to normal.

So, the "solution" was pretty routine.
What made it more "interesting" and perhaps odd is how seemingly randomly the 
problem would crop up. 
And it would only impact some clients, not all. There was no pattern that 
seemed to explain why some got the current/correct BIND server and others 
didn't. [The arp poisoning certainly wasn't anywhere nearly universal.]
And why was it so infrequent - it would go many days between issues.
I have to assume the bad VM had been up for some time, at least since the 
problems started.
There are quite a number of odd-ish other things too, but not worth detailing.

Probably it's just one of those "undefined" situations where you can't 
anticipate some predictable order to what happens when you screw it up. Rather 
than burn additional time trying to grok what was going on - it's simply best 
to say "don't do that - bad things happen, though I can't say what bad things 
will happen and in which logical order. They just will - so DON'T DO THAT!"

[And yeah, I obviously knew all about not doing that. But it happened anyway, 
in spite of specific steps to prevent it. I'm still not sure why.]

In the end, it's a somewhat complicated story with a very obvious cause - but 
it wasn't so clear at the outset.

TLDR version; 
Don't run your old and new bind servers on the same IP address - ether by 
accident or intentionally. Bad stuff will happen! 
It might be really odd, or it might be plain as day -  but in either case it 
won't be good! :)

Thanks all for the suggestions! 
Here's hoping I don't need to ask for BIND assistance for another 20 years! :)

-Greg



I just randomly spotted this post, and thought I would toss in 2¢

How many nics and how many it's are on the servers?  Are the failing clients on 
the same subnet as the server?

--
___
Please visit https://lists.isc.org/mailman/listinfo/bind-users to unsubscribe 
from this list

bind-users mailing list
bind-users@lists.isc.org
https://lists.isc.org/mailman/listinfo/bind-users


Re: Bind9 stops responding for some clients

2019-05-30 Thread Gregory Sloop
Ugh. Not wanting to packet capture. :)
[Yeah, not that hard, but it always seems to suck up so much time - it's like 
the black hole for time, I think.]
But, yeah, absent some other smoking gun, that's probably where we're headed. 

As for rate limiting - "rndc recursing" didn't show anything being rate 
limited. [No output. I assume that means there's nothing there. And ISC claims 
that rate limiting isn't turned on by default - I certainly haven't enabled 
anything related to rate limits.]

There's no firewalling/filtering between any of the affected clients and the 
DNS servers.
Without going into too much detail, the network's pretty flat. 
There are (4) /24 subnets, but they're all passing through a L3 switch for 
"routing." 
Essentially all filtering occurs at the border only, so, SPI or other stuff 
shouldn't be in the mix here.

Sigh. 
As my colleague said...
"Heh. How, um, fun?"

-Greg



Whilst you mentioned 150 seats and you mentioned 'no firewalls', you didn't 
mention the network topology at all, in particular is traffic passing through a 
commercial firewall/router (hardware or virtualized) to get to the DNS server? 
If there is, it may be worth checking what packet inspection is turned on for 
DNS traffic (Cisco, Juniper and Checkpoint have been known to have buggy 
inspection routines in the past).
 
I might also be worthwhile to see what your open filehandles are like and 
whether there's any rate limiting configured in the distributed BIND 
configuration.
 
Stuart
 
From: bind-users [mailto:bind-users-boun...@lists.isc.org] On Behalf Of John W. 
Blue
Sent: Friday, 31 May 2019 11:47 AM
To: bind-users@lists.isc.org
Subject: Re: Bind9 stops responding for some clients
 
Good job on the amount of troubleshooting work done so far.

Next steps should be to run tcpdump on the interface for port 53 to see what is 
happening when an outage is in progress.  What you will be looking for 
specifically is the query packet in and the response packet out.

Use the following command:

tcpdump -n -i eth0 port domain and host 172.24.67.32

Swap out eth0 for whatever you have configured and the host IP address for a 
host that is having problems.

John
 
Sent from Nine

From: Gregory Sloop 
Sent: Thursday, May 30, 2019 7:11 PM
To: bind-users@lists.isc.org
Subject: Bind9 stops responding for some clients
 
So, this is a very odd situation and I'm kind of grasping at straws here.
So, I've come to see if any of you have any good straws!

The setup.
---
Ubuntu 18.04 LTS is the distro we're running on. 
All software is packaged [from the distro] - not compiled from sources.
Bind9 acting as a recursive resolver for a smallish network. 150 seats.
They're also handling DHCP and Chrony/NTP requests.
[I actually have a pair of these handling DNS/DHCP/NTP this is the master.]

They are running on a Xen/XCP VM.

The one I'm having problems is the master for several internal zones - the one 
that's working fine is the slave for those same zones. None of the zones are 
large.

Intermittently, Bind9 simply stops handling queries from *some* hosts. 
Meaning, it simply times out for responses for those hosts.
Yet BIND *is* working fine for lots of other machines on the same networks. 
It's working fine doing dig queries locally on the server, and handles dns 
queries fine for lots of other machines. Yet, again, some machines simply get 
time-outs. I can't find any pattern to which machines get timeouts and which 
don't.

I've checked - no firewalls, fail2ban or the like that might be causing this. 
No selinux/apparmour.
Hosts that can't do dns queries can ping the dns server fine. 
[So, there's at least some network pathway to the DNS machine.]

Review of the logs for bind don't show anything that looks like a problem to me.
[But I'm not sure what keywords I ought to be looking for, in an effort to find 
symptoms/problems.]

Finally, the two bind/dhcp/ntp servers are currently running on the same Xen 
host, so if it's somehow host related, I'd expect both to have problems, but 
they don't.

Top doesn't show any CPU distress.
Processes look fine
Memory in use is far below what allocated to the machine. [1G allocated, like 
<400M used.]
Restart of BIND doesn't do anything, at least in the cases I've seen - which 
aren't all that many yet.
A restart of the whole VM does appear to fix the issue immediately.
These appear to occur every 3-5 days.
Oh, and if you simply wait, it eventually starts handling queries for all hosts 
again - but it might be a couple+ hours.

Any suggestions on things I might hunt for in the logs in an attempt to figure 
out what's happening?
Other suggestions for things to look for/consider?

TIA
-Greg

-- 
Gregory Sloop, Principal: Sloop Network & Computer Consulting
Voice: 503.251.0452 x82
EMail: gr...@sloop.net
http://www.sloop.net
---___
Please visit https://lists.isc.org/mailman/listinfo/bind-users to unsubscribe 
fr

Bind9 stops responding for some clients

2019-05-30 Thread Gregory Sloop
So, this is a very odd situation and I'm kind of grasping at straws here.
So, I've come to see if any of you have any good straws!

The setup.
---
Ubuntu 18.04 LTS is the distro we're running on. 
All software is packaged [from the distro] - not compiled from sources.
Bind9 acting as a recursive resolver for a smallish network. 150 seats.
They're also handling DHCP and Chrony/NTP requests.
[I actually have a pair of these handling DNS/DHCP/NTP this is the master.]

They are running on a Xen/XCP VM.

The one I'm having problems is the master for several internal zones - the one 
that's working fine is the slave for those same zones. None of the zones are 
large.

Intermittently, Bind9 simply stops handling queries from *some* hosts. 
Meaning, it simply times out for responses for those hosts.
Yet BIND *is* working fine for lots of other machines on the same networks. 
It's working fine doing dig queries locally on the server, and handles dns 
queries fine for lots of other machines. Yet, again, some machines simply get 
time-outs. I can't find any pattern to which machines get timeouts and which 
don't.

I've checked - no firewalls, fail2ban or the like that might be causing this. 
No selinux/apparmour.
Hosts that can't do dns queries can ping the dns server fine. 
[So, there's at least some network pathway to the DNS machine.]

Review of the logs for bind don't show anything that looks like a problem to me.
[But I'm not sure what keywords I ought to be looking for, in an effort to find 
symptoms/problems.]

Finally, the two bind/dhcp/ntp servers are currently running on the same Xen 
host, so if it's somehow host related, I'd expect both to have problems, but 
they don't.

Top doesn't show any CPU distress.
Processes look fine
Memory in use is far below what allocated to the machine. [1G allocated, like 
<400M used.]
Restart of BIND doesn't do anything, at least in the cases I've seen - which 
aren't all that many yet.
A restart of the whole VM does appear to fix the issue immediately.
These appear to occur every 3-5 days.
Oh, and if you simply wait, it eventually starts handling queries for all hosts 
again - but it might be a couple+ hours.

Any suggestions on things I might hunt for in the logs in an attempt to figure 
out what's happening?
Other suggestions for things to look for/consider?

TIA
-Greg___
Please visit https://lists.isc.org/mailman/listinfo/bind-users to unsubscribe 
from this list

bind-users mailing list
bind-users@lists.isc.org
https://lists.isc.org/mailman/listinfo/bind-users