Zitat von Leo Bush <[email protected]>:

Dear all,

Since one month our company uses unbound-1.4.8-1 on two RH6 servers as caching and resolving servers with IPv6 and DNSSec enabled. These two servers deal with all our DNS traffic, generated by all our customers (2x 5Mbps peak traffic). They work as stand alone servers, no complicated network components (Load balancer...) around.

At the beginning we used to activate the option use-caps-for-id, but since we got complaints from customers that certain domains were available everywhere in the world except at us, we preferred to deactivate.

Currently we face the following rather strange problem:
Under normal working conditions, in 70-90% of the time our two production servers cannot resolve domains registered at register.be and lying on the three authoritative name servers ns1.register.be, ns3.register.be, ns2.register.be (example: leonidas.be, estates.lu). They return me a SERVFAIL. register.be itself works all the time. By chance it sometimes works correctly for a brief period of time. Even though it was not easy due to the thousands of packets passing through in a second, I succeeded to trace the packets the server sends to the authoritative servers and it gets correct answers back.

I tried to install unbound 1.4.8 with the same configuration file (see attachment) on a desktop machine and there was no issue. All resolutions against domains at register.be were immediate and correct.

As customers continued to complain I was forced to take one server out of production and to replace it with bind which works correctly. Now I have one server with unbound that has the problem and one server with bind, that works fine in production. The formerly faulty unbound server that is now offloaded currently responds correctly at all tests (no restart done, no reboot done, just IP address switched).

Does anybody have an idea how I can solve this problem? Shall I offer you more technical information? Do you have further tests to suggest?


Looks for me like EDNS problem. At least some part of the .be zone is DNSSEC signed an the replies get bigger than 512 Byte like with "dig x.dns.be A +dnssec". Bind has a feature to reduce the EDNS size in case of trouble, not sure if Unbound does the same. What you should check: - Do the trouble domain/names resolve with unbound if you use checking disabled (+cdflag) - Do you have any firewall device in front of your resolvers maybe some Cisco inspecting DNS traffic
- Do you have disabled Unbound tcp

For some hints on the problem have a look here:
https://www.dns-oarc.net/oarc/services/replysizetest

Regards

Andreas



_______________________________________________
Unbound-users mailing list
[email protected]
http://unbound.nlnetlabs.nl/mailman/listinfo/unbound-users

Reply via email to