After updating to latest Consul I'm getting "error serializing DNS results" errors in my logs

far-blue · July 3, 2024, 10:49am

After updating my consul agents this morning I’m seeing a flood of errors in my logs of the types:

[ERROR] agent.dns: error serializing DNS results: error="no data"

and

[ERROR] agent.dns: error processing discovery query: error="not found"

These errors are coming from the agents on the compute nodes rather than the server quorum nodes. I assume it’s to do with dns requests but I have no clue what might be causing them.

Anyone have any ideas?

oavril · July 8, 2024, 2:19pm

Hello,

I have exactly the same problem.
I was on version 1.18.2, and even after an upgrade to version 1.19, this message persists.

For example, if I try to resolve this:
..service.consul.

the korum log reports this message.

I welcome your feedback.

keefertaylor · July 17, 2024, 3:10am

I also see this error on Consul 1.19, and I’m not sure what is causing it either.

agncr · July 17, 2024, 5:07am

We had a similar problem after upgrading to v1.19 and the service lookups didn’t work correctly. v1.19 has some known issues with DNS: 1.19.x | Consul | HashiCorp Developer.

Upgrading v1.19.1 resolved the problems for us. s. Release v1.19.1 · hashicorp/consul · GitHub

keefertaylor · July 18, 2024, 1:21am

So I did some digging today. This is all on version 19.1.

If you add --log-level=debug to your startup command, you’ll get to see what this error is from. I can now see:

2024-07-17T18:04:29.437-0700 [ERROR] agent.dns: error serializing DNS results: error="no data"
2024-07-17T18:04:29.437-0700 [DEBUG] agent.dns: no data available: name=myservice.service.consul.

So for some reason, this node is answering that it doesn’t have any DNS data for myservice. What’s weird is that myservice is definitely a legitimate service, and the consul node knows about it:

$ dig @127.0.0.1 -p 8600 +short myservice.service.consul
100.104.105.106

It’s also not specific to one service, different services are named (randomly), though it appears that the services are a subset of all the services we have.

I was wondering if there was some race condition, so I ran this in a loop:

$ while true; do  dig @127.0.0.1 -p 8600 +short myservice.service.consul; sleep 1; done

and I seem to always get results, even as I watch the consul node output this message.

Here’s my config, if that’s at all helpful to anyone who stumbles upon this:

advertise_addr = "x.x.x.x"
advertise_addr_ipv4 = "x.x.x.x"

auto_reload_config = true
bind_addr = "0.0.0.0"
bootstrap_expect = 6
check_update_interval = "60s"
client_addr = "0.0.0.0"
data_dir = "/consul"
datacenter = "dc1"

dns_config = {
  allow_stale = true
  max_stale = "45s"

  service_ttl {
    "*" = "60s"
  }
  node_ttl = "300s"

  only_passing = true
}

autopilot {
  min_quorum = 4.0
}

retry_join = ["node1.internal", "node2.internal", "node3.internal", "node4.internal", "node5.internal", "node6.internal"]
server = true

node_name = "node1"

ui_config = {
  enabled = true
}

I’m at a loss as to what the root cause here is, but I’m at least becoming convinced this error is mostly a red herring and doesn’t actually affect anything (at least AFAICT). I’d still love to know what’s causing it though.

keefertaylor · July 18, 2024, 5:30pm

I figured it out!

I had a service that was asking Consul for an IPv6 address (an AAAA DNS record). Consul doesn’t know about an IPv6 address because advertise_addr_ipv6 in the config isn’t set, which is the source of this error. Really specifically, this line asserts that Consul can answer an AAAA request with an IPv6 address, otherwise it returns an empty answer section, which is what causes this error text.

I don’t think this error message actually affected anything since we just use IPv4 everywhere, which explains why I couldn’t reproduce any failures in my earlier posts.

To fix, either remove IPv6 from the querying service (which in our case was an external docker container that had IPv6 networking configured) or set advertise_addr_ipv6 in the config to an address.

It would be nice if Consul was a little bit more helpful in it’s log output. I’ve opened a PR for this here: Add a debug level logging message for mismatched DNS Records and IPv4/v6 Addresses by keefertaylor · Pull Request #21552 · hashicorp/consul · GitHub.

far-blue · July 22, 2024, 8:05am

Ah, this is really helpful, thanks! Now I just need to track down which of my many containers are trying to resolve AAAA records I guess this also means it’s not actually an error worth worrying about.

nathanpalmer · August 5, 2024, 8:46pm

Looks like this is going to be fixed in 1.19.2

github.com/hashicorp/consul

[ERROR] agent.dns: error serializing DNS results: error="no data"

opened 06:49AM - 03 Jul 24 UTC

closed 06:51PM - 01 Aug 24 UTC

jzhao20230918

hello, When I was debuging an issue with envoy I checked consul logs with "jo…urnalctl -u consul.service -r" and got lots of error messages as following: `Jul 03 06:38:14 ip-10-4-0-215 consul[3984668]: 2024-07-03T06:38:14.375Z [ERROR] agent.dns: error serializing DNS results: error="no data" Jul 03 06:38:07 ip-10-4-0-215 consul[3984668]: 2024-07-03T06:38:07.871Z [ERROR] agent.dns: error serializing DNS results: error="no data" Jul 03 06:38:04 ip-10-4-0-215 consul[3984668]: 2024-07-03T06:38:04.369Z [ERROR] agent.dns: error serializing DNS results: error="no data" Jul 03 06:38:04 ip-10-4-0-215 consul[3984668]: 2024-07-03T06:38:04.334Z [ERROR] agent.dns: error serializing DNS results: error="no data" Jul 03 06:38:04 ip-10-4-0-215 consul[3984668]: 2024-07-03T06:38:04.317Z [ERROR] agent.dns: error serializing DNS results: error="no data" Jul 03 06:37:57 ip-10-4-0-215 consul[3984668]: 2024-07-03T06:37:57.868Z [ERROR] agent.dns: error serializing DNS results: error="no data" Jul 03 06:37:57 ip-10-4-0-215 consul[3984668]: 2024-07-03T06:37:57.853Z [ERROR] agent.dns: error serializing DNS results: error="no data" Jul 03 06:37:54 ip-10-4-0-215 consul[3984668]: 2024-07-03T06:37:54.314Z [ERROR] agent.dns: error serializing DNS results: error="no data" Jul 03 06:37:54 ip-10-4-0-215 consul[3984668]: 2024-07-03T06:37:54.301Z [ERROR] agent.dns: error serializing DNS results: error="no data" Jul 03 06:37:47 ip-10-4-0-215 consul[3984668]: 2024-07-03T06:37:47.850Z [ERROR] agent.dns: error serializing DNS results: error="no data" Jul 03 06:37:44 ip-10-4-0-215 consul[3984668]: 2024-07-03T06:37:44.266Z [ERROR] agent.dns: error serializing DNS results: error="no data" Jul 03 06:37:37 ip-10-4-0-215 consul[3984668]: 2024-07-03T06:37:37.847Z [ERROR] agent.dns: error serializing DNS results: error="no data" Jul 03 06:37:34 ip-10-4-0-215 consul[3984668]: 2024-07-03T06:37:34.262Z [ERROR] agent.dns: error serializing DNS results: error="no data" Jul 03 06:37:34 ip-10-4-0-215 consul[3984668]: 2024-07-03T06:37:34.219Z [ERROR] agent.dns: error serializing DNS results: error="no data" Jul 03 06:37:34 ip-10-4-0-215 consul[3984668]: 2024-07-03T06:37:34.202Z [ERROR] agent.dns: error serializing DNS results: error="no data" Jul 03 06:37:34 ip-10-4-0-215 consul[3984668]: 2024-07-03T06:37:34.188Z [ERROR] agent.dns: error serializing DNS results: error="no data" Jul 03 06:37:27 ip-10-4-0-215 consul[3984668]: 2024-07-03T06:37:27.844Z [ERROR] agent.dns: error serializing DNS results: error="no data" Jul 03 06:37:27 ip-10-4-0-215 consul[3984668]: 2024-07-03T06:37:27.829Z [ERROR] agent.dns: error serializing DNS results: error="no data" Jul 03 06:37:24 ip-10-4-0-215 consul[3984668]: 2024-07-03T06:37:24.184Z [ERROR] agent.dns: error serializing DNS results: error="no data" Jul 03 06:37:17 ip-10-4-0-215 consul[3984668]: 2024-07-03T06:37:17.826Z [ERROR] agent.dns: error serializing DNS results: error="no data" Jul 03 06:37:14 ip-10-4-0-215 consul[3984668]: 2024-07-03T06:37:14.181Z [ERROR] agent.dns: error serializing DNS results: error="no data" Jul 03 06:37:14 ip-10-4-0-215 consul[3984668]: 2024-07-03T06:37:14.161Z [ERROR] agent.dns: error serializing DNS results: error="no data" Jul 03 06:37:14 ip-10-4-0-215 consul[3984668]: 2024-07-03T06:37:14.135Z [ERROR] agent.dns: error serializing DNS results: error="no data" Jul 03 06:37:07 ip-10-4-0-215 consul[3984668]: 2024-07-03T06:37:07.810Z [ERROR] agent.dns: error serializing DNS results: error="no data" Jul 03 06:37:04 ip-10-4-0-215 consul[3984668]: 2024-07-03T06:37:04.132Z [ERROR] agent.dns: error serializing DNS results: error="no data" Jul 03 06:36:57 ip-10-4-0-215 consul[3984668]: 2024-07-03T06:36:57.807Z [ERROR] agent.dns: error serializing DNS results: error="no data" Jul 03 06:36:54 ip-10-4-0-215 consul[3984668]: 2024-07-03T06:36:54.128Z [ERROR] agent.dns: error serializing DNS results: error="no data" ` How can I fix it? Thanks a lot.

Topic		Replies	Views
DNS tag.foo.service.consul fails after 1.2.4 -> 1.6.10 upgrade? Consul dns	3	223	January 9, 2024
Classic networking issues Consul k8s	4	802	December 8, 2020
Consul server going out of cluster Consul	0	388	September 22, 2020
Consul service DNS resolution not working Consul	2	2245	March 5, 2021
Consul 1.7.0 Released! Consul	3	749	February 11, 2020

After updating to latest Consul I'm getting "error serializing DNS results" errors in my logs

Related topics