feat: allow setting Datadog syslog.hostname attribute #87

btkostner · 2022-05-23T20:59:29Z

What is breaking

When logging to Datadog from a kubernetes cluster, the syslog hostname is set to the container hostname. This means Datadog will try to use the container hostname as the actual host name for metrics, traces, etc. Because of this, nothing will link up successfully. Sadly you can't rewrite that or do some sort of pipeline fix for this.

Why this change fixes it

Removing the syslog hostname means that dd-agent will fall back to its native hostname logic. This will pick up the same hostname when running on a regular server (non containerized), and it will pick up the correct (node) host name when running in a container (kubernetes).

coveralls · 2022-05-23T21:36:45Z

Coverage increased (+0.2%) to 75.494% when pulling fe26628 on btkostner:patch-1 into 349c817 on Nebo15:master.

AndrewDryga · 2022-05-30T22:45:54Z

Hello @btkostner, I'm not sure about this one because it would be a breaking change for users of DataDog agents that rely on a hostname to be an Erlang one. How DD-agent hostname is different? Are you sure that we can't read it in the same way as DD-agent does and give you an option to override the hostname?

btkostner · 2022-05-30T23:35:26Z

The only time it differs is when running Elixir in a container, because it grabs the container hostname instead of the host's hostname. Sadly this isn't something I can overwrite in the agent configuration.

AndrewDryga · 2022-05-31T16:31:53Z

@btkostner do you run it in Kubernetes or some other environment?

btkostner · 2022-05-31T16:32:47Z

@AndrewDryga Yes, this is in kubernetes. I can show some screenshots in Datadog of some different setups if you want more clarity of what's happening.

AndrewDryga · 2022-05-31T16:36:28Z

@btkostner I think there is a way to do this without introducing breaking changes: you can expose the hostname using an environment variable and then read it and pass it to the logger (https://kubernetes.io/docs/tasks/inject-data-application/environment-variable-expose-pod-information/) so that both DD-agent values and Logger one would be equal. In logger we can allow overriding the host with anything you want.

btkostner · 2022-05-31T16:42:20Z

@AndrewDryga That would work. We could also just make a bool to include the hostname and have it default to true. Both options work and I can update the PR with the solution. Which one would you prefer?

AndrewDryga · 2022-05-31T17:33:48Z

@btkostner Great! I can share the plan that I have in my head that should fit all the use-cases:

We can add a formater_state here and a formatter_opts to the configuration. It would allow passing any formatter-specific options when the logger is started and make sure that people won't do expensive calls for each of the log entries. It can be used like so (eg. in release.exs):

config :logger_json, :backend,
  json_encoder: Jason,
  formatter: LoggerJSON.Formatters.GoogleCloudLogger,
  formatter_opts: [hostname: System.get_env("KUBERNETES_NODE_HOSTNAME")]

To make it work we probably would need to extend the behavior (https://github.com/Nebo15/logger_json/blob/master/lib/logger_json/formatter.ex) and add an init callback that will receive formatter_opts and return formatter_state which would be saved and passed to format_event. New init callback should be called somewhere here to make sure it would support runtime reconfiguration.

When the backend initializes we can check if such an option was given and:

if it's a string just write it to state variable formatter_opts and use it when we format the JSON;
if it's a :from_system_agent atom DataDog formatter can skip writing it entirely (your use-case);
if it's unset we read the hostname as it's done now, persist it to formatter_opts and use that, so by default behavior would not change.

AndrewDryga · 2022-05-31T17:34:12Z

I know it shoulds like a lot, if you feel like it's too much I'll handle this myself but a bit later

btkostner · 2022-05-31T17:37:38Z

That looks like a solid well thought out plan. I'll start working on it today 👍

btkostner · 2022-05-31T23:12:45Z

Alrighty, PR is updated. Let me know how it looks 👍

btkostner · 2022-06-21T02:19:12Z

As a stop gap, I found a way to unset that value in Datadog itself. You can remove syslog.hostname from the hostname attributes by editing the "Preprocessing for JSON logs" pipeline.

AndrewDryga · 2022-06-21T15:10:48Z

lib/logger_json/formatter.ex

@@ -16,6 +23,7 @@ defmodule LoggerJSON.Formatter do
              msg :: Logger.message(),
              ts :: Logger.Formatter.time(),
              md :: [atom] | :all,
-              state :: map
+              state :: map,
+              formatter_state :: Keyword.t()


I think we should use a map for the state. Keywords have higher complexity for reading operations because sometimes we need to traverse the full list to get a value, which would be harmful as the state grows. Maps are highly optimized for that in Erlang:

Suggested change

formatter_state :: Keyword.t()

formatter_state :: map()

AndrewDryga

One small change and looks good, thank you!

AndrewDryga · 2022-06-21T15:12:44Z

Hey @btkostner sorry for taking it so long, was distracted by life and work :). I think this still is a great change that can be used by other providers so let's merge it anyways, even with a workaround.

btkostner · 2022-06-21T16:01:25Z

@AndrewDryga No problem. I figured there was more important stuff happening in your life. No need to apologize.

AndrewDryga · 2022-06-21T17:04:21Z

Thank you @btkostner, great work! Can you run this in your project and verify it's what you need? I'll release a new hex version once I get feedback from you and make sure it's stable.

btkostner · 2022-06-21T17:49:21Z

Alright. Aside from the small doc fix I opened above, it looks to be working for me. I'm going to put it up on our staging cluster to make sure there are no side effects, but it should be good from my end.

Remove syslog hostname from datadog formatter

0869594

btkostner added 2 commits May 31, 2022 16:37

feat: support init/1 callback and formatter_state

4cf3d70

feat: allow setting datadog syslog.hostname attribute

a07bf0b

btkostner changed the title ~~Remove syslog hostname from datadog formatter~~ feat: allow setting Datadog syslog.hostname attribute May 31, 2022

AndrewDryga reviewed Jun 21, 2022

View reviewed changes

convert formatter_opts to a map

fe26628

AndrewDryga approved these changes Jun 21, 2022

View reviewed changes

AndrewDryga merged commit cad53fe into Nebo15:master Jun 21, 2022

btkostner deleted the patch-1 branch June 21, 2022 17:36

btkostner mentioned this pull request Jun 21, 2022

Update formatter_opts documentation #88

Merged

AndrewDryga mentioned this pull request Oct 21, 2022

App not passing FQDNs to datadog. #97

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: allow setting Datadog syslog.hostname attribute #87

feat: allow setting Datadog syslog.hostname attribute #87

btkostner commented May 23, 2022

coveralls commented May 23, 2022 •

edited

Loading

AndrewDryga commented May 30, 2022 •

edited

Loading

btkostner commented May 30, 2022

AndrewDryga commented May 31, 2022

btkostner commented May 31, 2022

AndrewDryga commented May 31, 2022 •

edited

Loading

btkostner commented May 31, 2022

AndrewDryga commented May 31, 2022 •

edited

Loading

AndrewDryga commented May 31, 2022

btkostner commented May 31, 2022

btkostner commented May 31, 2022

btkostner commented Jun 21, 2022

AndrewDryga Jun 21, 2022

AndrewDryga left a comment

AndrewDryga commented Jun 21, 2022

btkostner commented Jun 21, 2022

AndrewDryga commented Jun 21, 2022

btkostner commented Jun 21, 2022

feat: allow setting Datadog syslog.hostname attribute #87

feat: allow setting Datadog syslog.hostname attribute #87

Conversation

btkostner commented May 23, 2022

What is breaking

Why this change fixes it

coveralls commented May 23, 2022 • edited Loading

AndrewDryga commented May 30, 2022 • edited Loading

btkostner commented May 30, 2022

AndrewDryga commented May 31, 2022

btkostner commented May 31, 2022

AndrewDryga commented May 31, 2022 • edited Loading

btkostner commented May 31, 2022

AndrewDryga commented May 31, 2022 • edited Loading

AndrewDryga commented May 31, 2022

btkostner commented May 31, 2022

btkostner commented May 31, 2022

btkostner commented Jun 21, 2022

AndrewDryga Jun 21, 2022

Choose a reason for hiding this comment

AndrewDryga left a comment

Choose a reason for hiding this comment

AndrewDryga commented Jun 21, 2022

btkostner commented Jun 21, 2022

AndrewDryga commented Jun 21, 2022

btkostner commented Jun 21, 2022

coveralls commented May 23, 2022 •

edited

Loading

AndrewDryga commented May 30, 2022 •

edited

Loading

AndrewDryga commented May 31, 2022 •

edited

Loading

AndrewDryga commented May 31, 2022 •

edited

Loading