Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tracing and diagnostics logging #489

Open
9 tasks
BjarkeMeier opened this issue Aug 21, 2023 · 0 comments
Open
9 tasks

Tracing and diagnostics logging #489

BjarkeMeier opened this issue Aug 21, 2023 · 0 comments
Assignees
Labels

Comments

@BjarkeMeier
Copy link
Contributor

BjarkeMeier commented Aug 21, 2023

Synopsis

As a DataHub stakeholder
I want operations (the developers) to be able to efficiently identify the cause of system issues
So that we can quickly fix issues

"An application is properly instrumented when developers don’t need to add more instrumentation to troubleshoot an issue, because they have all of the information they need."
-- https://opentelemetry.io/docs/concepts/observability-primer/

Acceptance Criteria

  • ADR-123 has been approved
  • All domains adhere to ADR-123
  • All logging from DataHub 3 code targets Application Insights
  • All operationally connected cloud instances are also connected on the Application Map in Application Insights (for example a completed calculation in wholesale must be traceable over the Service Bus to the EDI domain)
  • All teams have been introduced to how to write good tracing and diagnostics logging
  • All domains have been extended with selected logging (hosted services started? use cases committed? ...? what are the general guidelines - if any?)
  • All teams have been introduced to how to use Application Insights efficiently for trouble-shooting tracing and diagnostics problems
  • An Azure dashboard shows the system-wide diagnostics state of DataHub 3. It provides a first-attempt solution for a single place to look to see if there are any issues that need attention from operations. It's useful to the 3rd-level support.
  • Log-level can be changed in production to e.g. enable debug level for further details (how easy should this be and at what granularity level?)

Tech Notes

Remember Databricks and please note that IHostedServices doesn't hook into the ASP.NET middleware pipeline.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants