Skip to content

dynatrace-oss/nvidia-observability

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Overview

This repo provides various guides to show how easy it is to setup and monitor the key tiers and components of the NVIDIA enterprise stack, LLMs, and Generative AI applications with Dynatrace.

These guides are structured according the visual shown below.

NVIDIA Enterprise Stack

1. AI Application Observability

1.1 Dynatrace AI Observability

Dynatrace AI Observability brings End-to-End visibility to user interactions, prompt flows, and AI/LLM model performance of your Generative AI, agentic, and LLM services.

1.2 NVIDIA NeMo Agent toolkit

The NVIDIA NeMo Agent toolkit is a flexible, lightweight, and unifying library that allows you to easily connect existing enterprise agents to data sources and tools across any framework. The NeMo Agent toolkit uses a flexible, plugin-based observability system that provides comprehensive support for configuring logging, tracing, and metrics for workflows.

2. NVIDIA NIM, NeMo and technologies

Expanded observability coverage of exposed Prometheus metrics and OpenTelemetry telemetry from the various NVIDIA generative AI and lifecycle management tools and technologies.

3. Kubernetes

Real-time auto-discovery and analysis of applications, NVIDIA platform components, and infrastructure.

4. GPU telemetry

Understand workload behavior or monitor GPUs in clusters

About

No description, website, or topics provided.

Resources

License

Code of conduct

Contributing

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published