Bright Cluster Manager leverages the latest NVIDIA Tesla™ V100 GPUs based on the new "Volta" architecture to offer administrators and owners of GPU clusters maximum insight and control.
Bright Cluster Manager can sample and monitor metrics from supported GPU cards and GPU Computing Systems, such as the NVIDIA Tesla V100 and Tesla P100. Examples of supported metrics include GPU temperatures, GPU exclusivity modes, GPU fan speeds, system fan speeds, PSU voltages and currents, system LED states, and GPU ECC memory statistics.
The frequency of metric sampling is fully configurable and so is the consolidation of the metrics data over time. Metrics data is stored in Bright Cluster Manager's central SQL database and can be visualized in value/time graphs, as well as in Bright Cluster Manager's unique Rackview. Bright Cluster Manager leverages NVIDIA’s Data Center GPU Manager (DCGM) for GPU health monitoring, diagnostics and validation, beginning with Version 8.0.
Read more on the NVIDIA – Bright technology partnership here.
-Andy Keane, General Manager of the Tesla Business at NVIDIA
NVIDIA is the pioneer of GPU-accelerated computing. They specialize in products and platforms for the large, growing markets of gaming, professional visualization, data center, deep learning, and automotive.
For more information, visit nvidia.com