-
Notifications
You must be signed in to change notification settings - Fork 9
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Add DCGMMonitor plugin document (#34)
* docs: add anomaly detection algorithm docs * docs: format anomaly detection algorithm docs * docs: add OpenAIMonitor and LangChainMonitor plugin document * docs: add DCGMMonitor plugin document * docs: add DCGMMonitor plugin document --------- Co-authored-by: wsy327643 <[email protected]>
- Loading branch information
1 parent
94d02c5
commit 0850fc2
Showing
14 changed files
with
47 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,20 @@ | ||
# dcgmMonitor 插件 | ||
在您GPU机器上部署k8s环境,并且安装dcgm-exporter和Holoinsight-agent,具体安装方法见文档 | ||
|
||
[**dcgm-exporter**](https://github.com/NVIDIA/dcgm-exporter#quickstart-on-kubernetes) | ||
|
||
[**holoinsight-agent**](https://traas-stack.github.io/holoinsight-docs/en/operations/deployment/k8s.html#deploy-holoinsight-agent) | ||
|
||
安装好之后默认会采集GPU数据 | ||
打开页面 http://localhost:8080/integration/agentComp?tenant=default. | ||
|
||
在集成组件页面安装DCGMMonitor插件 | ||
![dcgm1.png](dcgm1.png) | ||
点击预览 | ||
![dcgm2.png](dcgm2.png) | ||
|
||
可以自动生成dcgm监控仪表盘,监控GPU信息 | ||
![dcgm3.png](dcgm3.png) | ||
|
||
![dcgm4.png](dcgm4.png) | ||
|
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,22 @@ | ||
# dcgmMonitor 插件 | ||
Deploy the k8s environment on your GPU machine, and install dcgm-exporter and Holoinsigh-Agent, as described in the documentation | ||
|
||
[**dcgm-exporter**](https://github.com/NVIDIA/dcgm-exporter#quickstart-on-kubernetes) | ||
|
||
[**holoinsight-agent**](https://traas-stack.github.io/holoinsight-docs/en/operations/deployment/k8s.html#deploy-holoinsight-agent) | ||
|
||
By default, GPU data is collected after installation | ||
|
||
Open page http://localhost:8080/integration/agentComp?tenant=default. | ||
|
||
Install the DCGMMonitor plug-in on the Integration Components page | ||
|
||
![dcgm1.png](dcgm1.png) | ||
Click to preview | ||
![dcgm2.png](dcgm2.png) | ||
|
||
DCGMMonitor dashboards can be automatically generated to monitor GPU information | ||
![dcgm3.png](dcgm3.png) | ||
|
||
![dcgm4.png](dcgm4.png) | ||
|
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.