The Azure ML Observability solution accelerator provides libraries for model monitoring and data drift analysis.
Read our official blog post at: https://techcommunity.microsoft.com/t5/ai-machine-learning-blog/azureml-observability-a-scalable-and-extensible-solution-for-ml/ba-p/3474066
How to trust a deployed model, a model in operation?
In order to answer the question, we need to have an Observability. The observability is a qualitative indicator that a monitoring solution helps the monitoring consumer achieve the satisfactory level of control of a defined service, where monitoring provides service consumers with a suitable range of monitoring capabilities and perspectives.
To have the observability, we need to monitor metrics and track logs. Monitoring is about using various metrics to measure the model behavior. And to compute the metrics we need collect records of data in logs that are produced during the prediction of models. So tracking logs has to be done before computing the metrics.
If you are new to these tasks, it is not easy to start. So this solution accelerator will provide easy way to provision associated Azure Resources to collect logs and calculate metrics. Even you can use dashboard templates that are included in this repo to monitor metrics.
In addition to that, this solution will provide a library to help you to monitor data drift.
Library components
Name | Description |
---|---|
Data Collection |
Capture data from multiple sources, Spark, Real-time and Batch |
Monitoring |
Visualize metrics |
Drift Analysis |
Detect data drift |
Management |
Provision Azure Resources (infrastructure) for AML-Observability |
The solution is built on top of Azure Data Explorer and Azure ML
To install early release version:
-
For Data Collection to ingest data:
pip install --upgrade git+https://github.com/microsoft/AzureML-Observability#subdirectory=aml-obs-collector
-
For client library:
pip install --upgrade git+https://github.com/microsoft/AzureML-Observability#subdirectory=aml-obs-client
pip install azure-ai-ml==0.0.62653692 --extra-index-url https://azuremlsdktestpypi.azureedge.net/sdk-cli-v2
This notebook will help you to understand how to provision Azure Resources
This notebook will help you to understand how to collect data. To monitor metrics, you need to import ADX dashboard template to ADX that you created from the previous notebook _1. Solution provisioning_
This notebook will help you to understand how to run data drift detection job
This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.opensource.microsoft.com.
When you submit a pull request, a CLA bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.
This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact [email protected] with any additional questions or comments.
This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft's Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party's policies.