Deploy a machine learning model with the designer

Create a real-time inference pipeline

To deploy your pipeline, you must first convert the training pipeline into a real-time inference pipeline. This process removes training modules and adds web service inputs and outputs to handle requests.

Create a real-time inference pipeline

  1. Above the pipeline canvas, select Create inference pipeline > Real-time inference pipeline.Screenshot showing where to find the create pipeline buttonYour pipeline should now look like this:Screenshot showing the expected configuration of the pipeline after preparing it for deploymentWhen you select Create inference pipeline, several things happen:
    • The trained model is stored as a Dataset module in the module palette. You can find it under My Datasets.
    • Training modules like Train Model and Split Data are removed.
    • The saved trained model is added back into the pipeline.
    • Web Service Input and Web Service Output modules are added. These modules show where user data enters the pipeline and where data is returned.
     NoteBy default, the Web Service Input will expect the same data schema as the training data used to create the predictive pipeline. In this scenario, price is included in the schema. However, price isn’t used as a factor during prediction.
  2. Select Run, and use the same compute target and experiment that you used in part one.
  3. Select Deploy.

Create an inferencing cluster

In the dialog box that appears, you can select from any existing Azure Kubernetes Service (AKS) clusters to deploy your model to. If you don’t have an AKS cluster, use the following steps to create one.

  1. Select Compute in the dialog box that appears to go to the Compute page.
  2. On the navigation ribbon, select Inference Clusters > + New.Screenshot showing how to get to the new inference cluster pane
  3. In the inference cluster pane, configure a new Kubernetes Service.
  4. Enter aks-compute for the Compute name.
  5. Select a nearby region that’s available for the Region.
  6. Select Create. NoteIt takes approximately 15 minutes to create a new AKS service. You can check the provisioning state on the Inference Clusters page.

Deploy the real-time endpoint

After your AKS service has finished provisioning, return to the real-time inferencing pipeline to complete deployment.

  1. Select Deploy above the canvas.
  2. Select Deploy new real-time endpoint.
  3. Select the AKS cluster you created.
  4. Select Deploy.Screenshot showing how to set up a new real-time endpointA success notification above the canvas appears after deployment finishes. It might take a few minutes.

Test the real-time endpoint

After deployment finishes, you can test your real-time endpoint by going to the Endpoints page.

  1. On the Endpoints page, select the endpoint you deployed.Screenshot showing the real-time endpoints tab with the recently created endpoint highlighted
  2. Select Test.
  3. You can manually input testing data or use the autofilled sample data, and select Test.The portal submits a test request to the endpoint and shows the results. Although a price value is generated for the input data, it isn’t used to generate the prediction value.Screenshot showing how to test the real-time endpoint with the scored label for price highlighted

Clean up resources

 Important

You can use the resources that you created as prerequisites for other Azure Machine Learning tutorials and how-to articles.

Delete everything

If you don’t plan to use anything that you created, delete the entire resource group so you don’t incur any charges.

  1. In the Azure portal, select Resource groups on the left side of the window.Delete resource group in the Azure portal
  2. In the list, select the resource group that you created.
  3. Select Delete resource group.

Deleting the resource group also deletes all resources that you created in the designer.

Delete individual assets

In the designer where you created your experiment, delete individual assets by selecting them and then selecting the Delete button.

The compute target that you created here automatically autoscales to zero nodes when it’s not being used. This action is taken to minimize charges. If you want to delete the compute target, take these steps:

Delete assets

You can unregister datasets from your workspace by selecting each dataset and selecting Unregister.

Unregister dataset

To delete a dataset, go to the storage account by using the Azure portal or Azure Storage Explorer and manually delete those assets.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s