Cassie Breviu

Posted on Jun 25, 2019

How to Get a Dataset and Create a ML Model to Classify Yoga Poses

#machinelearning #azure #yoga #cognitiveservices

This tutorial will go over how to use Azure Cognitive Services to automatically get an image dataset and train a Custom Vision machine learning model. The github repo has a simple nodejs console app that uses Bing Image Search to scrape images to a local directory based on search term(s). Then use the images scraped to create a Custom Vision model with Azure Cognitive Services.

Prerequisites

First create an Azure account (if you dont have one already)

Click here to create your account

Next we will create a Bing Search Api Resource needed to get our dataset

Go to portal.azure.com
Select "Create Resource"
Select "AI + Machine Learning"
Select "Bing Search v7"
Complete the required fields
Select "Create"
When the deployment succeeds you will get a notification in the top right corner.
Select "Go to resource"
Select "Keys" from the left hand navigation for the resource.
Copy the Subscription Key and paste it into the script.

Cognitive Service Bing Search Docs

Image Scraper using Bing Search Cognitive Service

We are going to use the image search function of this service however you can use this same endpoint to search web, news, image, video and entity. To run the Image Scraper Script with the Bing Search Cognitive Service resource we created in Azure follow the below steps.

Clone the repo, install packages, and open in VS Code

git clone https://github.com/cassieview/Use-ML-To-Classify-Yoga-Poses.git
cd Use-ML-To-Classify-Yoga-Poses
npm install
code .

Add the key into the app.js script javascript const subscriptionKey = 'addyourkeyhere'
Run the script node app.js

When the script completes you will have a new folder named poses with subfolders for each yoga pose specified in the search term collection. Within each pose folder there will be a test and train folder. The data is split at aproximately 70% to train and 30% to test.

Now that we have a dataset lets create our Custom Vision API Resource to train a model that will recognize yoga poses

Go to portal.azure.com
Select "Create Resource"
Search "Custom Vision"
Select "Create"
Complete the required fields
Select "Create"
When the deployment succeeds you will get a notification in the top right corner.
Select "Go to resource"

Create a Custom Vision Project and Train Model

Select "Quick Start"
Select "Custom Vision Portal"
Select 'Sign in"
Select "New Project"
- Enter a Name and Resource Group
- Project Types: Classificaion
- Classification Types: MultiClass General
- Domains: General
Click Create project
Now we need to add and tag images
- Select "Add Images" and navigate to the train folder for the scraped images
- Ctrl + A to select all the images in the folder
- Select "Open"
- Add tag for photos to indicate the yoga pose
- Select "Upload photos"
- Repeat these steps for each yoga pose since each yoga pose is a class in our classification model.
- Review each class to see if the images are correct. THere are some images that are returned that are either incorrect or not good examples. If you dont have good data you wont get good results. Review and update the data so that the classification data for each pose is a good, clear dataset.
Once all the data is uploaded and tagged select "Train"
They API will provide some stats on how your model is performing. Use this is an indication of how its preforming but not and absolute truth. Test your model with the data from the scarped images in the test folder. Its important to test your model on unseen data.
Steps to upload a test image and test your model.
- Select "Quick Test" from the upper right corner
- Browse to local files and select a image from the test folder
- The predication will appear on the right side of the window
Once you have tested your model and feel you have a good dataset. Its time to operationalize the selected model. This is super easy with the Azure endpoints. No need to deploy anything. In the next steps I will show you how to grab the URLs to integrate your model into your project.

Get the Prediction Url

Select "Performace" from the top navigation in the Custom Vision portal
Select "Prediction Url"
Copy the Url to either use an image url or image file
Additionally by selecting the settings icon in the right corner will bring you to the keys and urls to call the endpoint.

Consume the API in an App

Once the model is trained and at a satisfactory accuracy its ready to be used in your App

Go to the Resource Group that contains the Custom Vision Resource
There should be a resource named "YourCustomVisionResourceName_Prediction - Quick start"
The Select "Api Reference" from the list of helpful links
This will open a page with docs on how to call the api to get predictions from the model
If you scoll all the way to the bottom there are code samples in multiple languages to get you started

I love to test APIs with Postman. Here are steps so use postman

Get the predction url as described above
Select "Post" from the dropdown and paste in the url
Go to the settings and get the Prediction Key
In the headers add the Prediction-key as the key and the actual key as the value
Auth should be set to no auth
Select Body > Raw > Json
Paste in the following Json for the body of the request

{ "Url": "UrlForImageGoesHere" }

Send Request

This same logic can be applied to all sorts of image classification problems. If you have quesitons or comments please leave them below. Thanks!

Helpful Doc Links

Quickstart: Search for images with the Bing Image Search SDK for Node.js
Quickstart: Create an image classification project with the Custom Vision Node.js SDK
Cognitive Service Bing Search Docs
Cognitive Service Custom Vision Docs
Create a Node.js app in Azure App Service on Linux

Top comments (1)

berlin • Sep 6 '23

Your input helps us continually improve how to use yoga wheel our offerings.Your input helps us continually improve our offerings.

DEV Community