Computer Pointer Controller

The project allows controlling the cursor of the mouse by a human gaze. Four pretrained models are used to detect human face, head pose, facial landmarks and, finally, human eye gaze.

Project Set Up and Installation

This project uses OpenVino distribution to perform inference, please find the instruction on this dependency installation following the link: https://docs.openvinotoolkit.org/2020.1/_docs_install_guides_installing_openvino_windows.html

The models used for the inference must be downloaded locally, please follow the steps described below:

Clone the repository
Go to the directory with the OpenVino installation to reach the downloader tool (your path may differ depending on the OpenVino distribution location on your computer as well as the version on OpenVino you have installed, please note that the installation instruction are given for Window): cd C:\Program Files (x86)\IntelSWTools\openvino_2020.3.194\deployment_tools\open_model_zoo\tools\downloader
Install dependencies for downloader.py: pip install requests pyyaml
Download each of 4 models using the downloader, precising the output directory:

python downloader.py --name head_pose_estimation_adas_0001 -o C:\Users\frup75275\Documents\OpenVinoProject3
python downloader.py --name face-detection-adas-binary-0001 -o C:\Users\frup75275\Documents\OpenVinoProject3
python downloader.py --name landmarks-regression-retail-0009 -o C:\Users\frup75275\Documents\OpenVinoProject3
python downloader.py --name gaze-estimation-adas-0002 -o C:\Users\frup75275\Documents\OpenVinoProject3

Create virtual environment in your project directory:

python -m venv env

Activate recently created virtual environment from your project directory:

.\env\Scripts\activate

Install dependencies:

pip install -r requirements.txt

Demo

Initiatize OpenVino environment

cd C:\Program Files (x86)\IntelSWTools\openvino_2020.3.194\bin
setupvars.bat

Activate virtual environment from your project directory:

.\env\Scripts\activate

Go to the /src subfolder inside your project directory:

cd \src

Run the main.py file:

python main.py

You may specify the optional arguments if you intend to use another input, device or models.

5.1. Precise your models:

python main.py 
-fd ../intel/face-detection-adas-binary-0001/FP32-INT1/face-detection-adas-binary-0001.xml 
-lr ../intel/landmarks-regression-retail-0009/FP32-INT8/landmarks-regression-retail-0009.xml 
-hp ../intel/head-pose-estimation-adas-0001/FP32-INT8/head-pose-estimation-adas-0001.xml 
-ge ../intel/gaze-estimation-adas-0002/FP32-INT8/gaze-estimation-adas-0002.xml

5.2. Precise your input: A video source:

-i video ../bin/demo.mp4

Or a webcam source:

-i cam

5.3. Precise your device (CPU by defauld), which can be CPU, GPU, FPGU or MYRIAD (a VPU device):

-d GPU

5.4. Precise the probability threshold (default is 0.6):

-prob 0.8

5.5. Precise the output folder for the output video file (default is /results):

-o /your_path

5.5. You may also use -flags to visualize the output of models used in the project (by default it's set to visualize the output from GazeEstimationModel):

-flags fd

Below is a list of possible arguments with the corresponding model's output to be visoualized:

fd: for FaceDetectionModel
lr: for LandmarkRegressionModel
hp: for HeadPoseEstimationModel
ge: for GazeEstimationModel

Documentation

The project contains 6 subfolders (src, intel, bin, benchmarks, env, results), where:

/src folder holds the code,
/intel folder contains models,
/bin folder contains the example video file,
/benchmarks folder contains the graphs taken out of benchmarks tests,
/env - virtual environment setups,
/results for the video file output with masks.

The main script is maintained in main.py file, whereas the classes to inference the models in model_ prefixed files. Two additional classes - input_feeder.py and mouse_controller.py provide additional handling on the batch feed and mouse manipulation respectively. The root directory contains README.md and requirement.txt files which should help with the required installations and project run.

Inference pipeline description

The gaze detection model requires three prameters - the arrays of 2 eyes images and a head position, which is received from landmark detection and head position models respectively. And the above mentioned models require the array of the cropped human face image. Here is a prediction outputs after the inference on a single frame:

face_coords, cropped_image.shape:  [[790, 102, 1036, 469]] (367, 246, 3)
left_eye_image.shape, right_eye_image.shape, eye_coords:  (20, 20, 3) (20, 20, 3) [[62, 137, 82, 157], [178, 137, 198, 157]]
pose_output [5.980412, -9.124736, -1.4861512]
mouse_coords, gaze_vector (0.49180450284510924, 0.1335856156718123) [ 0.49510366  0.12078557 -0.7978018 ]

Benchmarks

The benchmark tests were performed in DL Workbench metrics tool developped by Intel using CelebA dataset (http://mmlab.ie.cuhk.edu.hk/projects/CelebA.html). The available in DL Workbench pretrained model from OpenVino model zoo (face-detection-adas-0001) was tested with parallel streams from 1 to 4 and batch size range from 1 to 30 (with a batch step of 10) on a CPU device.

Results

The graph below shows that the lowest latency were achieved with a batch size of 1 (42.22 ms), whereas batch sizes of 10 (884.9 ms), 20 (1,811.86 ms) and 30 (2,633.22 ms) had no significant difference regarding this parameter. The throughput for the last 3 cases had minor fluctuations (11.36 fps - 12.12 fps). The significant increase in throuput was achieved by parallel streams augmentation (4), where the batch size influenced mostly the latency: 1,824.31 ms for 30 batches vs 1,189.09 ms for 10 batches, at minor fluctuations in a throughput (31,96 fps vs 32,59 fps respectively).

The execution time by layer also shows that convolution took the most time for this model, therefore any optimization might first address the possibilities of the convolution layers optimization.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Computer Pointer Controller

Project Set Up and Installation

Demo

Documentation

Inference pipeline description

Benchmarks

Results

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
benchmarking		benchmarking
bin		bin
env		env
intel		intel
results		results
src		src
.Instructions.md.swp		.Instructions.md.swp
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

asnota/Computer-Pointer-Controller

Folders and files

Latest commit

History

Repository files navigation

Computer Pointer Controller

Project Set Up and Installation

Demo

Documentation

Inference pipeline description

Benchmarks

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages