Hands Free Controller

By: Geoffrey Gao

An accessibility application that allows for hands-free control of a web browser using webcam input. Uses face recognition through face-api to determine cursor movement and action. Includes functionality that allows users to click, scroll, and input text using various head/face motions/expressions. Can be run in the browser as a Chrome Extension or local/server-side (using Node.js).

Getting Started

Webcam Input

The application requires usage of a webcam in order to function. You will be prompted to allow access to the webcam when the application starts. You can confirm that the webcam is correctly functioning by the webcam display in the top right of the application. The facial recognition package should also overlay the face landmark prediction over your image [1]. For proper functionality, please use the application in a well-lit environment with minimal background noise (esp. other faces).

Cursor Control Modes

The cursor is controlled based on the location of the face relative to the webcam input. Clicking functionality is enabled when the user opens their mouth (widely). An indicator of a click is when the blue cursor [2] turns to red. There are two different control schemes for movement of the cursor: Absolute and Relative mode. The two modes can be toggled between using the Cursor Control Mode button [3].

Absolute Control Mode

Absolute Control Mode moves the cursor to the position of the user's head. For example, if the user's head is located in the top right of the webcam input, the cursor will be be in the same approximate position in the web page. This is the default setting for the cursor control.

Relative Control Mode

Relative Control Mode utilizes motion control similar to how a game controller joystick would function. At the center of the webcam is a "deadzone". If the center of the user's face is in the deadzone, the cursor will not move. If the user moves their face out of the deadzone, the cursor will move in that relative direction. The webcam input is divided into 8 sections, each corresponding with a direction of movement (e.g., up, down-right, left, etc.). The magnitude of the cursor movement is determined by the distance away from the deadzone (i.e., the farther out the user moves their head, the faster the cursor will move in that direction).

Scroll Buttons

For long web pages, the user can scroll up and down a web page using the two scroll buttons [4]. The user should interact with these buttons as they would any other page elements (i.e., opening and closing their mouth).

Virtual Keyboard

Text forms can be filled in this application using a virtual keyboard. Simply interact with a text input element (i.e., opening and closing mouth) to open the virtual keyboard. Note: Currently the virtual keyboard functionality only works with websites that have implemented text inputs as HTML input elements.

Setup/Installation Requirements

Chrome Extension

Clone the repository and locate the folder extension within the Chrome Extension folder
Open Google Chrome and navigate to Chrome://Extensions in the address bar
Turn on the switch on the top right of the page that says "Developer Mode" [1]
Click on the button on the top left of the page that says "Load unpacked". Then select the location of the extension folder [2]
Confirm the extension is loaded [3]

Note: The extension files can be rebuilt if any modifications are to be made. The source code can be found in the handsfreecontroller folder. The depdencies will need to be installed using npm install. The content.js file can be regenerated using npm run build in the terminal

NodeJS

Clone the repository and navigate into the NodeJS folder
Install the dependencies by running the command npm install in the erminal
Run the application using npm run start

Note: The web page is injected as a HTML component (this was implemented for testing purposes). If running your own application, the HTML.js will need to be modified

Technologies Used

JavaScript/React
- face-api.js
- react-simple-keyboard
Node.js
Manifest V3 (Google Chrome Extension)

Future Developments

This is an ongoing project and will be continually developed. Here is a current list of areas of interest for improvements:

Improve motion control scheme using relative cursor control feature. Currently the relative motion parameters are a bit difficult to use
Added functionality using facial expressions (e.g., click and drag, copy paste, etc.)
Calibration tools to allow for a personalized experience
Implement predictive text engine for virtual keyboard
Allow client wide interaction (e.g., open new tab, input new URL)
UI/UX improvements for accessibility

Known Bugs

Occasional console warning regarding willReadFrequently attribute. Note: bug does not hinder performance
Cursor cannot interact with highly styllized/compartmentalized elements. For instance, some text inputs are not implemented as HTML text or input elements

Send bug reports to [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 102 Commits
Chrome Extension		Chrome Extension
NodeJS		NodeJS
DEVLOG.md		DEVLOG.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Hands Free Controller

Getting Started

Webcam Input

Cursor Control Modes

Absolute Control Mode

Relative Control Mode

Scroll Buttons

Virtual Keyboard

Setup/Installation Requirements

Chrome Extension

NodeJS

Technologies Used

Future Developments

Known Bugs

License

About

Uh oh!

Releases

Packages

Languages

geoffreygao1/Hands-Free-Controller

Folders and files

Latest commit

History

Repository files navigation

Hands Free Controller

Getting Started

Webcam Input

Cursor Control Modes

Absolute Control Mode

Relative Control Mode

Scroll Buttons

Virtual Keyboard

Setup/Installation Requirements

Chrome Extension

NodeJS

Technologies Used

Future Developments

Known Bugs

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages