adi_action_segmentation_tcn

This project was produced during an internship at Analog Devices Inc.

A low-cost Temporal Conv. Network (TCN) solution targeting action segmentation of videos from Kinetics dataset. This code can train and evaluate several models, reports logs and TensorBoard outputs. Most of the candidate models have seperate CNN layers (processing frames) and TCN structure on top (processing in time).

The challenges are:

Target hardware is an edge-AI board, meaning it is low in processing power and capacity. Owing to target limitation, we implemented custom efficient PyTorch layers by MaximIntegratedAI/ai8x-training.
Kinetics dataset is challeging due to non-consistent photometric features and occasionally non-informative content.
Video processing is always expensive, as it deals with 4-dimensions (RGB frames w/ time axis).

Fig. 1: Basic TCN structure [ref]

📚 Main Dependencies

ai8x-training
PyTorch v1.8.1
torchvision v0.9.1
NumPy, Pickle, PyYAML, distiller.

Plus, all the dependencies indicated in ai8x repo.

🔧 Hardware

Trained models eventually targets Analog Devices MAX78000FTHR. To make it compatible with the hardware, use layers presented in ai8x-training repo, enable QAT (Quantization Aware Training), then sythesize machine code following this repo before deployment.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
bin/model		bin/model
dat		dat
doc		doc
log		log
src		src
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

adi_action_segmentation_tcn

📚 Main Dependencies

🔧 Hardware

About

Releases

Packages

Languages

volkbay/adi_action_segmentation_tcn

Folders and files

Latest commit

History

Repository files navigation

adi_action_segmentation_tcn

📚 Main Dependencies

🔧 Hardware

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages