Skip to content

A low-cost Temporal Conv. Network (TCN) solution targeting action segmentation in videos from Kinetics dataset.

Notifications You must be signed in to change notification settings

volkbay/adi_action_segmentation_tcn

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

adi_action_segmentation_tcn

This project was produced during an internship at Analog Devices Inc.

A low-cost Temporal Conv. Network (TCN) solution targeting action segmentation of videos from Kinetics dataset. This code can train and evaluate several models, reports logs and TensorBoard outputs. Most of the candidate models have seperate CNN layers (processing frames) and TCN structure on top (processing in time).

The challenges are:

  • Target hardware is an edge-AI board, meaning it is low in processing power and capacity. Owing to target limitation, we implemented custom efficient PyTorch layers by MaximIntegratedAI/ai8x-training.
  • Kinetics dataset is challeging due to non-consistent photometric features and occasionally non-informative content.
  • Video processing is always expensive, as it deals with 4-dimensions (RGB frames w/ time axis).

image Fig. 1: Basic TCN structure [ref]

📚 Main Dependencies

Plus, all the dependencies indicated in ai8x repo.

🔧 Hardware

Trained models eventually targets Analog Devices MAX78000FTHR. To make it compatible with the hardware, use layers presented in ai8x-training repo, enable QAT (Quantization Aware Training), then sythesize machine code following this repo before deployment.

About

A low-cost Temporal Conv. Network (TCN) solution targeting action segmentation in videos from Kinetics dataset.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published