ViT-Video-Feature-Extraction

This repository contains scripts for extracting keyframes from video files, extracting features using a Vision Transformer (ViT) model, and utilizing a Long Short-Term Memory (LSTM) network for classification.

Keyframe Extraction (`key_frame_extraction.py`)

Overview

The key_frame_extraction.py script extracts keyframes from video files. Keyframes are sampled from the video, either by duplicating frames for videos with fewer frames than required or by extracting exactly n keyframes for larger videos.

Usage

Set the video_path variable in the script to the path of your video file.
Run the script:
```
python key_frame_extraction.py
```

Vision Transformer Feature Extraction (`image_feature_extraction_with_ViT.py`)

Overview

The image_feature_extraction_with_ViT.py script extracts features from image frames using a pre-trained Vision Transformer (ViT) model. The script utilizes the timm library for model creation.

Usage

Set the path variable in the script to the path of your image file.
Adjust the image_size variable as needed.

Run the script:

python image_feature_extraction_with_ViT.py

LSTM Classification (`lstm.py`)

Overview

The lstm.py script uses an LSTM network for classification based on features extracted from keyframes. It loads features from CSV files, preprocesses the data, builds an LSTM model, trains the model, evaluates its performance, and saves the model for future use.

Usage

Ensure CSV files with extracted features are available in the specified folder_path.
Run the script:
```
python lstm.py
```

Requirements

Python
Libraries: numpy, pandas, keras, scikit-learn, matplotlib, seaborn, timm

You can install the required libraries using pip:

pip install keras opencv-python numpy matplotlib seaborn pandas scikit-learn timm

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
KeyFrameFeatureExtraction.py		KeyFrameFeatureExtraction.py
README.md		README.md
image_feature_extraction_with_ViT.py		image_feature_extraction_with_ViT.py
key_frame_extraction.py		key_frame_extraction.py
label_encoder.pkl		label_encoder.pkl
lstm.py		lstm.py
model.h5		model.h5
model_weights.h5		model_weights.h5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ViT-Video-Feature-Extraction

Keyframe Extraction (`key_frame_extraction.py`)

Overview

Usage

Vision Transformer Feature Extraction (`image_feature_extraction_with_ViT.py`)

Overview

Usage

LSTM Classification (`lstm.py`)

Overview

Usage

Requirements

About

Releases

Packages

Languages

jeslinpjames/ViT-Video-Feature-Extraction

Folders and files

Latest commit

History

Repository files navigation

ViT-Video-Feature-Extraction

Keyframe Extraction (key_frame_extraction.py)

Overview

Usage

Vision Transformer Feature Extraction (image_feature_extraction_with_ViT.py)

Overview

Usage

LSTM Classification (lstm.py)

Overview

Usage

Requirements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Keyframe Extraction (`key_frame_extraction.py`)

Vision Transformer Feature Extraction (`image_feature_extraction_with_ViT.py`)

LSTM Classification (`lstm.py`)

Packages