ShuttleSensei - Badminton AI Coach

Inspired by the transformative impact of self-attention based Transformer models in various computer vision tasks, including image classification and object detection, ActionFormer emerged as a pioneering solution for temporal action localization in videos. However, the original ActionFormer faced computational challenges due to its reliance on a single-shot approach with transformers handling both classification and regression tasks.

In response to these challenges, this repository presents an enhanced version of ActionFormer specifically tailored for temporal action localization, with a primary focus on badminton videos. Our approach addresses the limitations of the original model by splitting the task into two distinct stages: initial boundary regression followed by refined action classification and boundary regression, leveraging an improved ActionFormer architecture.

The first stage employs a separate model dedicated to boundary regression, allowing for efficient approximation of action boundaries. Subsequently, the output from this stage is seamlessly integrated into a streamlined ActionFormer, featuring enhanced attention mechanisms optimized for precise action classification and boundary regression.

Through this innovative approach, our model achieves a remarkable mean Average Precision (mAP) of 93 on badminton video datasets, representing a significant improvement over the performance of the original ActionFormer, even after transfer learning. Our method demonstrates the effectiveness of combining specialized boundary regression with ActionFormer's attention mechanisms to achieve highly precise temporal action localization.

For further details on self-attention mechanisms, refer to Transformers: Attention is All You Need. To learn more about the original ActionFormer, see the paper ActionFormer: Unifying Transformers for Temporal Action Localization and Recognition. Additionally, details about the task of temporal action localization (TAL) can be found in Temporal Action Localization.

Features

Point and Winner Prediction: Utilizes a combination of 2D/3D CNNs and an enhanced transformer model to predict rally outcomes.
Shot Detection: Identifies shot types played by each player during rallies.
Player Analysis: Analyzes player strengths, weaknesses, and suggests strategies based on AI insights.

Technologies

Backend:

Python
PyTorch (Deep Learning framework)
OpenCV (Computer Vision library)
Flask (Web framework)

Frontend:

React (JavaScript framework)

Dataset

The AI model is trained on annotated badminton video data, including:

Video footage of badminton singles matches
Annotations for point outcomes and shot types by each player
Player positions on the court
Shuttlecock location

Usage

Upload a Badminton Match Video: Users upload a video of a badminton singles match (duration: 40 minutes to 2 hours) via the web interface.
Backend Processing: The backend receives the video, processes it using Flask, performs:
- Video pre-processing with OpenCV
- Shot detection and player tracking using the AI model
- Generating player analysis
Email Notification: Upon completion of processing, users receive an email notification.
Results Visualization: Users access the web interface to view AI-generated insights, including:
- Predicted points and winner
- Shot detection by player for each rally
- Player strengths, weaknesses, and potential strategies

Refer to respective documentation for installation instructions.

Getting Started

Clone the Repository:

git clone https://github.com/Team-Phoenix-Force/ShuttleSensei.git

Install Dependencies:

cd Badminton-AI-Coach
pip install -r requirements.txt

Run the Application:
```
python backend/app.py
```
- This starts the backend Flask server.
Frontend Setup: Follow instructions in the frontend directory to set up the React application.

Note: Use code with caution.

Research Papers

A collection of research papers related to badminton AI coaching is available in the references folder of the repository. Some notable papers include:

"Actionformer" - Actionformer: Transformer-Based Framework for Action Recognition
"SlowFastNetwork" - SlowFast Networks for Video Recognition
"VisionTransformer" - Vision Transformer
"Attention is all you Need" - Attention is All You Need

Please refer to the respective papers for more detailed information.

Google Colab Notebook

You can access the Google Colab notebook here.

Google Colab Link - https://colab.research.google.com/drive/1lyBA13h7edDPfFpxoLN0Ngh403g6TYBv?usp=sharing

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
AI_Model		AI_Model
frontend		frontend
node_modules		node_modules
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ShuttleSensei - Badminton AI Coach

Features

Technologies

Dataset

Usage

Getting Started

Research Papers

Google Colab Notebook

About

Releases

Packages

Contributors 4

Languages

Team-Phoenix-Force/ShuttleSensei

Folders and files

Latest commit

History

Repository files navigation

ShuttleSensei - Badminton AI Coach

Features

Technologies

Dataset

Usage

Getting Started

Research Papers

Google Colab Notebook

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages