Skip to content

This is the PyTorch implementation of our ACM Multimedia 2021 paper titled "Video Transformer for Deepfake Detection with Incremental Learning".

License

Notifications You must be signed in to change notification settings

sohailahmedkhan/Video-Transformer-for-Deepfake-Detection

Repository files navigation

Video Transformer for Deepfake Detection with Incremental Learning

This is a PyTorch implementation of our ACM Multimedia 2021 paper titled, "Video Transformer for Deepfake Detection with Incremental Learning". Paper is available here: https://arxiv.org/abs/2108.05307

deepfakedetectionpipeline

Usage

We heavily rely on 3D dense face alignment (3DDFA) from here: https://github.com/cleardusk/3DDFA_V2 for UV texture map generation. We already packaged the 3DDFA_V2 repository into this repository. To build 3DDFA code within our repository, please follow instructions as below.

Tested on Linux and macOS. For Windows, please refer to this issue: https://github.com/cleardusk/3DDFA_V2#FQA.

1: Clone this repo

git clone https://github.com/sohailahmedkhan/Video-Transformer-for-Deepfake-Detection.git
cd Video-Transformer-for-Deepfake-Detection

2: Build the cython version of NMS, Sim3DR

sh ./build.sh

3: Train Image ViT

python train_Image_ViT.py

To train image transformer, run following command. NOTE: You need to add the location of train and validation directories. You can do this by manually editing "train_Image_ViT.py" file.

To easily run training using Jupyter Notebook, you can use ImageTransformer.ipynb file.

For any questions, please raise "Issues".

Acknowledgements

I would like to thank, (1) cleardusk for the 3DDFA implementation, (2) LukeMelas for transformer architecture and pre-trained ViT models, (3) Ross Wightman for XceptionNet.

Cite

If you find this code useful, please cite the following:

@article{Khan2021VideoTF,
  title={Video Transformer for Deepfake Detection with Incremental Learning},
  author={Sohail Ahmed Khan and Hang Dai},
  journal={Proceedings of the 29th ACM International Conference on Multimedia},
  year={2021}
}

@inproceedings{guo2020towards,
    title =        {Towards Fast, Accurate and Stable 3D Dense Face Alignment},
    author =       {Guo, Jianzhu and Zhu, Xiangyu and Yang, Yang and Yang, Fan and Lei, Zhen and Li, Stan Z},
    booktitle =    {Proceedings of the European Conference on Computer Vision (ECCV)},
    year =         {2020}
}

@misc{3ddfa_cleardusk,
    author =       {Guo, Jianzhu and Zhu, Xiangyu and Lei, Zhen},
    title =        {3DDFA},
    howpublished = {\url{https://github.com/cleardusk/3DDFA}},
    year =         {2018}
}

@misc{rw2019timm,
  author = {Ross Wightman},
  title = {PyTorch Image Models},
  year = {2019},
  publisher = {GitHub},
  journal = {GitHub repository},
  doi = {10.5281/zenodo.4414861},
  howpublished = {\url{https://github.com/rwightman/pytorch-image-models}}
}

About

This is the PyTorch implementation of our ACM Multimedia 2021 paper titled "Video Transformer for Deepfake Detection with Incremental Learning".

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published