Video Transformer for Deepfake Detection with Incremental Learning

This is a PyTorch implementation of our ACM Multimedia 2021 paper titled, "Video Transformer for Deepfake Detection with Incremental Learning". Paper is available here: https://arxiv.org/abs/2108.05307

Usage

We heavily rely on 3D dense face alignment (3DDFA) from here: https://github.com/cleardusk/3DDFA_V2 for UV texture map generation. We already packaged the 3DDFA_V2 repository into this repository. To build 3DDFA code within our repository, please follow instructions as below.

Tested on Linux and macOS. For Windows, please refer to this issue: https://github.com/cleardusk/3DDFA_V2#FQA.

1: Clone this repo

git clone https://github.com/sohailahmedkhan/Video-Transformer-for-Deepfake-Detection.git
cd Video-Transformer-for-Deepfake-Detection

2: Build the cython version of NMS, Sim3DR

sh ./build.sh

3: Train Image ViT

python train_Image_ViT.py

To train image transformer, run following command. NOTE: You need to add the location of train and validation directories. You can do this by manually editing "train_Image_ViT.py" file.

To easily run training using Jupyter Notebook, you can use ImageTransformer.ipynb file.

For any questions, please raise "Issues".

Acknowledgements

I would like to thank, (1) cleardusk for the 3DDFA implementation, (2) LukeMelas for transformer architecture and pre-trained ViT models, (3) Ross Wightman for XceptionNet.

Cite

If you find this code useful, please cite the following:

@article{Khan2021VideoTF,
  title={Video Transformer for Deepfake Detection with Incremental Learning},
  author={Sohail Ahmed Khan and Hang Dai},
  journal={Proceedings of the 29th ACM International Conference on Multimedia},
  year={2021}
}

@inproceedings{guo2020towards,
    title =        {Towards Fast, Accurate and Stable 3D Dense Face Alignment},
    author =       {Guo, Jianzhu and Zhu, Xiangyu and Yang, Yang and Yang, Fan and Lei, Zhen and Li, Stan Z},
    booktitle =    {Proceedings of the European Conference on Computer Vision (ECCV)},
    year =         {2020}
}

@misc{3ddfa_cleardusk,
    author =       {Guo, Jianzhu and Zhu, Xiangyu and Lei, Zhen},
    title =        {3DDFA},
    howpublished = {\url{https://github.com/cleardusk/3DDFA}},
    year =         {2018}
}

@misc{rw2019timm,
  author = {Ross Wightman},
  title = {PyTorch Image Models},
  year = {2019},
  publisher = {GitHub},
  journal = {GitHub repository},
  doi = {10.5281/zenodo.4414861},
  howpublished = {\url{https://github.com/rwightman/pytorch-image-models}}
}

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
FaceBoxes		FaceBoxes
Sim3DR		Sim3DR
__pycache__		__pycache__
bfm		bfm
configs		configs
dataset_utils		dataset_utils
docs/images		docs/images
examples/inputs		examples/inputs
models		models
nms		nms
utils		utils
weights		weights
ImageTransformer.ipynb		ImageTransformer.ipynb
LICENSE		LICENSE
README.md		README.md
TDDFA.py		TDDFA.py
TDDFA_ONNX.py		TDDFA_ONNX.py
__init__.py		__init__.py
box_utils.py		box_utils.py
build.py		build.py
build.sh		build.sh
config.py		config.py
functions.py		functions.py
latency.py		latency.py
nms_wrapper.py		nms_wrapper.py
onnx.py		onnx.py
prior_box.py		prior_box.py
readme.md		readme.md
speed_cpu.py		speed_cpu.py
timer.py		timer.py
train_image_ViT.py		train_image_ViT.py
train_video_ViT.py		train_video_ViT.py
utils_ViT.py		utils_ViT.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Video Transformer for Deepfake Detection with Incremental Learning

Usage

Acknowledgements

Cite

About

Releases

Packages

Languages

License

sohailahmedkhan/Video-Transformer-for-Deepfake-Detection

Folders and files

Latest commit

History

Repository files navigation

Video Transformer for Deepfake Detection with Incremental Learning

Usage

Acknowledgements

Cite

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages