HeSer.Pytorch

unofficial implementation of Few-Shot Head Swapping in the Wild
you can find official version here
I did not use the discriminator from the paper and just follow DCT-NET

enviroment

torch
opencv-python
tensorboardX
imgaug
face-alignment

# download pretrain model
cd process
bash download_weight.sh

How to RUN

train

I only train one ID for driving

Data Process

download voxceleb2
a. I just download voxceleb2 test dataset, you can use this website
b. You can unzip this file like this:

+--- dataset
|   +--- vox2_test_txt
|   |   +--- txt
|   |   |   +--- id00017
|   |   |   |   +--- 01dfn2spqyE
|   |   |   |   |   +--- 00001.txt
|   |   |   |   +--- 5MkXgwdrmJw
|   |   |   |   |   +--- 00002.txt
|   |   |   |   +--- 7t6lfzvVaTM
|   |   |   |   |   +--- 00003.txt
|   |   |   |   |   +--- 00004.txt
|   |   |   |   |   +--- 00005.txt
|   |   |   |   |   +--- 00006.txt
|   |   |   |   |   +--- 00007.txt

c. Install yt-dlp and aria2c by yourself. I think you can do that through internet.

cd process
python download_and_process.py

d. the dataset is like:

voceleb2/
|-- id00017
|   |-- 01dfn2spqyE
|   |   `-- 00.npy
|   |-- 5MkXgwdrmJw
|   |   |-- 00.npy
|   |   `-- 5MkXgwdrmJw.mp4
|   |-- 7t6lfzvVaTM
|   |   |-- 00.npy
|   |   |-- 01.npy
|   |   |-- 02.npy
|   |   |-- 03.npy
|   |   |-- 04.npy
|   |   |-- 05.npy
|   |   |-- 06.npy
|   |   |-- 07.npy
|   |   |-- 08.npy
|   |   |-- 09.npy
|   |   `-- 7t6lfzvVaTM.mp4

crop and align

cd process 
python process_raw_video.py

the dataset is like:

process/
|-- img
|   |-- id00017
|   |   |-- 5MkXgwdrmJw-0000
|   |   |   |-- 1273.png
|   |   |   |-- 1274.png
|   |   |   |-- 1275.png
|   |   |   |-- 1276.png
|   |   |   |-- 1277.png
|   |   |   |-- 1278.png
|   |   |   |-- 1279.png
|   |   |   |-- 1280.png
|   |   |   |-- 1281.png
|   |   |   |-- 1282.png
|   |   |   |-- 1283.png
|   |   |   |-- 1284.png
|   |   |   |-- 1285.png
|   |   |   |-- 1286.png
|   |   |   |-- 1287.png
|   |   |   |-- 1288.png
|   |   |   |-- 1289.png

Remove data below threshold
```
cd process
python filter_idfiles.py
```

face parsing
follow LVT to get face parsing
the mask data is like:

process/mask/
|-- id00017
|   |-- 5MkXgwdrmJw-0000
|   |   |-- 1273.png
|   |   |-- 1274.png
|   |   |-- 1275.png
|   |   |-- 1276.png
|   |   |-- 1277.png
|   |   |-- 1278.png
|   |   |-- 1279.png
|   |   |-- 1280.png

Train Align

I just use id00061 to train align
check model/AlignModule/config.py to put your own path and params
for single gpu

python  train.py --model align --batch_size 8 --checkpoint_path checkpoint --lr 2e-4 --print_interval 100 --save_interval 100 --dist

for multi gpu

python -m torch.distributed.launch train.py --model align --batch_size 8 --checkpoint_path checkpoint --lr 2e-4 --print_interval 100 --save_interval 100

Train Blend

check model/BlendModule/config.py to put your own path and params
for single gpu

python  train.py --model blend --batch_size 8 --checkpoint_path checkpoint --lr 2e-4 --print_interval 100 --save_interval 100 --dist

for multi gpu

python -m torch.distributed.launch train.py --model blend --batch_size 8 --checkpoint_path checkpoint --lr 2e-4 --print_interval 100 --save_interval 100

Inference

follow inference.py, change your own model path and input images

python inference.py

Show

The result is just overfitting

Credits

latent-pose-reenactment model and implementation:
https://github.com/shrubb/latent-pose-reenactment Copyright © 2020, shrubb.
License https://github.com/shrubb/latent-pose-reenactment/blob/master/LICENSE.txt

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
assets		assets
dataloader		dataloader
model		model
process		process
trainer		trainer
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
ReadMe.md		ReadMe.md
inference.py		inference.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HeSer.Pytorch

enviroment

How to RUN

train

Data Process

Train Align

Train Blend

Inference

Show

Credits

About

Releases

Packages

Languages

License

JackZhouSz/HeSer.Pytorch

Folders and files

Latest commit

History

Repository files navigation

HeSer.Pytorch

enviroment

How to RUN

train

Data Process

Train Align

Train Blend

Inference

Show

Credits

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages