Environment and packages

This repo contains the work made for the project of Machine Learning course, at the University of Florence.

A simple and personal implementation of Vision Transformer link is provided here.

Environment and packages

In the following paragraph some steps to recreate a usable environment are explained. Conda package manager and Python 3.8 have been used.

A usable conda environment can be directly created from the requirements.txt file, using the command:

git conda create --name <env> --file requirements.txt

The requirements.txt file has been exported from an environment on Windows OS, so probably some packages don't correctly fit with different OS. A new conda environment can of course be created, with these commands:
```
conda create --name ViT python=3.8
conda activate ViT
conda install pytorch torchvision torchaudio cudatoolkit=10.2 -c pytorch
conda install -c anaconda -c conda-forge -c comet_ml comet_ml 
```
Pytorch have been used as the machine learning framework. We suggest to look to this link for different settings. CometML support is present Comet.ml, to monitor the training and validation metrics. A registration is needed (you can use the Github account). There are many ways to integrate comet support in a project, but we suggest to save the API key generated, as described here, in a .comet.config file and to copy the following lines in your script:
```
import comet_ml
experiment = comet_ml.Experiment()
```
The .comet.config file has to be placed in the folder, where the script is located. In the repo a blank .comet.config file is provided.

Download CIFAR-10 and CIFAR-100

Using Windows 10 OS we encountered some difficulties to download CIFARs dataset through the simple function of torchvision. We solved the problem executing two additional lines only the first time we downloaded the dataset. Please look at download_cifar.py to get the datasets.

Experiments

In order to make experiments on Vision Transformer performances, execute main.py for training from scratch or pretraining, fine_tune.py for fine tuning on a pretrained model. Look inside the files to get more information on the hyper-parameters.

"Inspecting ViT"

Interesting visualizations of some parts of the model are provided, to better understand what happens during the training process. In particular we have implemented:

Linear embedding weights visualization
Position embeddings similarities visualization

in embedding_weights_plot.py and

Heads visualization (this was not in the original paper)
Attention rollout visualization

in attention_plots.py

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
imgs		imgs
.comet.config		.comet.config
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
attention_plots.py		attention_plots.py
download_cifar.py		download_cifar.py
embedding_weights_plots.py		embedding_weights_plots.py
embeddings.py		embeddings.py
fine_tune.py		fine_tune.py
get_dataset_dataloaders.py		get_dataset_dataloaders.py
main.py		main.py
requirements.txt		requirements.txt
utils.py		utils.py
vit.py		vit.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Environment and packages

Download CIFAR-10 and CIFAR-100

Experiments

"Inspecting ViT"

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

chiaraalbi46/ViT

Folders and files

Latest commit

History

Repository files navigation

Environment and packages

Download CIFAR-10 and CIFAR-100

Experiments

"Inspecting ViT"

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages