GitHub - cole-group/FEgrow_AL_data: Data to reproduce the study "Active learning driven prioritisation of compounds from on-demand libraries targeting the SARS-CoV-2 main protease"

Data and scripts required to generate the plots for:

Active learning driven prioritisation of compounds from on-demand libraries targeting the SARS-CoV-2 main protease, by Ben Cree, Mateusz Bieniek, Siddique Amin, Akane Kawamura and Daniel Cole.

Please see the preprint for further details.

Reproducing plots in the paper:

Install the environment via:

mamba create --name fegrow_al_data python=3.11 jupyter rdkit numpy==1.24.4 scikit-learn umap-learn seaborn chemplot -c conda-forge -c anaconda
conda activate fegrow_al_data
pip install useful_rdkit_utils==0.2.7

To run the jupyter notebook:

python -m ipykernel install --user --name=fegrow_al_data
jupyter notebook

and open the notebook file plots.ipynb.

Description of data:

The file cs_49k.csv contains the SMILES and predicted affinity of the oracle dataset.

The folders rep_* contain five replicas of active learning hyperparameter tuning. For each experiment, folders contain SMILES and predicted affinity of compounds selected at each active learning cycle.

The same information is provided for the four prospective runs in the folders mpro-*, for the four different objective functions used.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
mpro-al-cs/generated		mpro-al-cs/generated
mpro-al-pK-beta01		mpro-al-pK-beta01
mpro-al-pK-beta10/generated		mpro-al-pK-beta10/generated
mpro-al-plip		mpro-al-plip
rep_1		rep_1
rep_2		rep_2
rep_3		rep_3
rep_4		rep_4
rep_5		rep_5
LICENSE		LICENSE
README.md		README.md
cs_49k.csv		cs_49k.csv
onebyone_it14_over6cnnaffinity.sdf		onebyone_it14_over6cnnaffinity.sdf
plots.ipynb		plots.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reproducing plots in the paper:

Description of data:

About

Releases

Packages

Languages

License

cole-group/FEgrow_AL_data

Folders and files

Latest commit

History

Repository files navigation

Reproducing plots in the paper:

Description of data:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages