Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

some questions #1

Open
legendaryInnovation opened this issue May 2, 2023 · 19 comments
Open

some questions #1

legendaryInnovation opened this issue May 2, 2023 · 19 comments
Assignees
Labels
help wanted Extra attention is needed question Further information is requested

Comments

@legendaryInnovation
Copy link

image
how to solve it

@njmarko
Copy link
Owner

njmarko commented May 2, 2023

I will check what arguments I used, and I will respond here.

Also, I will try to update the repo as soon as I have enough time with that kind of information. I will also add deployment for this later.

@legendaryInnovation
Copy link
Author

I will check what arguments I used, and I will respond here.

Also, I will try to update the repo as soon as I have enough time with that kind of information. I will also add deployment for this later.

Thank you

@njmarko
Copy link
Owner

njmarko commented May 2, 2023

The code for handling the dataset is from https://github.com/andreeadeac22/graph_coattention
I will list the steps that worked for me on Windows 10 with PowerShell.
I will add requirements.txt for this repo soon, so you can install them.

The first step is to download the dataset into the /data/decagon folder.
cd .\data\
mkdir decagon
cd .\decagon\

Download the dataset with wget (if it works, it has some problems if you are using Windows)
wget http://snap.stanford.edu/decagon/bio-decagon-combo.tar.gz;

Or try to use curl
curl http://snap.stanford.edu/decagon/bio-decagon-combo.tar.gz

If this doesn't work, just download the dataset manually by visiting the link http://snap.stanford.edu/decagon/bio-decagon-combo.tar.gz and chose the folder /data/decagon/ for the download location.

The next step is to unpack the data with tar
tar -xvzf bio-decagon-combo.tar.gz;

After that, you can run the data_download.py script with the following arguments
decagon -p decagon/

image

The progress bar doesn't seem to work in this case. Just wait until the script finishes executing. It will probably take around 10 minutes.

@njmarko
Copy link
Owner

njmarko commented May 2, 2023

The next step is to run data_preprocess.py with these arguments
decagon -p ./

image

@njmarko
Copy link
Owner

njmarko commented May 2, 2023

Finally, run split_cv_data with these arguments
decagon -p ./

image

@njmarko njmarko self-assigned this May 2, 2023
@njmarko njmarko added help wanted Extra attention is needed question Further information is requested labels May 2, 2023
@njmarko
Copy link
Owner

njmarko commented May 2, 2023

Executing split_cv_data.py takes around 20 to 30 minutes.
I added requirements.txt to the repo for now.
Also, there should be a folds directory in /data/decagon. I added a fix for that now, but you can also create it manually.

Let me know if these steps worked for you.

@legendaryInnovation
Copy link
Author

legendaryInnovation commented May 2, 2023 via email

@legendaryInnovation
Copy link
Author

Executing split_cv_data.py takes around 20 to 30 minutes. I added requirements.txt to the repo for now. Also, there should be a folds directory in /data/decagon. I added a fix for that now, but you can also create it manually.

Let me know if these steps worked for you.

image
Thank you. The previous ones are very normal. May I ask what the situation is

@njmarko
Copy link
Owner

njmarko commented May 2, 2023

Executing split_cv_data.py takes around 20 to 30 minutes. I added requirements.txt to the repo for now. Also, there should be a folds directory in /data/decagon. I added a fix for that now, but you can also create it manually.
Let me know if these steps worked for you.

image Thank you. The previous ones are very normal. May I ask what the situation is

Just create a new folder named folds in the decagon folder.
Structure should look something like this
/data/decagon/folds/

@njmarko
Copy link
Owner

njmarko commented May 2, 2023

Also, remember to pull new changes from the repo. It should automatically create that folder now. And requirements.txt should be at the top level also. I don't see requirements.txt in your screenshot right now, so you may not have the newest version.

@legendaryInnovation
Copy link
Author

Also, remember to pull new changes from the repo. It should automatically create that folder now. And requirements.txt should be at the top level also. I don't see requirements.txt in your screenshot right now, so you may not have the newest version.
Thank you for your reply, but now I'm really sorry that I have a new question.

image

@njmarko
Copy link
Owner

njmarko commented May 3, 2023

Also, remember to pull new changes from the repo. It should automatically create that folder now. And requirements.txt should be at the top level also. I don't see requirements.txt in your screenshot right now, so you may not have the newest version.
Thank you for your reply, but now I'm really sorry that I have a new question.

image

You should run the train.py with the following parameters, and maybe it will solve your issue
-dataset decagon -input_data_path data/decagon/

@legendaryInnovation
Copy link
Author

Thank you very much
I want to know if this error means I need to reinstall Pytorch

AttributeError: module 'torch' has no attribute 'sparse_csc'

1 similar comment
@legendaryInnovation
Copy link
Author

Thank you very much
I want to know if this error means I need to reinstall Pytorch

AttributeError: module 'torch' has no attribute 'sparse_csc'

@njmarko
Copy link
Owner

njmarko commented May 4, 2023

Thank you very much I want to know if this error means I need to reinstall Pytorch

AttributeError: module 'torch' has no attribute 'sparse_csc'

Maybe. Try creating a virtual environment, and then install the requirements that are located in the requirements.txt

@legendaryInnovation
Copy link
Author

Thank you very much I want to know if this error means I need to reinstall Pytorch
AttributeError: module 'torch' has no attribute 'sparse_csc'

Maybe. Try creating a virtual environment, and then install the requirements that are located in the requirements.txt

OK,I will try it at once.

@njmarko
Copy link
Owner

njmarko commented May 4, 2023

Since you are using PyCharm, you can do like this:

File > Settings > Project: gnn-polypharmacy-ddi > Python Interpreter > Add Interpreter > Add Local Interpreter > Virtualenv Environment

Then simply select a python version. I think I was using python 3.10

@legendaryInnovation
Copy link
Author

Since you are using PyCharm, you can do like this:

File > Settings > Project: gnn-polypharmacy-ddi > Python Interpreter > Add Interpreter > Add Local Interpreter > Virtualenv Environment

Then simply select a python version. I think I was using python 3.10

I just reinstalled the environment, but there were more problems. Then, I used the previous environment again and displayed a new problem. May I ask what the problem is? Thank you.
BrokenPipeError: [Errno 32] Broken pipe

@njmarko
Copy link
Owner

njmarko commented May 4, 2023

Since you are using PyCharm, you can do like this:
File > Settings > Project: gnn-polypharmacy-ddi > Python Interpreter > Add Interpreter > Add Local Interpreter > Virtualenv Environment
Then simply select a python version. I think I was using python 3.10

I just reinstalled the environment, but there were more problems. Then, I used the previous environment again and displayed a new problem. May I ask what the problem is? Thank you. BrokenPipeError: [Errno 32] Broken pipe

Not sure why that happens. Try to disable logging maybe, by commenting out wandb logging.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
help wanted Extra attention is needed question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants