Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reproducible error using peptide_coefficient_predictor.py #2

Open
Arthfael opened this issue Jul 27, 2022 · 4 comments
Open

Reproducible error using peptide_coefficient_predictor.py #2

Arthfael opened this issue Jul 27, 2022 · 4 comments

Comments

@Arthfael
Copy link

Arthfael commented Jul 27, 2022

Hi,
I am trying to run the scripts on my own data, but since I am encountering an error with it, I also just tried on each of the 12 example pre-processed tsv datasets provided in the download. All give me the same error, so this cannot be coming from my data:
One of the dimensions in the output is <= 0 due to downsampling in conv2d_17. Consider increasing the input size. Received input shape [None, 40, 20, 1] which would produce output shape with a zero or negative value in a dimension.
I suspect that this has to do with my configuration, in particular I know that python version can be critical. I am using python 3.10.5. Which version was used in the paper? I tried a few other 3.5 variants but on some I could not install all requirements.

Note:
This was done from R using package reticulate to create a python 3.10.5 virtual environment, install all requirements listed in the two scripts I am using, namely numpy, pandas, argparse, matplotlib, seaborn, tensorflow, keras, sklearn and scipy.
Because reticulate does not let me run scripts with arguments, a copy of the relevant scripts was made then edited so that the inputs, outputs and if necessary relevant parameters would reflect the correct paths/values already within the script. The datasets were successfully one-hot-encoded this way, then I called on each results file the edited peptide_coefficient_predictor.py script, applying default parameters except n_runs and seq_length.

@justin-a-sanders
Copy link
Contributor

I think your python versions are fine given you were able to install all the dependencies. Looking at the R script you sent, it looks like you are not passing in the --filter_size argument to peptide_coefficient_predictor.py. The default value for filter_size when one isn't provided is 103 (a mistake we should correct) while your sequence length is only 40, so the two are incompatible. Try passing in a filter size of 3 instead and seeing if that resolves the issue.

In general, you can refer to train_models.sh for some example calls to peptide_coefficient_predictor.py using our recommended parameters.

@Arthfael
Copy link
Author

Arthfael commented Oct 19, 2022 via email

@wsnoble
Copy link
Contributor

wsnoble commented Oct 19, 2022

You are right: the parameters are defined in this script, but many had no "help" field defined. I have added them now. However, a detailed discussion of how to tune the hyperparameters of a deep neural network is outside the scope of Pepper documentation.

@Arthfael
Copy link
Author

Arthfael commented Oct 19, 2022 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants