Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Generated many same sequences but with differnet scores #116

Open
hqdream opened this issue Oct 30, 2024 · 1 comment
Open

Generated many same sequences but with differnet scores #116

hqdream opened this issue Oct 30, 2024 · 1 comment

Comments

@hqdream
Copy link

hqdream commented Oct 30, 2024

I used ProteinMPNN to generate 10000 sequences with some residues fixed with the script submit_example_4_non_fixed.sh. However, I found that in the output results there were many sequences that were exactly the same. More strangely, these same sequences had different scores. How can I solve this problem? Thanks.

@wyattxuanyang
Copy link

Maybe that's from batch_clones = [copy.deepcopy(protein) for i in range(BATCH_COPIES)],which makes copies of sequences. The copied sequences are then processed by function tied_featurize and given random decoding order.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants