Add standalone PDB ↔ token inference scripts #6

mahdip72 · 2025-10-03T16:10:39Z

Description

This PR adds two lightweight standalone scripts for PDB ↔ token conversion, addressing issue #5.

Changes Made

foldtoken/pdb_to_token.py: Converts PDB files to VQ-VAE token IDs
foldtoken/token_to_pdb.py: Converts VQ-VAE token IDs back to PDB files

These scripts follow the logic from the original reconstruct.py but with simplified I/O and no batching for easier standalone use.

Motivation

These scripts enable independent validation and benchmarking of FoldToken's tokenization pipeline, as discussed in issue #5. They maintain accuracy while providing a more accessible interface for researchers.

@gaozhangyang Please review and let me know if any adjustments are needed!

…entical names

mahdip72 added 8 commits May 25, 2025 20:38

feat: pdb to token code

3ea134f

feat: token to pdb code

f20db9a

fix: issues

69edd76

doc: installation.sh on python env

3c4ca50

feat: progress bar

d059f74

ref: make the pdb names identical

f7bb648

feat: recursively collect PDB files from subdirectories

d85f125

feat: ensure unique output filenames for PDB files if find already id…

c185da5

…entical names

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add standalone PDB ↔ token inference scripts #6

Add standalone PDB ↔ token inference scripts #6

Uh oh!

mahdip72 commented Oct 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Add standalone PDB ↔ token inference scripts #6

Are you sure you want to change the base?

Add standalone PDB ↔ token inference scripts #6

Uh oh!

Conversation

mahdip72 commented Oct 3, 2025

Description

Changes Made

Motivation

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant