Issue with output-format pdb vs output-format mmcif #209

ameya98 · 2025-03-17T18:50:46Z

Hi, thanks for the wonderful implementation and repo!

I was experimenting with Boltz1 on a kinase:

version: 1  # Optional, defaults to 1
sequences:
  - protein:
      id: ABL
      sequence: GAMDPSSPNYDKWEMERTDITMKHKLGGGQYGEVYEGVWKKYSLTVAVKTLKEDTMEVEEFLKEAAVMKEIKHPNLVQLLGVCTREPPFYIITEFMTYGNLLDYLRECNRQEVNAVVLLYMATQISSAMEYLEKKNFIHRDLAARNCLVGENHLVKVADFGLSRLMTGDTYTAHAGAKFPIKWTAPESLAYNKFSIKSDVWAFGVLLWEIATYGMSPYPGIDLSQVYELLEKDYRMERPEGCPEKVYELMRACWQWNPSDRPSFAEIHQAFETMFQES

With --output_format mmcif, I get something very reasonable as visualized in PyMOL:

but with --output_format pdb:

Am I doing something wrong?

The text was updated successfully, but these errors were encountered:

ameya98 · 2025-03-17T19:18:52Z

Ahh, turns out the issue is in this line which messes up the column offset: https://github.com/jwohlwend/boltz/blob/main/src/boltz/data/write/pdb.py#L37C1-L38C1

        chain_tag = chain["name"]

which should be:

        chain_tag = chain["name"][0]

I'll send in a PR to fix!

ameya98 linked a pull request Mar 17, 2025 that will close this issue

Use only first character of chain name for PDB #210

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue with output-format pdb vs output-format mmcif #209

Issue with output-format pdb vs output-format mmcif #209

ameya98 commented Mar 17, 2025

ameya98 commented Mar 17, 2025

Issue with output-format pdb vs output-format mmcif #209

Issue with output-format pdb vs output-format mmcif #209

Comments

ameya98 commented Mar 17, 2025

ameya98 commented Mar 17, 2025