Skip to content

Releases: datagato/proteogram

Incorporation of atomistic molecular dynamics simulations - Proteogram v2

Choose a tag to compare

@datagato datagato released this 17 May 20:40
f2407ab

What's Changed

  • Molecular dynamics simulations for new energy calculations and structure refinement - Proteogram v2
  • Reorg repo for v1 and v2 approaches
  • Dealing better with PTMs
  • Better evaluation logic
  • Inclusion of Foldseek as a benchmark method
  • Single chain inference, update licenses and improved documentation

Full Changelog: v0.0.2...v0.0.3

Attached to this release

  • Demo model file for testing - proteogram_demo_resnet18_finetuned_lr0.001_bs8_e29_85.5acc.pt
  • Demo embedding vectors for all proteins used in the v2 study (2724 proteogram embeddings) - proteogram_demo_embeddings_scope2.08-nr60_20-200.pkl (Note: these are a limited set of embeddings to be able to demo the scripts only, not for production workflows) - a limited, non-redundant (60% sequence identity cutoff) set of Proteogram embeddings for proteins 20-200 residues long (set that was used for training and eval in the e2e pipeline)

Acknowledgements

  • Thank you to @charms-research for advising on the molecular dynamics simulation application for this release
  • Thank you to @swapnilashtekar for contributing a path for reproducibility and validating the codebase and workflows

ISCB Paper Submission Release

Choose a tag to compare

@datagato datagato released this 09 Feb 03:14

The primary purpose of this release is to add a model file for measuring similarity. Place the name of this model file with its path into your config.yml, e.g., model_file: /your/path/resnet18_scratch_without_sampler_bs8_lr0.01.pt.