Rework chirality detection in restype determination for PDB loading#698
Open
roccomoretti wants to merge 10 commits into
Open
Rework chirality detection in restype determination for PDB loading#698roccomoretti wants to merge 10 commits into
roccomoretti wants to merge 10 commits into
Conversation
It bases its decisions on the type returned by get_representative type, which may be a misnomer. Also, as it stands now, it's superflous to the decide_is_l_aa/decide_is_d_aa/decide_is_known_achiral (None of the types which would trigger d_l_threeletter_codes_are_same_for_aa() are listed in those sets.
Member
Author
|
Aside from cosmetic changes in debug-level tracers, the swm_dna_bridge and rna_denovo_dna_bridge integration tests change as the input PDBs type the first nucleotide in the chain as |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
A side quest from PR #687
Currently, we have a rather brittle atom-name-based approach to chirality detection on PDB read-in, which isn't as robust as it could be, in large part due to trying to decide chirality early on. Instead of imposing L/D identity early, we can defer it until later with the atom name matching heuristics. This way, the D & L version of amino acids can co-exist with non-aa residues with the same three letter code (e.g. when we're loading a CCD type with the same three letter code as an unrelated Rosetta residue type), and the ResidueTypeFinder should pick the best match.
I've also reworked the atom name matching heuristics, implementing the previously proposed idea to separate out virtual atom counts from non-virtual counts.