Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Macro: Enumeration of linear and branch chains. #3222

Closed
isidoraotasevic opened this issue Aug 30, 2023 · 2 comments · Fixed by #3634
Closed

Macro: Enumeration of linear and branch chains. #3222

isidoraotasevic opened this issue Aug 30, 2023 · 2 comments · Fixed by #3634

Comments

@isidoraotasevic
Copy link
Collaborator

isidoraotasevic commented Aug 30, 2023

Background

This task covers enumeration of linear and branch chains.
AP - attachment point

Peptide linear chain - several (>=2) peptide monomers connected by bonds between R2 attachment point of one monomer and R1 attachment point of the next monomer. Several peptide chains could be connected to each other by the bonds that are distinct from R2-R1. 

RNA linear chain - several RNA monomers (phosphates and sugars), connected by bonds  between R2 attachment point of one monomer and R1 attachment point of the next monomer, AND bases, connected to sugars by bond established between R3 AP of sugar and R1 AP of base. 
Several RNA chains could be connected to each other by the R1-R1 bond between two phosphates or by the connections to additional AP on sugars, phosphates and bases. 

Requirements

For peptides:

  1. Enumeration of linear chain (all other connections except R2-R1 are ignored)
    1.0 Starting from version 2.23 we decided to start enumeration for single peptide as well - approved by @olganaz and @ljubica-milovic
    image

1.1 When at least two monomers are connected to each other, the system shall start with enumeration process.
1.2 The system shall start enumeration from the monomer, which has occupied R2 attachment point and:

  • no R1 attachment point
  • free R1 attachment point
  • occupied R1 attachment point with the bond R1-Ri, i<>2
    image
    image

THIS POINT IS NO LONG VALID, DELETED - 1.3 If there are no monomers mentioned in 1.2 enumeration shouldn't be performed.
1.4 If two monomers are connected through R2-R1 bond, then the number of next monomer should be n+1, where n is the number of previous monomer. The number of monomer should be displayed according to mockups (see comments).
image

  1. Other connections
    2.1 All connections except R2->R1 are considered as branch connections. Branch connections are ignored in enumeration process.
    2.2 The chains forming a branch should be enumerated independently.
    image
    2.3 Two monomers connected with any bond except R2-R1 are considered as two chains made up from single monomer which are connected to each other.

For RNA:

  1. Linear chain
    1.1 A set of RNA monomers (phosphates, sugars) with the bonds established between R2 and R1 attachment points are considered as core of RNA chain.
    1.2 Enumeration should be started from the phosphate or sugar, which has:
    -no R1 attachment point
    -free R1 attachment point
    -occupied R1 attachment point with the bond R1-Ri, i<>2
    1.3 Enumeration should start even single RNA (added as result of conversation with @olganaz)

1.3 If there are no monomers mentioned in 1.2 enumeration shouldn't be performed.
1.4 The system should walk along sugar-phosphate core from R2 to R1 until it meets the sugar with the base (attached to R3 AP via R3-R1 bond). Such base should be counted. It's number should be displayed according to mockups (see in comments).
image

  1. Branch chain
    2.1 All connections except R2-R1 for core and R3-R1 for bases are considered as branch connections. Branch connections are ignored in enumeration process.
    2.2 The chains forming a branch should be enumerated independently.
    image

Mixed chains

  1. When RNA chain is connected to peptide chain, each chain shall be counted separately.
  2. If there is CHEM, connecting different monomers, then we should consider as different chains connected via chem. The chains are enumerated separately. Chem shouldn't be enumerated
@OlZhur OlZhur added this to the Polymer Editor MVP milestone Aug 30, 2023
@OlZhur OlZhur assigned Andrey-Antonov and unassigned Nitvex Sep 1, 2023
@isidoraotasevic isidoraotasevic changed the title Enumeration of peptide chains (linear format): Enumeration of peptide chains (linear and branch format): Sep 8, 2023
@isidoraotasevic isidoraotasevic changed the title Enumeration of peptide chains (linear and branch format): Enumeration of linear chains Sep 8, 2023
@isidoraotasevic isidoraotasevic changed the title Enumeration of linear chains Enumeration of linear and branch chains. Sep 8, 2023
@Andrey-Antonov
Copy link
Collaborator

image
image
image

@olganaz olganaz self-assigned this Nov 17, 2023
@rrodionov91 rrodionov91 changed the title Enumeration of linear and branch chains. Macro: Enumeration of linear and branch chains. Nov 28, 2023
rrodionov91 added a commit that referenced this issue Nov 28, 2023
- done enumeration for peptides and rna
@olganaz olganaz assigned rrodionov91 and unassigned Andrey-Antonov and olganaz Nov 29, 2023
rrodionov91 added a commit that referenced this issue Nov 29, 2023
- done enumeration for peptides and rna
@rrodionov91 rrodionov91 linked a pull request Nov 30, 2023 that will close this issue
9 tasks
rrodionov91 added a commit that referenced this issue Dec 4, 2023
* #3222 - Macro: Enumeration of linear and branch chains.
- done enumeration for peptides and rna

---------

Co-authored-by: Roman Rodionov <roman_rodionov@epam.com>
@AlexeyGirin
Copy link
Collaborator

image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment