Skip to content

Commit

Permalink
Explicitly permit eg N+m in MM tag (PR#799)
Browse files Browse the repository at this point in the history
The text already states that an unmodified base of N means we count
any base type, but base N code N in the table is a little misleading
as to the intention.  It was intended to mean any unspecified
modification, in the same way C+C is any unspecified C mod, but in
this case it's against all bases rather than a specific base type.

However that doesn't solve the issue of whether we can record specific
mods against any "fundamental" source base.  Clarified this by adding
an extra line to the table and some text.  (However note this doesn't
necessarily imply downstream processing tools will not do any
compatibility assessment and reject N+m when the SEQ base is a T.)

Fixes #785
  • Loading branch information
jkbonfield committed Oct 28, 2024
1 parent 5a6f5e9 commit 03505ac
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions SAMtags.tex
Original file line number Diff line number Diff line change
Expand Up @@ -532,6 +532,7 @@ \subsection{Base modifications}

An unmodified base of `{\tt N}' means count any base in {\sf SEQ}, not only those of `{\tt N}'.
Thus `{\tt N+n,100;}' means the 101st base is Xanthosine (n), irrespective of the sequence composition.
A fundamental base of `{\tt N}' may also be used with a base-specific modification code to force the counting to be applied per base rather than per base-type.

The standard code types and their associated ChEBI values are listed
below, taken from Viner {\it et al.}%
Expand Down Expand Up @@ -567,6 +568,7 @@ \subsection{Base modifications}
\hline
N & n & Xao & Xanthosine & 18107 \\
N & N & & Ambiguity code; any mod & \\
N & any & & Mod applied to any base & \\
\end{tabular}
\end{center}

Expand Down

0 comments on commit 03505ac

Please sign in to comment.