Skip to content

Commit

Permalink
readme - add issues and limitations
Browse files Browse the repository at this point in the history
  • Loading branch information
vmikk committed Dec 7, 2024
1 parent 7fe94db commit 5a3d403
Showing 1 changed file with 6 additions and 0 deletions.
6 changes: 6 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -224,4 +224,10 @@ In contrast, SHA-3 is the slowest hash function in this benchmark, generating th
> of the hash algorithms used in `seqhasher`. Other implementations may yield different results,
> and these values should not be interpreted as a definitive ranking of the algorithms themselves.
## Known issues and limitations

- Seqhasher does not take line wrapping in FASTA file into account (whitespace characters are stripped from the sequence before processing);
- The tool may not work correctly with sequences containing non-ASCII characters;
- IUPAC ambiguity codes (R,Y,S,W,K,M,B,D,H,V,N), characters denoting gaps ('-' or '.'), **and any other non-DNA characters** are handled "as is" (hash will depend on them);
- Empty sequences return an empty hash;

0 comments on commit 5a3d403

Please sign in to comment.