Skip to content

Commit

Permalink
Merge pull request #244 from nextstrain/clade-i-updates
Browse files Browse the repository at this point in the history
Tweak clade I definitions and exclude misaligned/overdiverged
  • Loading branch information
corneliusroemer authored Apr 30, 2024
2 parents c7adcb9 + 64bd225 commit 22835a9
Show file tree
Hide file tree
Showing 3 changed files with 18 additions and 9 deletions.
18 changes: 11 additions & 7 deletions phylogenetic/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -16,24 +16,28 @@ If you're unfamiliar with Nextstrain builds, you may want to follow our
The easiest way to run this pathogen build is using the Nextstrain
command-line tool from within the `phylogenetic/` directory:

cd phylogenetic/
nextstrain build .
```bash
cd phylogenetic/
nextstrain build .
```

Once you've run the build, you can view the results with:

nextstrain view .
```bash
nextstrain view .
```

### Example build

You can run an example build using the example data provided in this repository via:

```
```bash
nextstrain build . --configfile build-configs/ci/config.yaml
```

When the build has finished running, view the output Auspice trees via:

```
```bash
nextstrain view .
```

Expand All @@ -53,7 +57,7 @@ If you analyze and plan to publish using these data, please contact these labs f
Within the analysis pipeline, these data are fetched from data.nextstrain.org and written to `data/` with:

```bash
nextstrain build . data/sequences.fasta data/metadata.tsv
nextstrain build . data/sequences.fasta.xz data/metadata.tsv.gz
```

### Run analysis pipeline
Expand Down Expand Up @@ -107,7 +111,7 @@ It can also be used as a small subset of real-world data.
Example data should be updated every time metadata schema is changed or a new clade/lineage emerges.
To update, run:

```sh
```bash
nextstrain build . update_example_data -F \
--configfiles build-configs/ci/config.yaml build-configs/chores/config.yaml
```
Expand Down
4 changes: 2 additions & 2 deletions phylogenetic/defaults/clades.tsv
Original file line number Diff line number Diff line change
Expand Up @@ -2,8 +2,8 @@
clade gene site alt
outgroup nuc 179226 T

clade I nuc 86502 T
clade I nuc 35352 A
clade I nuc 87560 T
clade I nuc 136015 A

clade II nuc 86502 G
clade II nuc 150970 A
Expand Down
5 changes: 5 additions & 0 deletions phylogenetic/defaults/exclude_accessions.txt
Original file line number Diff line number Diff line change
Expand Up @@ -74,3 +74,8 @@ PP098595
PP098578

HM172544 # cidofovir-resistant lab strain that is derived from DQ011155 (h/t Andrew Rambaut)

TMP0003 # Overdiverged 23MPX1786C
TMP0045 # Overdiverged RDC-NKV-GOM-MPOX-004

NC_003310 # Overdiverged RefSeq NC_003310

0 comments on commit 22835a9

Please sign in to comment.