-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathorganism_verification.Rmd
126 lines (94 loc) · 1.84 KB
/
organism_verification.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
---
title: "organismVerification"
output: html_document
---
# purpose: identify samples/variables with non-human descriptors from the dataset (dataset21and22)
# isolation source
```{r}
# values of interest:
# feces from 52 yr old caucasian male
# feces from 53 yr old caucasian female
# feces of 51 yr old caucasian female with stage I colorectal cancer
# infant under 1 year of age, breastfeeding, had prelonged diarrhea
# ^ circle back and add ages for these values
# other than that, no values of interest with non-human descriptors
```
# sample type
```{r}
View(table(dataset21and22$`sample type`))
"
values of interest:
whole organism
cell culture
pure culture
Cell culture
Bacterial culture
Culture
mixed culture
culture
Pure culture
cultured bacterial isolate
single bacterium
Microbe isolate
Pure cell culture
soil # remove this one for sure
SINGLE STRAIN
"
```
# investigation type
```{r}
View(table(dataset21and22$`investigation type`))
# nothing of interest
```
# isolate
```{r}
" values of interest:
Clostridioides difficile
Salmonella wangata
Enterobacteriaceae
"
```
# strain
```{r}
"
values of interest:
Vancomycin resistant Enterococcus faecium
Salmonella enterica subsp. enterica serovar Enteritidis
"
```
# serovar
```{r}
"
values of interest:
Typhimurium
"
# this one is a headache. circle back to it after you've removed samples based on other variables
# same with sub species
```
# lab host
```{r}
"
Sus scrofa (gnotobiotic piglets)
"
```
```{r}
```
# common name
```{r}
"
many descriptors here for uncultured bacteria.
check with Jess for how to proceed with these samples
if the sample is from an identified species, should it be excluded?
same with `Inferred taxonomy`, `genus`, and `species`
"
```
# organism modifier note
```{r}
"
vancomycin resistant enterococci
"
```
# component organism
```{r}
# Aeromonas caviae
```