-
Notifications
You must be signed in to change notification settings - Fork 3
/
slides.Rpres
64 lines (45 loc) · 1.94 KB
/
slides.Rpres
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
ChromeQC
========================================================
author: S. Jackman, J. Chu, E. Erhan, N. Keivanfar, S. La, S. Menon, T. Mozgacheva, B. Orabi, C. Yang, H. Younesy
date: 2017-10-22
autosize: true
### Summarize sequencing library quality of 10x Genomics Chromium linked reads
Inspiration
========================================================
Loupe from 10x Genomics
- Reports inferred DNA molecule sizes
- Number of barcodes (GEMs)
- Number of molecules per barcode
Some Loupe Stats
========================================================
Input DNA Stats | Barcode Stats
------------- | -------------
![plot of chunk unnamed-chunk-2](slides-figure/loupe_summary_page_input_DNA.png) | ![plot of chunk unnamed-chunk-2](slides-figure/loupe_summary_page_GEM.png)
Inspiration
========================================================
FastQC & MultiQC:
- FastQC: Reports base qualities, sequence distribution, GC content, etc
- MultiQC: Aggregate multiple FastQC reports
MultiQC Example
========================================================
![plot of chunk unnamed-chunk-2](slides-figure/mutltiQC.png)
========================================================
# ChromeQC Pipeline
![plot of chunk unnamed-chunk-2](slides-figure/chromeQC.png)
Pipeline: Subsample
========================================================
- From subset of fastq files, and subset of read pairs
- Randomly select 4000 out of ~4M whitelisted barcodes
- Extract reads with selected barcodes for downstream analysis
- Report histrograms of unmatched and of whitelisted barcodes
Pipeline: Read Alignment
========================================================
- minimap
- GRCh38 reference genome
- Group by barcode, sort by position
Pipeline: Molecule Size Extraction
========================================================
Heuristic:
- Reads with
- Any two reads < 60Kbp away are in the same molecule
- Any reads with same position and orientation are discarded except for one