-
Notifications
You must be signed in to change notification settings - Fork 19
/
README
91 lines (53 loc) · 2.67 KB
/
README
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
================================================================================
MOSAIK 2.2 Release Distribution Documentation 2013-09-20
Wan-Ping Lee & Michael Stromberg Marth Lab, Boston College Biology Department
================================================================================
News:
1. 2014-03-26
A bug causing incorrect bases of reverse complement alignments has been fixed.
Please check any version greater than 2.2.19 for the fix.
2. Version 2.2 is about four-fold faster than previous by improving the Smith-
Waterman (SW) strategy and using SIMD SW [1]. This is also the version that we
use on 1000 Genomes Project [2] Phase-III datasets.
Overview:
MOSAIK is a stable, sensitive and open-source program for mapping second and
third-generation sequencing reads to a reference genome. Uniquely among current
mapping tools, MOSAIK can align reads generated by all the major sequencing
technologies, including Illumina, Applied Biosystems SOLiD, Roche 454,
Ion Torrent and Pacific BioSciences SMRT.
License:
MOSAIK is open source software, licensed under the GPL 2.0+, and is provided
free-of-charge to all users. All we ask in return is that you acknowledge its
use in publications, talks, etc.
Citation:
http://dx.plos.org/10.1371/journal.pone.0090581
We ask that you cite this paper if you use MOSAIK in work that leads to
publication.
Obtaining:
To download MOSAIK, please use git to download the most recent development tree.
% git clone https://github.com/wanpinglee/MOSAIK.git
Or, download it from google code webpage:
https://code.google.com/p/mosaik-aligner/downloads/list
If you encounter issues with the development HEAD, or simply wish to obtain the
most recent stable revision (2.2.3) then use:
% git checkout 915f6feca4d321a79c29de34f6512f92b751b457
Compilation:
MOSAIK requires g++ and the standard C and C++ development libraries.
Just enter the src directory, glance at the README file, and type "make".
If static libraries are not installed, please modify includes/linux.inc,
deleting "-static".
Running the demo:
To see a full list of the available command line options, enter the bin
directory and run any of the program without any extra parameters.
Included in this release is an Illumina data set featuring reads E.Coli.
To test MOSAIK with this data set, just enter the demo directory and
type Build, and Align to run the appropriate scripts.
example:
> cd demo
> ./Build.sh
> ./Align.sh
The resulting bam file (read.mka.bam) will be found in the
demo/fastq directory.
References:
[1]: https://github.com/mengyao/Complete-Striped-Smith-Waterman-Library
[2]: http://www.1000genomes.org/