Skip to content

Commit

Permalink
license updates and help proofreading
Browse files Browse the repository at this point in the history
  • Loading branch information
Brian Ondov committed Oct 22, 2015
1 parent cd47c5c commit 989c955
Show file tree
Hide file tree
Showing 33 changed files with 96 additions and 80 deletions.
18 changes: 14 additions & 4 deletions LICENSE.txt
Original file line number Diff line number Diff line change
Expand Up @@ -3,15 +3,25 @@ PURPOSE
Mash is a fast sequence distance estimator that uses the MinHash
algorithm and is designed to work with genomes and metagenomes in the
form of assemblies or reads. It is implemented in C++ and is
distributed with KSeq (lh3lh3.users.sourceforge.net/kseq.shtml) and
MurmurHash3 (code.google.com/p/smhasher/wiki/MurmurHash3).
distributed with:

KSeq
lh3lh3.users.sourceforge.net/kseq.shtml
MIT License

MurmurHash3
code.google.com/p/smhasher/wiki/MurmurHash3
Public domain

Open Bloom Filter
https://code.google.com/p/bloom/source/browse/trunk/bloom_filter.hpp
Common Public License

COPYRIGHT LICENSE

Copyright © 2015, Battelle National Biodefense Institute (BNBI);
all rights reserved. Authored by: Brian Ondov, Todd Treangen, and
Adam Phillippy
all rights reserved. Authored by: Brian Ondov, Todd Treangen,
Sergey Koren, and Adam Phillippy

This Software was prepared for the Department of Homeland Security
(DHS) by the Battelle National Biodefense Institute, LLC (BNBI) as
Expand Down
6 changes: 3 additions & 3 deletions src/mash/Command.cpp
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
// Copyright © 2015, Battelle National Biodefense Institute (BNBI);
// all rights reserved. Authored by: Brian Ondov, Todd Treangen, and
// Adam Phillippy
// all rights reserved. Authored by: Brian Ondov, Todd Treangen,
// Sergey Koren, and Adam Phillippy
//
// See the LICENSE.txt file included with this software for license information.

Expand Down Expand Up @@ -162,7 +162,7 @@ Command::Command()
addAvailableOption("individual", Option(Option::Boolean, "i", "Sketch", "Sketch individual sequences, rather than whole files.", ""));
addAvailableOption("warning", Option(Option::Number, "w", "Sketch", "Probability threshold for warning about low k-mer size.", "0.01", 0, 1));
addAvailableOption("unique", Option(Option::Boolean, "u", "Sketch", "Remove (most) unique kmers using a Bloom Filter. This is useful for reducing noise from sequencing errors in read sets. See Bloom filter options below. Incompatible with -i.", ""));
addAvailableOption("genome", Option(Option::Size, "g", "Bloom", "Expected genome size (raw bases, or with K/M/G/T). Helps pick the Bloom Filter size. Should be within an order of magnitude of the true size. Implies -u.", "5M"));
addAvailableOption("genome", Option(Option::Size, "g", "Bloom", "Expected genome size (raw bases or with K/M/G/T). Helps pick the Bloom Filter size. Should be within an order of magnitude of the true size. Implies -u.", "5M"));
addAvailableOption("memory", Option(Option::Size, "m", "Bloom", "Maximum Bloom Filter memory usage (raw bytes or with K/M/G/T). More memory will allow more thorough detection of unique kmers, so this should be as high as is practical for the computing environment (though it may not actually be used). Implies -u.", "1G"));
addAvailableOption("bloomError", Option(Option::Number, "e", "Bloom", "Target false-negative rate for Bloom filtering unique kmers with. Implies -u.", "0.1", 0, 1));
addAvailableOption("noncanonical", Option(Option::Boolean, "n", "Sketch", "Non-canonical. By default, canonical DNA kmers (alphabetical minima of forward-reverse pairs) are used, and kmers with non-acgtACGT characters are ignored. This option uses kmers as they appear and allows all characters.", ""));
Expand Down
4 changes: 2 additions & 2 deletions src/mash/Command.h
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
// Copyright © 2015, Battelle National Biodefense Institute (BNBI);
// all rights reserved. Authored by: Brian Ondov, Todd Treangen, and
// Adam Phillippy
// all rights reserved. Authored by: Brian Ondov, Todd Treangen,
// Sergey Koren, and Adam Phillippy
//
// See the LICENSE.txt file included with this software for license information.

Expand Down
4 changes: 2 additions & 2 deletions src/mash/CommandContain.cpp
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
// Copyright © 2015, Battelle National Biodefense Institute (BNBI);
// all rights reserved. Authored by: Brian Ondov, Todd Treangen, and
// Adam Phillippy
// all rights reserved. Authored by: Brian Ondov, Todd Treangen,
// Sergey Koren, and Adam Phillippy
//
// See the LICENSE.txt file included with this software for license information.

Expand Down
4 changes: 2 additions & 2 deletions src/mash/CommandContain.h
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
// Copyright © 2015, Battelle National Biodefense Institute (BNBI);
// all rights reserved. Authored by: Brian Ondov, Todd Treangen, and
// Adam Phillippy
// all rights reserved. Authored by: Brian Ondov, Todd Treangen,
// Sergey Koren, and Adam Phillippy
//
// See the LICENSE.txt file included with this software for license information.

Expand Down
6 changes: 3 additions & 3 deletions src/mash/CommandDistance.cpp
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
// Copyright © 2015, Battelle National Biodefense Institute (BNBI);
// all rights reserved. Authored by: Brian Ondov, Todd Treangen, and
// Adam Phillippy
// all rights reserved. Authored by: Brian Ondov, Todd Treangen,
// Sergey Koren, and Adam Phillippy
//
// See the LICENSE.txt file included with this software for license information.

Expand All @@ -25,7 +25,7 @@ CommandDistance::CommandDistance()
{
name = "dist";
summary = "Estimate the distance of query sequences to references.";
description = "Estimate the distance of each query sequence to the reference. Both the reference and queries can be fasta or fastq, gzipped or not, or mash sketch files (.msh) with matching k-mer sizes. Query files can also be files of file names (see -l). Whole files are compared by default (see -i). The output fields are [reference-ID, query-ID, distance, p-value, shared-hashes].";
description = "Estimate the distance of each query sequence to the reference. Both the reference and queries can be fasta or fastq, gzipped or not, or Mash sketch files (.msh) with matching k-mer sizes. Query files can also be files of file names (see -l). Whole files are compared by default (see -i). The output fields are [reference-ID, query-ID, distance, p-value, shared-hashes].";
argumentString = "<reference> <query> [<query>] ...";

useOption("help");
Expand Down
4 changes: 2 additions & 2 deletions src/mash/CommandDistance.h
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
// Copyright © 2015, Battelle National Biodefense Institute (BNBI);
// all rights reserved. Authored by: Brian Ondov, Todd Treangen, and
// Adam Phillippy
// all rights reserved. Authored by: Brian Ondov, Todd Treangen,
// Sergey Koren, and Adam Phillippy
//
// See the LICENSE.txt file included with this software for license information.

Expand Down
4 changes: 2 additions & 2 deletions src/mash/CommandFind.cpp
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
// Copyright © 2015, Battelle National Biodefense Institute (BNBI);
// all rights reserved. Authored by: Brian Ondov, Todd Treangen, and
// Adam Phillippy
// all rights reserved. Authored by: Brian Ondov, Todd Treangen,
// Sergey Koren, and Adam Phillippy
//
// See the LICENSE.txt file included with this software for license information.

Expand Down
4 changes: 2 additions & 2 deletions src/mash/CommandFind.h
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
// Copyright © 2015, Battelle National Biodefense Institute (BNBI);
// all rights reserved. Authored by: Brian Ondov, Todd Treangen, and
// Adam Phillippy
// all rights reserved. Authored by: Brian Ondov, Todd Treangen,
// Sergey Koren, and Adam Phillippy
//
// See the LICENSE.txt file included with this software for license information.

Expand Down
4 changes: 2 additions & 2 deletions src/mash/CommandInfo.cpp
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
// Copyright © 2015, Battelle National Biodefense Institute (BNBI);
// all rights reserved. Authored by: Brian Ondov, Todd Treangen, and
// Adam Phillippy
// all rights reserved. Authored by: Brian Ondov, Todd Treangen,
// Sergey Koren, and Adam Phillippy
//
// See the LICENSE.txt file included with this software for license information.

Expand Down
4 changes: 2 additions & 2 deletions src/mash/CommandInfo.h
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
// Copyright © 2015, Battelle National Biodefense Institute (BNBI);
// all rights reserved. Authored by: Brian Ondov, Todd Treangen, and
// Adam Phillippy
// all rights reserved. Authored by: Brian Ondov, Todd Treangen,
// Sergey Koren, and Adam Phillippy
//
// See the LICENSE.txt file included with this software for license information.

Expand Down
30 changes: 18 additions & 12 deletions src/mash/CommandList.cpp
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
// Copyright © 2015, Battelle National Biodefense Institute (BNBI);
// all rights reserved. Authored by: Brian Ondov, Todd Treangen, and
// Adam Phillippy
// all rights reserved. Authored by: Brian Ondov, Todd Treangen,
// Sergey Koren, and Adam Phillippy
//
// See the LICENSE.txt file included with this software for license information.

Expand Down Expand Up @@ -36,12 +36,7 @@ void CommandList::print()

cout << endl << "Mash version " << version << endl << endl;

cout << "Copyright © 2015, Battelle National Biodefense Institute (BNBI); all rights" << endl;
cout << "reserved. Authored by: Brian Ondov, Todd Treangen, and Adam Phillippy." << endl;

cout << "This program is free software and comes with ABSOLUTELY NO WARRANTY, though you" << endl;
cout << "are welcome to redistribute it under certain conditions. Source code is" << endl;
cout << "available at github.com/marbl/mash. For more details, type 'mash --license'." << endl;
cout << "Type 'mash --license' for license and copyright information." << endl;

cout << endl << "Usage:" << endl << endl;

Expand Down Expand Up @@ -103,8 +98,19 @@ PURPOSE\n\
Mash is a fast sequence distance estimator that uses the MinHash\n\
algorithm and is designed to work with genomes and metagenomes in the\n\
form of assemblies or reads. It is implemented in C++ and is\n\
distributed with KSeq (lh3lh3.users.sourceforge.net/kseq.shtml) and\n\
MurmurHash3 (code.google.com/p/smhasher/wiki/MurmurHash3).\n\
distributed with:\n\
\n\
KSeq\n\
lh3lh3.users.sourceforge.net/kseq.shtml\n\
MIT License\n\
\n\
MurmurHash3\n\
code.google.com/p/smhasher/wiki/MurmurHash3\n\
Public domain\n\
\n\
Open Bloom Filter\n\
https://code.google.com/p/bloom/source/browse/trunk/bloom_filter.hpp\n\
Common Public License\n\
\n";
#ifdef DIST_LICENSE
cout << "\n\
Expand All @@ -118,8 +124,8 @@ cout << "\n\
COPYRIGHT LICENSE\n\
\n\
Copyright © 2015, Battelle National Biodefense Institute (BNBI);\n\
all rights reserved. Authored by: Brian Ondov, Todd Treangen, and\n\
Adam Phillippy\n\
all rights reserved. Authored by: Brian Ondov, Todd Treangen,\n\
Sergey Koren, and Adam Phillippy\n\
\n\
This Software was prepared for the Department of Homeland Security\n\
(DHS) by the Battelle National Biodefense Institute, LLC (BNBI) as\n\
Expand Down
4 changes: 2 additions & 2 deletions src/mash/CommandList.h
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
// Copyright © 2015, Battelle National Biodefense Institute (BNBI);
// all rights reserved. Authored by: Brian Ondov, Todd Treangen, and
// Adam Phillippy
// all rights reserved. Authored by: Brian Ondov, Todd Treangen,
// Sergey Koren, and Adam Phillippy
//
// See the LICENSE.txt file included with this software for license information.

Expand Down
4 changes: 2 additions & 2 deletions src/mash/CommandPaste.cpp
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
// Copyright © 2015, Battelle National Biodefense Institute (BNBI);
// all rights reserved. Authored by: Brian Ondov, Todd Treangen, and
// Adam Phillippy
// all rights reserved. Authored by: Brian Ondov, Todd Treangen,
// Sergey Koren, and Adam Phillippy
//
// See the LICENSE.txt file included with this software for license information.

Expand Down
4 changes: 2 additions & 2 deletions src/mash/CommandPaste.h
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
// Copyright © 2015, Battelle National Biodefense Institute (BNBI);
// all rights reserved. Authored by: Brian Ondov, Todd Treangen, and
// Adam Phillippy
// all rights reserved. Authored by: Brian Ondov, Todd Treangen,
// Sergey Koren, and Adam Phillippy
//
// See the LICENSE.txt file included with this software for license information.

Expand Down
4 changes: 2 additions & 2 deletions src/mash/CommandSketch.cpp
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
// Copyright © 2015, Battelle National Biodefense Institute (BNBI);
// all rights reserved. Authored by: Brian Ondov, Todd Treangen, and
// Adam Phillippy
// all rights reserved. Authored by: Brian Ondov, Todd Treangen,
// Sergey Koren, and Adam Phillippy
//
// See the LICENSE.txt file included with this software for license information.

Expand Down
4 changes: 2 additions & 2 deletions src/mash/CommandSketch.h
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
// Copyright © 2015, Battelle National Biodefense Institute (BNBI);
// all rights reserved. Authored by: Brian Ondov, Todd Treangen, and
// Adam Phillippy
// all rights reserved. Authored by: Brian Ondov, Todd Treangen,
// Sergey Koren, and Adam Phillippy
//
// See the LICENSE.txt file included with this software for license information.

Expand Down
4 changes: 2 additions & 2 deletions src/mash/HashList.cpp
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
// Copyright © 2015, Battelle National Biodefense Institute (BNBI);
// all rights reserved. Authored by: Brian Ondov, Todd Treangen, and
// Adam Phillippy
// all rights reserved. Authored by: Brian Ondov, Todd Treangen,
// Sergey Koren, and Adam Phillippy
//
// See the LICENSE.txt file included with this software for license information.

Expand Down
4 changes: 2 additions & 2 deletions src/mash/HashList.h
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
// Copyright © 2015, Battelle National Biodefense Institute (BNBI);
// all rights reserved. Authored by: Brian Ondov, Todd Treangen, and
// Adam Phillippy
// all rights reserved. Authored by: Brian Ondov, Todd Treangen,
// Sergey Koren, and Adam Phillippy
//
// See the LICENSE.txt file included with this software for license information.

Expand Down
4 changes: 2 additions & 2 deletions src/mash/HashPriorityQueue.cpp
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
// Copyright © 2015, Battelle National Biodefense Institute (BNBI);
// all rights reserved. Authored by: Brian Ondov, Todd Treangen, and
// Adam Phillippy
// all rights reserved. Authored by: Brian Ondov, Todd Treangen,
// Sergey Koren, and Adam Phillippy
//
// See the LICENSE.txt file included with this software for license information.

Expand Down
4 changes: 2 additions & 2 deletions src/mash/HashPriorityQueue.h
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
// Copyright © 2015, Battelle National Biodefense Institute (BNBI);
// all rights reserved. Authored by: Brian Ondov, Todd Treangen, and
// Adam Phillippy
// all rights reserved. Authored by: Brian Ondov, Todd Treangen,
// Sergey Koren, and Adam Phillippy
//
// See the LICENSE.txt file included with this software for license information.

Expand Down
4 changes: 2 additions & 2 deletions src/mash/HashSet.cpp
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
// Copyright © 2015, Battelle National Biodefense Institute (BNBI);
// all rights reserved. Authored by: Brian Ondov, Todd Treangen, and
// Adam Phillippy
// all rights reserved. Authored by: Brian Ondov, Todd Treangen,
// Sergey Koren, and Adam Phillippy
//
// See the LICENSE.txt file included with this software for license information.

Expand Down
4 changes: 2 additions & 2 deletions src/mash/HashSet.h
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
// Copyright © 2015, Battelle National Biodefense Institute (BNBI);
// all rights reserved. Authored by: Brian Ondov, Todd Treangen, and
// Adam Phillippy
// all rights reserved. Authored by: Brian Ondov, Todd Treangen,
// Sergey Koren, and Adam Phillippy
//
// See the LICENSE.txt file included with this software for license information.

Expand Down
4 changes: 2 additions & 2 deletions src/mash/Sketch.cpp
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
// Copyright © 2015, Battelle National Biodefense Institute (BNBI);
// all rights reserved. Authored by: Brian Ondov, Todd Treangen, and
// Adam Phillippy
// all rights reserved. Authored by: Brian Ondov, Todd Treangen,
// Sergey Koren, and Adam Phillippy
//
// See the LICENSE.txt file included with this software for license information.

Expand Down
4 changes: 2 additions & 2 deletions src/mash/Sketch.h
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
// Copyright © 2015, Battelle National Biodefense Institute (BNBI);
// all rights reserved. Authored by: Brian Ondov, Todd Treangen, and
// Adam Phillippy
// all rights reserved. Authored by: Brian Ondov, Todd Treangen,
// Sergey Koren, and Adam Phillippy
//
// See the LICENSE.txt file included with this software for license information.

Expand Down
4 changes: 2 additions & 2 deletions src/mash/ThreadPool.h
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
// Copyright © 2015, Battelle National Biodefense Institute (BNBI);
// all rights reserved. Authored by: Brian Ondov, Todd Treangen, and
// Adam Phillippy
// all rights reserved. Authored by: Brian Ondov, Todd Treangen,
// Sergey Koren, and Adam Phillippy
//
// See the LICENSE.txt file included with this software for license information.

Expand Down
4 changes: 2 additions & 2 deletions src/mash/ThreadPool.hxx
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
// Copyright © 2015, Battelle National Biodefense Institute (BNBI);
// all rights reserved. Authored by: Brian Ondov, Todd Treangen, and
// Adam Phillippy
// all rights reserved. Authored by: Brian Ondov, Todd Treangen,
// Sergey Koren, and Adam Phillippy
//
// See the LICENSE.txt file included with this software for license information.

Expand Down
4 changes: 2 additions & 2 deletions src/mash/capnp/MinHash.capnp
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
# Copyright © 2015, Battelle National Biodefense Institute (BNBI);
# all rights reserved. Authored by: Brian Ondov, Todd Treangen, and
# Adam Phillippy
# all rights reserved. Authored by: Brian Ondov, Todd Treangen,
# Sergey Koren, and Adam Phillippy
#
# See the LICENSE.txt file included with this software for license information.

Expand Down
4 changes: 2 additions & 2 deletions src/mash/hash.cpp
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
// Copyright © 2015, Battelle National Biodefense Institute (BNBI);
// all rights reserved. Authored by: Brian Ondov, Todd Treangen, and
// Adam Phillippy
// all rights reserved. Authored by: Brian Ondov, Todd Treangen,
// Sergey Koren, and Adam Phillippy
//
// See the LICENSE.txt file included with this software for license information.

Expand Down
4 changes: 2 additions & 2 deletions src/mash/hash.h
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
// Copyright © 2015, Battelle National Biodefense Institute (BNBI);
// all rights reserved. Authored by: Brian Ondov, Todd Treangen, and
// Adam Phillippy
// all rights reserved. Authored by: Brian Ondov, Todd Treangen,
// Sergey Koren, and Adam Phillippy
//
// See the LICENSE.txt file included with this software for license information.

Expand Down
4 changes: 2 additions & 2 deletions src/mash/mash.cpp
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
// Copyright © 2015, Battelle National Biodefense Institute (BNBI);
// all rights reserved. Authored by: Brian Ondov, Todd Treangen, and
// Adam Phillippy
// all rights reserved. Authored by: Brian Ondov, Todd Treangen,
// Sergey Koren, and Adam Phillippy
//
// See the LICENSE.txt file included with this software for license information.

Expand Down
4 changes: 2 additions & 2 deletions src/mash/memcpyLink.h
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
// Copyright © 2015, Battelle National Biodefense Institute (BNBI);
// all rights reserved. Authored by: Brian Ondov, Todd Treangen, and
// Adam Phillippy
// all rights reserved. Authored by: Brian Ondov, Todd Treangen,
// Sergey Koren, and Adam Phillippy
//
// See the LICENSE.txt file included with this software for license information.

Expand Down
4 changes: 2 additions & 2 deletions src/mash/version.h
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
// Copyright © 2015, Battelle National Biodefense Institute (BNBI);
// all rights reserved. Authored by: Brian Ondov, Todd Treangen, and
// Adam Phillippy
// all rights reserved. Authored by: Brian Ondov, Todd Treangen,
// Sergey Koren, and Adam Phillippy
//
// See the LICENSE.txt file included with this software for license information.

Expand Down

0 comments on commit 989c955

Please sign in to comment.