-
Notifications
You must be signed in to change notification settings - Fork 80
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improvement: allow both num and scaled to be set #538
Comments
n
and scaled
to be set
argh, this leads to all sorts of problems with searching, I think. will
articulate later if there's interest.
but, you could imagine allowing a search and then just saying "well, here
are places where sensitivity would be limited/nonexistent"
|
Hmm, it wouldn't change anything with searching if |
After writing the docs in #436 I now understand this idea. Yes, I like! |
This involves changes in the C++ code, so I don't think we should implement it before #424 is merged =] |
This would also allow a |
relevant: marbl/Mash#133 |
selectors #1072 could help with framing how to do the automatic conversion. |
after #1420, this should be much easier (maybe even straightforward?) to implement! |
In the current implementation
n
andscaled
are exclusive: if you set one, the other must be unset.If we have enough hashes, we can convert from a scaled minhash back to a regular mash sketch (by truncating the hashes list at
n
). In cases where we do have enough hashes (like in a metagenome), it doesn't make much difference, but for cases where setting a scaled value lead to just a bunch of hashes (think: viruses), we can't apply this conversion.I think we can support the conversion anyway, if we change a bit how we save the data internally:
n
andscaled
n
n
, apply themax_hash
comparison (derived fromscaled
)scaled
set, discard any hash overmax_hash
The text was updated successfully, but these errors were encountered: