Skip to content

The release version of refseq.genomes.k21.s1000.msh #177

Description

@guosongjia

Dear Developers and other users:
I'm now trying to use the mash screen to detect potential contaminants within my NGS data. Now I'm following a tutorial offered by the developers: https://mash.readthedocs.io/en/latest/tutorials.html#screening-a-read-set-for-containment-of-refseq-genomes.
I downloaded the pre-sketched RefSeq archive from the following website for my analysis: https://gembox.cbcb.umd.edu/mash/refseq.genomes.k21s1000.msh
When I manually inspect the results, I cannot find any reliable hits (identity >=0.95) in the outputs for some of my samples (the expected organism was not there also). I guess a possible reason is that the pre-sketched refseq database offered by the developer was too old and not only my expected organism but also the potential contaminant were not included.
My question: Can anyone tell me the release version of refseq database?
In a previous issue in 2020 #139, the RefSeq release version was release 93
A related question: Does anyone try to establish a sketched RefSeq database using the latest release manually? I'm looking forward to any suggestions on this idea!
Best,
Guo-Song

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions