Today we added automated Multi-Locus Sequence Typing (MLST) to One Codex. MLST is a powerful epidemiological tool that is based on curated collections of conserved mutations in core marker genes. This common reference standard is used to differentiate closely-related isolates of the same species, with many common species having hundreds or thousands of defined MLST profiles. Datasets that are identified as being isolates or single-genome assemblies will be automatically analyzed and tagged with the detected ST label.
Today is a big day for One Codex, where we’re launching out of beta! This marks the introduction of a number of new features (and many under-the-hood improvements), and incorporates much of the feedback our users have provided this year. You will notice some changes to our website, and the platform can now be found at app.onecodex.com (note that all previous links to the beta site will still work).
Today we’re launching our new sample comparison tool on the One Codex Beta Platform. The tool enables quick selection and comparison of any of your samples and displays the abundance of taxonomic groups in each sample as a stacked bar graph: The comparison view supports large side-by-side comparisons, viewing data at a particular taxonomic level and/or abundance, filtering to specific clades, and relative vs. absolute scaling modes. See an example and try out the tool here
We’re pleased to announce that we’re launching a new feature on One Codex today: instant sequence search. With this new tool, simply enter a nucleotide sequence – whether a read from a FASTQ file or a section of a reference genome – and we’ll instantly compare it against our One Codex database. How it works Our instant classifier works by splitting up the input sequence into all overlapping k-mers and then comparing each against our reference database, using a k=31.
We’re happy to announce that we’ve just released larger, improved microbial reference libraries on One Codex. More data The RefSeq Database now includes over 7,000 reference and representative genomes from NCBI, while the One Codex Database holds nearly 34,000 different bacterial, viral, fungal, and archeal genomes. This is more than a 45% and 20% increase, respectively, from the last releases of the RefSeq and One Codex databases. As with all of our data releases, you should continue to see improvements in the specificity of your analyses by using our more comprehensive reference libraries.